Home
/
Data and Analytics
/
Senior Software Engineer - Data Scientist (OCR and Document Training)
Senior Software Engineer - Data Scientist (OCR and Document Training)-November 2024
New York
Nov 1, 2025
ABOUT CAPGEMINI
We focus on helping drive value for our customers in three key areas: customer experience, intelligent industry, and enterprise management.
10,000+ employees
Consulting, Information Technology
VIEW COMPANY PROFILE >>
About Senior Software Engineer - Data Scientist (OCR and Document Training)

  Job description:

  We are seeking highly skilled Data Scientists, specializing in Optical Character Recognition (OCR) and Document Training. Your primary mission will be to develop OCR solutions to extract information from documents and leverage Google's Document AI to train models on the underlying unstructured data. With your extensive experience in data science and data engineering expertise, you will play a pivotal role in transforming unstructured documents into actionable insights.

  Key Responsibilities:

  a) Develop OCR Solutions: One of your primary responsibilities will be to design and develop OCR solutions capable of accurately extracting textual and structural data from various types of documents.

  b) Leverage Document AI: You will harness the power of Google's Document AI to train models on the extracted unstructured data. This process will involve structuring, categorizing, and making sense of large volumes of textual information.

  c) Experience and Expertise: With a minimum of 5 years of experience in the field of Data Science, you will bring a deep understanding of machine learning, natural language processing, and computer vision. Your expertise will be instrumental in solving complex OCR and document training challenges.

  d) Data Engineering Experience: In addition to your data science skills, you should have experience in data engineering, particularly in handling key-value pairs and structuring unstructured data for effective analysis. This skill set will be essential for preprocessing and structuring data before applying machine learning models.

  Technical Requirements:

  To excel in these roles, you should have expertise in:

  • Optical Character Recognition (OCR) technologies

  • Machine learning and deep learning

  • Natural language processing (NLP)

  • Computer vision

  • Google's Document AI or similar document analysis tools

  • Data preprocessing and data engineering

  • Programming languages like Python, SQL, and relevant libraries/frameworks

  Life at Capgemini

  Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:

  Flexible work Healthcare including dental, vision, mental health, and well-being programsFinancial well-being programs such as 401(k) and Employee Share Ownership PlanPaid time off and paid holidays Paid parental leaveFamily building benefits like adoption assistance, surrogacy, and cryopreservationSocial well-being benefits like subsidized back-up child/elder care and tutoringMentoring, coaching and learning programsEmployee Resource Groups Disaster Relief

  About Capgemini

  Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 360,000 team members in more than 50 countries. With its strong 55-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms. The Group reported in 2022 global revenues of €22 billion.

  Get The Future You Want | www.capgemini.com

  Disclaimer

  Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

  This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.

  Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.

  Click the following link for more information on your rights as an Applicant http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law

  Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process.

  Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by Capgemini.

  Capgemini discloses salary range information in compliance with state and local pay transparency obligations. The disclosed range represents the lowest to highest salary we, in good faith, believe we would pay for this role at the time of this posting, although we may ultimately pay more or less than the disclosed range, and the range may be modified in the future. The disclosed range takes into account the wide range of factors that are considered in making compensation decisions including, but not limited to, geographic location, relevant education, qualifications, certifications, experience, skills, seniority, performance, sales or revenue-based metrics, and business or organizational needs. At Capgemini, it is not typical for an individual to be hired at or near the top of the range for their role. The base salary range for the tagged location is $145,000 to $187,000.This role may be eligible for other compensation including variable compensation, bonus, or commission. Full time regular employees are eligible for paid time off, medical/dental/vision insurance, 401(k), and any other benefits to eligible employees.Note: No amount of pay is considered to be wages or compensation until such amount is earned, vested, and determinable. The amount and availability of any bonus, commission, or any other form of compensation that are allocable to a particular employee remains in the Company's sole discretion unless and until paid and may be modified at the Company's sole discretion, consistent with the law.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Senior Payroll Country Expert - Netherlands
About Remote Remote is solving global remote organizations' biggest challenge: employing anyone anywhere compliantly. We make it possible for businesses big and small to employ a global team by handl
Big Data Engineer
If you are an analytical problem solver with a vast knowledge of Java, we have the perfect job for you. We are seeking a Big Data Developer to join our friendly team of experts. Your mission will be
APAC Payroll Associate
JOB DESCRIPTION JOB DESCRIPTION Title: APAC Payroll Associate Location: Bangalore, India Hours worked - 9 hrs 12:30 pm to 9:30 pm Onsite The Position We are seeking a dedicated and detail-oriented Pa
Technical Architect Salesforce Service Cloud
This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice.
Customer Accounts Specialist - 12 months contract
Job Summary: The purpose of this role is to handle the management and processing of transactional activity for the CRM team, including, but not restricted to, Order Entry, Documentation creation, Com
Associate Principal Business Analyst, Transparency Services
The Associate Principal Business Analyst position is responsible for collaborating with senior business analysts on gathering, drafting, and creating requirements for developers, as well as drafting
Manager of Solutions Consulting
Meet Our Team: As a Manager of our Solutions Consulting LATAM team, you will collaborate and manage a team of Solutions Consultants while working cross functionally with Consulting & Sales to ens
Technical Communications Specialist Remote
ABOUT US At HUB International, we are a team of entrepreneurs. We believe in empowering our clients, and we do so by protecting businesses and individuals in our local communities. We help businesses
Data Scientist, Geodata
Niantic is a leading mobile gaming platform company known for creating popular augmented reality (AR) games such as Pokémon GO, Ingress, Pikmin Bloom and Monster Hunter Now. Our mission is to build i
Legal Assistant - Trade
Firm Summary White & Case is an elite global law firm serving leading companies, financial institutions and governments worldwide. Our long history as an international firm means we are perfectly
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved