Home
/
Data and Analytics
/
Senior Software Engineer - Data Scientist (OCR and Document Training)
Senior Software Engineer - Data Scientist (OCR and Document Training)-March 2024
New York
Mar 29, 2026
ABOUT CAPGEMINI
We focus on helping drive value for our customers in three key areas: customer experience, intelligent industry, and enterprise management.
10,000+ employees
Consulting, Information Technology
VIEW COMPANY PROFILE >>
About Senior Software Engineer - Data Scientist (OCR and Document Training)

  Job description:

  We are seeking highly skilled Data Scientists, specializing in Optical Character Recognition (OCR) and Document Training. Your primary mission will be to develop OCR solutions to extract information from documents and leverage Google's Document AI to train models on the underlying unstructured data. With your extensive experience in data science and data engineering expertise, you will play a pivotal role in transforming unstructured documents into actionable insights.

  Key Responsibilities:

  a) Develop OCR Solutions: One of your primary responsibilities will be to design and develop OCR solutions capable of accurately extracting textual and structural data from various types of documents.

  b) Leverage Document AI: You will harness the power of Google's Document AI to train models on the extracted unstructured data. This process will involve structuring, categorizing, and making sense of large volumes of textual information.

  c) Experience and Expertise: With a minimum of 5 years of experience in the field of Data Science, you will bring a deep understanding of machine learning, natural language processing, and computer vision. Your expertise will be instrumental in solving complex OCR and document training challenges.

  d) Data Engineering Experience: In addition to your data science skills, you should have experience in data engineering, particularly in handling key-value pairs and structuring unstructured data for effective analysis. This skill set will be essential for preprocessing and structuring data before applying machine learning models.

  Technical Requirements:

  To excel in these roles, you should have expertise in:

  • Optical Character Recognition (OCR) technologies

  • Machine learning and deep learning

  • Natural language processing (NLP)

  • Computer vision

  • Google's Document AI or similar document analysis tools

  • Data preprocessing and data engineering

  • Programming languages like Python, SQL, and relevant libraries/frameworks

  Life at Capgemini

  Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:

  Flexible work Healthcare including dental, vision, mental health, and well-being programsFinancial well-being programs such as 401(k) and Employee Share Ownership PlanPaid time off and paid holidays Paid parental leaveFamily building benefits like adoption assistance, surrogacy, and cryopreservationSocial well-being benefits like subsidized back-up child/elder care and tutoringMentoring, coaching and learning programsEmployee Resource Groups Disaster Relief

  About Capgemini

  Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organization of over 360,000 team members in more than 50 countries. With its strong 55-year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms. The Group reported in 2022 global revenues of €22 billion.

  Get The Future You Want | www.capgemini.com

  Disclaimer

  Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

  This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.

  Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.

  Click the following link for more information on your rights as an Applicant http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law

  Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process.

  Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by Capgemini.

  Capgemini discloses salary range information in compliance with state and local pay transparency obligations. The disclosed range represents the lowest to highest salary we, in good faith, believe we would pay for this role at the time of this posting, although we may ultimately pay more or less than the disclosed range, and the range may be modified in the future. The disclosed range takes into account the wide range of factors that are considered in making compensation decisions including, but not limited to, geographic location, relevant education, qualifications, certifications, experience, skills, seniority, performance, sales or revenue-based metrics, and business or organizational needs. At Capgemini, it is not typical for an individual to be hired at or near the top of the range for their role. The base salary range for the tagged location is $145,000 to $187,000.This role may be eligible for other compensation including variable compensation, bonus, or commission. Full time regular employees are eligible for paid time off, medical/dental/vision insurance, 401(k), and any other benefits to eligible employees.Note: No amount of pay is considered to be wages or compensation until such amount is earned, vested, and determinable. The amount and availability of any bonus, commission, or any other form of compensation that are allocable to a particular employee remains in the Company's sole discretion unless and until paid and may be modified at the Company's sole discretion, consistent with the law.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
ML Engineer
Who We Are Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business st
SCM bar turning manager F/M
Eaton Souriau recherche un(e) : Responsable Supply Chain Manager Decolletage H/F Lieu : Cluses, Haute-Savoie (74), France Rejoignez Eaton et aidez-nous à développer des solutions innovantes en gestio
Data Engineer, Client Ops - X Delivery
Who We Are Boston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business st
Acheteur frais généraux
Désirez-vous travailler au sein d'une entreprise globale dans laquelle nous donnons de l'importance à l'éthique, l'inclusion, la diversité et nos employés ? Rejoignez Eaton et aidez-nous à développer
Data Architect Manager
Overview PepsiCo operates in an environment undergoing immense and rapid change. Big-data and digital technologies are driving business transformation that is unlocking new capabilities and business
Principal Engineer - Data Engineering
At Wells Fargo, we are looking for talented people who will put our customers at the center of everything we do. We are seeking candidates who embrace diversity, equity and inclusion in a workplace w
Senior Manager, Security Compliance & Audit
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Finance Job Details About Salesforce W
Generative AI Analyst, Trust and Safety
Minimum qualifications: Bachelor's degree in Computer Science, Engineering, Statistics, Mathematics, or related discipline, or equivalent practical experience 4 years of experience working with analy
Architect,FOBO Data Analyst
Overview The main objective for the FOBO Sr. Data Analyst is to support the IT Product Owner and Business Product Owner to help advance the data discovery efforts for new implementation and improveme
Vendor Operations Manager, Trust and Safety
Minimum qualifications: Bachelor's degree or equivalent practical experience. Experience with process management systems such as Lean and Six Sigma. Experience with SQL, other reporting tools, and te
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved