Senior Data Scientist

7 hours ago


Noida, Uttar Pradesh, India EXL SERVICES Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Manager

Job Summary:
As a Senior Data Scientist specializing in NLP, Generative AI, and Cloud technologies, you will be responsible for driving the development of data extraction solutions from documents at scale. This role requires advanced technical expertise in machine learning, NLP, and cloud computing, with a focus on automating document understanding processes and enhancing the quality of data extraction through state-of-the-art techniques.

You will lead the design, implementation, and deployment of scalable NLP and AI models, mentor junior data scientists, and work collaboratively with cross-functional teams to deliver innovative solutions. This is a strategic role that requires both deep technical knowledge and leadership capabilities to shape the future of document data extraction within the organization.

Key Responsibilities:

  • Lead Data Extraction Solutions: Design, implement, and scale advanced NLP and machine learning models for automating the extraction of structured data from a wide range of unstructured documents (e.g., PDFs, scanned images, contracts, reports, etc.).
  • Generative AI Expertise: Leverage Generative AI models (such as GPT, BERT, and related architectures) for tasks such as document summarization, content generation, and enhancing extracted data.
  • Cloud-Based Deployment: Architect and deploy data extraction models and workflows in cloud environments (AWS, Azure, GCP), ensuring scalability, reliability, and cost-efficiency.
  • Model Development & Optimization: Develop and fine-tune machine learning and NLP models, ensuring high performance in accuracy, efficiency, and robustness for real-world data extraction tasks.
  • Data Pipeline Design: Build and optimize end-to-end data pipelines, including data preprocessing, feature engineering, and model deployment, to process large-scale document datasets in the cloud.
  • Cross-Functional Collaboration: Work closely with product, engineering, and business teams to understand requirements, provide technical solutions, and deliver impactful data-driven results.
  • Research & Innovation: Stay up-to-date with the latest advancements in NLP, machine learning, and AI, applying cutting-edge research to improve data extraction methodologies.
  • Mentorship & Leadership: Lead and mentor a team of junior data scientists, providing guidance on best practices, model development, and cloud deployment.
  • Model Monitoring & Maintenance: Establish systems for monitoring model performance in production and ensure models are maintained and updated based on new data or changing requirements.
  • Compliance & Security: Ensure data processing and extraction workflows adhere to industry standards, data privacy regulations, and security protocols, particularly when working with sensitive information.

Required Skills & Qualifications:

  • Experience: Minimum 6-8 years of experience as a Data Scientist or similar role, with a focus on NLP, machine learning, and AI. At least 3 years in a senior or lead capacity.
  • NLP & Document Processing Expertise: Proven experience applying NLP techniques such as Named Entity Recognition (NER), Optical Character Recognition (OCR), information extraction, document classification, and semantic analysis for data extraction from unstructured text.
  • Generative AI: Advanced knowledge of Generative AI models (e.g., GPT-3, BERT, T5) and experience applying them to real-world document and text processing tasks.
  • Cloud Technologies: Extensive experience with cloud platforms (AWS, Azure, or GCP) for deploying data pipelines, managing machine learning models, and processing large datasets.
  • Programming Skills: Proficiency in Python and libraries such as SpaCy, Hugging Face Transformers, TensorFlow, PyTorch, and scikit-learn.
  • Data Pipeline & DevOps Tools: Hands-on experience with building, optimizing, and deploying data pipelines in cloud environments, including tools like Docker, Kubernetes, Apache Airflow, and MLFlow.
  • Data Handling & Analysis: Expertise in data manipulation and analysis using tools such as Pandas, NumPy, and SQL, and ability to work with large datasets.
  • Leadership & Communication: Strong leadership and mentoring abilities, with excellent written and verbal communication skills to explain complex technical concepts to non-technical stakeholders.
  • Problem Solving: Exceptional problem-solving skills with a creative approach to tackling challenges related to document data extraction.
  • Collaboration: Experience working in a collaborative, cross-functional team environment to deliver end-to-end solutions.

Preferred Qualifications:

  • Advanced Degree: Master's or PhD in Computer Science, Data Science, Artificial Intelligence, or a related field.
  • Advanced NLP Techniques: Experience with state-of-the-art NLP methods such as transfer learning, attention mechanisms, and reinforcement learning applied to document data extraction.
  • Compliance Experience: Familiarity with legal, financial, or healthcare industry regulations regarding data privacy and document processing.
  • Industry Experience: Previous experience in industries such as finance, legal, healthcare, or other sectors that heavily rely on document data extraction.


  • Noida, Uttar Pradesh, India ThreatModeler Software, Inc Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the CompanyThreatModeler Software, Inc. is an industry leader in automated threat modeling, helping enterprises proactively secure their systems by identifying, quantifying, and mitigating cybersecurity threats during the design phase. We're expanding our AI capabilities to accelerate threat detection, model generation, and decision intelligence —...

  • Senior Data Scientist

    3 weeks ago


    Greater Noida, Uttar Pradesh, India Chai Waale Full time

    Senior Data Scientist - Reporting & Data Validation (7-9 Years)Location : Noida Sec-62Industry : IT Services / SoftwareJob Type : Full-TimeRole Summary :We are hiring a Senior Data Scientist with strong experience in reporting, data validation, and analysis. The candidate will be responsible for ensuring data accuracy, preparing automated reports,...


  • Noida, Uttar Pradesh, India Trinity Mobile App Lab Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    As a Senior Data Scientist AI, you will lead the design and development of machine learningand generative AI models that enhance our platform. You will work closely with engineers,product managers, and cybersecurity SMEs to innovate how we analyze architectures, assessrisks, and automate threat intelligence. This is a hands-on, highly strategic role that...


  • Noida, Uttar Pradesh, India Infoorigin Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    NEW OPPORTUNITY || IMMEDIATE TO 45 DAYS JOINERS REQUIRED ||Position Title:- Data ScientistExperience Required:- 5+ YearsLocation:- Noida, UP (Hybrid)About the RoleWe're seeking a seasoned Senior Data Scientist to lead NLP/ML research efforts and architect end-to-end solutions. You will be instrumental in driving innovation, mentoring junior team members, and...

  • Data Scientist

    2 weeks ago


    Noida, Uttar Pradesh, India DevloIT Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    *Need Financial Domain ExpPosition: Data ScientistLocation: NoidaWork Mode: Onsite SetupAs an Applied/ Data scientist, you will contribute to the design, development, and improvement of AI/ML models and systems. You will be expected to:Work with applied scientists, data scientist, software engineers and product partners to design and deliver AI/ML solutions...


  • Noida, Uttar Pradesh, India Uplers Full time US$ 1,50,000 - US$ 2,00,000 per year

    Experience: yearsSalary: INR / year (based on experience)Expected Notice Period: 30 DaysShift: (GMT+05:30) Asia/Kolkata (IST)Opportunity Type: Office (Noida)Placement Type: Full Time Permanent position(Payroll and Compliance to be managed by: Threat Modeler Software Inc)**(*Note: This is a requirement for one of Uplers' client - Threat Modeler Software...

  • Data Scientist

    2 weeks ago


    Noida, Uttar Pradesh, India Rezo Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Position:Senior Data ScientistJob DescriptionKey Responsibilities:Design, develop, and deploy speech-to-text and text-to-speech models for various applications.Work with large language models (LLMs) and implement them in real-world projects.Analyze and interpret contact center calling data to generate meaningful business insights.Apply advanced statistical...

  • Data Scientist

    6 days ago


    Noida, Uttar Pradesh, India Magneum Technology Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are seeking a highly analytical and skilled Data Scientist to join our team. The ideal candidate will be adept at using large data sets to find opportunities for product and process optimization and using models to test the effectiveness of different courses of action.Responsibilities:Data mining or extracting usable data from valuable data sourcesUtilize...

  • Data Scientist

    4 weeks ago


    Noida, Uttar Pradesh, India Denave Full time

    Job Description- The candidate will be primarily responsible for building models and bots for executing various data maintenance, profiling, and historical and predictive analysis tasks. He will be an integral part of a team comprising of Data Scientists, Python Programmers and Data Analysts.Experience Essential- Experience in relation extraction, knowledge...


  • Noida, Uttar Pradesh, India ShyftLabs Full time US$ 1,25,000 - US$ 1,75,000 per year

    Position OverviewHere at ShyftLabs, we are searching for an experienced Data Scientist who can derive performance improvement and cost efficiency in our product through a deep understanding of the ML and infra system, and provide a data driven insight and scientific solutionShyftLabs is a growing data product company that was founded in early 2020 and works...