Senior Data Scientist
2 days ago
Manager
Job Summary:
     As a Senior Data Scientist specializing in NLP, Generative AI, and Cloud technologies, you will be responsible for driving the development of data extraction solutions from documents at scale. This role requires advanced technical expertise in machine learning, NLP, and cloud computing, with a focus on automating document understanding processes and enhancing the quality of data extraction through state-of-the-art techniques.
You will lead the design, implementation, and deployment of scalable NLP and AI models, mentor junior data scientists, and work collaboratively with cross-functional teams to deliver innovative solutions. This is a strategic role that requires both deep technical knowledge and leadership capabilities to shape the future of document data extraction within the organization.
Key Responsibilities:
- Lead Data Extraction Solutions: Design, implement, and scale advanced NLP and machine learning models for automating the extraction of structured data from a wide range of unstructured documents (e.g., PDFs, scanned images, contracts, reports, etc.).
- Generative AI Expertise: Leverage Generative AI models (such as GPT, BERT, and related architectures) for tasks such as document summarization, content generation, and enhancing extracted data.
- Cloud-Based Deployment: Architect and deploy data extraction models and workflows in cloud environments (AWS, Azure, GCP), ensuring scalability, reliability, and cost-efficiency.
- Model Development & Optimization: Develop and fine-tune machine learning and NLP models, ensuring high performance in accuracy, efficiency, and robustness for real-world data extraction tasks.
- Data Pipeline Design: Build and optimize end-to-end data pipelines, including data preprocessing, feature engineering, and model deployment, to process large-scale document datasets in the cloud.
- Cross-Functional Collaboration: Work closely with product, engineering, and business teams to understand requirements, provide technical solutions, and deliver impactful data-driven results.
- Research & Innovation: Stay up-to-date with the latest advancements in NLP, machine learning, and AI, applying cutting-edge research to improve data extraction methodologies.
- Mentorship & Leadership: Lead and mentor a team of junior data scientists, providing guidance on best practices, model development, and cloud deployment.
- Model Monitoring & Maintenance: Establish systems for monitoring model performance in production and ensure models are maintained and updated based on new data or changing requirements.
- Compliance & Security: Ensure data processing and extraction workflows adhere to industry standards, data privacy regulations, and security protocols, particularly when working with sensitive information.
Required Skills & Qualifications:
- Experience: Minimum 6-8 years of experience as a Data Scientist or similar role, with a focus on NLP, machine learning, and AI. At least 3 years in a senior or lead capacity.
- NLP & Document Processing Expertise: Proven experience applying NLP techniques such as Named Entity Recognition (NER), Optical Character Recognition (OCR), information extraction, document classification, and semantic analysis for data extraction from unstructured text.
- Generative AI: Advanced knowledge of Generative AI models (e.g., GPT-3, BERT, T5) and experience applying them to real-world document and text processing tasks.
- Cloud Technologies: Extensive experience with cloud platforms (AWS, Azure, or GCP) for deploying data pipelines, managing machine learning models, and processing large datasets.
- Programming Skills: Proficiency in Python and libraries such as SpaCy, Hugging Face Transformers, TensorFlow, PyTorch, and scikit-learn.
- Data Pipeline & DevOps Tools: Hands-on experience with building, optimizing, and deploying data pipelines in cloud environments, including tools like Docker, Kubernetes, Apache Airflow, and MLFlow.
- Data Handling & Analysis: Expertise in data manipulation and analysis using tools such as Pandas, NumPy, and SQL, and ability to work with large datasets.
- Leadership & Communication: Strong leadership and mentoring abilities, with excellent written and verbal communication skills to explain complex technical concepts to non-technical stakeholders.
- Problem Solving: Exceptional problem-solving skills with a creative approach to tackling challenges related to document data extraction.
- Collaboration: Experience working in a collaborative, cross-functional team environment to deliver end-to-end solutions.
Preferred Qualifications:
- Advanced Degree: Master's or PhD in Computer Science, Data Science, Artificial Intelligence, or a related field.
- Advanced NLP Techniques: Experience with state-of-the-art NLP methods such as transfer learning, attention mechanisms, and reinforcement learning applied to document data extraction.
- Compliance Experience: Familiarity with legal, financial, or healthcare industry regulations regarding data privacy and document processing.
- Industry Experience: Previous experience in industries such as finance, legal, healthcare, or other sectors that heavily rely on document data extraction.
- 
					  Senior Data Scientist1 week ago 
 Noida, Uttar Pradesh, India Infoorigin Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per yearNEW OPPORTUNITY || IMMEDIATE TO 45 DAYS JOINERS REQUIRED ||Position Title:- Data ScientistExperience Required:- 5+ YearsLocation:- Noida, UP (Hybrid)About the RoleWe're seeking a seasoned Senior Data Scientist to lead NLP/ML research efforts and architect end-to-end solutions. You will be instrumental in driving innovation, mentoring junior team members, and... 
- 
					  Data Scientist2 weeks ago 
 Noida, Uttar Pradesh, India DevloIT Full time ₹ 20,00,000 - ₹ 25,00,000 per year*Need Financial Domain ExpPosition: Data ScientistLocation: NoidaWork Mode: Onsite SetupAs an Applied/ Data scientist, you will contribute to the design, development, and improvement of AI/ML models and systems. You will be expected to:Work with applied scientists, data scientist, software engineers and product partners to design and deliver AI/ML solutions... 
- 
					Data Scientist2 weeks ago 
 Noida, Uttar Pradesh, India Rezo Full time ₹ 20,00,000 - ₹ 25,00,000 per yearPosition:Senior Data ScientistJob DescriptionKey Responsibilities:Design, develop, and deploy speech-to-text and text-to-speech models for various applications.Work with large language models (LLMs) and implement them in real-world projects.Analyze and interpret contact center calling data to generate meaningful business insights.Apply advanced statistical... 
- 
					  Data Scientist2 hours ago 
 Noida, Uttar Pradesh, India HCL Technologies Full time ₹ 10,00,000 - ₹ 25,00,000 per yearSpecialist Data Analyst Skill (Primary) Technical Skills (ERS)-Emerging Technologies-Machine Learning Location Noida Job Family Development Job Description (Posting). About HCLTech HCLTech is a global technology company, spread across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and... 
- 
					Senior Data Scientist1 week ago 
 Noida, Uttar Pradesh, India Clarivate Full time ₹ 15,00,000 - ₹ 20,00,000 per yearWe are looking for a Senior Data Scientist to join our Technology team at Clarivate. You will get the opportunity to work on interesting IP data and interesting challengesto create insights and drive business acumen to add value to our world class products and servicesAbout You experience, education, skills, and accomplishmentsAdvanced degree in Computer... 
- 
					Senior Data Scientist7 days ago 
 Noida, Uttar Pradesh, India Clarivate Full time ₹ 8,00,000 - ₹ 24,00,000 per yearWe are looking for a 'Senior Data Scientist' to join our Technology team at Clarivate. You will get the opportunity to work on interesting IP data and interesting challenges to create insights and drive business acumen to add value to our world class products and servicesAbout You – experience, education, skills, and accomplishmentsAdvanced degree in... 
- 
					Senior Data Scientist1 week ago 
 Noida, Uttar Pradesh, India ShyftLabs Full time US$ 1,25,000 - US$ 1,75,000 per yearPosition OverviewHere at ShyftLabs, we are searching for an experienced Data Scientist who can derive performance improvement and cost efficiency in our product through a deep understanding of the ML and infra system, and provide a data driven insight and scientific solutionShyftLabs is a growing data product company that was founded in early 2020 and works... 
- 
					  Data Scientist2 weeks ago 
 Noida, Uttar Pradesh, India LanceSoft Inc Full time ₹ 15,00,000 - ₹ 35,00,000 per yearData ScientistLocation: NoidaNeed Immediate JoinersJD 1 – Data ScientistStatistical Knowledge, Good communication skillsCoding Skills in Python, SQLKnowledge on GenAiPowerBI Tableu,Good ComminicationHands on testing, coding. APIKey responsibilities:Analyze large and complex data sets to identify patterns, trends, and actionable insights using AI... 
- 
					Senior Data Scientist1 week ago 
 Greater Noida, Uttar Pradesh, India Clarivate Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Description We are looking for a Senior Data Scientist to join our Technology team at Clarivate. You will get the opportunity to work on interesting IP data and interesting challenges to create insights and drive business acumen to add value to our world-class products and services.About You experience, education, skills, and accomplishmentsAdvanced... 
- 
					Sr Data Scientist5 days ago 
 Noida, Uttar Pradesh, India Birlasoft Full time ₹ 20,00,000 - ₹ 25,00,000 per yearCountry/Region: INRequisition ID: 30111Work Model:Position Type:Salary Range:Location: INDIA - NOIDA- BIRLASOFT OFFICETitle: Sr Data ScientistDescription:Area(s) of responsibilityData Scientist experience (8 to 10 years)5 years of relevant work experience as a data scientistExperience designing and building statistical forecasting models.Experience in...