NLP Data Scientist

3 days ago


Multiple Locations, India Norstella Full time ₹ 20,00,000 - ₹ 25,00,000 per year

About Norstella :

At Norstella, our mission is simple : to help our clients bring life-saving therapies to market quickerand help patients in need.

Founded in 2022, but with history going back to 1939, Norstella unites best-in-class brands to help clients navigate the complexities at each step of the drug development life cycle and get the right treatments to the right patients at the right time.

Each Organization (Citeline, Evaluate, MMIT, Panalgo, The Dedham Group) Delivers Must-have Answers For Critical Strategic And Commercial Decision-making.

Together, Via Our Market-leading Brands, We Help Our Clients.

Citeline : accelerate the drug development cycle.

Evaluate bring the right drugs to market.

MMIT identify barrier to patient access.

Panalgo turn data into insight faster.

The Dedham Group think strategically for specialty therapeutics.

By combining the efforts of each organization under Norstella, we can offer an even wider breadth of expertise, cutting-edge data solutions and expert advisory services alongside advanced technologies such as real-world data, machine learning and predictive analytics.

As one of the largest global pharma intelligence solution providers, Norstella has a footprint across the globe with teams of experts delivering world class solutions in the USA, UK, The Netherlands, Japan, China and India.

The Role : NLP Data Scientist, AI & Life Sciences :

We are seeking a skilled NLP Data Scientist with a focus on cutting-edge Language Models to join our AI & Life Sciences Solutions team.

Your expertise in processing and understanding natural language, paired with your experience in Electronic Health Records (EHR) and clinical data analysis, will be crucial in driving our data science initiatives.

You will be instrumental in developing rich, multimodal real-world datasets that will accelerate RWD-driven drug development within the pharmaceutical industry.

Responsibilities :

- Lead the application of advanced NLP and Large Language Models (LLMs), including state-of-the-art open-source models (e.g., Llama3, Mixtral, Gemma) and other foundational models, to extract and interpret complex, unstructured medical data from diverse sources such as EHRs, clinical notes, and laboratory reports.

- Architect and deploy innovative and scalable NLP solutions that leverage the latest in deep learning to solve complex healthcare challenges, working closely with clinical scientists and data scientists.

- Design and implement robust data pipelines for cleaning, preprocessing, and validating unstructured data, ensuring the accuracy and reliability of all extracted insights.

- Develop and optimize prompt engineering strategies for fine-tuning LLMs and enhancing their performance on specialized clinical tasks.

- Translate complex findings into clear, actionable insights for both technical and non-technical stakeholders, driving data-informed decisions across the organization.

Qualifications :

- Advanced Degree : Master's or Ph.D. in Computer Science, Data Science, Computational Linguistics, Computational Biology, Physics, or a related analytical field.

- Clinical Data Expertise : Proven experience (3 years) in handling and interpreting Electronic Health Records (EHRs) and clinical laboratory data.

- Advanced NLP & Generative AI : Deep experience (3 years) with modern NLP techniques like semantic search, knowledge graph construction, and few-shot learning.

- LLM Proficiency : Practical, hands-on experience (2 years) with fine-tuning, prompt engineering, and inference optimization for LLMs.

- Technical Stack : Expert proficiency in Python and SQL, with strong experience using Hugging Face Transformers, PyTorch, and/or TensorFlow.

- Experience in a cloud environment, specifically AWS, with large-scale data systems.

- MLOps & Workflow Automation : Familiarity with modern MLOps practices (e.g., Git) and a proven track record of developing automated, scalable workflows.

- Analytical Prowess : A strong analytical mindset with excellent problem-solving skills and a detail-oriented approach to data.

- Communication : Exceptional verbal and written communication skills with the ability to articulate complex technical findings to a diverse audience.

Preferred Qualifications :

- Healthcare Compliance : Experience managing Protected Health Information (PHI) and a working knowledge of healthcare data privacy laws such as HIPAA.

- Medical Terminologies : Familiarity with standard healthcare codes and terminologies, including ICD-10, CPT, LOINC, and SNOMED CT.

- Advanced Retrieval Systems : Practical experience with Retrieval-Augmented Generation (RAG) systems and vector databases for managing and querying large volumes of unstructured medical documents.

Location : Remote India.


  • Data Scientist

    2 days ago


    Multiple Locations, India Techmora Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Job Title : Data ScientistExperience : 6 - 9 yearsLocation : Fully RemoteEmployment Type : Full-timeNotice Period : 30 days Job Overview : We are seeking an experienced Data Scientist to join our team and help transform complex datasets into actionable insights. The ideal candidate will be responsible for designing and implementing advanced...

  • Data Scientist

    2 weeks ago


    Multiple Locations, India Zorba Consulting Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Description : About the Role : We are seeking a results-driven Data Scientist to design, build, and deploy predictive models that solve complex business problems, focusing on customer behavior and personalization. This role requires a strong blend of theoretical knowledge in Machine Learning and practical experience in deploying models in a...

  • Data Scientist

    7 days ago


    Multiple Locations, India CROSSDEV TECHNOLOGIES PRIVATE LIMITED Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Description : We are seeking a talented Data Scientist with around 3 - 5 years of experience, preferably with strong expertise in Azure Databricks and Azure Database services. The ideal candidate will have hands-on experience in building and deploying data models, working with large datasets, and leveraging cloud platforms (Azure preferred; AWS is also...

  • Data Scientist

    3 days ago


    Anywhere in India/Multiple Locations Aays Analytics Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Description : About the Role : We are seeking a highly skilled and motivated Data Scientist to join our growing data consulting team. In this role, you will work closely with clients' senior stakeholders and cross-functional teams to uncover actionable insights, build predictive models, and deliver data-driven solutions that drive business value. ...

  • AI Research Scientist

    2 weeks ago


    Multiple Locations, India CodeZio Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Notice Period : 30 Days (max) About the Role : We are seeking an experienced and highly motivated AI Research Scientist to join our fully remote, innovative research and development team. This mid to senior-level role is crucial for advancing our core AI technology and translating cutting-edge academic research into robust, scalable product features....

  • Data Scientist

    1 day ago


    Multiple Locations, India TESTQ Technologies Limited Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Role Overview : We are looking for a highly skilled GenAI Data Scientist proficient in cutting-edge Generative AI frameworks and tools such as Agentic AI, LangGraph, LlamaIndex, and OpenAI. The ideal candidate will translate complex business requirements into scalable AI solutions, creating production-ready systems that leverage the latest advances in...


  • Multiple Locations, India NS Global Corporation Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    About the Role : We are seeking a GenAI Data Engineer to design, build, and optimize data pipelines for unstructured and semi-structured content, integrating advanced AI/ML capabilities. This role combines modern ETL expertise with Vector Database & GenAI integration to support intelligent document processing and semantic search applications.Key...


  • Anywhere in India/Multiple Locations TESTQ Technologies Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Opportunity : We are looking for an exceptional Senior / Lead Data Scientist with expertise in Machine Learning (ML), Deep Learning (DL), Natural Language Processing (NLP), and Generative AI (LLMs) to design and deploy AI-driven solutions at scale. This role offers the opportunity to work on cutting-edge GenAI use cases from building...


  • Multiple Locations, India Forage AI Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Description : Data Pipeline Engineer Web Services, WebCrawling, ETL, NLP(spaCy/LLM), AWS. Experience Level : 5-7 years of relevant experience in data engineering. About Forage AI : Forage AI is a pioneering AI-powered data extraction and automation company that transforms complex, unstructured web and document data into clean, structured intelligence....


  • Anywhere in India/Multiple Locations Gowin Search LLC Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Description : Role : Senior Data Scientist (Gen AI Developer). Experience : 5 to 7 Years. Location : Hyderabad. Employment Type : Full-Time. Work Mode : Hybrid (4 days in office, 1 day from home).Job Brief : We are looking for a talented AI Engineer with hands-on experience in Speech-to-Text and Text Generation technologies to tackle a...