NLP Data Scientist
1 day ago
About Norstella :
At Norstella, our mission is simple : to help our clients bring life-saving therapies to market quickerand help patients in need.
Founded in 2022, but with history going back to 1939, Norstella unites best-in-class brands to help clients navigate the complexities at each step of the drug development life cycle and get the right treatments to the right patients at the right time.
Each Organization (Citeline, Evaluate, MMIT, Panalgo, The Dedham Group) Delivers Must-have Answers For Critical Strategic And Commercial Decision-making.
Together, Via Our Market-leading Brands, We Help Our Clients.
Citeline : accelerate the drug development cycle.
Evaluate bring the right drugs to market.
MMIT identify barrier to patient access.
Panalgo turn data into insight faster.
The Dedham Group think strategically for specialty therapeutics.
By combining the efforts of each organization under Norstella, we can offer an even wider breadth of expertise, cutting-edge data solutions and expert advisory services alongside advanced technologies such as real-world data, machine learning and predictive analytics.
As one of the largest global pharma intelligence solution providers, Norstella has a footprint across the globe with teams of experts delivering world class solutions in the USA, UK, The Netherlands, Japan, China and India.
The Role : NLP Data Scientist, AI & Life Sciences :
We are seeking a skilled NLP Data Scientist with a focus on cutting-edge Language Models to join our AI & Life Sciences Solutions team.
Your expertise in processing and understanding natural language, paired with your experience in Electronic Health Records (EHR) and clinical data analysis, will be crucial in driving our data science initiatives.
You will be instrumental in developing rich, multimodal real-world datasets that will accelerate RWD-driven drug development within the pharmaceutical industry.
Responsibilities :
- Lead the application of advanced NLP and Large Language Models (LLMs), including state-of-the-art open-source models (e.g., Llama3, Mixtral, Gemma) and other foundational models, to extract and interpret complex, unstructured medical data from diverse sources such as EHRs, clinical notes, and laboratory reports.
- Architect and deploy innovative and scalable NLP solutions that leverage the latest in deep learning to solve complex healthcare challenges, working closely with clinical scientists and data scientists.
- Design and implement robust data pipelines for cleaning, preprocessing, and validating unstructured data, ensuring the accuracy and reliability of all extracted insights.
- Develop and optimize prompt engineering strategies for fine-tuning LLMs and enhancing their performance on specialized clinical tasks.
- Translate complex findings into clear, actionable insights for both technical and non-technical stakeholders, driving data-informed decisions across the organization.
Qualifications :
- Advanced Degree : Master's or Ph.D. in Computer Science, Data Science, Computational Linguistics, Computational Biology, Physics, or a related analytical field.
- Clinical Data Expertise : Proven experience (3 years) in handling and interpreting Electronic Health Records (EHRs) and clinical laboratory data.
- Advanced NLP & Generative AI : Deep experience (3 years) with modern NLP techniques like semantic search, knowledge graph construction, and few-shot learning.
- LLM Proficiency : Practical, hands-on experience (2 years) with fine-tuning, prompt engineering, and inference optimization for LLMs.
- Technical Stack : Expert proficiency in Python and SQL, with strong experience using Hugging Face Transformers, PyTorch, and/or TensorFlow.
- Experience in a cloud environment, specifically AWS, with large-scale data systems.
- MLOps & Workflow Automation : Familiarity with modern MLOps practices (e.g., Git) and a proven track record of developing automated, scalable workflows.
- Analytical Prowess : A strong analytical mindset with excellent problem-solving skills and a detail-oriented approach to data.
- Communication : Exceptional verbal and written communication skills with the ability to articulate complex technical findings to a diverse audience.
Preferred Qualifications :
- Healthcare Compliance : Experience managing Protected Health Information (PHI) and a working knowledge of healthcare data privacy laws such as HIPAA.
- Medical Terminologies : Familiarity with standard healthcare codes and terminologies, including ICD-10, CPT, LOINC, and SNOMED CT.
- Advanced Retrieval Systems : Practical experience with Retrieval-Augmented Generation (RAG) systems and vector databases for managing and querying large volumes of unstructured medical documents.
Location : Remote India.
-
Data Scientist
1 day ago
Multiple Locations, India Techmora Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Title : Data ScientistExperience : 6 - 9 yearsLocation : Fully RemoteEmployment Type : Full-timeNotice Period : 30 days Job Overview : We are seeking an experienced Data Scientist to join our team and help transform complex datasets into actionable insights. The ideal candidate will be responsible for designing and implementing advanced...
-
Senior Product Manager
1 day ago
Multiple Locations, India Norstella Full time ₹ 8,00,000 - ₹ 20,00,000 per yearJob Description : Norstella is seeking a skilled and dynamic professional with a background in Market Access to join our team. Are you passionate about turning emerging technologies into real-world value, fast? Do you thrive in spaces where ideas move quickly from concept to customer? We are looking for a creative innovator who believes building,...
-
Data Scientist/Architect
1 day ago
Anywhere in India/Multiple Locations Indihire Private Limited Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Description : Required Skills : - Strong expertise in Python for building, training, and deploying machine learning models. - Deep understanding of transformer architectures, large language models (LLMs), and generative AI techniques. - Experience with frameworks such as PyTorch / TensorFlow. - Knowledge of fine-tuning, prompt...
-
Senior AI/ML Architect
1 day ago
Multiple Locations, India Zorba Consulting Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Summary : We are seeking a senior AI/ML Architect to lead the design and implementation of our AI and machine learning initiatives. You will be responsible for creating the architectural vision and blueprints for scalable, secure, and performant ML systems. This is a strategic role that requires a blend of deep technical knowledge, leadership, and...
-
Agentic AI Developer
1 day ago
Multiple Locations, India VUPICO Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Title: Agentic AI Developer - Java & Spring Boot Experience : 8- 10 Years Location : Remote / Hybrid (as per requirement) Employment Type : Contract / Full-time About the Role : We are looking for a hands-on Agentic AI Developer with strong expertise in Java, Spring Boot, and AI frameworks. The role involves designing and building agentic...
-
Machine Learning Engineer
1 day ago
Anywhere in India/Multiple Locations Cyanous Software Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Description : We are seeking a highly skilled Machine Learning Engineer with expertise in building and deploying end-to-end ML solutions. The ideal candidate will have strong experience in model development, deployment, and monitoring in cloud environments (preferably Azure). You will be responsible for the full ML lifecycle, ensuring robust,...