Data Scientist

3 weeks ago


india Circuit Compilers Full time

Role : Data Scientist

Job Responsibilities :

LLM Architecture :


- Good understanding of the architecture underlying large language models, such as Transformer-based models and their variants.


- Design and implement deep learning model architectures using PyTorch.

Language Model Training and Fine-Tuning :


- Experience in training large-scale language models from scratch, as well as fine-tuning pre-trained models on domain data.

Data Preprocessing for NLP :


- Skilled in preprocessing textual data, including tokenization, stemming, lemmatization, and handling of different text encoding.

Transfer Learning and Adaptation :


- Proficiency in applying transfer learning techniques to adapt existing LLMs to new languages, domains, or specific business needs.

Data Annotation and Evaluation :


- Skills in designing and implementing data annotation strategies for training LLMs and evaluating their performance using appropriate metrics.

Scalability and Deployment :


- Experience in scaling LLMs for production environments, ensuring efficiency and robustness in deployment.

Model Training, Optimization, and Evaluation :


- Evaluate the performance of PyTorch models using appropriate metrics and techniques like cross-validation, holdout sets, or online evaluation.


- This encompasses the complete cycle of training, fine-tuning, and validating language models.


- You will be designing and adapting LLMs for use in virtual assistants, Information retrieval and extraction etc.


- Experimentation with Emerging Technologies and Methods :


- Actively exploring new technologies and methodologies in language model development, including experimental frameworks and software tools.

LLM Alignment :


- Understanding of algorithms like DPO, PPO, KPO, RLHF and using it for guardrails.

AI Data Retrieval :


- Data retrieval from unstructured data, extract key value pairs using techniques like donut, layoutLM, table transformers.

- Analyze data and build EDAs to identify data patterns


- Hands-on and strong understanding of concepts in Deep Learning and NLP Proficient in TensorFlow and similar libraries.

Required Qualifications :

- 5+ years of hands-on experience in developing and deploying Large Language Models, and Machine learning and working with Pytorch.

- A thorough understanding of machine learning, particularly deep learning techniques, including knowledge of neural network architectures, training methods, and optimization algorithms.

- Proficiency in AI technology, Python, including experience with NLP libraries (e.g., Hugging Face Transformers, NLTK, spaCy), text classification.

- Experience with frameworks: PyTorch, or Tensorflow.

- Experience with cloud services (AWS, Azure) and ML deployment tool Docker

- Familiarity with model fine-tuning and optimization techniques for LLMs.

- Proven track record of innovative solutions in the field of LLMs.

- Strong communication skills, with the ability to explain complex AI concepts to non-expert audiences.

Additional good to have qualifications :

- 4+ years' experience in data analytics, data science, quantitative analysis using statistical computer languages to draw insights from large data sets 3+ years' experience in Python development, preferably delivering production code for data applications.

- Experience with unstructured data or computer vision models is a plus.

- Experience with SQL is a big plus Extensive model implementation experience using Scikit.

- Experience designing and developing for security critical applications; experience with the specifics for HIPAA/PHI/PII/GDPR a big plus.

- Basic experience with Linux, Git, Jupyter Notebooks is must Knowledge of Agile development practices Flexibility and adaptability to respond to a rapidly changing environment.

- Experience with distributed computational techniques and job orchestration tools and platforms is very valuable: airflow, etc.

(ref:hirist.tech)
  • Data Scientist

    3 weeks ago


    india Aventurine Technologies Inc Full time

    Job Description Data Scientist Location : Sunnyvale CA - HybridWe are looking for a Data Scientist for our Client Data Scientist with PHD & Mandarin Lang experience-Must have 9+ Yrs Experience 

  • Data Scientist

    2 months ago


    india Aventurine Technologies Inc Full time

    Job Description Data Scientist Location : Sunnyvale CA - HybridWe are looking for a Data Scientist for our Client Data Scientist with PHD & Mandarin Lang experience-Must have 9+ Yrs Experience 

  • Data Scientist

    2 months ago


    India Bloom Consulting Services Full time

    **Data Scientist ( Job ID : 000001151 )**: NA Experience **12 - year** Offered Salary Notice Period **Not Disclosed** **Data Scientist** **We need a professionally qualified data scientist, preferably with working experience in a bank.** *** **Keywords** Data scientist and the toolsets like SAS, KXEN, R are the words I would associate with data...

  • Data Scientist

    5 days ago


    India Bloom Consulting Services Full time

    Data Scientist ( JobID ::NAExperience1- yearOffered SalaryNotice PeriodNot DisclosedData ScientistWe need a professionally qualified data scientist, preferably with working experience in a bank.***KeywordsData scientist and the toolsets like SAS, KXEN, R are the words I would associate with data scientists Data exploration, mathematical/statistical modeling,...

  • Data Scientist

    5 days ago


    India Artefact Full time

    Data Scientist Artefact India Artefact is a new generation of data service providers specialising in data consulting and data-driven digital marketing. It is dedicated to transforming data into business impact across the entire value chain of organisations. We are proud to say we're enjoying skyrocketing growth. The backbone of our consulting missions,...

  • Data Scientist

    3 weeks ago


    india Artefact Full time

    Artefact is a new generation of data service providers specialising in data consulting and data-driven digital marketing. It is dedicated to transforming data into business impact across the entire value chain of organisations. We are proud to say we’re enjoying skyrocketing growth. The backbone of our consulting missions, today our Data consulting team...

  • Data Scientist

    2 months ago


    india Artefact Full time

    Artefact is a new generation of data service providers specialising in data consulting and data-driven digital marketing. It is dedicated to transforming data into business impact across the entire value chain of organisations. We are proud to say we’re enjoying skyrocketing growth. The backbone of our consulting missions, today our Data consulting team...

  • Data Scientist

    3 weeks ago


    india Artefact Full time

    Data Scientist Artefact India Artefact is a new generation of data service providers specialising in data consulting and data-driven digital marketing. It is dedicated to transforming data into business impact across the entire value chain of organisations. We are proud to say we’re enjoying skyrocketing growth. The backbone of our consulting missions,...

  • Data Scientists

    1 month ago


    India Xtage Labs Full time

    People at Xtage Labs are often looked at the intersection of machine and humans. Conversations involve data and how our work impact businesses bringing data science as a competitive advantage for our clients. The primary force shaping Xtage Labs is not simply technological innovation - but also how we could use data science to improve decision making and...

  • Data Scientist

    12 hours ago


    India D2N Solutions Full time

    Company Description D2N Solutions specializes in providing talent and solutions for companies. Our team consists of enthusiastic recruitment, compliance, finance, and marketing experts who understand the challenges faced by both clients and candidates in today's employment climate. We offer all-inclusive support, focusing on their needs and requirements....

  • Data Scientist

    5 days ago


    India Dreamwave AI Full time

    Company Description Dreamwave AI is an AI research lab that provides next-gen creative tools powered by AI to augment human creativity. The company is dedicated to pushing the boundaries of Artificial Intelligence and Machine Learning in order to create new ways of thinking about creativity. Role Description This is a full-time remote role for a Data...

  • Data Scientist

    2 months ago


    India Dreamwave AI Full time

    Company Description Dreamwave AI is an AI research lab that provides next-gen creative tools powered by AI to augment human creativity. The company is dedicated to pushing the boundaries of Artificial Intelligence and Machine Learning in order to create new ways of thinking about creativity. Role Description This is a full-time remote role for a Data...

  • Data Scientist

    2 months ago


    India Dreamwave AI Full time

    Company Description Dreamwave AI is an AI research lab that provides next-gen creative tools powered by AI to augment human creativity. The company is dedicated to pushing the boundaries of Artificial Intelligence and Machine Learning in order to create new ways of thinking about creativity. Role Description This is a full-time remote role for a Data...

  • Data Scientist

    6 days ago


    India DigiMoksha Solutions Full time

    Position: Data ScientistCompany: Tech Solutions Inc.Location: RemoteExperience: 4-6 YearsNotice Period: Immediate JoinersJob Description:Data Scientist Opportunity with Focus on Gen AI:Bachelor's degree in Computer Science, Artificial Intelligence, Data Science, or related field. Master's or Ph.D. preferred.Demonstrated ability to work with both structured...

  • Data Scientist

    5 days ago


    India Ara Resources Pvt Ltd Full time

    About The Company:Ara's Client is a leading company that specializes in assisting businesses with integrating generative AI into their operations through application development, LLM training, data enhancement, and providing on-demand talent.Job Title: Data Scientist & AnalystThe Role:We are currently looking for skilled Data Scientists & Analysts proficient...

  • Data Scientist

    4 weeks ago


    India Flexi Analyst Full time

    Company Description Flexi Analyst is a dynamic and growing company that specializes in business-quality data and content analysis. Our leadership team has extensive experience in top-tier companies, including Accenture, Amazon, Flipkart, Apple, and Inmobi. We are proud to be building the world's largest community of analysts and are committed to adding value...

  • Data Scientist

    4 weeks ago


    india Flexi Analyst Full time

    Company Description Flexi Analyst is a dynamic and growing company that specializes in business-quality data and content analysis. Our leadership team has extensive experience in top-tier companies, including Accenture, Amazon, Flipkart, Apple, and Inmobi. We are proud to be building the world's largest community of analysts and are committed to adding value...

  • : Data Scientist

    2 months ago


    India Turing Full time

    Job Requirements: - Bachelor’s/Master’s degree in Engineering, Computer Science (or equivalent experience) - At least 2 years of relevant experience as a data scientist - 2+ years of data analysis experience and a desire to have a significant impact on the field of artificial intelligence - 2+ years of experience working with Python programming - Strong...

  • Data Scientist

    3 weeks ago


    india Birlasoft Full time

    About the Job – Data Scientist with 6-8 years of experience Job Title – Data Scientist Location - Pune, Bangalore, Mumbai, Chennai, Hyderabad, Noida Educational Background - UG - B.Tech /B.E in any specialization & PG. MCA/MSC in Computers. Key Responsibilities – Experience in the IT infrastructure domain, such as network, cloud, or database...

  • Data Scientist

    4 weeks ago


    india Thoucentric Full time

    Job Description At Thoucentric, we work on various problem statements. The most popular ones are -Building capabilities that address a market need, basis our ongoing research efforts Solving a specific use case for a current or potential client based on challenges on-ground Developing new systems  that help be a better employer and a better partner to...