Senior Distributed Training Research Engineer

2 weeks ago


Delhi, Delhi, India Krutrim Full time
Senior Distributed Training Research Engineer (Frontier LLMs)

Location:

Bangalore (India)
Type of Job:

Full-time

About Krutrim:
is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and built the first foundation model from the country.
Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.
The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco.

Job Description:
We are seeking an experienced Senior Generative AI Model Research Engineer to efficiently train frontier and foundation multimodal large language models. In this critical role, you will be responsible for scalable training methodologies to develop a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this hands-on role, you will optimize and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with hundreds of billions and trillions of parameters to production while optimizing for low latency, high throughput, and cost efficiency.

Key Responsibilities:
Architect Distributed Training Systems: Design and implement highly scalable distributed training pipelines for LLMs and frontier models, leveraging model parallelism (tensor, pipeline, expert) and data parallelism techniques.
Optimize Performance: Utilize deep knowledge of CUDA, C++, and low-level optimizations to enhance model training speed and efficiency across diverse hardware configurations.
Implement Novel Techniques: Research and apply cutting-edge parallelism techniques like Flash Attention to accelerate model training and reduce computational costs.
Framework Expertise: Demonstrate proficiency in deep learning frameworks such as PyTorch, TensorFlow, and JAX, and tailor them for distributed training scenarios.
Scale to Hundreds of Billions of Parameters: Work with massive models, ensuring stable and efficient training across distributed resources.
Evaluate Scaling Laws: Design and conduct experiments to analyze the impact of model size, data, and computational resources on model performance.
Collaborate: Partner closely with research scientists and engineers to integrate research findings into production-ready training systems.

Qualifications:
Advanced Degree: Ph.D. or Master's degree in Computer Science, Machine Learning, or a related field.
Proven Experience: 5+ years of experience in distributed training of large-scale deep learning models, preferably LLMs or similar models.
Deep Learning Expertise: Strong theoretical and practical understanding of deep learning algorithms, architectures, and optimization techniques.
Parallelism Mastery: Extensive experience with various model and data parallelism techniques, including tensor parallelism, pipeline parallelism, and expert parallelism.
Framework Proficiency: Expert-level knowledge of PyTorch, TensorFlow, or JAX, with a demonstrated ability to extend and customize these frameworks.
Performance Optimization: Proven track record of optimizing deep learning models for speed and efficiency using CUDA, C++, and other performance-enhancing tools.
Research Acumen: Familiarity with current research trends in large model training and the ability to apply new techniques to real-world problems.
Join Krutrim to shape the future of AI and make a significant impact on 100s of millions of lives across India and the world. If you're passionate about pushing the boundaries of AI and want to work with a team at the forefront of innovation, we want to hear from you
  • Researcher

    2 weeks ago


    Delhi, Delhi, India Rantrix Research Full time

    Welcome to Rantrix ResearchRantrix Research is an Udyam registered, Micro Enterprise under the Ministry of MSME, India. We provide next generation solutions towards scientific research and development.Choose any research area/ topic.Work as an individual or as a group.Be the driver of next generation researchAre you someone who wishes to be a part of...

  • Researcher

    2 weeks ago


    Delhi, Delhi, India Rantrix Research Full time

    Welcome to Rantrix ResearchRantrix Research is an Udyam registered, Micro Enterprise under the Ministry of MSME, India. We provide next generation solutions towards scientific research and development.Choose any research area/ topic.Work as an individual or as a group.Be the driver of next generation researchAre you someone who wishes to be a part of...


  • Delhi, Delhi, India Hansa Research Full time

    We are looking for MR professionals with 2-4 years of Market Research experience in the. Quantitative domain - purely handling Consumer Research and Client Servicing. Location : Mumbai. JD:- Handling End to End Research Stages. (Receiving Client Brief, Proposal Writing, Customized Questionnaire.- Building, Preparation of Research Instruments, Preparing...


  • Delhi, Delhi, India Hansa Research Full time

    We are looking for MR professionals with 2-4 years of Market Research experience in the. Quantitative domain - purely handling Consumer Research and Client Servicing. Location : Mumbai. JD:- Handling End to End Research Stages. (Receiving Client Brief, Proposal Writing, Customized Questionnaire.- Building, Preparation of Research Instruments, Preparing...


  • Delhi, Delhi, India Hansa Research Group Full time

    We are looking for MR professionals with 2-4 years of Market Research experience in theQuantitative domain purely handling Consumer Research and Client Servicing.Location: MumbaiJD:Handling End to End Research Stages(Receiving Client Brief, Proposal Writing, Customized QuestionnaireBuilding, Preparation of Research Instruments, Preparing reports and...


  • Delhi, Delhi, India Hansa Research Group Full time

    We are looking for MR professionals with 2-4 years of Market Research experience in theQuantitative domain purely handling Consumer Research and Client Servicing.Location: MumbaiJD:Handling End to End Research Stages(Receiving Client Brief, Proposal Writing, Customized QuestionnaireBuilding, Preparation of Research Instruments, Preparing reports and...


  • Delhi, Delhi, India Krutrim Full time

    Multimodal and Vision AI Research Engineer / ScientistLocation:Bangalore (India)Type of Job:Full-timeAbout Krutrim:is building AI computing for the future. Our envisioned AI computing stack encompasses AI infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered applications. As India's first AI unicorn, we built the country's...

  • Senior Data Engineer

    4 weeks ago


    Delhi, Delhi, India Uncap Research Labs Full time

    Senior Data EngineerOUR STORY : The tech space is crowded, but most solutions feel like they're cut from the same cloth-missing the mark on what businesses truly need. Organizations are chasing innovation, but too often, it comes at the expense of flexibility, independence, and real technological depthAt AT DAWN Technologies, we're a niche startup...


  • Delhi, Delhi, India Uncap Research Labs Full time

    Senior Data EngineerOUR STORY : The tech space is crowded, but most solutions feel like they're cut from the same cloth-missing the mark on what businesses truly need. Organizations are chasing innovation, but too often, it comes at the expense of flexibility, independence, and real technological depthAt AT DAWN Technologies, we're a niche startup...

  • m360 Research

    1 week ago


    Delhi, Delhi, India M-PANELS RESEARCH SERVICES PRIVATE LIMITED Full time

    Position Title : Senior Data Engineer (with DBA Expertise)Department : Software Development M3GRBusiness Unit Mission : M3 Global Research, an M3 company, is seeking a Senior Data Engineer with DBA expertise to join our data engineering team. This role will focus on building and maintaining robust data pipelines while also managing database administration...

  • m360 Research

    2 weeks ago


    Delhi, Delhi, India M-PANELS RESEARCH SERVICES PRIVATE LIMITED Full time

    Position Title : Senior Data Engineer (with DBA Expertise)Department : Software Development M3GRBusiness Unit Mission : M3 Global Research, an M3 company, is seeking a Senior Data Engineer with DBA expertise to join our data engineering team. This role will focus on building and maintaining robust data pipelines while also managing database administration...


  • Delhi, Delhi, India Insight Alpha Full time

    Job Description-Research Manager/Senior Research Manager This exciting role provides opportunities to work with business and investment leaders across sectors. Providing knowledge on large-scale issues down to the most niche and esoteric, our experts range from domestic and international policy specialists to economic advisors, business leaders to academics....

  • AI Research Scientist

    4 weeks ago


    Delhi, Delhi, India ShoppinPal Full time

    About Shoppin' :Shoppin' is an AI - powered visual fashion search engine - if Google's search exhaustiveness and Pinterest's social DNA were to have a baby, it'd be us. Today, gen-z shopping is super trend and intent-led, where they know exactly what they want to look for. our multi modal search engine allows you to discover fashion with personalised...

  • AI Research Scientist

    3 weeks ago


    Delhi, Delhi, India ShoppinPal Full time

    About Shoppin' :Shoppin' is an AI - powered visual fashion search engine - if Google's search exhaustiveness and Pinterest's social DNA were to have a baby, it'd be us. Today, gen-z shopping is super trend and intent-led, where they know exactly what they want to look for. our multi modal search engine allows you to discover fashion with personalised...


  • Delhi, Delhi, India SaveLIFE Foundation Full time

    Job Description :Are you passionate about making a positive impact on road safety? Do you thrive in a collaborative team environment built on mutual respect and integrity?Are you an experienced professional with a strong drive to research and contribute to enhancing safety across the country? If so, we have an exciting opportunity for you to join our team at...


  • Delhi, Delhi, India SaveLIFE Foundation Full time

    Job Description :Are you passionate about making a positive impact on road safety? Do you thrive in a collaborative team environment built on mutual respect and integrity?Are you an experienced professional with a strong drive to research and contribute to enhancing safety across the country? If so, we have an exciting opportunity for you to join our team at...

  • Senior Accountant

    2 weeks ago


    Delhi, Delhi, India Uttaranchal Dental & Medical Research Institute Full time

    Company DescriptionUttaranchal Dental & Medical Research Institute, under the aegis of the Gangotri Human Resources and Development Society Allahabad (U.P.), is dedicated to providing comprehensive education and medical services. We focus on the physical and mental well-being of society through excellence in dental and medical sciences training and research....


  • Delhi, Delhi, India Vint Foods & Beverages Pvt. Ltd. Full time

    Senior Training Manager We are seeking a highly skilled and experienced Senior Training Manager to join our team at Vint Foods & Beverages Pvt. Ltd. , a premium Ice cream trading organization with a strong presence in South India and rapid expansion plans nationwide. Key Responsibilities: Understand the corporate philosophy and align training programs...


  • Delhi, Delhi, India Insight Alpha Full time

    About Insight Alpha:Insight Alpha provides its clients access to a network of frontline industry experts that help them get critical information they need to be successful. We help thousands of our clients get answers to their most critical questions, without leaving their desks. Having built a strong network of senior industry experts and key decision...


  • Delhi, Delhi, India PharmSight Research and Analytics Full time

    About PharmSight Research and AnalyticsPharmSight Research and Analytics is a leading innovator in bio-pharma analytics, providing cutting-edge AI-powered solutions that transform product research, market intelligence, and healthcare decision-making. Our mission is to improve patient outcomes and drive advancements in the pharmaceutical industry through the...