Senior Distributed Training Research Engineer

4 weeks ago


india Krutrim Full time
Location:

Bangalore (India), Singapore and Palo Alto (CA, US)

About Krutrim:Krutrim

is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India’s first AI unicorn and built the first foundation model from the country.Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco.

Job Description:We are seeking an experienced Senior Generative AI Model Research Engineer to efficiently train frontier and foundation multimodal large language models. In this critical role, you will be responsible for scalable training methodologies to develop a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this hands-on role, you will optimize and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with hundreds of billions and trillions of parameters to production while optimizing for low latency, high throughput, and cost efficiency.

Key Responsibilities:Architect Distributed Training Systems: Design and implement highly scalable distributed training pipelines for LLMs and frontier models, leveraging model parallelism (tensor, pipeline, expert) and data parallelism techniques.Optimize Performance: Utilize deep knowledge of CUDA, C++, and low-level optimizations to enhance model training speed and efficiency across diverse hardware configurations.Implement Novel Techniques: Research and apply cutting-edge parallelism techniques like Flash Attention to accelerate model training and reduce computational costs.Framework Expertise: Demonstrate proficiency in deep learning frameworks such as PyTorch, TensorFlow, and JAX, and tailor them for distributed training scenarios.Scale to Hundreds of Billions of Parameters: Work with massive models, ensuring stable and efficient training across distributed resources.Evaluate Scaling Laws: Design and conduct experiments to analyze the impact of model size, data, and computational resources on model performance.Collaborate: Partner closely with research scientists and engineers to integrate research findings into production-ready training systems.Qualifications:Advanced Degree: Ph.D. or Master's degree in Computer Science, Machine Learning, or a related field.Proven Experience: 5+ years of experience in distributed training of large-scale deep learning models, preferably LLMs or similar models.Deep Learning Expertise: Strong theoretical and practical understanding of deep learning algorithms, architectures, and optimization techniques.Parallelism Mastery: Extensive experience with various model and data parallelism techniques, including tensor parallelism, pipeline parallelism, and expert parallelism.Framework Proficiency: Expert-level knowledge of PyTorch, TensorFlow, or JAX, with a demonstrated ability to extend and customize these frameworks.Performance Optimization: Proven track record of optimizing deep learning models for speed and efficiency using CUDA, C++, and other performance-enhancing tools.Research Acumen: Familiarity with current research trends in large model training and the ability to apply new techniques to real-world problems.Join Krutrim to shape the future of AI and make a significant impact on 100s of millions of lives across India and the world. If you're passionate about pushing the boundaries of AI and want to work with a team at the forefront of innovation, we want to hear from you



  • India Microsoft Full time

    Role OverviewThe Azure Storage team is responsible for designing and building a distributed storage system that serves billions of users worldwide. As a senior engineer on this team, you will play a key role in shaping the future of Azure Storage by driving innovation and scalability. Your primary focus will be on developing and optimizing software...


  • India Tower Research Capital Full time

    Tower Research Capital is seeking a highly skilled Research Platform Engineer Leader to drive the design and implementation of our next generation research platform. As a key member of our team, you will work closely with our researchers and HPC infrastructure team to develop innovative solutions that meet the needs of our global trading operations.Key...


  • India Tower Research Capital Full time

    At Tower Research Capital, we are committed to developing innovative solutions that drive our global trading operations forward. We are seeking a highly skilled Distributed Systems Specialist to join our research platform development team and contribute to the design and implementation of our next generation research platform.Main ObjectivesDesign and...


  • India EdgeVerve Full time

    About the Role: We are seeking an exceptional Senior AI Researcher to join our Applied AI research team. As a key member of the team, you will work on designing, developing, and training state-of-the-art Models that drive cutting-edge multi-modal understanding and generation capabilities.Key Responsibilities:- Design and develop transformer-based models for...


  • india Hansa Research Group Full time

    Responsibilities-Assist senior staff in delivering quality services to clients and ensure the services provided to clients are timely and precise according to client business needs and specifications and at the same time meeting the company's quality standardsTo support research and client service operationsSupport research with the spade work and critical...

  • Senior AI Engineer

    4 weeks ago


    Gurugram, India Siemens Full time

    Job Description Can we energize society and fight climate change at the same time At Siemens Energy, we can. Our technology is key, but our people make the difference. Brilliant minds innovate. They connect, create, and keep us on track towards changing the world's energy systems. Their spirit fuels our mission. Our culture is defined by caring, agile,...


  • India Amazon Music Full time

    **Why Work at Amazon Music?**We're a dynamic and innovative company that's passionate about delivering exceptional customer experiences. Our teams are collaborative, fast-paced, and always looking for ways to improve. As a Distributed Applications Engineer, you'll have the opportunity to work on cutting-edge projects, develop new skills, and make a real...


  • india Quantiphi Full time

    About the CompanyQuantiphi is an award-winning AI-first digital engineering company, driven by a deep desire tosolve transformational problems at the heart of businesses. Our signature approach combinesgroundbreaking machine-learning research with disciplined cloud and data-engineeringpractices to create breakthrough impact at unprecedented speed.Quantiphi...


  • India ValueAdd Research & Analytics Solutions Full time

    Company Description:- ValueAdd Research and Analytics Solutions (ValueAdd) is a growing research and analytics solutions provider. - We offer capital markets research, and business strategy and consulting research solutions to global clients including buy-side and sell-side firms, banks and financial institutions, corporations, consulting firms, and private...

  • AI Researcher

    2 days ago


    India EdgeVerve Full time

    Location : India (Multiple Locations) Job Type : Full-time Infosys Power Programmers are a select group of highly skilled software engineers within Infosys who are passionate about technology and problem-solving. They are known for their expertise in various programming languages, data structures, and algorithms, and their ability to tackle complex technical...


  • India Pylon Management Consulting Full time

    Pylon Management Consulting seeks a seasoned Senior LLM Engineer to drive innovation and deliver impactful solutions in the field of artificial intelligence. The ideal candidate will possess a deep understanding of large language models, natural language processing, and machine learning, with a proven track record of developing and deploying advanced...


  • India IX7 Research Full time

    Job Title: Senior Backend Software EngineerAbout IX7 ResearchWe are a leading fintech firm seeking highly skilled Backend Developers to contribute to building high-performance backend systems.Key ResponsibilitiesDesign and implement scalable backend infrastructure with a focus on performance and efficiency using system design principles.Develop and manage...

  • m360 Research

    1 day ago


    India M-PANELS RESEARCH SERVICES PRIVATE LIMITED Full time

    Position Title : Senior Data Engineer (with DBA Expertise) Department : Software Development M3GR Business Unit Mission : M3 Global Research, an M3 company, is seeking a Senior Data Engineer with DBA expertise to join our data engineering team. This role will focus on building and maintaining robust data pipelines while also managing database administration...


  • India ValueAdd Research & Analytics Solutions Full time

    COMPANY DESCRIPTION:ValueAdd Research and Analytics Solutions (ValueAdd) is a growing research and analytics solutions provider. We offer capital markets research, and business strategy and consulting research solutions to global clients including buy-side and sell-side firms, banks and financial institutions, corporations, consulting firms, and private...


  • India Vantage Point Consulting Inc. Full time

    Job DescriptionJob Title: Distributed Computing EngineerLocation: PAN IndiaType: Full-TimeJob Description:- Candidate should have strong knowledge and experience working on Distributed Computing systems- Hands-on Kubernetes experience for deployment of resources in Distributed Architecture in AWS cloud- Knowledge on Streaming tools like Kafka on Distributed...


  • India Anveta Full time

    Anveta is a leading technology company that provides innovative solutions to its clients. We are currently seeking a highly skilled Distributed Systems Engineer to join our team.The ideal candidate will have a strong background in distributed systems and the ability to design and implement scalable, secure, and efficient systems. The Distributed Systems...


  • Bengaluru, India CliniLaunch Research Institute Full time

    Job Description Company Description CliniLaunch Research Institute is an advanced clinical research institute and professional training center located in Bengaluru. The institute aims to bridge the gap between aspiring professionals and the industry in the fields of Pharmacy, Lifesciences, Medicine, and Paramedical. Clini Launch Research Institute (CLRI)...

  • AI Researcher

    3 weeks ago


    India EdgeVerve Full time

    Location : India (Multiple Locations)Job Type : Full-timeInfosys Power Programmers are a select group of highly skilled software engineers within Infosys who are passionate about technology and problem-solving. They are known for their expertise in various programming languages, data structures, and algorithms, and their ability to tackle complex technical...

  • Events Coordinator

    2 weeks ago


    India Springer Nature Full time

    **Job Title: Events Coordinator - Researcher Training Solutions** **Location**: New Delhi/Pune, India (Hybrid Working)** **Full Time, Permanent Role** **Job Title: Events Coordinator - Researcher Training Solutions** **Location**: New Delhi/Pune, India (Hybrid Working)** **Full Time, Permanent Role** We are looking for an Events Coordinator to support...


  • India Sony Research India Full time

    Company OverviewSony Research India is a leading research and development center that drives innovation in cutting-edge technologies, including artificial intelligence and data analytics. We aim to foster a diverse pool of research and engineering talent to create a technology talent bank and drive research excellence worldwide.Our team at Sony Research...