Current jobs related to Senior Distributed Training Research Engineer - Bengaluru, Karnataka - Krutrim
-
Senior Distributed Training Research Engineer
3 weeks ago
Bengaluru, Karnataka, India Krutrim Full timeSenior Distributed Training Research Engineer (Frontier LLMs)Location:Bangalore (India)Type of Job:Full-timeAbout Krutrim:Krutrim is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's...
-
Bengaluru, Karnataka, India Krutrim Full timeAbout Krutrim: Krutrim is a pioneering force in building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and have successfully built the first foundation model from the...
-
Distributed Training Research Engineer
3 days ago
Bengaluru, Karnataka, India Krutrim Full timeAbout KrutrimKrutrim is a pioneer in building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and built the first foundation model from the country.Our AI stack is empowering...
-
Distributed Training Systems Expert
2 weeks ago
Bengaluru, Karnataka, India Krutrim Full timeJob OverviewWe are seeking an experienced Senior Generative AI Model Research Engineer to efficiently train frontier and foundation multimodal large language models.
-
AI Training System Engineer
5 days ago
Bengaluru, Karnataka, India Krutrim Full timeWe are seeking an experienced Senior Generative AI Model Research Engineer to work on developing scalable training methodologies for large language models and frontier models. In this hands-on role, you will be responsible for designing and implementing highly scalable distributed training pipelines for LLMs and frontier models, leveraging model parallelism...
-
Senior Mechanical Engineer
2 days ago
Bengaluru, Karnataka, India Abha Engineer Full timeWe are looking for a Senior Mechanical Engineer Roles are described below. 1. Manpower Planning. 2. Preparing of Project Cost. 3. Schedule wise work execution. 4. As Drawing & quality work execution. 5. Client & Third Party Manage. 6. Working Team Manage & Review. 7. Reporting to Management. 8. ROB & FOB Fabrication & Erection Work Knowledge.
-
Teacher Training Expert
23 hours ago
Bengaluru, Karnataka, India Indus Training and Research Institute Full timeThe Indus Training and Research Institute is dedicated to excellence in teacher preparation and is seeking passionate educators to join our team.As a member of our faculty, you will be responsible for developing and delivering cutting-edge teacher education programs.We offer a collaborative environment that values innovation and thought leadership in...
-
Teacher Training Professional
6 days ago
Bengaluru, Karnataka, India Indus Training and Research Institute Full timeJoin Our TeamAs a faculty member at Indus Training and Research Institute, you will have the opportunity to make a meaningful impact on the lives of aspiring educators. You will be part of a dynamic team that is passionate about shaping the future of K-12 education.What We OfferA platform to design and deliver cutting-edge teacher education...
-
Lead Electrical Engineering Professional
18 hours ago
Bengaluru, Karnataka, India Lam Research Full timeAs a Senior Electrical Systems Design Manager at Lam Research, you will lead a team of engineers in designing and developing electronic solutions for advanced applications.ResponsibilitiesLead a team of engineers in designing and developing complex electrical systems, sub-systems, and solutions.Develop functional specifications for electrical,...
-
Senior Distributed Systems Engineer
3 days ago
Bengaluru, Karnataka, India Oracle Full timeOverview: We're looking for hands-on engineers with expertise and passion in solving difficult problems in distributed systems, virtualized infrastructure, and highly available services. If this is you, at Oracle, you can design and build innovative new systems from the ground up. These are exciting times in our space - we are growing fast, still at an early...
-
Senior Research Engineer
4 weeks ago
Bengaluru, Karnataka, India Quantiphi Full timeQuantiphi is an award-winning AI-first digital engineering company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed. Quantiphi has seen 2.5x...
-
Senior Research Engineer
3 weeks ago
Bengaluru, Karnataka, India Quantiphi Full timeQuantiphi is an award-winning AI-first digital engineering company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed. Quantiphi has seen 2.5x...
-
Senior Research Engineer
4 weeks ago
Bengaluru, Karnataka, India Quantiphi Full timeQuantiphi is an award-winning AI-first digital engineering company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed. Quantiphi has seen 2.5x...
-
Distributed Systems Engineering Manager
2 days ago
Bengaluru, Karnataka, India LinkedIn Full timeWe're looking for a highly experienced Software Engineering Manager, Systems Infrastructure to join our team at LinkedIn. As a key member of our software engineering team, you'll be responsible for managing a team of engineers and owning significant parts of our products that require design, architecture, and coding.About the Role:This is a unique...
-
Expert AI Computing Researcher
3 days ago
Bengaluru, Karnataka, India Krutrim Full timeKey ResponsibilitiesDistributed Training Systems Architect: Design and implement highly scalable distributed training pipelines for LLMs and frontier models, leveraging model parallelism (tensor, pipeline, expert) and data parallelism techniques.Performance Optimization Expert: Utilize deep knowledge of CUDA, C++, and low-level optimizations to enhance model...
-
Senior Backend Software Engineer
1 day ago
Bengaluru, Karnataka, India HashiCorp Full time**Job Summary**We're looking for a highly skilled Senior Backend Software Engineer - Distributed Systems to join our team at HashiCorp. As a key member of our engineering team, you will be responsible for designing, developing, and deploying scalable and secure distributed systems.**Key Responsibilities**Design, prototype, and implement features and tools...
-
Distributed Systems Engineer
1 week ago
Bengaluru, Karnataka, India Cerebras Systems Full timeWe are seeking a skilled Distributed Systems Engineer to join our Cluster engineering team at Cerebras Systems. As a member of this team, you will be responsible for designing, developing, and deploying software that enables efficient management of large-scale clusters.The ideal candidate will have a strong background in software architecture, system design,...
-
Senior Distributed Systems Engineer
3 hours ago
Bengaluru, Karnataka, India BazaarVoice Full timeAbout Us Bazaarvoice is a pioneering company in user-generated content, empowering businesses to deliver transformative experiences through our expansive global network and innovative solutions. We connect thousands of brands and retailers with billions of consumers, enabling them to collect valuable feedback and insights at an unprecedented scale. We are...
-
Distributed Systems Engineer
4 days ago
Bengaluru, Karnataka, India Oracle Full timeJob DescriptionWe are looking for a senior cloud native architect to join our team. As a key member of our engineering team, you will be responsible for designing and building scalable, high-impact solutions for our business needs and end customers.Key ResponsibilitiesDesign and build scalable, high-impact solutions for our business needs and end...
-
Senior Large Language Models Researcher
5 days ago
Bengaluru, Karnataka, India Krutrim Full timeKrutrim is building a cutting-edge AI computing stack that encompasses AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and have built the first foundation model from the country.About UsOur team at Krutrim represents a convergence of talent across AI...
Senior Distributed Training Research Engineer
1 month ago
Location: Bangalore (India), Singapore and Palo Alto (CA, US)
About Krutrim:
Krutrim is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and built the first foundation model from the country.
Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.
The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco.
Job Description:
We are seeking an experienced Senior Generative AI Model Research Engineer to efficiently train frontier and foundation multimodal large language models. In this critical role, you will be responsible for scalable training methodologies to develop a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this hands-on role, you will optimize and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with hundreds of billions and trillions of parameters to production while optimizing for low latency, high throughput, and cost efficiency.
Key Responsibilities:
- Architect Distributed Training Systems: Design and implement highly scalable distributed training pipelines for LLMs and frontier models, leveraging model parallelism (tensor, pipeline, expert) and data parallelism techniques.
- Optimize Performance: Utilize deep knowledge of CUDA, C++, and low-level optimizations to enhance model training speed and efficiency across diverse hardware configurations.
- Implement Novel Techniques: Research and apply cutting-edge parallelism techniques like Flash Attention to accelerate model training and reduce computational costs.
- Framework Expertise: Demonstrate proficiency in deep learning frameworks such as PyTorch, TensorFlow, and JAX, and tailor them for distributed training scenarios.
- Scale to Hundreds of Billions of Parameters: Work with massive models, ensuring stable and efficient training across distributed resources.
- Evaluate Scaling Laws: Design and conduct experiments to analyze the impact of model size, data, and computational resources on model performance.
- Collaborate: Partner closely with research scientists and engineers to integrate research findings into production-ready training systems.
Qualifications:
- Advanced Degree: Ph.D. or Master's degree in Computer Science, Machine Learning, or a related field.
- Proven Experience: 5+ years of experience in distributed training of large-scale deep learning models, preferably LLMs or similar models.
- Deep Learning Expertise: Strong theoretical and practical understanding of deep learning algorithms, architectures, and optimization techniques.
- Parallelism Mastery: Extensive experience with various model and data parallelism techniques, including tensor parallelism, pipeline parallelism, and expert parallelism.
- Framework Proficiency: Expert-level knowledge of PyTorch, TensorFlow, or JAX, with a demonstrated ability to extend and customize these frameworks.
- Performance Optimization: Proven track record of optimizing deep learning models for speed and efficiency using CUDA, C++, and other performance-enhancing tools.
- Research Acumen: Familiarity with current research trends in large model training and the ability to apply new techniques to real-world problems.
Join Krutrim to shape the future of AI and make a significant impact on 100s of millions of lives across India and the world. If you're passionate about pushing the boundaries of AI and want to work with a team at the forefront of innovation, we want to hear from you