Lead Generative AI Research Engineer
2 weeks ago
Location:
Bangalore (India)
Type of Job:
Full-time
About Krutrim:
is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's first AI unicorn and built the first foundation model from the country.
Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.
The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco.
Job Description:
We are looking for an experienced Lead Generative AI Engineer to train, optimize, scale, and deploy a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this hands-on role, you will architect and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with billions of parameters to production while optimizing for low latency, high throughput, and cost efficiency.
Key Responsibilities:
Architect and refine foundation model infrastructure to support the deployment of optimized AI models with a focus on
C/C++, CUDA,
and kernel-level programming enhancements.
Implement state-of-the-art optimization techniques, including quantization, distillation, sparsity, streaming, and caching, for model performance enhancements.
Spearhead the development of Vision pipelines, ensuring scalable training and inference workflows of 10s and 100s of billions of parameter foundation models.
Should be able to innovate for the state-of-the-art architectures involving Panoptic Segmentation, Image Classification and Image Generation. It is expected that the candidate experiments with the internals of Vision Transformers and convolutional Models like
ConvNext, CLIP, Visual Question Answering (VQA) and Diffusion Models.
Practice around AI Arts, Image Prompts, Conditional Image Generation will be an additional advantage.
Design, develop, and innovate state-of-the-art in large multimodal models.
Make architectural choices across dense /
Mixture-of-experts, early fusion / deep fusion, choice of modality encoders (VQ-GAN, ViT, CLIP/SigLIP),
decoders (Stable diffusion, Stable cascade, AudioLDM).
Proven track record of developing and applying novel neural network architectures such as
Mixture of Experts, Diffusion Models, and State Space Machines (MAMBA, SAMBA)
Execute training and inference processes with a key emphasis on minimizing latency and maximizing throughput, utilizing GPU clusters and custom hardware.
Innovate on current model deployment platforms, employing AWS, GCP, and GPU clusters, to enable high scalability and responsiveness.
Integrate and tailor frameworks such as
PyTorch, TensorFlow, DeepSpeed, Lightening, FSDP,
and Habana for the advancement of super-fast model training and inference.
Advance the deployment infrastructure with MLOps frameworks such as KubeFlow, MosaicML, Anyscale, Terraform, ensuring robust development and deployment cycles.
Enhance post-deployment mechanisms with exhaustive testing, real-time monitoring, and sophisticated explainability and robustness checks.
Drive continuous improvement initiatives for deployed models with automated pipelines for drift detection and performance degradation.
Lead the charge in model management, encompassing version control, reproducibility, and lineage tracking.
Cultivate a culture of high-performance computing and optimization within the AI/ML domain, propagating best practices and knowledge sharing.
Qualifications:
Ph.D. with 5+ years or MS with 8+ years of experience in ML Engineering, Data Science, or related fields.
Demonstrated expertise in high-performance computing with proficiency in
Python, C/C++, CUDA, and kernel-level programming
for AI applications.
Extensive experience in the optimization of training and inference for large-scale AI models, including practical knowledge of quantization, distillation, and Vision Pipelines.
It will be of additional benefit if the Candidate understands Diffusion Models (DDPM), Variational Autoencoders, Bayesian Modelling, Stochastic Variational Inference (SVI) and Reinforcement Learning.
Experience in building 10s and 100s of billions of parameters generative AI foundation models
AI training job scheduling, orchestration, and management via SLURM and Kubeflow.
Proven success in deploying optimized ML systems on a large scale, utilizing cloud infrastructures and GPU resources.
In-depth understanding and hands-on experience with advanced model optimization frameworks such as
DeepSpeed, FSDP, PyTorch, TensorFlow,
and corresponding MLOps tools.
Familiarity with contemporary MLOps frameworks like MosaicML, Anyscale, Terraform, and their application in production environments.
Strong grasp of state-of-the-art ML infrastructures, deployment strategies, and optimization methodologies.
An innovative problem-solver with strategic acumen and a collaborative mindset.
Exceptional communication and team collaboration skills, with an ability to lead and inspire.
-
Software/AI Research Engineer
1 week ago
Delhi, Delhi, India Scale AI Full timeJob Description : Software/AI Research EngineerCompany : Scale AIRole : AI Research Engineer (Part-Time / Full-Time) - RemoteAbout Scale AI : Scale AI is a leading data infrastructure and services company that empowers organizations to accelerate the development of AI applications. We are at the forefront of AI innovation, providing high-quality training...
-
AI Lead Engineer
1 week ago
Delhi, Delhi, India Sparsa AI Full timePLEASE ONLY APPLY FOR THIS ROLE IF:You can implement a transformer by hand in PyTorch You can implement reinforcement learning algorithms from the ground up with no/minimal use of AI frameworks You can code/tune a foundational model from the ground up You have atleast 5 years of deep foundational AI architecture and/or development experienceCompany...
-
Research and Insights Lead
2 weeks ago
Delhi, Delhi, India Bloom AI Full timeBloom AI is a modern intelligence firm that accelerates decisions through AI-driven synthesized insights. We empower enterprises to unlock the value of data with human-like synthesis and decision intelligence at scale. Our proprietary tools and solutions are trusted by investment managers, insurance, private equity, and Fortune 1000 companies for more...
-
Lead Generative AI Engineer
1 week ago
Delhi, Delhi, India REPLACI Full timeAbout the RoleWe are seeking a highly skilled Generative AI Engineer with 5 years of experience in designing andimplementing diffusion models for furniture replacement in images and achieving ultra-realistic re-rendering. If you thrive on solving challenging AI problems and are passionate about creatinginnovative solutions in the field of computer vision and...
-
AI Project Lead
18 hours ago
Delhi, Delhi, India AI Regent Full timeProject OverviewAI Regent India is a pioneering organization at the forefront of artificial intelligence (AI) innovation. We are seeking a highly skilled and motivated Project Manager to lead the planning, development, and execution of AI projects within our organization.Key Responsibilities:· Project Planning & Strategy:• Define and manage AI project...
-
Generative AI Engineer
3 weeks ago
Delhi, Delhi, India Techno Wise Full timePosition Overview : As a Generative AI Engineer, you will be responsible for designing, implementing, and optimizing generative AI models and algorithms.You will primarily work in building new b2c applications and integrating AI features into existing apps. You will work closely with cross-functional teams to integrate AI capabilities into our products. The...
-
Generative AI Engineer
2 weeks ago
Delhi, Delhi, India Techno Wise Full timePosition Overview : As a Generative AI Engineer, you will be responsible for designing, implementing, and optimizing generative AI models and algorithms.You will primarily work in building new b2c applications and integrating AI features into existing apps. You will work closely with cross-functional teams to integrate AI capabilities into our products. The...
-
Director - Generative AI
1 week ago
Delhi, Delhi, India VIDPRO CONSULTANCY SERVICES Full timeJob Description : The Director of Generative AI plays a pivotal role in leading the development and implementation of generative AI strategies, algorithms, and models for the organization.This role is critical in driving innovation, solving complex problems, and contributing to the advancement of AI technology.Key Responsibilities :- Lead the research and...
-
Generative AI Engineer Lead
3 weeks ago
Delhi, Delhi, India EduRun Full timeJob Description :- Should have 6 to 10 years of experience. - Bachelor's or Master's degree in Computer Science, Engineering, or a related field. - Proven experience in Python programming and developing AI models. - Strong expertise in LangChain and its applications in generative AI. - Solid understanding of large language models (LLMs) and their...
-
Generative AI Engineer Lead
2 weeks ago
Delhi, Delhi, India EduRun Full timeJob Description :- Should have 6 to 10 years of experience. - Bachelor's or Master's degree in Computer Science, Engineering, or a related field. - Proven experience in Python programming and developing AI models. - Strong expertise in LangChain and its applications in generative AI. - Solid understanding of large language models (LLMs) and their...
-
Generative AI Engineer
1 day ago
Delhi, Delhi, India Heliverse Technologies Full timeAt Heliverse, we don't just follow trends — we build the future of intelligent systems. We're looking for a Generative AI Engineer who thrives on solving hard problems with deep learning, and isn't afraid to push the limits of what's possible with models like GPT, VAE, and GANs.You'll work on real-world applications that demand creativity, precision, and...
-
Software Engineer, Generative AI
3 weeks ago
Delhi, Delhi, India ClearML Full timeGenerative AI Software EngineerOverview:ClearML is a unified, open source platform for continuous AI/ML, trusted by forward-thinking Data Scientists, ML Engineers, DevOps, and decision makers at leading Fortune 500, enterprises, academia, and innovative start-ups worldwide. We enable customers to achieve the fastest time to production, fastest time to value,...
-
Leading Edge AI Technologist
3 days ago
Delhi, Delhi, India Altrum AI Full timeJob Overview:AltrumAI is at the forefront of building cutting-edge Generative AI systems. As a skilled Machine Learning Engineer, you will be part of our product-focused team developing innovative solutions that empower our AltrumAI platform to understand and transform natural language in dynamic and novel ways.We are seeking an experienced professional with...
-
Generative AI Engineer
3 weeks ago
Delhi, Delhi, India Crimson Energy Experts Pvt Ltd Full timeJob Title: Generative AI Engineer (Immediate Joining)Location: New Delhi, IndiaJoining Timeline: Immediate (1-2 weeks)Salary: ₹10-18 LPAAbout Us:Crimson Energy Experts Pvt. Ltd. is a leading technology-driven organization specializing in advanced energy solutions and cutting-edge innovations. Our focus is on integrating AI and technology to revolutionize...
-
Bioinformatics AI Specialist
5 days ago
Delhi, Delhi, India PharmSight Research and Analytics Full timeAbout PharmSight Research and AnalyticsPharmSight Research and Analytics is a leading innovator in bio-pharma analytics, providing cutting-edge AI-powered solutions that transform product research, market intelligence, and healthcare decision-making. Our mission is to improve patient outcomes and drive advancements in the pharmaceutical industry through the...
-
AI Developer/Engineer
5 days ago
Delhi, Delhi, India PharmSight Research and Analytics Full timeAbout PharmSightPharmSight is a leading innovator in bio-pharma analytics, providing cutting-edge AI-powered solutions that transform product research, market intelligence, and healthcare decision-making. We are dedicated to improving patient outcomes and driving advancements in the pharmaceutical industry through the application of advanced artificial...
-
AI Alignment Research Engineer
2 weeks ago
Delhi, Delhi, India Krutrim Full timePrincipal Research Scientist, AI Alignment (Reinforcement Learning, Red Teaming, Explainability)Location:Bangalore (India)About Us:is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India's...
-
Full Stack Engineer
3 weeks ago
Delhi, Delhi, India Lead Masters AI Full timeLeadMasters AI is an ai-driven lead generation and ad optimization platform designed to help businesses automate their marketing campaigns. We are looking for a talented mern stack engineer to join our team and contribute to the development of intelligent, scalable, and high-performance applications.ResponsibilitiesDevelop, test, and maintain full-stack...
-
Content Generation Strategist
5 days ago
Delhi, Delhi, India INDEEP AI Full timeAbout the RoleWe're seeking a highly skilled Content Generation Strategist to join our innovative team at Indeep AI, the creators of cutting-edge AI-powered writing mentors.In this pivotal role, you'll report directly to the Head of Operations and play a crucial part in building advanced content-generation capabilities.Key Responsibilities:Create structured...
-
Generative ai engineer
1 day ago
Delhi, Delhi, India Claw Legaltech Full timeOur Company Claw Legal Tech is at the forefront of AI-driven legal solutions, transforming the way legal professionals draft, analyze, and automate legal documents. We are looking for a Generative AI Engineer Intern who is passionate about working with large language models (LLMs), Retrieval-Augmented Generation (RAG) models, fine-tuning AI models,...