AI Research Engineer

1 week ago


bangalore, India Krutrim Full time

Location: Bangalore, India.


Type of Job: Full-time


About Krutrim:

Krutrim is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India’s first AI unicorn and built the first foundation model from the country.


Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains.

The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages.


The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco.


Job Description:

We are looking for experienced Generative AI Engineers to train, optimize, scale, and deploy a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this role, you will conduct advanced research and development to push the boundaries of what is possible with generative AI and language models.


Responsibilities:

  • Research, architect, and deploy new generative AI methods such as autoregressive models, causal models, and diffusion models.
  • Refine foundation model infrastructure to support the deployment of optimized AI models with a focus on C/C++, CUDA, and kernel-level programming enhancements.
  • Implement state-of-the-art optimization techniques, including quantization, distillation, sparsity, streaming, and caching, for model performance enhancements.
  • Design and develop novel large language models and corresponding architectures by leveraging transformers, Mixture-of-experts, attention mechanisms (FlashAttention-2 (w/ MQA, GQA), MLA (Multi-head Latent Attention) and state-of-the-art architectures.
  • Implement large multimodal models following latest architectures - early fusion (e.g. NExT-GPT, Unified IO-2) or deep fusion (e.g. Zipper, Mirasol 3B) or similar.
  • Train or finetune speech / audio models for representation (like, W2V-BERT, SONAR, AST), generation (like, Hi-Fi GAN, VQ-GAN, AudioLDM), multilingual multitask models (like, SeamlessM4T).
  • Train or fine-tune vision models for representation (like, ViT, Q-Former, CLIP, SigLIP.), generation (like, Stable diffusion, Stable cascade), video representation (like, Video-Swin transformer).
  • Drive innovations in NLP techniques like text generation, summarization, translation, question answering, etc. enabled by generative models.
  • Integrate and tailor frameworks such as PyTorch, TensorFlow, DeepSpeed, Lightening, Habana and FSDP for the advancement of super-fast model training and inference.
  • Advance the deployment infrastructure with MLOps frameworks such as KubeFlow, MosaicML, Anyscale, and Terraform, ensuring robust development and deployment cycles
  • Publish papers at top-tier AI/ML conferences like NeurIPS, ICML, ICLR on new research contributions.
  • Collaborate with engineering teams to productionize research advancements into scalable services and products.


Qualifications:

  • Ph.D. or MS with 2+ years of research / applied research experience in LLMs, NLP, CV, Reinforcement Learning, Voice, and Generative models.
  • Demonstrated expertise in high-performance computing with proficiency in Python, C/C++, CUDA, and kernel-level programming for AI applications.
  • Extensive experience in the optimization of training and inference for large-scale AI models, including practical knowledge of quantization, distillation, and LLMOps,
  • Prior experience with large-scale distributed training and fine-tuning of foundation models such as GPT-3, LLaMA2, AlphaFold, and DALL-E.
  • Experience with language modeling evaluation, prompt tuning and engineering, instruction tuning, and/or RLHF.
  • Research contributions in NLP, generative modeling, LLMs demonstrated through publications and products.
  • Strong programming skills and proficiency in Python, TensorFlow/PyTorch, and other ML frameworks and tools.
  • Experience in Information Extraction, Question Answering, Conversational Agents (Chatbots), Data Visualization and/or text-to-image models.
  • Excellent communication and collaboration skills to work cross-functionally with various teams.

  • Research Scientist

    3 weeks ago


    bangalore, India Murf AI Full time

    At Murf AI , we're simplifying multimedia creation by harnessing the power of artificial intelligence. Our platform empowers users to craft high-quality voiceovers effortlessly, without the need for recording equipment.Some interesting facts about Murf AI:Customers in 100+ countries1Mn+ registered users6X growth in revenue in last 12 months120+ voices in...


  • bangalore, India Fractal Full time

    It's fun to work in a company where people truly BELIEVE in what they are doing! AI Research Engineer At Fractal Analytics, we are leveraging cutting-edge deep-learning and machine-learning based solutions to address various business problems for leading Fortune 500 companies. We are looking to hire Deep-learning / Machine-learning AI Research engineers to...

  • Research Scientist

    3 weeks ago


    bangalore, India Murf AI Full time

    At Murf AI, we're simplifying multimedia creation by harnessing the power of artificial intelligence. Our platform empowers users to craft high-quality voiceovers effortlessly, without the need for recording equipment. Some interesting facts about Murf AI: Customers in 100+ countries 1Mn+ registered users 6X growth in revenue in last 12 months 120+ voices...


  • Bengaluru/ Bangalore, India timesjobs Full time

    JOB DETAILS1.Prior experience in research (Academic/Industrial) in Artificial Intelligence or Mathematics.2.Comfortable in using multiple deep learning frameworks - tensorflow, pytorch, OpenCV etc.3.Strong understanding of fundaments in Deep Learning4.Passionate to learn new methodologies and implementation5.Hands on experience in at least one of the...


  • bangalore, India Quantiphi Full time

    Quantiphi is an award-winning AI-first digital engineering company, driven by a deep desire to solve transformationalproblems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research withdisciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed.Quantiphi has seen 2.5x...


  • bangalore, India Quantiphi Full time

    Quantiphi is an award-winning AI-first digital engineering company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed. Quantiphi has seen 2.5x...


  • Bangalore, India Autonomize AI Full time

    About Autonomize, Inc. : At Autonomize AI, we're on a mission to help healthcare organizations unlock the potential of dark data to significantly impact human health outcomes. We strive to make the process of deriving insights from unstructured data effortless and accessible. Join our ambitious team and help create AI solutions that make a real...


  • bangalore, India Autonomize AI Full time

    About Autonomize, Inc. : At Autonomize AI, we're on a mission to help healthcare organizations unlock the potential of dark data to significantly impact human health outcomes. We strive to make the process of deriving insights from unstructured data effortless and accessible. Join our ambitious team and help create AI solutions that make a real...


  • Bangalore, Karnataka, India Autonomize AI Full time

    About Autonomize, Inc. : At Autonomize AI, we're on a mission to help healthcare organizations unlock the potential of dark data to significantly impact human health outcomes. We strive to make the process of deriving insights from unstructured data effortless and accessible. Join our ambitious team and help create AI solutions that make a real...

  • Multimodal AI Intern

    3 weeks ago


    bangalore, India Sony Research India Full time

    Sony Research India is driving cutting-edge research and development in various locations around the globe, including laboratories in Japan, the United States, Europe, and Asia. We endeavor to create new technology, products, and services while sustaining Sony Group’s diverse businesses in electronics, entertainment, and financial fields. For our research...


  • bangalore, India Microsoft Full time

    Overview At Microsoft, we operate the largest collaboration services in the world with 100s of millions of consumer/enterprise mailboxes, documents, and conversations. We are an Applied Research team driving medium and long-term product innovations. We closely collaborate with multiple research teams and product groups across the globe who bring a...

  • AI Engineer

    2 weeks ago


    bangalore, India Lilly Full time

    We’re looking for people who are determined to make life better for people around the world. Position Overview: We are seeking a highly skilled AI Engineer to join our team in designing, developing, and researching AI systems and solutions. The ideal candidate will have a strong background in data science, data analytics, and a deep understanding of...


  • bangalore, India Accolite Full time

    **Job Title: Generative AI Engineer** **Location:** Bangalore / Hyderabad/Chennai/Gurgaon **Experience:** 8+years of overll datascience experience with hands on experience in Generative AI project is must **Notice Period:** Immediate to 30 days **Job Description:** We are looking for a skilled and innovative Generative AI Engineer to join our team. The...


  • Bengaluru/ Bangalore, India timesjobs Full time

    1.Prior experience in research (Academic/Industrial) in Artificial Intelligence or Mathematics.2.Comfortable in using multiple deep learning frameworks - tensorflow, pytorch, OpenCV etc.3.Strong understanding of fundaments in Deep Learning4.Passionate to learn new methodologies and implementation5.Hands on experience in at least one of the following: Machine...


  • bangalore, India IDC AsiaPacific Full time

    Company ProfileIDC is the premier global provider of market intelligence, research, advisory services, and events for the information technology, telecommunications, and consumer technology markets.Position OverviewIDC is seeking a highly motivated and resourceful individual in Analytics and AI research based in India, preferably Bengaluru. The primary...

  • AI Palette

    3 weeks ago


    bangalore, India AI Palette Full time

    We are Ai Palette : We think the world would be a nicer place to be if Food Companies could create products that the consumers really want. So we are making it happen. We want to be the most preferred Food AI company in the world. We're making it possible by building an AI-powered SaaS platform based on our founders' experience in the Food Industry & AI.We...

  • AI Palette

    4 weeks ago


    Bangalore, India AI Palette Full time

    We are Ai Palette : We think the world would be a nicer place to be if Food Companies could create products that the consumers really want. So we are making it happen. We want to be the most preferred Food AI company in the world. We're making it possible by building an AI-powered SaaS platform based on our founders' experience in the Food Industry...

  • AI Palette

    1 month ago


    Bangalore, Karnataka, India AI Palette Full time

    We are Ai Palette : We think the world would be a nicer place to be if Food Companies could create products that the consumers really want. So we are making it happen. We want to be the most preferred Food AI company in the world. We're making it possible by building an AI-powered SaaS platform based on our founders' experience in the Food Industry &...

  • Lead AI Engineer

    4 weeks ago


    bangalore, India JLL Full time

    Lead AI Engineer JLL Technologies COE, Bangalore About JLL and JLL Technologies JLL is a leading professional services firm that specializes in real estate and investment management. Our vision is to reimagine the world of real estate, creating rewarding opportunities and amazing spaces where people can achieve their ambitions. In doing so,...


  • bangalore, India Super AI Full time

    Mission: Develop and optimize predictive models that select the best AI models/platforms/tools for diverse use cases. Outcomes :1. Model Development: - Develop and deploy an initial predictive model within the first 2 months. - Achieve high accuracy on test datasets.2. Data Collection and Preparation: - Establish robust data pipelines within 2 months. -...