LLM Ops Engineer

16 hours ago


Hyderabad, Telangana, India Apple Full time ₹ 12,00,000 - ₹ 36,00,000 per year

We work on Apple scale opportunities and challenges. We are engineers at heart. We like solving technical problems. We believe a good engineer has the curiosity to dig into inner workings of technology and is always experimenting, reading and in constant learning mode. If you are a software engineer with passion to code and dig deeper into any technology, love knowing the internals, fascinated by distributed systems architecture, we want to hear from you.

Description

We are seeking a highly skilled LLM Ops and ML Ops Engineer to lead the deployment, scaling, monitoring, and optimization of large language models (LLMs) across diverse environments. This role is critical to ensuring our machine learning systems are production-ready, high-performing, and resilient. The ideal candidate will have deep expertise in Python programming / Go Programming, a comprehensive understanding of LLM internals, and hands-on experience with various inference engines and deployment strategies. The person should be capable of exhibiting deftness to balance multiple simultaneous competing priorities and deliver solutions in a timely manner. The person should be able to understand complex architectures and be comfortable working with multiple teams KEY RESPONSIBILITIES: - Design and build scalable infrastructure for fine-tuning, and deploying large language models. - Develop and optimize inference pipelines using popular frameworks and engines (e.g. TensorRT, vLLM, Triton Inference Server). - Implement observability solutions for model performance, latency, throughput, GPU/TPU utilization, and memory efficiency. - Own the end-to-end lifecycle of LLMs in production-from experimentation to continuous integration and continuous deployment (CI/CD). - Collaborate with research scientists, ML engineers, and backend teams to operationalize groundbreaking LLM architectures. - Automate and harden model deployment workflows using Python, Kubernetes, Containers and orchestration tools like Argo Workflows and GitOps. - Design reproducible model packaging, versioning, and rollback strategies for large-scale serving. - Stay current with advances in LLM inference acceleration, quantization, distillation, and model compilation techniques (e.g., GGUF, AWQ, FP8).

Minimum Qualifications

  • 5+ years of experience in LLM/ML Ops, DevOps, or infrastructure engineering with a focus on machine learning systems.
  • Advance level proficiency in Python/Go, with ability to write clean, performant, and maintainable production code.
  • Deep understanding of transformer architectures, LLM tokenization, attention mechanisms, memory management, and batching strategies.
  • Proven experience deploying and optimizing LLMs using multiple inference engines.
  • Strong background in containerization and orchestration (Kubernetes, Helm).
  • Familiarity with monitoring tools (e.g., Prometheus, Grafana), logging frameworks, and performance profiling.

Preferred Qualifications

  • Experience integrating LLMs into micro-services or edge inference platforms.
  • Experience with Ray distributed inference
  • Hands-on with quantization libraries
  • Contributions to open-source ML infrastructure or LLM optimization tools.
  • Familiarity with cloud platforms (AWS, GCP) and infrastructure-as-code (Terraform).
    Exposure to secure and compliant model deployment workflows

Submit CV


  • llm

    6 days ago


    Hyderabad, Telangana, India procallisto solutions pvt Full time ₹ 15,60,000 - ₹ 18,00,000 per year

    ob Title: Senior AI/ML Engineer – LLMsExperience: 6+ yearsLocation: [ Bangalore / HyderabadWork Mode: OnsiteRole Overview:We are seeking a highly skilled Senior AI/ML Engineer with expertise in Large Language Models (LLMs), Artificial Intelligence, and Machine Learning. The candidate will design, develop, and optimize AI-driven solutions, with a strong...

  • Senior LLM Engineer

    1 week ago


    Hyderabad, Telangana, India Avisoft Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Profile Summary :We are seeking a Senior LLM Engineer with deep expertise in transformer-based NLP models (GPT, BERT, T5, RoBERTa, etc.) and a strong command of prompt engineering, fine-tuning, and instruction-based learning. The ideal candidate will design, optimize, and deploy large language models (LLMs) for real-world applications in text generation,...

  • Senior LLM Engineer

    1 week ago


    Hyderabad, Telangana, India Avisoft Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Description :Profile Summary : We are seeking a Senior LLM Engineer with deep expertise in transformer-based NLP models (GPT, BERT, T5, RoBERTa, etc.) and a strong command of prompt engineering, fine-tuning, and instruction-based learning. The ideal candidate will design, optimize, and deploy large language models (LLMs) for real-world applications in...

  • AIOps Engineer

    2 weeks ago


    Hyderabad, Telangana, India T3 strategic Partners Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Description: AI Ops Engineer - (ML Ops & LLM Ops).Location: Hyderabad or Remote (within India).Experience: Years. IMMEDIATE JOINERS PREFERRED FROM IT SERVICES ORGANIZATION.Role Overview: We are looking for experienced AI Ops Engineers with deep expertise in MLOps, LLM deployment, and AI infrastructure management. The ideal candidate will design and...


  • Hyderabad, Telangana, India Apple Full time US$ 1,50,000 - US$ 2,00,000 per year

    At Apple, new ideas have a way of becoming extraordinary products, services and customer experiences very quickly. Bring passion and dedication to your job, and there's no telling what you could accomplish. The people here at Apple don't just craft products - they build the kind of wonder that's revolutionised entire industries. It's the diversity of those...


  • Hyderabad, Telangana, India Apple Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    At Apple, new ideas have a way of becoming extraordinary products, services and customer experiences very quickly. Bring passion and dedication to your job, and there's no telling what you could accomplish. The people here at Apple don't just craft products - they build the kind of wonder that's revolutionised entire industries. It's the diversity of those...


  • Hyderabad, Telangana, India Avirasoft Digital Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    We are looking for both lead AILLM ENg-1, Mid lever AILLM Eng-2Preferred candidJob Title: Lead AI / LLM EngineerLocation: Hybrid / Remote Department: AI & Data Science Company: Avira DigitalAbout the RoleWe are looking for an experienced Lead AI / LLM Engineer to drive the design, development, and deployment of advanced AI and GenAI solutions. This role will...

  • Senior LLM Engineer

    4 days ago


    Hyderabad, Telangana, India Nomiso Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Senior LLM EngineerWhat You Can Expect from Us:Here at NomiSo, we work hard to provide our team with the best opportunities to grow their careers.  You can expect to be a pioneer of ideas, a student of innovation, and a leader of thought. Innovation and thought leadership is at the center of everything we do, at all levels of the company. Let's make your...

  • LLM Engineer

    2 weeks ago


    Hyderabad, Telangana, India AVE-Promagne Business Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Bachelors or Masters degree in computer science / AIML / Data Science.5 to 8 years of overall experience and hands-on experience with the design and implementation of Machine Learning models, Deep Learning models, andLLM models for solving business problems.Proven experience working withGenerative AI technologies, including prompt engineering, fine-tuning...


  • Hyderabad, Telangana, India Ekshvaku Tech Innovations Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Generative AI Engineer (LLMs & RAG) Healthcare SaaSLocation: Remote / Hyderabad (preferred)Experience: 5+ yearsEmployment Type: Full-timeDomain: Generative AI, Healthcare, SaaSAbout the RoleWe are looking for a hands-on Generative AI Engineer with expertise in Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). In this role, you will...