LLM Ops Engineer

2 days ago


Hyderabad, Telangana, India Apple Full time ₹ 12,00,000 - ₹ 36,00,000 per year

We work on Apple scale opportunities and challenges. We are engineers at heart. We like solving technical problems. We believe a good engineer has the curiosity to dig into inner workings of technology and is always experimenting, reading and in constant learning mode. If you are a software engineer with passion to code and dig deeper into any technology, love knowing the internals, fascinated by distributed systems architecture, we want to hear from you.

Description

We are seeking a highly skilled LLM Ops and ML Ops Engineer to lead the deployment, scaling, monitoring, and optimization of large language models (LLMs) across diverse environments. This role is critical to ensuring our machine learning systems are production-ready, high-performing, and resilient. The ideal candidate will have deep expertise in Python programming / Go Programming, a comprehensive understanding of LLM internals, and hands-on experience with various inference engines and deployment strategies. The person should be capable of exhibiting deftness to balance multiple simultaneous competing priorities and deliver solutions in a timely manner. The person should be able to understand complex architectures and be comfortable working with multiple teams KEY RESPONSIBILITIES: - Design and build scalable infrastructure for fine-tuning, and deploying large language models. - Develop and optimize inference pipelines using popular frameworks and engines (e.g. TensorRT, vLLM, Triton Inference Server). - Implement observability solutions for model performance, latency, throughput, GPU/TPU utilization, and memory efficiency. - Own the end-to-end lifecycle of LLMs in production-from experimentation to continuous integration and continuous deployment (CI/CD). - Collaborate with research scientists, ML engineers, and backend teams to operationalize groundbreaking LLM architectures. - Automate and harden model deployment workflows using Python, Kubernetes, Containers and orchestration tools like Argo Workflows and GitOps. - Design reproducible model packaging, versioning, and rollback strategies for large-scale serving. - Stay current with advances in LLM inference acceleration, quantization, distillation, and model compilation techniques (e.g., GGUF, AWQ, FP8).

Minimum Qualifications

  • 5+ years of experience in LLM/ML Ops, DevOps, or infrastructure engineering with a focus on machine learning systems.
  • Advance level proficiency in Python/Go, with ability to write clean, performant, and maintainable production code.
  • Deep understanding of transformer architectures, LLM tokenization, attention mechanisms, memory management, and batching strategies.
  • Proven experience deploying and optimizing LLMs using multiple inference engines.
  • Strong background in containerization and orchestration (Kubernetes, Helm).
  • Familiarity with monitoring tools (e.g., Prometheus, Grafana), logging frameworks, and performance profiling.

Preferred Qualifications

  • Experience integrating LLMs into micro-services or edge inference platforms.
  • Experience with Ray distributed inference
  • Hands-on with quantization libraries
  • Contributions to open-source ML infrastructure or LLM optimization tools.
  • Familiarity with cloud platforms (AWS, GCP) and infrastructure-as-code (Terraform).
    Exposure to secure and compliant model deployment workflows

Submit CV


  • LLM Engineer

    3 weeks ago


    Hyderabad, Telangana, India Ampstek Full time

    Job Role : Sr. LLM Engineer : Gen AILocation : the Role :LLM Engineer : Gen AI. Turing is looking for people with LLM experience to join us in solving business problems for our Fortune 500 customers. You will be a key member of the Turing GenAI delivery organization and part of a GenAI project. You will be required to work with a team of other Turing...

  • LLM Engineer

    3 weeks ago


    Hyderabad, Telangana, India E Solutions Full time

    Job DescriptionRequired skills- 4+ years of professional experience in buildingMachine Learning models& systems- 1+ years of hands-on experience in howLLMswork &Generative AI (LLM)techniques particularly prompt engineering,RAG,and agents.- Experience in driving the engineering team toward a technical roadmap.- Expert proficiency in programming skills...

  • llm

    1 week ago


    Hyderabad, Telangana, India procallisto solutions pvt Full time ₹ 15,60,000 - ₹ 18,00,000 per year

    ob Title: Senior AI/ML Engineer – LLMsExperience: 6+ yearsLocation: [ Bangalore / HyderabadWork Mode: OnsiteRole Overview:We are seeking a highly skilled Senior AI/ML Engineer with expertise in Large Language Models (LLMs), Artificial Intelligence, and Machine Learning. The candidate will design, develop, and optimize AI-driven solutions, with a strong...

  • GEN AI Engineer-LLM

    2 weeks ago


    Hyderabad, Telangana, India GEMRAJ TECHNOLOGIES LIMITED Full time

    Sr. LLM Engineer: Gen AI Location-Work from Office - Hyderabad/Gurgaon - 3 days hybrid YoE-5 - 9 Years Start Date: Immediate/Full time Must Have Skills: Machine Learning models & systems Generative AI (LLM) / LLM prompt engineering Building LLM RAG Experience in using langchain or similar tool Turing is looking for people with LLM experience to join us in...

  • LLM Engineer

    2 days ago


    Hyderabad, Telangana, India Huemn Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job OverviewWe are seeking a Junior LLM Engineer to join our dynamic team at Huemn in Hyderabad. This full-time position requires 1 to 3 years of work experience. The ideal candidate will have a strong background in natural language processing and large language models, crucial for advancing our AI-powered tools. You will collaborate with cross-functional...


  • Hyderabad, Telangana, India Kiash Solutions LLP Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Exp -- 4+ YearsShift--2.00 PM to 11:30 PM ISTMandatory--Python with LLM OpsJob Description--We are looking for ahands-on AI Engineerwith strong expertise inLLM integration, platform observability, performance optimization, and API development. The ideal candidate will work on critical platform enhancements, includingLLM API integrations, observability...

  • AI / LLM Specialist

    2 days ago


    Hyderabad, Telangana, India Codehive Labs Hyderabad Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Role & responsibilitiesThe AI/LLM Specialist will focus on developing, fine-tuning, and embedding large language models into GCP-based data processing and automation pipelines. This role ensures accuracy, scalability, and cost efficiency of AI-driven automation.ResponsibilitiesTrain, fine-tune, and optimize LLMs for automation, mapping, and anomaly...

  • Senior LLM Engineer

    4 weeks ago


    Hyderabad, Telangana, India Altysys Full time

    Experience : 7+ YearsRelevant Experience : 4+ YearsWork Mode : GurgaonBudget : 2.6lpmKey Responsibilities:Model Expertise: Work with transformer models (GPT, BERT, T5, RoBERTa, etc.) across NLP tasks including text generation, summarization, classification, and translation.Model Fine-Tuning: Fine-tune pre-trained models on domain-specific datasets to...

  • Senior LLM Engineer

    4 weeks ago


    Hyderabad, Telangana, India Altysys Full time

    Experience : 7+ YearsRelevant Experience : 4+ YearsWork Mode : HyderabadBudget : 2.6lpmKey Responsibilities:Model Expertise: Work with transformer models (GPT, BERT, T5, RoBERTa, etc.) across NLP tasks including text generation, summarization, classification, and translation.Model Fine-Tuning: Fine-tune pre-trained models on domain-specific datasets to...

  • Senior LLM Engineer

    3 weeks ago


    Hyderabad, Telangana, India Altysys Full time

    Experience : 7+ YearsRelevant Experience : 4+ YearsWork Mode : HyderabadBudget : 2.6lpmKey Responsibilities:Model Expertise: Work with transformer models (GPT, BERT, T5, RoBERTa, etc.) across NLP tasks including text generation, summarization, classification, and translation.Model Fine-Tuning: Fine-tune pre-trained models on domain-specific datasets to...