LLMOps Architect

2 weeks ago


Bengaluru, Karnataka, India Enterpret Full time US$ 1,50,000 - US$ 2,50,000 per year

About Enterpret
Enterpret is at the forefront of
AI-native
applications, unlocking the power of customer feedback for businesses. We centralize feedback from every channel and transform it into actionable insights that drive customer-centric decisions for teams at the world's leading companies like Perplexity, Notion, Canva, and Figma. Backed by investors such as Kleiner Perkins and Peak XV, we're redefining how businesses understand and act on the voice of their customers.

About the Role
As LLMOps Architect at Enterpret, you'll be responsible for how LLM models are fine-tuned, how prompts are managed, how we run evals, how we optimize for cost, and how we optimize for speed—both at the experimentation stage and in production. This is a foundational, high-ownership role where you'll work directly with the
OpenAI and Anthropic teams
(whom we partner with closely), as well as
AWS
(whom we partner with closely), to build world-class ML infrastructure.

You'll work closely with ML researchers, backend engineers, and product teams to ensure our AI systems are resilient, secure, and cost-efficient as we grow 10x. Key success metrics include improving the speed of experimentation, time to productionization, and the quality of models. You'll report directly to the CTO.

What You'll Do

  • Design and evolve Enterpret's ML platform for training, serving, and retraining our encoders and LLM models using AWS/Terraform/OpenAI/Anthropic.
  • Build CI/CD pipelines tailored for ML—including model versioning, testing, canary releases, rollbacks, and gated production deploys.
  • Deploy and manage model serving systems for both real-time inference (e.g., tagging support tickets on the fly) and batch pipelines (e.g., analyzing historical product feedback).
  • Set up observability for model performance and data drift—using Braintrust and custom alerts to catch issues before they affect customers.
  • Lead incident response, root cause analysis, and postmortems for ML systems—ensuring uptime for insights that product teams rely on, alongside governance and security.
  • Track and optimize cloud usage for ML workflows, making model delivery cost-aware and aligned with product usage.
  • Implement governance and security across the stack—owning IAM, data access, auditability, and model explainability where needed.
  • Partner with ML and product teams to productionize GenAI and AI models powering our Knowledge Graph and Adaptive Taxonomy engine, tackling problems on retrieval, encoder LLM fine-tuning, and reinforcement learning.
  • Evaluate tools for model registry, feature stores, and orchestration—and build where needed to keep the feedback loop fast.
  • Champion best practices in MLOps across the org—mentoring engineers and setting scalable foundations for the future.
  • Act as a coach to our team of researchers who are transitioning into engineering, helping them self-serve their capabilities and self-service these tools rather than doing it yourself.

What It Takes

  • A minimum of 6 years' experience in MLOps and ML infrastructure, ideally with exposure to designing, deploying, and scaling machine learning systems in fast-paced, product-driven environments such as startups or high-growth companies.
  • Deep expertise with AWS (SageMaker, EC2, EKS, S3, IAM), infrastructure-as-code (Terraform), and container orchestration (Docker, Kubernetes).
  • Strong Python skills, with bonus points for Go, Bash, or Rust scripting where appropriate.
  • Hands-on experience with CI/CD systems like GitHub Actions, ArgoCD, or Jenkins—especially for ML model delivery.
  • Proven ability to monitor and maintain production ML systems, including model drift, latency, uptime, and alerting.
  • Comfort with cloud cost optimization, resource provisioning, and auto-scaling for ML-heavy environments.
  • Familiarity with model serving stacks and experimentation tools (MLflow, Langsmith, etc.).
  • Bonus: exposure to GenAI workflows (LangChain, vector DBs, RAG), encoder/LLM model tuning, reinforcement learning, or responsible AI practices.
  • Track record of mentoring, collaborating across functions, and taking full ownership of systems in production.
  • You hate repeated manual work and have a strong drive to automate everything.
  • Proficiency with AI coding agents like Claude and Cursor to work multiple times more effectively than normal.

Why Enterpret

  • ML at the Core: You won't be supporting ML—you'll be enabling the core product.
  • High Impact, Early Ownership: Define our MLOps foundation, influence every model's path to production, and shape how product teams experience insights.
  • Work With Sharp People: Collaborate with researchers, engineers, and product builders solving complex problems every week.
  • Focused, Fast Environment: No heavy process—just smart, principled builders shipping high-quality infrastructure.
  • Comp + Culture: Competitive salary, meaningful equity, full-stack healthcare, generous leave, and a team-first culture built on trust and ownership.

What We Value

At Enterpret, we operate with a deep sense of ownership — we play for the team and do what it takes to win together. We care personally for our teammates while pushing each other with honest, actionable feedback. Above all, we approach everything with humility and a drive to keep learning and getting better.

Equal Opportunities
We are an equal opportunity employer. We ensure that none of our employees or prospective employees receives less favourable treatment as a result of age, sex, disability, marital status, colour, race, religion or ethnic origin. Equally we aim to ensure that no such employee is disadvantaged by terms and conditions of employment which cannot be justified.


  • AI LLMOps Architect

    16 hours ago


    Bengaluru, Karnataka, India Collabrah Tech Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Company DescriptionCollabrah Tech Solutions is an innovative IT company specializing in IT consulting, staffing, and recruiting. Our expertise encompasses a wide range of services including analytics, data science, data engineering, machine learning, artificial intelligence, cloud migration, managed services, and more. We serve clients across various...


  • Bengaluru, Karnataka, India Arting Digital Private Limited Full time US$ 1,50,000 - US$ 2,00,000 per year

    Job Title: AI/ML Technical Architect (Creative Cloud)Location: Bangalore / NoidaWork Mode: HybridExperience: 10+ yearsNotice Period: Immediate joiners to 15 days.Job OverviewWe are seeking an experienced AI/ML Technical Architect with a strong background in designing and scaling AI platforms and delivering enterprise-grade solutions. The ideal candidate will...


  • Bengaluru, Karnataka, India Arting Digital Full time

    Job Title : AI/ML Technical Architect (Creative Cloud)Location : Bangalore / NoidaWork Mode : HybridExperience : 10+ yearsNotice Period : Immediate joiners to 15 days.Job Overview :We are seeking an experienced AI/ML Technical Architect with a strong background in designing and scaling AI platforms and delivering enterprise-grade solutions. The ideal...


  • Bengaluru, Karnataka, India Delphi Consulting Middle East Full time

    Ready to embark on a journey where your growth is intertwined with our commitment to making a positive impact? Join the Delphi family - where Growth Meets Values.At Delphi Consulting Pvt. Ltd., we foster a thriving environment with a hybrid work model that lets you prioritize what matters most. Interviews and onboarding are conducted virtually, reflecting...

  • Associate Architect

    2 weeks ago


    Bengaluru, Karnataka, India Quantiphi Full time

    Role : Associate Architect - MLOps / LLMOps Experience : 6 to 8 Years Location : Bangalore / Mumbai (Hybrid) Job Summary: Join our dynamic team as a Platform Architect and leverage your expertise in production-scale platforms within the GenAI or ML domain . In this role, you'll be instrumental in designing, developing and maintaining...


  • Bengaluru, Karnataka, India Codilar Technologies Pvt. Ltd. Full time US$ 1,50,000 - US$ 2,00,000 per year

    Role Title: AI Platform EngineerLocation: Bangalore (In Person in office when required)Part of the GenAI COE TeamKey Responsibilities· Platform Development and Evangelism:Build scalable AI platforms that are customer-facing.Evangelize the platform with customers and internal stakeholders.Ensure platform scalability, reliability, and performance to meet...


  • Bengaluru, Karnataka, India beBeeGenerativeai Full time ₹ 1,80,00,000 - ₹ 2,00,00,000

    Job Title: AI Solutions ArchitectBachelor's degree with 6-8 years of AI/ML experience.Experienced in developing GenAI applications and integration of GenAI with existing applications.Strong understanding of machine learning concepts and algorithms, especially related to Generative AI and LLMs.Experienced with Open-source frameworks like langchain and...


  • Bengaluru, Karnataka, India Collabrah Tech Solutions Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Responsibilities:* Design, develop & maintain AI solutions using LLMOps, Agentic AI & NLP.* Collaborate with cross-functional teams on AI platform strategy & implementation.Model development, training, deployment at scale, monitoring performance


  • Bengaluru, Karnataka, India Butterfly Groups Full time ₹ 47,400 - ₹ 17,93,580 per year

    AI Platform EngineerLocation: Bangalore (In Person in office when required)Key ResponsibilitiesPlatform Development and Evangelism:Build scalable AI platforms that are customer-facing.Evangelize the platform with customers and internal stakeholders.Ensure platform scalability, reliability, and performance to meet businessneeds.Machine Learning Pipeline...

  • Advanced AI Architect

    2 weeks ago


    Bengaluru, Karnataka, India beBeeGenAi Full time ₹ 1,50,00,000 - ₹ 2,25,00,000

    Artificial Intelligence Developer PositionThis senior-level role involves designing and developing advanced artificial intelligence applications. The ideal candidate will possess a deep understanding of cutting-edge technologies, including generative AI, natural language processing, and computer vision.Key Responsibilities:Implementing Gen AI models using...