AI Infrastructure Architect

2 days ago


Anantapur, Andhra Pradesh, India beBeeMLOps Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

As a seasoned professional in AI infrastructure, you will play a pivotal role in leading the end-to-end design, implementation, and scaling of our artificial intelligence infrastructure. This includes partnering with researchers, product teams, and DevOps to transform prototypes into production services that meet strict service-level agreements (SLAs) for latency, reliability, and cost efficiency.

The ideal candidate will possess strong expertise in cloud platforms (AWS/GCP/Azure), Kubernetes, Docker, Terraform, Helm, Kubeflow, and MLflow. Additionally, experience with inference frameworks (Triton, TensorFlow Serving, BentoML, TorchServe) and familiarity with distributed training, workload schedulers, and GPU-cluster orchestration are highly desirable. Proficiency in Python, TypeScript, and infrastructure-as-code (Terraform, Helm, etc.) is essential.

About the Role

Responsibilities:
  • Design and implement scalable ML pipelines (training, evaluation, deployment) for LLMs, CV, and multimodal models.
  • Lead efforts in model serving, versioning, automated CI/CD, and real-time monitoring of AI workflows.
  • Build and optimize GPU-backed serving infrastructure targeting p99 latency < 100 ms, 99.9% uptime, and > 80% GPU utilization.
  • Drive initiatives on model governance, automated drift detection (≤10% false positives), and data-management best practices.
  • Integrate vector databases (Qdrant, Pinecone) for low-latency semantic retrieval, and build agentic workflows using LangChain or similar frameworks.
  • Architect RBAC-driven, isolated ML services to securely serve 100–500+ organizations.
  • Design Prometheus/Grafana dashboards, ELK/Fluentd logging pipelines, and alerting for all ML workloads.
  • Maintain CI/CD pipelines for Python (FastAPI) and TypeScript (NestJS) inference services.
  • Define and track SLAs/SLOs, optimize cloud spend by ≥ 20% year-over-year, and ensure GPU clusters operate at > 80% utilization.
  • Partner with AI researchers, product managers, and legal to align MLOps standards with compliance and roadmap goals.
  • Mentor junior engineers, run quarterly brown-bags, own onboarding docs (upskill 5+ engineers/quarter), and publish ≥ 1 open-source contribution or talk annually.
Requirements

Must-Haves:

  • 9–14 years in software engineering, including ≥ 4 years in MLOps or ML infrastructure.
  • Strong expertise in cloud platforms (AWS/GCP/Azure), Kubernetes, Docker, Terraform, Helm, Kubeflow, and MLflow.
  • Experience with inference frameworks (Triton, TensorFlow Serving, BentoML, TorchServe).
  • Familiarity with distributed training, workload schedulers, and GPU-cluster orchestration.
  • Proficiency in Python, TypeScript, and infrastructure-as-code (Terraform, Helm, etc.).
  • A proven track record building reliable, scalable ML systems in production.

Critical Skills:

  • Vector DB integration (Qdrant, Pinecone).
  • Agent orchestration (LangChain, LlamaIndex).
  • Multi-tenant security and RBAC.
  • Observability stacks (Prometheus/Grafana, ELK).
  • CI/CD for FastAPI/NestJS services.

Nice-to-Haves:

  • Master's/PhD in CS/AI and certifications such as AWS ML Specialty, Google Cloud Professional ML Engineer, or CNCF CKA/CKAD.
  • Prior experience at AI-focused startups or enterprises scaling ML for 100–500 orgs.
  • Understanding of low-latency streaming inference or agent-based LLM systems.
  • Excellent written and verbal communication, and a proven ability to drive consensus across functions.


  • Anantapur, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 1,91,22,000 - ₹ 2,13,33,000

    Job Title: Cloud Infrastructure Architect">Key Responsibilities:We are looking for an experienced Cloud Infrastructure Architect to lead the design and implementation of our cutting-edge infrastructure.This role will be pivotal in designing comprehensive automated testing strategies and frameworks across unit, integration, API, and end-to-end levels for...


  • Anantapur, Andhra Pradesh, India beBeeEnterprise Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Title: Enterprise AI Solutions ArchitectWe are seeking an experienced and skilled Solutions Architect to lead the design, implementation, and delivery of our enterprise-grade AI infrastructure solutions.About the Role:The ideal candidate will have expertise in Kubernetes, High-Performance Computing (HPC), and Artificial Intelligence/Machine Learning...

  • Chief Architect

    6 days ago


    Anantapur, Andhra Pradesh, India beBeeEngineering Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Title: Lead Engineer - Scalable AI SolutionsWe are seeking a highly skilled and experienced engineer to lead the development of our scalable AI platform. This is a hands-on leadership role that requires strong technical expertise, excellent communication skills, and the ability to work collaboratively with cross-functional teams.Key...

  • AI Architect

    4 days ago


    Anantapur, Andhra Pradesh, India beBeeGenerative Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Title: Artificial Intelligence ArchitectAbout the Role:We are seeking a highly skilled Artificial Intelligence Architect with extensive expertise in designing and deploying AI solutions. The ideal candidate will combine hands-on AI/ML engineering skills with the ability to design scalable architectures, implement retrieval-augmented generation (RAG)...


  • Anantapur, Andhra Pradesh, India beBeeArtificialIntelligence Full time US$ 20,000 - US$ 40,000

    Artificial Intelligence Developer PositionWe are seeking a highly skilled AI developer to lead the development of core intelligence systems.This role requires expertise in information retrieval, scalable backend architecture, and generative AI technology.The ideal candidate will:Design and implement semantic search pipelines, graph-based retrieval, and...


  • Anantapur, Andhra Pradesh, India beBeeLeadership Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    GenAI Leadership RoleWe are seeking a highly skilled GenAI Lead Engineer to spearhead the development of our AI Agent platforms. This is a leadership position that demands strong technical expertise, exceptional communication skills, and the ability to collaborate effectively across teams.Responsibilities:Platform Architecture Design: Develop cost-efficient...


  • Anantapur, Andhra Pradesh, India beBeeMachineLearning Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    RoleAssociate Architect - Machine Learning (Gen AI)ResponsibilitiesDevelop domain-adaptive AI systems to automate business processes.Fine-tune large-scale pre-trained models using PEFT and SFT techniques for specific applications and domains.Enhance model responses with Chain of Thought and Few Shot prompts.Design end-to-end workflows for AI solutions from...


  • Anantapur, Andhra Pradesh, India beBeeinfrastructure Full time ₹ 1,75,00,000 - ₹ 2,25,00,000

    Growing eSIM services platforms require scalable cloud infrastructure. We simplify connectivity with powerful APIs and seamless integrations.Our platform enables global eSIM lifecycle management and user onboarding, creating opportunities for innovative collaboration.About the RoleWe're seeking a Cloud Infrastructure Architect to maintain up-to-date system...


  • Anantapur, Andhra Pradesh, India beBeecybersecurity Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Title: Cybersecurity and AI Solutions Architect">\


  • Anantapur, Andhra Pradesh, India beBeeArtificialIntelligence Full time ₹ 15,00,000 - ₹ 20,00,000

    Senior AI StrategistWe are seeking a seasoned AI professional to spearhead the development and deployment of cutting-edge AI solutions.Key Responsibilities:Model Development: Develop, integrate, and deploy AI/ML models, including large language models and agentic architectures.Technical Expertise: Proficient in Python frameworks and libraries, such as...