
Senior AI Infrastructure Engineer
7 days ago
About this Role:
We are seeking a strategic Senior MLOps Engineer to lead the end-to-end design, implementation, and scaling of our AI infrastructure.
The ideal candidate will partner with researchers, product teams, and DevOps to turn prototypes into production services that meet strict SLAs for latency, reliability, and cost efficiency.
Key Responsibilities:
- Design and implement scalable ML pipelines (training, evaluation, deployment) for LLMs, CV, and multimodal models.
- Lead efforts in model serving, versioning, automated CI/CD, and real-time monitoring of AI workflows.
- Build and optimize GPU-backed serving infrastructure targeting p99 latency < 100 ms, 99.9% uptime, and > 80% GPU utilization.
- Drive initiatives on model governance, automated drift detection (≤10% false positives), and data-management best practices.
- Integrate vector databases (Qdrant, Pinecone) for low-latency semantic retrieval, and build agentic workflows using LangChain or similar frameworks.
- Architect RBAC-driven, isolated ML services to securely serve 100–500+ organizations.
- Design Prometheus/Grafana dashboards, ELK/Fluentd logging pipelines, and alerting for all ML workloads.
- Maintain CI/CD pipelines for Python (FastAPI) and TypeScript (NestJS) inference services.
- Define and track SLAs/SLOs, optimize cloud spend by ≥ 20% year-over-year, and ensure GPU clusters operate at > 80% utilization.
- Partner with AI researchers, product managers, and legal to align MLOps standards with compliance and roadmap goals.
- Mentor junior engineers, run quarterly brown-bags, own onboarding docs (upskill 5+ engineers/quarter), and publish ≥ 1 open-source contribution or talk annually.
Requirements:
- 9–14 years in software engineering, including ≥ 4 years in MLOps or ML infrastructure.
- Strong expertise in cloud platforms (AWS/GCP/Azure), Kubernetes, Docker, Terraform, Helm, Kubeflow, and MLflow.
- Experience with inference frameworks (Triton, TensorFlow Serving, BentoML, TorchServe).
- Familiarity with distributed training, workload schedulers, and GPU-cluster orchestration.
- Proficiency in Python, TypeScript, and infrastructure-as-code (Terraform, Helm, etc.).
- Proven track record building reliable, scalable ML systems in production.
Critical Skills:
- Vector DB integration (Qdrant, Pinecone).
- Agent orchestration (LangChain, LlamaIndex).
- Multi-tenant security and RBAC.
- Observability stacks (Prometheus/Grafana, ELK).
- CI/CD for FastAPI/NestJS services.
Prior Experience:
- A strong portfolio of projects demonstrating expertise in MLOps, AI infrastructure, and cloud-based solutions.
- A Master's or Ph.D. in CS/AI and relevant certifications.
- Excellent communication and interpersonal skills, with the ability to drive consensus across functions.
-
AI Development Engineer Opportunity
5 days ago
Dindigul, Tamil Nadu, India beBeeDevelopment Full time ₹ 1,80,00,000 - ₹ 2,40,00,000Senior Engineering SpecialistThe role involves hands-on coding and contributing to the development of next-generation AI workflows. Key responsibilities include architecting and coding agentic workflows, building AI agents using LLM frameworks, cloud and infrastructure optimization, product architecture and coding, improving API performance and reliability,...
-
Global AI Infrastructure Developer
1 week ago
Dindigul, Tamil Nadu, India beBeeEngineering Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Description">We are seeking a skilled SRE and DevOps Engineer to support large-scale AI/ML infrastructure.The ideal candidate will have expertise in JVM debugging, Kubernetes, Docker, Linux fundamentals, CI/CD, automation, monitoring, and testing frameworks.">Key Responsibilities">Supporting and scaling AI platform services for global teamsEnsuring...
-
Senior AI Engineer
5 days ago
Dindigul, Tamil Nadu, India beBeeDataPrivacy Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Summary:As a senior AI engineer, you will play a crucial role in ensuring the secure usage of AI models.Description:The primary responsibility of this position is to manage and configure data privacy controls to prevent sensitive data from being used in AI model training.Key Responsibilities:Configure and manage data privacy controls to safeguard...
-
AI Infrastructure Architect
1 week ago
Dindigul, Tamil Nadu, India beBeeSolution Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Our organization is seeking an experienced architect to design and build platforms for AI/ML and HPC workloads.Key Responsibilities:We are looking for a highly skilled professional to lead the development of infrastructure projects, including platform build-outs for AI/ML and HPC workloads.The ideal candidate will have a strong background in designing and...
-
Senior AI Engineer
3 days ago
Dindigul, Tamil Nadu, India ARC-Net | Applied Research Capability Network Full timeAbout ARC-NetARC-Net, backed by ARTPARK at IISc Bengaluru, drives industry-oriented AI research into scalable solutions. We bring together top-tier talent, IISc's research potential, and industry partners to take on projects with industry relevance. We have delivered successful projects in space technology and AI-based interview assessment, alongside ongoing...
-
Building Scalable AI Solutions
7 days ago
Dindigul, Tamil Nadu, India beBeeTechnical Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Senior Technical Lead - AI InfrastructureWe are seeking an experienced Senior Technical Lead to lead our AI Platform Team. This role will be responsible for building a trusted, scalable, and compliant platform to operate with speed, efficiency, and quality.The successful candidate will have 10+ years of experience in software engineering, including at least...
-
Cloud and AI Infrastructure Specialist
1 week ago
Dindigul, Tamil Nadu, India beBeeCloudInfrastructure Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Title: Cloud and AI Infrastructure SpecialistAbout the Role:We are seeking a skilled Cloud and AI Infrastructure Specialist to join our team. As a key member of our infrastructure group, you will be responsible for designing, implementing, and maintaining cloud platforms and DevOps tools to ensure efficient operations.Key Responsibilities:Design and...
-
Seasoned AI Engineer
2 days ago
Dindigul, Tamil Nadu, India beBeeAIENGINEER Full time ₹ 1,80,00,000 - ₹ 2,00,00,000Our team seeks a seasoned AI Engineer to build and optimize AI/ML infrastructure for model training, deployment, and monitoring. This role requires a deep understanding of software development, AI/ML infrastructure, and full-stack development.Key Responsibilities:Design and implement end-to-end MLOps pipelines with CI/CD automation.Develop scalable AI...
-
Senior Infrastructure Automation Engineer
2 days ago
Dindigul, Tamil Nadu, India beBeeAutomation Full time ₹ 97,30,000 - ₹ 1,46,90,000Infrastructure Automation SpecialistAbout this role:This is a senior-level position for an experienced automation engineer to join our team.Key ResponsibilitiesDevelop and implement automated deployment pipelines using Ansible, Python, and related infrastructure tooling.Build robust workflows to manage end-to-end infrastructure lifecycle from provisioning to...
-
Top Backend Infrastructure Developer Wanted
7 days ago
Dindigul, Tamil Nadu, India beBeeBackend Full time ₹ 1,00,00,000 - ₹ 1,10,00,000Job Title:Senior Backend Software Engineer About the RoleThis is a high-ownership position at a fast-paced AI company where you will work alongside YC-backed founders and a seasoned founding engineer to drive projects from design to deployment in a startup environment.You will be responsible for building and scaling the core backend infrastructure that...