MLOps Engineer

9 hours ago

Guntur, Andhra Pradesh, India maawaabro it solutions pvt ltd Full time ₹ 1,20,00,000 - ₹ 1,80,00,000 per year

Job Description – MLOps Engineer (Triton + GPU + Production AI)

Immediate joining.

Employment Type: Full-time

Project: OTRAS – Next-Gen AI-based Government Exam & Recruitment Platform

MLOps Engineer (Triton + GPU + Production AI)

Role: MLOps Engineer

Experience: 5–10 Years

Location: Andhra Pradesh

Salary: ₹1,00,000 – ₹1,50,000 per month

About the Role

We are building OTRAS, India's largest next-gen AI-based examination platform serving 250M+ candidates per year.

We need an experienced MLOps Engineer who can productionize large AI/ML models (OMR, OCR, face recognition, fraud detection) using NVIDIA Triton, ONNX, TensorRT, and GPU pipelines.

You will be responsible for deploying, scaling, monitoring, and optimizing AI workloads in a distributed Kubernetes environment.

Key Responsibilities

Model Deployment & Serving

Deploy PyTorch/TensorFlow models on NVIDIA Triton Inference Server
Convert models to ONNX and optimize using TensorRT
Implement batching, dynamic batching, and GPU scheduling
Build scalable inference APIs (HTTP/gRPC)

Infrastructure & Automation

Deploy and manage AI workloads on Kubernetes (GPU node pools)
Automate model CI/CD using GitHub Actions + ArgoCD
Setup model versioning, canary deployments, and rollback workflows
Manage the Triton model repository & configs

Monitoring & Optimization

Implement inference metrics (latency, TPS, GPU utilization)
Setup monitoring using Prometheus + Grafana
Optimize inference speed and memory with TensorRT
Run load tests for 10M+ inference events

Data & Pipelines

Build ETL workflows for AI datasets
Automate dataset cleaning, preprocessing
Integrate with ClickHouse / S3 storage
Create pipelines for:
OMR data ingestion ID card OCR Face detection & liveness scoringSecurity & Reliability
Ensure secure model access (token-based + mTLS)
Handle production failures, logs, distributed tracing
Implement AI/ML model audit trails

Required Skills

4+ years experience in MLOps or ML Engineering
Strong hands-on with:
NVIDIA Triton Inference Server ONNX / ONNX Runtime TensorRT PyTorch or TensorFlow CUDA (basic understanding)
Strong in Docker & Kubernetes
Experience with CI/CD
Knowledge of GPU scaling, batching, and memory optimization
Experience working with large-scale ML systemsBonus Skills
Experience with Airflow or Kubeflow
Experience with model quantization
Familiarity with computer vision
Knowledge of message queues (Kafka)
Worked on AI for ID verification / OMR / OCR

Why Join OTRAS?

Build India's first AI-powered exam infrastructure
Work with Go microservices + Kubernetes + Triton
Massive impact (250M candidates)
Fast-moving, high-performance engineering culture
High visibility role with strong growth

Job Types: Full-time, Permanent, Volunteer

Pay: ₹180, ₹1,080,070.03 per year

Benefits:

Health insurance
Life insurance
Provident Fund

Ability to commute/relocate:

Guntur, Andhra Pradesh: Reliably commute or planning to relocate before starting work (Required)

Work Location: In person

ML/AI Lead

11 hours ago

Guntur, Andhra Pradesh, India maawaabro it solutions pvt ltd Full time ₹ 1,80,000 - ₹ 10,80,070 per year

Job Description – Go EngineerImmediate joining.Employment Type: Full-timeProject: OTRAS – Next-Gen AI-based Government Exam & Recruitment PlatformML/AI Lead (Next-Gen AI for OTRAS)Role: ML/AI LeadExperience: 7–12 YearsSalary: ₹upto – ₹2,50,000 per monthAbout the RoleOTRAS is building India's first next-gen AI-powered examination ecosystem,...
Data Engineer

1 day ago

guntur, India Digitalzone Full time

About the RoleAs a Data Engineer, you will design, build, and optimize data pipelines and real-time systems that power AI-driven decisioning and analytics.ResponsibilitiesDevelop and maintain scalable ETL/ELT pipelines using Python, Airflow, and dbtBuild and optimize real-time streaming pipelines using Kafka, RabbitMQ, Spark and event-driven...
Full Stack Engineer

4 weeks ago

Guntur, India MarketEngine.ai Full time

We're looking for a passionate and experienced Full Stack Engineers to join our Pune team and spearhead the development of production-grade LLM-based agentic platform. In this role, you'll be instrumental in designing scalable infrastructure for our LLM systems and developing robust frameworks for prompt management. As a key team member, you'll have...
Senior AI/ML Specialist/Lead( Location: Bangalore)

4 weeks ago

Guntur, India DigiHelic Solutions Pvt. Ltd. Full time

Job descriptionSenior AI/ML Specialist/LeadExperience Range:7+ years of professional experienceMandatory SkillsAiml,Python,Data Science,Deep LearningKey Responsibilities:Define and execute the AI/ML strategy for content generation (Gen-AI), entity linkage and matching, image processing, and content understandingLead the development of next-generation content...
Senior AI/ML Engineer

4 weeks ago

Guntur, India Tech Raid Inc Full time

Senior AI/ML Engineer - Generative AI GCP Python FastAPI Company: Tech Raid Inc. (Makers of Time Krishna, YayEye & Roorama) Location: Vishakapatnam, Andhra Pradesh On-site Experience: 6-10 years Type: Full-time About Us Time Krishna is a bold, independent media and social platform born out of India's global voice. We are building an ecosystem where creators,...
Generative Ai Solution Architect

1 week ago

Guntur, India Whatjobs IN C2 Full time

We are seeking a highly skilled Generative AI Solution Architect with 7+ years of experience having strong backend development expertise, to design, implement, and scale AI-powered solutions across enterprise use cases. The ideal candidate combines deep technical knowledge of backend systems and cloud architectures with hands-on experience in applying...
[Apply in 3 Minutes] Senior AI/ML Engineer – Generative AI | GCP | Python | FastAPI

3 weeks ago

Guntur, India Tech Raid Inc Full time

Senior AI/ML Engineer – Generative AI | GCP | Python | FastAPI Company: Tech Raid Inc. (Makers of Time Krishna, YayEye & Roorama) Location: Vishakapatnam, Andhra Pradesh | On-site Experience: 6–10 years Type: Full-time About Us Time Krishna is a bold, independent media and social platform born out of India's global voice. We are building an ecosystem...

Americas

Europe

Asia / Oceania

Africa

MLOps Engineer

ML/AI Lead

Data Engineer

Full Stack Engineer

Senior AI/ML Specialist/Lead( Location: Bangalore)

Senior AI/ML Engineer

Generative Ai Solution Architect

[Apply in 3 Minutes] Senior AI/ML Engineer – Generative AI | GCP | Python | FastAPI