Senior Devops Engineer- ML Engineering Support
1 week ago
Roku is changing how the world watches TV
Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television in the world. Roku pioneered streaming to the TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect consumers to the content they love, enable content publishers to build and monetize large audiences, and provide advertisers unique capabilities to engage consumers.
From your first day at Roku, you'll make a valuable - and valued - contribution. We're a fast-growing public company where no one is a bystander. We offer you the opportunity to delight millions of TV streamers around the world while gaining meaningful experience across a variety of disciplines.
About the RoleWe are seeking a talented and experienced Senior Software Engineer, DevOps/SRE to join our dynamic team and play a critical role in supporting Machine Learning Engineering activities. The ideal candidate will have a strong background in DevOps practices, cloud infrastructure management, automation, and MLOps tooling, along with team leadership skills.
If you have a proven track record architecting and scaling ML/AI platforms, enjoy solving intriguing system challenges at internet-scale, are innovative at heart, and thrive in building infrastructure that accelerates ML experimentation and deployment — this role might be a great fit for you
What You'll Be Doing
- Provide technical leadership and guidance to DevOps/SRE engineers supporting ML Engineering initiatives; mentor team members in best practices, technologies, and methodologies.
- Design, implement, and maintain scalable and resilient cloud infrastructure (AWS & GCP) optimized for ML workloads, including GPU/TPU orchestration and distributed training.
- Partner with ML Engineers to streamline the end-to-end ML lifecycle: data ingestion, feature engineering, training, evaluation, deployment, and monitoring.
- Build and maintain CI/CD pipelines for ML applications and models using GitHub Actions, GitLab CI/CD, Argo, or Tekton.
- Integrate with MLOps platforms (e.g., MLflow, Kubeflow, Airflow, SageMaker, Vertex AI) to ensure reproducibility and traceability of experiments.
- Lead incident response efforts for ML-serving and training infrastructure, minimizing downtime and ensuring high availability.
- Implement observability practices for ML workloads, including model performance monitoring, drift detection, and metrics via Prometheus, Grafana, and Datadog.
- Collaborate with security and compliance teams to ensure adherence to data governance, PCI, SOX, and AI/ML data security standards.
- Optimize system resources for large-scale ML jobs, including auto-scaling GPU clusters, cost optimization, and quota management.
- Drive continuous improvement across DevOps + MLOps processes; proactively identify areas for enhancement.
- Maintain clear documentation and foster a culture of knowledge sharing across DevOps, ML, and Data Engineering teams.
- Participate in 24x7 on-call rotation, with availability to work with global teams in the event of critical outages.
We're Excited if You Have
- 8+ years of experience in DevOps/SRE roles, including at least 2–3 years supporting ML or data-intensive workloads.
- Strong programming skills in Python or Go; experience building internal tools and automation for ML pipelines.
- Hands-on experience with Kubernetes, Docker, ECS/EKS/GKE, and service mesh tools such as Istio or Envoy.
- Familiarity with GPU/accelerator orchestration (NVIDIA GPU Operator, KubeFlow, Slurm, Ray, or similar).
- Experience with Infrastructure as Code (IaC): Terraform, Helm, Ansible, or CloudFormation.
- Deep understanding of distributed systems, microservices architecture, and cloud-native design patterns.
- Exposure to MLOps tools: MLflow, Kubeflow Pipelines, Airflow, Argo, Vertex AI, or SageMaker.
- Strong proficiency in cloud platforms (AWS and GCP required; Azure a plus).
- Knowledge of data engineering concepts (object storage like S3/GCS, parquet/ORC, data versioning with DVC or Delta Lake).
- Experience with networking, security, and compliance (role-based access, VPC design, encryption, auditing).
- Demonstrated success in cross-functional collaboration with ML, Data, and Product teams.
- Preferred certifications: Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer, NVIDIA Deep Learning Institute courses.
- AI literacy and curiosity, You have either tried Gen AI in your previous work or outside of work or are curious about Gen AI and have explored it.
- BS Degree in Computer Science or equivalent experience.
Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Our employees can take time off work for vacation and other personal reasons to balance their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.
The Roku CultureRoku is a great place for people who want to work in a fast-paced environment where everyone is focused on the company's success rather than their own. We try to surround ourselves with people who are great at their jobs, who are easy to work with, and who keep their egos in check. We appreciate a sense of humor. We believe a fewer number of very talented folks can do more for less cost than a larger number of less talented teams. We're independent thinkers with big ideas who act boldly, move fast and accomplish extraordinary things through collaboration and trust. In short, at Roku you'll be part of a company that's changing how the world watches TV.
We have a unique culture that we are proud of. We think of ourselves primarily as problem-solvers, which itself is a two-part idea. We come up with the solution, but the solution isn't real until it is built and delivered to the customer. That penchant for action gives us a pragmatic approach to innovation, one that has served us well since 2002.
To learn more about Roku, our global footprint, and how we've grown, visit
By providing your information, you acknowledge that you want Roku to contact you about job roles, that you have read Roku's Applicant Privacy Notice, and understand that Roku will use your information as described in that notice. If you do not wish to receive any communications from Roku regarding this role or similar roles in the future, you may unsubscribe here at any time.
-
Azure DevOps Engineer
4 weeks ago
Bengaluru, Karnataka, India ALIQAN Technologies Full timeJob Title : Senior DevOps Engineer - AI/MLExperience : 4- 7yearsRelevant Experience : 5 yearsLocation : BengaluruMode : 6 month + ext.Job Summary :We are seeking an experienced Senior DevOps Engineer to support the deployment, monitoring, and scaling of AI/ML models and infrastructure. The successful candidate will bridge the gap between data science and...
-
ML/DevOps Engineer
4 weeks ago
Bengaluru, Karnataka, India Saarthee Full timeAbout Saarthee : Saarthee is a Global Strategy, Analytics, Technology and AI consulting company, where our passion for helping others fuels our approach and our products and solutions. Our diverse and global team work with one objective in mind : Our Customers Success. At Saarthee, we are passionate about guiding organizations towards insights fueled...
-
Senior DevOps Engineer
3 days ago
Bengaluru, Karnataka, India Navikenz Full time US$ 1,25,000 - US$ 1,75,000 per yearJob Summary:We're looking for an experienced Senior DevOps Engineer who loves working with Kubernetes and AI-driven applications. In this role, you'll be responsible for designing, implementing, and maintaining scalable cloud infrastructure while supporting MLOps pipelines for AI workloads.What You'll Be Doing:Building Scalable Infrastructure: You'll design,...
-
Senior Devops Engineer
3 days ago
Bengaluru, Karnataka, India Techno Facts Solutions Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Title: Senior / Lead DevOps Engineer (GenAI)Level: A4 & A5 (Experience: 5.1 9 Years)Key ResponsibilitiesDesign, build, and maintain CI/CD pipelines for GenAI model training and deployment.Automate infrastructure provisioning & scaling using Terraform, Ansible, Pulumi.Optimize GPU/TPU utilization and monitor model performance in production.Integrate GenAI...
-
Devops Engineer
1 week ago
Bengaluru, Karnataka, India Cigres Technologies Full time ₹ 15,00,000 - ₹ 25,00,000 per yearBengaluru, Karnataka, IndiaJob TypeFull TimeAbout the RoleThe jobs consist in being a DevOps engineer within the AI Hub Technology ML Engineering team, in charge of delivering Infrastructure-As-Code cloud services and develop and maintain CI/CD stackActivities:R&D Develop, Test and Document Infrastructure-as-Code to manage Cloud Services for ML...
-
Senior Azure DevOps Engineer
4 weeks ago
Bengaluru, Karnataka, India Talpro Full timeRole Overview : We are seeking a highly skilled Senior Microsoft Azure DevOps AI Engineer with proven expertise in Azure DevOps and Azure Databricks. The ideal candidate should have strong experience in building, deploying, and automating infrastructure while integrating AI-driven solutions. This role is perfect for someone passionate about combining cloud...
-
Devops Engineer
1 day ago
Bengaluru, Karnataka, India Talent Hired-the Job Store Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWe are seeking a highly skilled DevOps Engineer with experience in Generative AI (GenAI) to support the development, deployment, and scaling of AI/ML infrastructure, pipelines, and applications. The ideal candidate will have a strong foundation in traditional DevOps practices, cloud infrastructure, and CI/CD automation, along with hands-on experience working...
-
DevOps AI/ML Engineer
2 weeks ago
Bengaluru, Karnataka, India Cisco Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMeet the team:The Office of the CEO at Cisco is looking for a highly expert, innovative, and self-motivated DevOps AI/ML Engineer to join our Customers Insights & Action (CIA) organization. In this critical role, you will drive the development, deployment, and production readiness of scalable AI and machine learning models, with a particular focus on Agentic...
-
Senior Devops Engineer
3 days ago
Bengaluru, Karnataka, India HYI Full time ₹ 6,00,000 - ₹ 18,00,000 per yearWe Are Hiring – DevOps Engineer(5+ Years Experience) Location: Koramangala, Bengaluru (Work from Office) Company: HYI.AI Apply Now:Notice Period: Immediate Joiners Preferred is looking for a passionate DevOps Engineer to join our growing team If you're ready to work on cutting-edge AI/ML deployments, scalable cloud infrastructure, and CI/CD automation –...
-
Senior DevOps Engineer
5 days ago
Bengaluru, Karnataka, India Optum Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOptum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers,...