SRE / DevOps engineer (with Python and ML frameworks)

8 hours ago


Bengaluru, Karnataka, India N-iX Full time ₹ 12,00,000 - ₹ 36,00,000 per year

N-iX is a global software development service company that helps businesses across the world develop successful software products. Founded in 2002, N-iX has come a long way, expanding its presence across Europe, the US, and Latin America. Today, we are a strong community of 2,000+ professionals and a reliable partner for global industry leaders and Fortune 500 companies. 

Our client is a global commerce leader where you can influence how the world buys, sells, and gives. You'll be part of a work culture that's been genuinely committed to diversity and inclusion since its founding over twenty five years ago. Here, you can be yourself, do your best work along with a team of professionals, and have a meaningful impact on people across the globe. We seek people with drive, ideas, and a passion for helping small businesses succeed to help.

About the team:
We are the AI Platform Team, providing highly available, scalable, and automated machine learning infrastructure for researchers and data scientists globally. We are looking for a motivated, self-reliant SRE / DevOps engineer with Python and ML framework experience to drive operational excellence, automation, and platform reliability.

Role Overview:
This role focuses on maintaining, deploying, and improving AI/ML platform services with strong emphasis on DevOps, SRE practices, and automation. You will collaborate closely with developers, researchers, and infrastructure teams to ensure robust, scalable, and highly available ML systems.

Responsibilities:

DevOps (~60%):

  • Design, implement, and maintain CI/CD pipelines for AI/ML platform services.
  • Manage and troubleshoot Kubernetes clusters, Docker containers, and cloud infrastructure.
  • Ensure high availability %), system reliability, and security across platforms.
  • Automate operational tasks, monitoring, and deployment workflows.
  • Collaborate with AI platform developers to deploy and scale ML frameworks efficiently.
  • Analyze and resolve production issues, performance bottlenecks, and functional problems.
  • Define operational standards, versioning practices, and advise teams on DevOps best practices.
  • Prepare documentation, training materials, and provide technical support to platform users.

Development (~40%):

  • Design, build, and refactor Python services and ML framework integrations.
  • Work with ML frameworks such as PyTorch, TensorFlow, and Triton.
  • Handle framework-related issues, version upgrades, and environment compatibility.
  • Work with Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, Ray Data.
  • Integrate Ray with tools such as Airflow, MLflow, Dask, DeepSpeed (plus).
  • Support AI/ML model training, inferencing platforms, and LLM fine-tuning systems.
  • Collaborate with developers to integrate ML pipelines into automated CI/CD workflows.

Requirements:

  • Strong Python development experience (2–4 years).
  • Overall 3–5 years of relevant DevOps / SRE experience.
  • Hands-on experience with ML frameworks (PyTorch, TensorFlow, Triton).
  • Hands-on experience with cluster deployment, workload management, distributed task scheduling.
  • Familiarity with Ray ecosystem libraries (Train, Tune, Serve, Data) and integration with ML tooling.
  • Experience with AI/ML model training and inferencing platforms is a plus.
  • Familiarity with LLM fine-tuning systems is a plus.
  • Solid understanding of Kubernetes, Docker, Linux fundamentals, and DevOps practices.
  • Experience with CI/CD pipelines (Jenkins or similar), test automation, and monitoring.
  • Strong debugging and triaging skills.
  • Excellent communication and collaboration skills with cross-functional teams.
  • Strong organizational skills to manage multiple projects in a fast-paced environment.
  • Fluent in English (spoken and written).

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers



  • Bengaluru, Karnataka, India, Karnataka ITC Infotech Full time

    We're Hiring! I'm excited to share that we're looking for SRE and DevOps - ML Framework to join our team at ITC Infotech.Below is the JD for your reference.Job Functions: ● You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization.● You'll partner with...


  • Bengaluru, Karnataka, India Amiseq Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Title: SRE & Devops Engineer (ML Framework)Job Functions: You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization. You'll partner with vendors and the infrastructure engineering team for security and service availability You'll fix production issues with...


  • Bengaluru, Karnataka, India Plumeria Tech Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We're hiring: SRE & DevOps Engineer (Python / AI-ML Platforms)Hyderabad / Bangalore, IndiaAleading multinational in global commerce & technologyis expanding itsAI Platform Teamand seeking a motivatedSRE & DevOps Engineer. In this role, you'll help build and support large-scaleAI/ML infrastructurethat empowers researchers and data scientists worldwide.What...


  • Bengaluru, Karnataka, India ITC Infotech Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    We're HiringI'm excited to share that we're looking for SRE and DevOps - ML Framework to join our team at ITC Infotech.Below is the JD for your reference.Job Functions:● You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization.● You'll partner with...


  • Bengaluru, Karnataka, India Prospance Inc Full time ₹ 25,00,000 - ₹ 35,00,000 per year

    Hi,We openings for SRE Devops ML frmaework, With one of Product based client, for Bangalore (Contract to hire).Job Functions:You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization.● You'll partner with vendors and the infrastructure engineering team for...


  • Bengaluru, Karnataka, India, Karnataka Prospance Inc Full time

    SRE & DevOps Engineer (ML/AI Platform)Contract Position | Global E-Commerce Leader | HybridAbout the OpportunityWe're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions of...


  • Bengaluru, Karnataka, India Bahwan Cybertek Group Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:- Develop and...


  • Bengaluru, Karnataka, India Brillio Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Key Skills - MLOps , LLMOps , CI/CD for ML , Fast API , Teraform , Azure , AWS , Python, DevOps Engineer.Demonstrated ability in designing, building, refactoring and releasing software written in Python.Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton.Ability to handle framework-related issues, version upgrades, and compatibility...


  • Bengaluru, Karnataka, India Bahwan CyberTek Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:Develop and...


  • Bengaluru, Karnataka, India Amiseq Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Title: SRE & Devops Engineer (NodeJS)Job Functions: You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization. You'll partner with vendors and the infrastructure engineering team for security and service availability You'll fix production issues with...