ML Ops Engineer

6 days ago


bangalore, India SatSure Full time

We are looking for a Machine Learning Operations Engineer to join our team, to design, build, and integrate ML Ops for large-scale, distributed machine learning systems, focusing on cutting-edge tools, distributed GPU training, and enhancing research experimentation.About SatSure:SatSure is a deep tech, decision Intelligence company that works primarily at the nexus of agriculture, infrastructure, and climate action creating an impact for the other millions, focusing on the developing world. We want to make insights from earth observation data accessible to all.Join us to be at the forefront of building a deep tech company from India that solves problems for the globe. Roles & Responsibilities:Architect, build and integrate end-to-end life cycles of large-scale, distributed machine learning systems i.e. ML Ops using cutting-edge tools/frameworks.Develop tools and services for explainability of ML solutions.Implement distributed cloud GPU training approaches for deep learning models.Build software/tools that improve the rate of experimentation for the research team and extract insights from it.Identify and evaluate new patterns and technologies to improve the performance, maintainability, and elegance of our machine learning systems.Contribute to and execute technical projects to completion. Collaborate with peers to develop requirements and monitor progress.Collaborate with engineers across various functions to solve complex data problems at scale.Qualification:5 - 8 years of professional experience in implementing MLOps framework to scale up ML in production.Master’s degree or PhD in Computer Science, Machine Learning / Deep Learning domainsMust-haves:Hands-on experience with orchestration and pipeline tools like Kubernetes, Apache Airflow, etc., and ML lifecycle management tools such as MLflow, SageMaker, or similar, covering model training, inference, evaluation, and deployment.Proficient in deploying ML models using frameworks like Ray Serve, TorchServe, TensorFlow Serving, or NVIDIA Triton Inference Server.Strong foundation in ML model training frameworks such as PyTorch, PyTorch Lightning, TensorFlow, etc.Experience leveraging GPU computing for parallel processing of data and model training.Solid software engineering skills with a track record of building production-grade systems.Advanced programming skills in Python.Proven experience in designing and implementing end-to-end data systems in roles like ML Engineer, ML Platform Engineer, or similar.Familiarity with cloud-based data processing tools and services such as AWS (S3, ECR, Lambda), Spark, Dask, Elasticsearch, Presto, and SQL.Exposure to geospatial or remote sensing data is an added advantage.Core Competencies:Strong debugging and critical thinking capabilities.Excellent analytical and problem-solving skills.Ability to thrive in fast-paced, collaborative team environments. Benefits:Medical Health Cover for you and your family, including unlimited online doctor consultationsAccess to mental health experts for you and your familyDedicated allowances for learning and skill developmentComprehensive leave policy with casual leaves, paid leaves, marriage leaves, bereavement leavesTwice a year appraisalInterview Process:Intro callAssessmentPresentationInterview rounds (ideally up to 3-4 rounds)Culture Round / HR round


  • ML Ops Engineer

    1 week ago


    bangalore, India SatSure Full time

    We are looking for a Machine Learning Operations Engineer to join our team, to design, build, and integrate ML Ops for large-scale, distributed machine learning systems, focusing on cutting-edge tools, distributed GPU training, and enhancing research experimentation.About SatSure:SatSure is a deep tech, decision Intelligence company that works primarily at...

  • ML Ops

    4 days ago


    bangalore, India EXL Full time

    Prior ~2+ years of experience working with ML Ops & DSResponsibilities & Skills:Deploy, monitor, and scale ML models on AWS (SageMaker, EKS, Lambda) or GCP (Vertex AI, GKE, Cloud Functions).Build and maintain CI/CD pipelines for ML workflows using GitHub Actions / Jenkins / cloud-native tools.Containerize and orchestrate workloads with Docker & Kubernetes;...

  • ML Ops

    3 weeks ago


    bangalore, India EXL Full time

    Prior ~2+ years of experience working with ML Ops & DS Responsibilities & Skills: Deploy, monitor, and scale ML models on AWS (SageMaker, EKS, Lambda) or GCP (Vertex AI, GKE, Cloud Functions) . Build and maintain CI/CD pipelines for ML workflows using GitHub Actions / Jenkins / cloud-native tools . Containerize and orchestrate workloads with Docker &...

  • ML Ops

    21 hours ago


    bangalore, India EXL Full time

    Prior ~2+ years of experience working with ML Ops & DS Responsibilities & Skills : Deploy, monitor, and scale ML models on AWS (SageMaker, EKS, Lambda) or GCP (Vertex AI, GKE, Cloud Functions) . Build and maintain CI/CD pipelines for ML workflows using GitHub Actions / Jenkins / cloud-native tools . Containerize and orchestrate workloads with Docker &...

  • ML Ops Engineer

    1 week ago


    bangalore district, India SatSure Full time

    We are looking for a Machine Learning Operations Engineer to join our team, to design, build, and integrate ML Ops for large-scale, distributed machine learning systems, focusing on cutting-edge tools, distributed GPU training, and enhancing research experimentation. About SatSure: SatSure is a deep tech, decision Intelligence company that works primarily at...

  • SRE ML Ops

    6 days ago


    bangalore, India People Prime Worldwide Full time

    SRE ML Ops Role – Primarily development-focused with exposure to DevOps practices. Involves managing ML pipelines using frameworks like PyTorch and TensorFlow, implementing CI/CD deployments, and contributing to infrastructure within the CDT group, which handles both infra management and development.Pipeline management using ML frameworkexposure on SRE,...


  • bangalore, India Amicon Hub Services Full time

    We’re Hiring: Senior MLOps / DevOps Engineer (for one of our esteemed clients) Location: PAN India (Gurugram, Noida, Bangalore, Chennai, Hyderabad, Pune) Experience: 7+ Years Domain: AI / ML Infrastructure & Automation About the Company We’re looking for a passionate and experienced Senior MLOps / DevOps Engineer to join our client’s growing team....

  • ML Ops Engineer

    1 week ago


    bangalore, India DAT Full time

    About DATDAT is an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on...


  • bangalore, India Insight Global Full time

    Agentic & AI Tech Ops Engineer Location: AI Center of Excellence Role Overview: We seek a proactive Agentic & AI Tech Ops Engineer to ensure reliability, scalability, and efficiency of AI and Agentic AI systems in production. You will manage deployments, monitor performance, troubleshoot issues, and implement best practices for Tech Ops/MLOps/LLMOps . Key...


  • bangalore, India Insight Global Full time

    Agentic & AI Tech Ops EngineerLocation: AI Center of ExcellenceRole Overview:We seek a proactive Agentic & AI Tech Ops Engineer to ensure reliability, scalability, and efficiency of AI and Agentic AI systems in production. You will manage deployments, monitor performance, troubleshoot issues, and implement best practices for Tech Ops/MLOps/LLMOps.Key...