ML Ops Engineer

4 weeks ago


Bengaluru India Aurigo Software Technologies Full time

Job Description Role Brief: We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language model (LLM) pipelines in cloud environments, primarily using AWS services. This role is critical to ensuring the reliability, efficiency, and performance of ML systems in production. The ideal candidate will have hands-on experience with AWS tools such as SageMaker, Lambda, Bedrock, Batch with Fargate, and infrastructure components like RDS, DynamoDB, and SQS. You will be responsible for automating CI/CD workflows, managing auto-scaling APIs, and provisioning cloud resources to support high-performance ML workloads, including RAG systems. Primary Responsibilities: - Strategizing and implementing scalable infrastructure for ML or LLM model pipelines using tools like and cloudservices such as AWS (e.g.,AWS Batch, Fargate,Bedrock) - Manage auto-scaling mechanisms to handle varying workloads and ensure high availability of Rest APIs - Automate CI/CD pipelines and Lambda functions for model testing, deployment, and updates, reducing manual errorsand improving efficiency. - Amazon SageMaker Pipelines for end-to-end ML workflow automation. Optimize utilizing step-functions - Conduct drift analysis to detect and respond to data drift, concept drift, and label drift. Implement mitigation strategies such as automated alerts, model retraining triggers, and performance audits. - Set up reproducible workflows for data preparation, model training, and deployment. - Provision and optimize cloud resources (e.g., GPUs, memory) to meet computational demands of large models like those used in RAG systems - Automate retraining workflows to keep models updated as data evolves - Work closely with data scientists, ML engineers, and DevOps teams to integrate models into production environments. - Implement monitoring tools to track model performance and detect issues like drift or degradation in real- time. Monitoring dashboards with real-time alerts for pipeline failures or performance issues C Implementing ModelObservability frameworks. Required Skills: - Education Any Engineering (BE/Btech/ME/Mtech) - Min 4 years of experience with AWS services such as Lambda, Bedrock, Batch with Fargate, RDS (PostgreSQL), DynamoDB, SQS, CloudWatch, API Gateway, SageMaker - Should have hands-on experience in drift analysis, including detecting and mitigating data, concept, and label drift in production ML systems - Knowledge of ML frameworks (e.g., PyTorch, TensorFlow) to understand model requirements during deployment - Experience with Rest API Frameworks like Fast APIs, Flask - Familiarity with model observability like Evidently, Nanny ML, Phoenix and monitoring tools (Grafana etc) and retraining tools like MLflow/ Kubeflow / Airflow - AWS Certified Machine Learning Specialty Good to have this certification


  • Ml ops engineer

    4 weeks ago


    Bengaluru, India Aurigo Software Technologies Full time

    Role Brief:We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language model (LLM) pipelines in cloud environments, primarily using AWS services. This role is critical to ensuring the reliability, efficiency, and performance of ML systems in production.The ideal candidate will have hands-on...

  • Ml ops engineer

    4 weeks ago


    Bengaluru, India Aurigo Software Technologies Full time

    Role Brief:We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language model (LLM) pipelines in cloud environments, primarily using AWS services. This role is critical to ensuring the reliability, efficiency, and performance of ML systems in production.The ideal candidate will have hands-on...

  • AI ML Ops Engineers

    3 weeks ago


    Pune, Maharashtra, India, Maharashtra Amazure Technologies Pvt Ltd Full time

    Detailed JD (Roles and Responsibilities)Bachelor’s or Master’s degree in computer science, Data Science, or a related field.3+ years of experience in AI/ML engineering, preferably in IT operations or DevOps environments.Strong programming skills in PythonExperience with implementing GenAi and AI Python SDKsExperience with time-series...

  • ML Ops

    4 hours ago


    India EXL Full time

    Prior ~2+ years of experience working with ML Ops & DSResponsibilities & Skills:Deploy, monitor, and scale ML models on AWS (SageMaker, EKS, Lambda) or GCP (Vertex AI, GKE, Cloud Functions).Build and maintain CI/CD pipelines for ML workflows using GitHub Actions / Jenkins / cloud-native tools.Containerize and orchestrate workloads with Docker & Kubernetes;...

  • ML Ops Engineer

    4 weeks ago


    Bengaluru, Karnataka, India, Karnataka Aurigo Software Technologies Full time

    Role Brief: We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language model (LLM) pipelines in cloud environments, primarily using AWS services. This role is critical to ensuring the reliability, efficiency, and performance of ML systems in production.The ideal candidate will have hands-on...

  • ML Ops Engineer

    1 week ago


    Bengaluru, Karnataka, India foundit Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Position:- ML Ops EngineerExperience:- 6+ yearsLocation:- BangaloreRequired Skills: ● Demonstrated ability in designing, building, refactoring and releasing software written in Python. ● Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton. ● Ability to handle framework-related issues, version upgrades, and compatibility with data...

  • ML Ops

    4 weeks ago


    India EXL Full time

    Job Description Prior 2+ years of experience working with ML Ops & DS Responsibilities & Skills: Deploy, monitor, and scale ML models on AWS (SageMaker, EKS, Lambda) or GCP (Vertex AI, GKE, Cloud Functions). Build and maintain CI/CD pipelines for ML workflows using GitHub Actions / Jenkins / cloud-native tools. Containerize and orchestrate workloads with...

  • ML/Dev Ops Engineer

    6 days ago


    Bengaluru, Karnataka, India Ubique Systems Full time ₹ 24 - ₹ 30 per year

    Req 1 - ML/Dev Ops EngineerMandatory Skills:Databricks, mlFlow, Seldon, AWS, Kubeflow, Tecton, JenkinsSkill to Evaluate:Databricks,-mlFlow,-Seldon,-AWS,-Kubeflow,-Tecton,-Jenkins,-Graffana-,-Python-,Experience: 5 to 10 YearsLocation:BengaluruBudget: 24 to 30 LPANotice - ImmediateJob Description:· Focus on ML model load testing and creation of E2E test...

  • ML Ops Engineer

    19 hours ago


    Bengaluru, Karnataka, India Aziro Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Key ResponsibilitiesDevelop, deploy, and maintain ML services using Langserve for efficient model serving.Monitor model performance and manage observability using Langfuse.Containerize applications and services using Docker for consistent development and production environments.Orchestrate and manage containerized workloads using Kubernetes (EKS/GKE/AKS or...

  • ML Ops Engineer

    2 weeks ago


    Bengaluru, India Aurigo Software Technologies Full time

    Role Brief:We are seeking a skilled ML Ops Engineer to design, implement, and maintain scalable machine learning and large language model (LLM) pipelines in cloud environments, primarily using AWS services. This role is critical to ensuring the reliability, efficiency, and performance of ML systems in production.The ideal candidate will have hands-on...