sre - ai&ml

1 week ago


Bengaluru, Karnataka, India GrayMatter Software Services Pvt Ltd Full time

Site Reliability Engineer (SRE) - Machine Learning and AI Platform

Are you interested in playing a crucial role in designing and maintaining a top-notch infrastructure to support machine learning and artificial intelligence initiatives? Join our team as a Site Reliability Engineer (SRE) specializing in Machine Learning and AI Platform and work closely with data scientists, software engineers, and product managers to ensure the reliability, availability, and efficiency of our ML/AI platform.

Key Responsibilities:

  • Design and implement robust, scalable, and automated infrastructure solutions.
  • Identify and address performance bottlenecks, reliability issues, and security vulnerabilities.
  • Collaborate with AI engineering teams to define best practices for deploying and managing machine learning models.
  • Optimize infrastructure components for cost-effectiveness, scalability, and performance.
  • Ensure platform performance, security, and compliance standards are met.
  • Troubleshoot and resolve platform-related issues.
  • Provide technical guidance and mentorship.
  • Improve automation framework and scripts for company-wide solutions.
  • Support top priority initiatives and maintain technical standards.
  • Ensure delivered solutions align with enterprise standards and quality requirements.

Deliverables:

  • Architect and deploy highly available infrastructure for hosting machine learning models.
  • Implement automated deployment pipelines for ML/AI models.
  • Develop monitoring and alerting systems for platform health and performance.
  • Create documentation and provide training on best practices.
  • Contribute to internal tools and frameworks development.

Qualifications:

  • 5-10 years of experience.
  • Academic Degree in BE, BTech, MCA, or M.Sc.
  • Extensive experience in cloud-based infrastructure solutions.
  • Proficiency in Docker, Kubernetes, Python, and Terraform.
  • Experience with Prometheus, Grafana, and ELK stack.
  • Strong problem-solving and communication skills.


  • Bengaluru, Karnataka, India Quantzig Full time

    Company: Quantzig AnalyticsLocation: Kadubeesanahalli, Bangalore.Experience : 5+Years minimum is requiredNotice: preferably 0 to 30daysAs a Site Reliability Engineer (SRE) specializing in Machine Learning and AI Platform, you will play a critical role in designing, implementing, and maintaining a highly scalable, reliable, and performant infrastructure to...

  • SRE Devops Engineer

    1 week ago


    Bengaluru, Karnataka, India Concentrix Full time

    Description Experience range: 5 to 10 years. Location- Bengaluru This position requires the following technical skills: Working experience: · SRE · Cloud - Azure - IoT, Event Hub, Databricks, AKS · Troubleshooting application/ debugging containers · Monitoring Tools, ELK, APM, Building dashboards/ alerts · Automation/ Scripting -...


  • Bengaluru, Karnataka, India Concentrix Full time

    Concentrix is a technology-enabled global business services company specializing in customer engagement and business performance. With more than 4,00,000 staff, Concentrix is present across 40 countries and six continents. We are considered as a category leader in the CXM (Customer Experience Management) Services. We serve automotive; banking and financial...

  • ML Ops Engineer

    1 week ago


    Bengaluru, Karnataka, India Wipro Full time

    Job Description Build and deploy training and serving pipelines for ML models in GCP Take offline ML models developed by other ML scientists and turn them into a machine learning production system (both offline batch and realtime inference) Optimize inference pipeline latency for cloud deployments to enable real-time ML serving Work with upstream and...

  • ML Ops Engineer

    1 week ago


    Bengaluru, Karnataka, India Wipro Full time

    Job Description Build and deploy training and serving pipelines for ML models in GCP Take offline ML models developed by other ML scientists and turn them into a machine learning production system (both offline batch and realtime inference) Optimize inference pipeline latency for cloud deployments to enable real-time ML serving Work with upstream and...


  • Bengaluru, Karnataka, India Concentrix Full time

    Concentrix is a technology-enabled global business services company specializing in customer engagement and business performance. With more than 4,00,000 staff, Concentrix is present across 40 countries and six continents. We are considered as a category leader in the CXM (Customer Experience Management) Services. We serve automotive; banking and financial...

  • SRE Platform Engg

    1 week ago


    Bengaluru, Karnataka, India Jobs for Humanity Full time

    Job Description Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0% SRE Platform Engg (Devops + Production Support) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most...

  • SRE Platform Engg

    3 weeks ago


    Bengaluru, Karnataka, India FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues...

  • Sre

    1 week ago


    Bengaluru, Karnataka, India IBM Full time

    IntroductionThe IBM AI Applications business unit is seeking talented and motivated SRE/DevOps professionals to work on the Maximo family of products. In this role you will closely collaborate with the broader SRE team to design and implement the cloud deployment of Maximo set of products on AWS. Support production customers in operational task such as...


  • Bengaluru, Karnataka, India FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Science Travel Percentage : 5 - 10%Senior SRE - Docker Kubernetes - Openshift Technologies Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and...


  • Bengaluru, Karnataka, India FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Science Travel Percentage : 5 - 10%Senior SRE - Docker Kubernetes - Openshift Technologies Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and...

  • SRE Platform Engg

    2 months ago


    Bengaluru, Karnataka, India FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues...

  • SRE Platform Engg

    1 week ago


    Bengaluru, Karnataka, India FIS Full time

    Position Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in...

  • SRE Platform Engg

    1 week ago


    Bengaluru, Karnataka, India Jobs for Humanity Full time

    Job DescriptionPosition Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant...

  • Associate Sre

    1 week ago


    Bengaluru, Karnataka, India Acceldata Full time

    Bengaluru, KarnatakaWork Type: Full Time About Acceldata Acceldata is an enterprise Data Observability organization that was first to the market, having coined the term 'Data Observability' in 2018. Founded by industry veterans who have spent decades in the AI, Analytics, and Data Monitoring space, Acceldata is a startup in the hypergrowth phase. Having...

  • Sre with Golang

    1 week ago


    Bengaluru, Karnataka, India Wipro Limited Full time

    Overview:Role : SRE Development EngineerLocation: Banglore.Constraint : Need to work from customer location (Banglore) at least 2-3 days in a week.MUST - Working exp required in Golang, graphQL, Python, Docker, Kubernetes.Good to have - Preffer if worked on DevOpes CI/CD pipeline.Role Purpose_ Required Skills:_**- _5+ yearsof experience with programming in...


  • Bengaluru, Karnataka, India NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology—and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots,...


  • Bengaluru, Karnataka, India CAST AI Full time

    Started out of a personal frustration with ever increasing cloud bills, the CAST AI team is led by serial entrepreneurs who've built successful global companies before. Why CAST AI? CAST AI is the leading Kubernetes cost optimization platform for AWS, GCP and Azure customers. The company is on a mission to deliver a fully automated Kubernetes experience....


  • Bengaluru, Karnataka, India GK HR Consulting India Pvt. Ltd. Full time

    Mandatory SkillsPython Programming, ML Model Deployment, GKE(Google Kubernet Engine)/AWS/Azure, ML Engineer.Preferred SkillsKubeFlow, Vetex AI, GCP/ Any Cloud, VM(Virtual Machine).Job Description * Build and deploy training and serving pipelines for ML models in GCP * Take offline ML models developed by other ML scientists and turn them into a machine...

  • Data Scientist

    1 week ago


    Bengaluru, Karnataka, India Wipro Limited Full time

    Bengaluru, India; Pune, India; Hyderabad, India Tech HiringJob Description: Job Description Build and deploy training and serving pipelines for ML models in GCP Take offline ML models developed by other ML scientists and turn them into a machine learning production system (both offline batch and realtime inference) Optimize inference pipeline latency for...