SRE & DevOps Engineer ()
2 days ago
Title: SRE & DevOps Engineer )
Job Functions:
You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization.
You'll partner with vendors and the infrastructure engineering team for security and service availability
You'll fix production issues with engineering teams, researchers, data scientists, including performance and functional issues
Diagnose and solve customer technical problems Participate in training customers and prepare reports on customer issues
Be responsible for customer service improvements and recommend product improvements
Write support documentation
You'll design and implement zero-downtime to monitor and accomplish a highly available service %)
As a support engineer, find opportunities to automate as part of the problem management process, creating automation to avoid issues.
Define engineering excellence for operational maturity
You'll work together with AI platform developers to provide the CI/CD model to deploy and configure the production system automatically
Develop and follow operational standard processes for tools and automation development. Including: Style guides, versioning practices, source control, branching and merging patterns and advising other engineers on development standards
Deliver solutions that accelerate the activities, phenomenal engineers would perform through automation, deep domain expertise, and knowledge sharing
Required Skills:
Demonstrated ability in designing, building, refactoring and releasing software written in Python, C++.
Hands-on experience with , including workload management, cluster deployment, distributed task scheduling, and troubleshooting.
Ability to use Ray Dashboard and CLI tools for monitoring, resource tracking, debugging distributed jobs, and resolving production issues.
Having knowledge of Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, and Ray Data is a big plus.
Experience integrating Ray with tools such as Airflow, MLflow, Dask, DeepSpeed is a big plus.
Debugging and triaging skills.
Cloud technologies like Kubernetes, Docker and Linux fundamentals.
Familiar with DevOps practices and continuous testing.
- DevOps pipeline and automations: app deployment/configuration & performance monitoring.
Test automations, Jenkins CI/CD.
Excellent communication, presentation, and leadership skills to be able to work and collaborate with partners, customers and engineering teams.
Well organized and able to manage multiple projects in a fast paced and demanding environment.
Good oral/reading/writing English ability.
-
DevOps / SRE with Python
7 hours ago
Bengaluru, Karnataka, India Bahwan Cybertek Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:- Develop and...
-
DevOps / SRE - Python
6 days ago
Bengaluru, Karnataka, India Bahwan CyberTek Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:Develop and...
-
DevOps Engineer/SRE
1 week ago
Bengaluru, Karnataka, India SuprSend Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout SuprSend:SuprSend is redefining notification infrastructure for businesses, enabling seamless communication at scale. Our platform ensures reliability, scalability, and efficiency in delivering notifications for the world's most demanding applications. We're looking for talented engineers passionate about building robust, high-performing systems to...
-
SRE Engineering Manager
2 weeks ago
Bengaluru, Karnataka, India hackajob Full time ₹ 20,00,000 - ₹ 25,00,000 per yearhackajob*is collaborating withOneAdvanced*to connect them with exceptional tech professionals for this role.Job Description: SRE Engineering Manager - DevOps & ReliabilityOverview: We are seeking a highly skilled and experienced SRE Engineering Manager to lead our Site Reliability Engineering (SRE) and DevOps teams. This leader will play a crucial role in...
-
DevOps & SRE Platform Engineers
1 week ago
Bengaluru, Karnataka, India Photon Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout The Role About The Role : As a DevOps & SRE Platform Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and tools necessary for the continuous delivery, deployment, and reliability of our software systems. You will work closely with development, operations, and security teams to streamline...
-
SRE/DevOps Engineer
2 weeks ago
Bengaluru, Karnataka, India Airties Full time ₹ 6,00,000 - ₹ 18,00,000 per yearAt Airties we are on a mission to empower broadband operators to deliver a better-connected home experience for their subscribers. We have an exciting story to tell and we want you to help us tell it.Airties is the most widely deployed provider of Wi-Fi Mesh solutions to operators around the globe. Airties designs and develops software and hardware that...
-
SRE & DevOps Engineer (ML/AI Platform)
3 weeks ago
Bengaluru, Karnataka, India, Karnataka Prospance Inc Full timeSRE & DevOps Engineer (ML/AI Platform)Contract Position | Global E-Commerce Leader | HybridAbout the OpportunityWe're partnering with a leading global e-commerce company to find an exceptional SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine learning infrastructure that powers innovation for millions of...
-
SRE Engineer
2 weeks ago
Bengaluru, Karnataka, India Technology Next Full time ₹ 4,20,00,000 - ₹ 10,80,00,000 per yearSite Reliability Engineer (SRE) – 6+ Years | Immediate JoinersLocation: BangaloreContract: 6 months (extendable)About the Role:We are hiring an experienced Site Reliability Engineer (SRE) with 6+ years of experience to ensure system reliability, scalability, and performance across large-scale cloud environments.Key Responsibilities:Monitor, troubleshoot,...
-
SRE & Devops Engineer (NodeJS)
2 days ago
Bengaluru, Karnataka, India Amiseq Full time ₹ 12,00,000 - ₹ 36,00,000 per yearTitle: SRE & Devops Engineer (NodeJS)Job Functions: You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization. You'll partner with vendors and the infrastructure engineering team for security and service availability You'll fix production issues with...
-
DevOps & SRE Platform Engineers | Offshore
2 hours ago
Bengaluru, Karnataka, India Photon Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearDescriptionJob Description:As a DevOps & SRE Platform Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and tools necessary for the continuous delivery, deployment, and reliability of our software systems. You will work closely with development, operations, and security teams to streamline processes, automate...