Site Reliability Engineer II
1 week ago
About the Role We are looking for a Site Reliability Engineer (SRE) II to help ensure the stability, scalability, and reliability of critical services and infrastructure. The role focuses on building automation, maintaining observability, supporting incident response, and collaborating across engineering, product, and operations teams to embed reliability practices. Key Responsibilities Service Reliability & Operations - Support availability and durability of production services. - Monitor service health using SLIs, SLOs, and error budgets, escalating issues as needed. - Participate in on-call rotations, incident response, and post-incident reviews. - Follow ITIL/OSS processes: incident, change, problem, and capacity management. Automation & Tooling - Develop automation for operational tasks to reduce manual effort. - Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, ELK). - Work with CI/CD pipelines, configuration management, and infrastructure-as-code tools (Terraform, Ansible, Jenkins). - Write scripts (Python, Bash, Go) to improve system reliability. Collaboration & Continuous Improvement - Partner with teams to support resilient system design and operations. - Assist in capacity planning and disaster recovery exercises. - Document systems, contribute to playbooks/runbooks, and propose improvements. - Promote a reliability-focused culture within development and operations teams. Qualifications Education & Experience - Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience). - 2–4 years of experience in site reliability, systems engineering, or operations. - Exposure to large-scale, production-grade systems. Technical Skills - Linux systems administration and troubleshooting. - Service reliability concepts: monitoring, alerting, incident response, root cause analysis. - Proficiency in at least one scripting language: Python, Bash, or Go. - Understanding of containers (Kubernetes, Docker) and microservices. - Knowledge of incident response and operational best practices. Preferred Attributes - Experience in SaaS, service provider, or distributed systems environments. - Familiarity with ITIL/OSS practices and SLO/SLA management. - Experience with cloud platforms (AWS, GCP, Azure). - Strong problem-solving skills and ownership mindset. Pro5 is a global platform helping thousands of vetted professionals get hired by top employers. See what others say on our public Google Reviews and learn how we keep your data safe in our Trust Center.
-
Site Reliability Engineer II
1 week ago
Greater Bengaluru Area, India Pro5.ai Full timeAbout the RoleWe are looking for a Site Reliability Engineer (SRE) II to help ensure the stability, scalability, and reliability of critical services and infrastructure. The role focuses on building automation, maintaining observability, supporting incident response, and collaborating across engineering, product, and operations teams to embed reliability...
-
Site Reliability Engineer II
1 week ago
Greater Bengaluru Area, India Pro5 Full timeAbout the RoleWe are looking for aSite Reliability Engineer (SRE) IIto help ensure the stability, scalability, and reliability of critical services and infrastructure. The role focuses on building automation, maintaining observability, supporting incident response, and collaborating across engineering, product, and operations teams to embed reliability...
-
Site Reliability Engineer II
3 weeks ago
Bengaluru, India JPMorgan Chase & Co. Full timeAs a Site Reliability Engineer II at JPMorgan Chase within Corporate Technology, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively...
-
Site Reliability Engineer II
3 weeks ago
Bengaluru, India JP Morgan Chase & Co. Full timeJob Description As a Site Reliability Engineer II at JPMorgan Chase within Corporate Technology, youwill solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and...
-
Site Reliability Engineer Ii
1 week ago
Bengaluru, India Pro5.ai Full timeAbout the Role We are looking for a Site Reliability Engineer (SRE) II to help ensure the stability, scalability, and reliability of critical services and infrastructure. The role focuses on building automation, maintaining observability, supporting incident response, and collaborating across engineering, product, and operations teams to embed reliability...
-
Site Reliability Engineer II
1 week ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 8,00,000 - ₹ 24,00,000 per yearThe Production Engineering and Artificial Intelligence (AI) Group, part of the Linux Systems Group within Microsoft, plays a critical role in powering Azure Cloud. This team ensures that Azure operates with the latest version of Linux software at the highest levels of quality and performance, serving as the gatekeeper for production software. The team...
-
Site Reliability Engineer II
1 week ago
Bengaluru, Karnataka, India JPMorgan Chase Full timeAs a Site Reliability Engineer II at JPMorgan Chase within Corporate Technology, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively...
-
Site Reliability Engineer II
2 weeks ago
Bengaluru, Karnataka, India JPMorganChase Full time US$ 80,000 - US$ 1,20,000 per yearDescriptionPlay a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions.As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology, Finance Last Mile Reporting team, you will use technology to solve business problems and leverage software engineering best practices as we strive...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Whatjobs IN C2 Full timeHiring Site Reliability Engineers Exp : 2.5 +years (Excluding internship) Location : Bangalore Apply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer II
2 days ago
Bengaluru East, Karnataka, India Backblaze Full timeBackblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we're helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the...