Reliability Engineering Team Lead
2 weeks ago
About the Role
Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We are seeking a highly skilled Reliability Engineering Team Lead to join our team and drive the stability and availability of our mission-critical production systems.
Key Responsibilities
- Lead reliability engineering projects and drive them to closure.
- Write code and perform code reviews for best practices and code quality.
- Contribute to the design/architecture of the system.
- Automate processes and find opportunities to improve observability and availability of the Platform and reduce toil.
- Supervise a team of SREs, ensuring production applications are stable, reliable, and well documented.
- Own end-to-end availability and performance of mission-critical services.
- Analyze and debug complex issues across tiers from frontend to mid-tier to infrastructure.
- Practice sustainable incident response and blameless postmortems.
Requirements
- 5 to 8 years of experience handling systems for large-scale production environments.
- A self-starter, able to build, drive, and advocate for SRE solutions.
- Effective cross-functional collaboration skills to develop tools for secured, scalable, and reliable systems.
- Solid understanding of SRE concepts like SLAs, SLOs, SLIs, error budgets, MTTR, MTTD, etc.
- Experience with a variety of tools that help manage, understand, and debug large, complex distributed systems.
- Good programming experience (Python/Go).
- Hands-on experience with Kubernetes and Docker.
- Working knowledge in any one of the cloud platforms (AWS, Azure, GCP).
- Experience with monitoring and logging tools (e.g. Datadog, ELK, Prometheus, Grafana).
- Good knowledge of Unix system, networking, web technologies, and databases.
- Expert with troubleshooting issues and bugs.
- Incident Management experience coupled with effective communication skills.
- Experience in the financial domain (desirable).
- Prior SRE/DevOps experience desirable.
Arcesium is an equal opportunities employer and welcomes applications from diverse candidates. We are committed to creating an inclusive environment that values diversity and promotes equal opportunities for all employees.
-
Reliability Engineering Lead
3 weeks ago
Hyderabad, Telangana, India Arcesium Full timeAbout the RoleArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We are seeking a highly skilled Reliability Engineering Lead to join our team and drive the stability and availability of our mission-critical production systems.Key...
-
Reliability Engineering Lead
4 weeks ago
Hyderabad, Telangana, India Arcesium Full timeAbout ArcesiumArcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients...
-
Reliability Engineering Lead
3 weeks ago
Hyderabad, Telangana, India Arcesium Full timeAbout ArcesiumArcesium is a global financial technology firm that tackles complex data-driven challenges faced by leading financial institutions. We continuously innovate our platform and capabilities to address tomorrow's challenges, anticipate risks, and design advanced solutions to drive transformational business outcomes.As a high-growth industry,...
-
Lead Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India UnitedHealth Group Full timeOptum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by diversity and inclusion,...
-
Digitalization Team Lead
3 weeks ago
Hyderabad, Telangana, India A LARGE GLOBAL ENGINEERING COMPANY Full timeJob Title: Digitalization Team LeadWe are seeking a highly skilled and experienced Digitalization Team Lead to join our team at A LARGE GLOBAL ENGINEERING COMPANY. As a Digitalization Team Lead, you will be responsible for leading a team of experts in digitalization, IoT, and agile methodologies to deliver high-quality solutions to our customers.Key...
-
Lead Data Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India Zeta Services Inc. Full timeAbout ZetaZeta is a cutting-edge banking technology company that empowers banks and fintechs to launch innovative banking products for the future. Founded in 2015 by Ramki Gaddipati, Zeta's flagship processing platform, Zeta Tachyon, is a modern, cloud-native, and fully API-enabled stack that brings together issuance, processing, lending, core banking, fraud...
-
Cloud Reliability Engineer
1 week ago
Hyderabad, Telangana, India UnitedHealth Group Full timeCloud Reliability EngineerAt UnitedHealth Group, we're committed to delivering high-quality healthcare services to millions of people worldwide. As a Cloud Reliability Engineer, you'll play a crucial role in ensuring the performance, security, and reliability of our cloud infrastructure.Collaborate with cross-functional teams to design, implement, and...
-
Lead Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India FactSet Full timeResponsibilitiesAs a Site Reliability Engineer, you will collaborate with cross-functional teams to design, implement, and maintain highly available and scalable architectures for our applications and infrastructure.Develop and enhance automated tools and frameworks to optimize system monitoring, deployment, and recovery.Troubleshoot and resolve complex...
-
Lead Data Reliability Engineer
6 days ago
Hyderabad, Telangana, India Zeta Services Inc. Full timebody{font-family:Arial,sans-serif;}h1{font-size:24px;}About ZetaZeta is a next-generation banking technology company that empowers banks and fintechs to launch banking products for the future. It was founded by and Ramki Gaddipati in 2015. Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India UnitedHealth Group Full timeOptum Site Reliability EngineerAt UnitedHealth Group, we're committed to helping people live healthier lives and making the health system work better for everyone. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and performance of our systems, driving a culture of performance excellence and proactive issue...
-
Enterprise Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India DBS Bank Full timeJob SummaryDBS Bank is seeking an experienced Enterprise Reliability Engineer to lead our Site Reliability Engineering team. The successful candidate will be responsible for ensuring technical assurance in significant projects, delivering quality technical deliverables, and overseeing the SRE team to ensure they are involved in every step of the application...
-
Senior Software Engineering Team Lead
7 days ago
Hyderabad, Telangana, India Microsoft Full timeAbout the RoleWe are seeking a skilled Senior Software Engineering Lead to join our Cloud Operations and Innovation Engineering team at Microsoft. This role will involve leading a team of software engineers and collaborating with cross-functional teams to develop and deliver cutting-edge solutions for our data centers.ResponsibilitiesLead and manage a team...
-
Digitalization Team Lead
3 weeks ago
Hyderabad, Telangana, India A LARGE GLOBAL ENGINEERING COMPANY Full timeJob Title: Digitalization Team LeadCompany: A large global engineering companyJob Description:We are seeking a passionate people leader with technical experience to lead one of our digitalization teams. This role requires a deep understanding of IoT technologies, strong leadership skills, and the ability to work collaboratively with cross-functional teams to...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India FactSet Full timeJob Title: Lead Site Reliability EngineerAt FactSet, we're seeking a highly skilled Lead Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining highly available and scalable architectures for our applications and infrastructure.Key...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India Crox Consulting Inc Full timeSite Reliability EngineerJob Summary:Crox Consulting Inc is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based SaaS environment.Key Responsibilities:Design and implement automation and software solutions...
-
Site Reliability Engineering Manager
4 weeks ago
Hyderabad, Telangana, India Quest Diagnostics Full timeJob Title: Site Reliability Engineering ManagerWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Quest Diagnostics. As a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining scalable and reliable systems.Key...
-
Lead Data Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India Zeta Services Inc. Full timeAbout Zeta Services Inc.Zeta Services Inc. is a pioneering Next-Gen Banking Tech company that empowers banks and fintechs to launch innovative banking products for the future. Founded in 2015 by Ramki Gaddipati, our flagship processing platform - Zeta Tachyon - is a modern, cloud-native, and fully API-enabled stack that brings together issuance, processing,...
-
Lead Data Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Zeta Services Inc. Full timeAbout Zeta Services Inc.Zeta Services Inc. is a cutting-edge Next-Gen Banking Tech company that empowers banks and fintechs to launch innovative banking products for the future. Founded in 2015 by Ramki Gaddipati, our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together...
-
Site Reliability Engineering Manager
3 weeks ago
Hyderabad, Telangana, India Quest Diagnostics Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Quest Diagnostics. As a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining reliable and scalable systems.Key ResponsibilitiesLead and manage a team of Site...
-
Site Reliability Engineering Manager
3 weeks ago
Hyderabad, Telangana, India Quest Diagnostics Full timeJob Title: Site Reliability Engineering ManagerQuest Diagnostics is seeking a highly skilled Site Reliability Engineering Manager to lead our team of engineers in delivering high-quality, reliable, and scalable systems.Key Responsibilities:Lead and manage a team of Site Reliability Engineers, providing mentorship, guidance, and support to ensure the team's...