Site Reliability Engineering Lead
2 months ago
The SRE team is responsible for monitoring the stability and availability of mission critical production systems, managing incidents for quicker resolution, and establishing BAU. Team also building tools/infra which all development teams will use to help monitor and troubleshoot.
What you'll do:
- Lead reliability engineering projects and drive it to closure.
- Write code and perform code reviews for best practices and code quality.
- Contribute to the design/architecture of the system.
- Automate processes and find opportunities to improve observability and availability of the Platform and reduce toil.
- Supervise a team of SREs, ensuring production applications are stable, reliable, and well documented.
- Own end to end availability and performance of mission critical services.
- Analyze and debug complex issues across tiers from frontend to mid-tier to infrastructure.
- Practice sustainable incident response and blameless postmortems.
What you'll need:
- 5 to 9 years of experience handling systems for large scale production environments.
- A self-starter, able to build, drive and advocate for SRE solution.
- Effective cross-functional collaboration skills to develop tools for secured, scalable, and reliable systems.
- Solid understanding of SRE concepts like SLAs, SLOs, SLIs, error budgets, MTTR, MTTD, etc.
- Experience with variety of tools that help manage, understand, and debug large, complex distributed systems.
- Good programming experience (Python/Go).
- Hands-on experience with Kubernetes and Docker.
- Working knowledge in any one of the cloud platforms (AWS, Azure, GCP)
- Experience with monitoring and logging tools (e.g. Datadog, ELK, Prometheus, Grafana).
- Good knowledge of Unix system, networking, web technologies, and databases.
- Expert with troubleshooting issues and bugs.
- Incident Management experience coupled with effective communication skills.
- Experience in financial domain (desirable).
- Prior SRE/DevOps experience desirable.
Arcesium and its affiliates do not discriminate in employment matters on the basis of race, color, religion, gender, gender identity, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other category protected by law. Note that for us, this is more than just a legal boilerplate. We are genuinely committed to these principles, which form an important part of our corporate culture, and are eager to hear from extraordinarily well qualified individuals having a wide range of backgrounds and personal characteristics.
-
Site Reliability Engineering Lead
2 weeks ago
Bangalore, India OptOut Full timeJob DescriptionAt OptOut, we're seeking a highly skilled Site Reliability Engineer Lead to join our team. As a key member of our engineering organization, you will be responsible for leading our SRE & Observability teams and executing on the vision of providing an enterprise-based common Observability Platform leveraged by a global Engineering, Product, and...
-
Site Reliability Engineer
3 weeks ago
Bangalore, India Yogy HR Solutions Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Yogy HR Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our cloud-based systems.Key Responsibilities:Collaborate with development partners to design and implement scalable...
-
Site Reliability Engineer
4 weeks ago
Bangalore, India Yogy HR Solutions Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Yogy HR Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our cloud-based systems.Key Responsibilities:Collaborate with development partners to design and...
-
Lead Site Reliability Engineer
2 months ago
bangalore, India Tyson Foods India Full timeJob Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...
-
Lead Site Reliability Engineer
2 months ago
bangalore, India Tyson Foods India Full timeJob Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...
-
Lead Site Reliability Engineer
5 months ago
bangalore, India Tyson Foods India Full timeJob Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...
-
Lead Site Reliability Engineer
2 months ago
Bangalore City, India Tyson Foods India Full timeJob Description – Lead Site Reliability Engineer (Cloud Engineering) The role as Lead Site Reliability Engineer in the Data & Analytics organization, is to lead efforts in ensuring the reliability, scalability, and performance of our cloud-based systems in like GCP/AWS. The role will play a crucial part in designing and implementing robust, scalable...
-
Site Reliability Engineer
4 weeks ago
Bangalore, India Cyitechsearch Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Cyitechsearch. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our full-stack software applications.Key Responsibilities:Develop and provide operational...
-
Site Reliability Engineer
3 weeks ago
Bangalore, India Micoworks Full timeJob Title: Site Reliability EngineerAt Micoworks, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our cloud-based services.Key Responsibilities:Design, implement, and maintain scalable and reliable...
-
Site Reliability Engineer
3 weeks ago
Bangalore, India Squareroot Consulting Pvt Ltd. Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Squareroot Consulting Pvt Ltd. in Bangalore, India. As a Site Reliability Engineer, you will be responsible for designing and implementing secure and scalable infrastructure as a service, automating infrastructure provisioning, and building tools...
-
Site reliability engineer
3 weeks ago
Bangalore, India Integra Connect Full timeAbout Integra Connect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the Integra Cloud platform, the company’s core applications span population health including...
-
Site Reliability Engineer
2 months ago
bangalore, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
3 weeks ago
Bangalore, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
1 month ago
Bangalore, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
4 hours ago
bangalore, India Tranzeal Incorporated Full timeHi Everyone,One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for...
-
Site Reliability Engineer
4 weeks ago
Bangalore, India Wealthy Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Wealthy. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining reliable containerized applications using Kubernetes on GCP.Key Responsibilities:Develop and optimize SLIs, SLOs, and SLAs for critical...
-
Site Reliability Engineer
3 weeks ago
Bangalore, India Wealthy Full timeJob Title: Site Reliability EngineerWealthy is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining reliable containerized applications using Kubernetes on GCP.Key Responsibilities:Develop and optimize SLIs, SLOs, and SLAs for critical systems...
-
Site reliability engineer
4 days ago
Bangalore, India CSC Full timeRole: Site Reliability Engineer Location: Mumbai/ Bangalore Working Model: Hybrid Shift: 12-9 PM Intro: Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading...
-
Site Reliability Engineer
6 days ago
bangalore, India Integra Connect Full timeAbout IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
5 months ago
bangalore, India Integra Connect Full timeAbout IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...