Site Reliability Engineering Lead
3 months ago
The SRE team is responsible for monitoring the stability and availability of mission critical production systems, managing incidents for quicker resolution, and establishing BAU. Team also building tools/infra which all development teams will use to help monitor and troubleshoot.
What you'll do:
- Lead reliability engineering projects and drive it to closure.
- Write code and perform code reviews for best practices and code quality.
- Contribute to the design/architecture of the system.
- Automate processes and find opportunities to improve observability and availability of the Platform and reduce toil.
- Supervise a team of SREs, ensuring production applications are stable, reliable, and well documented.
- Own end to end availability and performance of mission critical services.
- Analyze and debug complex issues across tiers from frontend to mid-tier to infrastructure.
- Practice sustainable incident response and blameless postmortems.
What you'll need:
- 5 to 9 years of experience handling systems for large scale production environments.
- A self-starter, able to build, drive and advocate for SRE solution.
- Effective cross-functional collaboration skills to develop tools for secured, scalable, and reliable systems.
- Solid understanding of SRE concepts like SLAs, SLOs, SLIs, error budgets, MTTR, MTTD, etc.
- Experience with variety of tools that help manage, understand, and debug large, complex distributed systems.
- Good programming experience (Python/Go).
- Hands-on experience with Kubernetes and Docker.
- Working knowledge in any one of the cloud platforms (AWS, Azure, GCP)
- Experience with monitoring and logging tools (e.g. Datadog, ELK, Prometheus, Grafana).
- Good knowledge of Unix system, networking, web technologies, and databases.
- Expert with troubleshooting issues and bugs.
- Incident Management experience coupled with effective communication skills.
- Experience in financial domain (desirable).
- Prior SRE/DevOps experience desirable.
Arcesium and its affiliates do not discriminate in employment matters on the basis of race, color, religion, gender, gender identity, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other category protected by law. Note that for us, this is more than just a legal boilerplate. We are genuinely committed to these principles, which form an important part of our corporate culture, and are eager to hear from extraordinarily well qualified individuals having a wide range of backgrounds and personal characteristics.
-
Site Reliability Engineering Lead
4 weeks ago
Bengaluru, Karnataka, India Flipkart Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineering Manager to lead our Reliability and Productivity Engineering team at Flipkart. As a key member of our SRE organization, you will be responsible for overseeing the end-to-end development process, from ideation to deployment, ensuring the delivery of high-quality, scalable, and...
-
Site Reliability Engineering Lead
2 weeks ago
Bengaluru, Karnataka, India Flipkart Full timeAbout the RoleAs a Site Reliability Engineering Manager at Flipkart, you will be responsible for leading a team of skilled engineers in optimizing search functionalities and driving innovation.As a Site Reliability Engineering Manager, you will oversee the end-to-end development process, from ideation to deployment, ensuring the delivery of high-quality,...
-
Site Reliability Engineering Lead
4 weeks ago
Bengaluru, Karnataka, India Arcesium Full timeJob Title: Site Reliability Engineering LeadThe Site Reliability Engineering (SRE) team at Arcesium plays a critical role in ensuring the stability and availability of our mission-critical production systems. As a Site Reliability Engineering Lead, you will be responsible for leading reliability engineering projects, driving them to closure, and ensuring the...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Granicus Full timeAbout GranicusGranicus is a leading provider of cloud-based solutions for government agencies and organizations. We are committed to helping our customers achieve their goals by providing innovative and effective technology solutions.The RoleWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Okta, Inc. Full timeAbout OktaOkta is a leading identity and access management company that helps organizations securely manage access to their applications and resources. We're looking for a skilled Site Reliability Engineer to join our team and help us build and maintain our production infrastructure.Job SummaryWe're seeking a highly motivated and experienced Site Reliability...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India TEKsystems Global Services in India Full timeJob Title: Site Reliability EngineerAt TEKsystems Global Services in India, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our systems and infrastructure.Key Responsibilities:Lead and manage incident response and blameless...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India ITC Infotech Full timeJob Title: Site Reliability EngineerAt ITC Infotech, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our systems and infrastructure.Key Responsibilities:Collaboration and Partnership: Partner with application developers and...
-
Site Reliability Engineer
7 hours ago
Bengaluru, India N Consulting Ltd Full timeExperience: 10+ years Location: Bengaluru Job Description: Site Reliability EngineeringGood Communication & Leadership skillExperience in Software Release Management or worked in application side(Code Reviews)Should have strong knowledge in Java Should have strong knowledge in PythonShould have strong knowledge in AWSShould have Lead experience.Site...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India TEKsystems Global Services in India Full timeJob Title: Site Reliability EngineerAt TEKsystems Global Services in India, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key Responsibilities:Incident Management: Lead and...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India TEKsystems Global Services in India Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at TEKsystems Global Services in India. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud and on-premise infrastructure.Key Responsibilities:Design, implement, and...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India ITC Infotech Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at ITC Infotech. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and services.Key Responsibilities:Collaboration and Partnership: Partner with application...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India TEKsystems Global Services in India Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at TEKsystems Global Services in India. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.Key Responsibilities:Design and implement monitoring and...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India TEKsystems Global Services in India Full timeJob Title: Site Reliability EngineerTEKsystems Global Services in India is seeking a highly skilled Site Reliability Engineer to join our team.About the RoleWe are looking for a talented individual with a strong background in monitoring and observability tools, automation, and scripting languages. The ideal candidate will have a proven track record in...
-
Site Reliability Engineering Practice Lead
4 weeks ago
Bengaluru, Karnataka, India NexionPro Services Full timeJob Title: Site Reliability Engineering Practice HeadWe are seeking an experienced Site Reliability Engineering (SRE) Practice Head to lead and manage our SRE function. Based in Pune and Bengaluru, this senior leadership role will involve building and scaling the SRE teams, ensuring operational excellence, and delivering reliable, scalable, and...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Accolite Full timeSite Reliability Engineering Leadership OpportunityAccolite is seeking a seasoned Site Reliability Engineering (SRE) expert to serve as the SRE Lead / Architect. This role is based in Bangalore and requires 10+ years of experience.Key Responsibilities:Lead the SRE team and guide them towards excellence in SRECollaborate with stakeholders to define KPIs,...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Seven N Half Full timeJob Title: Site Reliability EngineerWe are seeking a skilled and experienced Site Reliability Engineer to join our dynamic team at Seven N Half. As a Site Reliability Engineer, you will play a crucial role in designing, implementing, and maintaining our cloud infrastructure and CI/CD pipelines.Key Responsibilities:Design High-Availability Systems to ensure...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Fidelity Investments Full timeThe Role of a Site Reliability EngineerFidelity Investments is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. This includes designing, implementing, and maintaining systems that meet the needs of our business.Key...
-
Site Reliability Engineering Practice Lead
3 weeks ago
Bengaluru, Karnataka, India NexionPro Services Full timeJob Title: Site Reliability Engineering Practice HeadLocation: Pune & BengaluruExperience: 20+ YearsEmployment Type: Full-timeWe are seeking an experienced Site Reliability Engineering (SRE) Practice Head to lead and manage our SRE function. Based in Pune and Bengaluru, this senior leadership role will involve building and scaling the SRE teams, ensuring...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Cisco Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Cisco. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and reliability of our cloud services.Key ResponsibilitiesDesign and implement scalable and efficient cloud infrastructure solutions.Collaborate with cross-functional...
-
Site Reliability Engineering
5 months ago
Bengaluru, India Microsoft Full timeOverview Looking to join an exciting industry and organization at the forefront of the next Tech industry transformation? Are you ready to join a team of the world’s best technical experts to enable the success of Microsoft solutions for our commercial & enterprise customers? We are seeking to build out the team of next generation Site Reliability...