
Lead - Site Reliability Engineer
3 weeks ago
We are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus, Grafana, Google Cloud Monitoring, and OpenTelemetry, along with exposure to SolarWinds. You should be comfortable working with metrics, logs, and traces, and be able to correlate telemetry data to proactively detect, diagnose, and resolve performance issues.
Key Responsibilities:
- Design and maintain observability pipelines using OpenTelemetry, Prometheus, and Grafana.
- Build dashboards and alerts to monitor system health, application performance, and business KPIs.
- Integrate observability solutions with Google Cloud Platform services and SolarWinds.
- Correlate logs, metrics, and traces to troubleshoot incidents and reduce MTTR.
- Collaborate with SREs, DevOps, and development teams to improve end-to-end system observability.
- Implement best practices for telemetry data collection, enrichment, storage, and visualization.
Requirements:
- Strong experience with Prometheus and Grafana for monitoring and alerting.
- Proficiency in OpenTelemetry for instrumenting distributed systems.
- Working knowledge of observability tools in Google Cloud (e.g., Cloud Monitoring, Logging, Trace).
- Exposure to SolarWinds for network and infrastructure monitoring.
- Solid understanding of telemetry data types: metrics, logs, and traces.
- Ability to correlate and analyze multi-source observability data.
- Scripting skills (Python, Bash) and familiarity with Infrastructure-as-Code is a plus.
Preferred Qualifications:
- Experience in Site Reliability Engineering or Platform Engineering roles.
- Knowledge of SLIs/SLOs and performance benchmarking.
- Experience with APM tools (e.g., Datadog, New Relic) is a plus.
-
Lead Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India JP Morgan Chase & Co. Full timeJob DescriptionAssume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking Team, you will take the lead in conducting resiliency design reviews, break...
-
Site Reliability Engineer
2 days ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
2 days ago
Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per yearImagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India IntraEdge Full timeSite Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:- Strong leadership and people management skills.- Exceptional technical proficiency in Pearson's technology stack.- Advanced project management capabilities.- Excellent communication and collaboration skills.- Adept at risk assessment and...
-
Senior Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India Options Executive Search Private Limited Full timeJob Title : SRE Lead Engineer. Location : Hyderabad, India. We are seeking a DevOps / SRE Lead Engineer to architect and scale our client's multi-tenant SaaS platform with AI/ML at the core. Our client, a fast-growing AI-powered SaaS company in the FinTech space, is looking for a Site Reliability Engineering (SRE) Lead Engineer to join their dynamic team....
-
Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India ServiceNow Full timeSite Reliability Engineer (SRE)Experience : 6+ YearsAbout the Role : We are seeking a seasoned SRE to ensure the reliability, availability, and performance of our critical services. You will combine software engineering with systems administration to create scalable and highly reliable software systems.Responsibilities : - Design, build, and maintain...
-
Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India IntraEdge Full timeSite Reliability Engineer Experience: 7+ Years Location: Hyderabad Hybrid 4-day office and 1 Day remote Skills for Principal: Strong leadership and people management skills. Exceptional technical proficiency in Pearson's technology stack. Advanced project management capabilities. Excellent communication and collaboration skills. Adept at risk assessment...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India IntraEdge Full timeSite Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...
-
SRE(Site Reliability Engineer)
2 days ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India INDIGLOBE IT SOLUTIONS PRIVATE LIMITED Full timeJob Summary :We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). Youll be responsible for owning application support, maintaining our microservices...