
Site Reliability Engineer
2 weeks ago
Site Reliability Engineer (SRE)
You will work on system monitoring, incident response, and platform stability-while also improving observability, creating automation scripts, and collaborating with developers and DevOps teams. You wont just respond to alerts-youll help prevent them.
Work Mode : Permanent Night Shift
Note : This is a fixed night shift role. Candidates must have prior experience or explicitly confirm readiness for permanent US-time zone shifts.
Key Responsibilities :
- Lead incident triage for P1/P2 alerts, engage in war rooms, update tickets (JIRA/SNOW), and participate in post-incident RCA documentation.
- Create or enhance automation scripts (Bash/Python) for log ingestion, alert suppression, auto-recovery, and health checks.
- Analyze application runtime issues-such as JVM logs, memory usage, GC pauses, or thread deadlocks-to support root cause analysis.
- Participate in daily DevOps/SRE standups, collaborating closely with engineering teams to improve production reliability.
- Handle database performance alerts (Oracle/Postgres) and collaborate with DBAs or developers to resolve backend bottlenecks.
- Track and interpret SLO breaches, availability metrics, and system latencies to enforce production SLAs.
Core Skills & Expertise :
Must-Have Technical Skills :
- Experience with Grafana, Prometheus, ELK Stack, or Stackdriver. Able to define alerts, read logs, and correlate cross-system issues.
- Full ownership of P1/P2 incidents - including triage, ticketing, stakeholder communication, and RCA participation.- Proficient in Bash or Python scripting to automate routine SRE tasks and recovery workflows.
- Experience managing production workloads on GCP, AWS, or Azure, with ability to inspect cloud logs, VM status, networking, and storage configurations.
- Familiar with concepts like error budgets, latency thresholds, and SLO tracking. Capable of interpreting breaches and reporting anomalies.
- Able to spot symptoms of JVM issues like GC pauses, memory leaks, thread contention, and raise appropriate diagnostics.
- Identify backend delays or errors from logs and assist in pinpointing query or connection-related issues.
- Strong communication skills to work with distributed teams during escalations, code fixes, or configuration changes.
- Must be fully aligned to a permanent night shift (US time) and self-sufficient in a remote-first environment.
Nice-to-Have Skills :
- Experience monitoring CPU, memory, and traffic metrics to recommend infrastructure scale-up/down strategies.
- Exposure to embedding SRE gates, smoke tests, or health validations in CI pipelines like Jenkins or GitHub Actions.
- Basic understanding of tools like SLO Generator or Datadog for automated budget tracking and alerting.
- Can interpret Terraform code related to monitoring, infrastructure, or alert rules. Not required to author full modules.
- Holding a GCP Associate Cloud Engineer or similar certification is a plus but not mandatory.
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Programming Full time ₹ 10,00,000 - ₹ 25,00,000 per yearRole - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India FOSS United Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAll JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India ViewSonic Full timeJob Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. Interest and understanding of Platform...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India ViewSonic Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India NatWest Group Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer, AVP Join us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India NatWest Group Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Funic Tech Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Title : Site Reliability Engineer (SRE)Experience Required : 7 YearsLocation : Bangalore / ChennaiEmployment Type : Full-TimeWork Mode : OnsiteRole Overview : We are seeking a highly skilled Site Reliability Engineer (SRE) with 7 years of experience to ensure the reliability, scalability, and performance of our systems. The ideal candidate will bring...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India PROGRESS SOFTWARE Full time ₹ 6,00,000 - ₹ 12,00,000 per yearJob Description Site Reliability Engineer Hybrid Hyderabad, IndiaBengaluru, India DevOps Apply nowJob Summary We are Progress (Nasdaq: PRGS) - the trusted provider of software that enables our customers to develop, deploy and manage responsible, AI-powered applications and experience with agility and ease. Were proud to have a diverse, global team...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India NatWest Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer,VP Join us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...
-
Site Reliability Engineer
5 days ago
Bengaluru, Karnataka, India Zetamicron Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Site Reliability Engineer (SRE)About the RoleWe are seeking a highly skilled and proactive Site Reliability Engineer (SRE)to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance...