Site Reliability Engineer

1 week ago


Bengaluru, Karnataka, India GlobalLogic Full time ₹ 8,00,000 - ₹ 12,00,000 per year

Description
Same As above

Requirements
Site Reliability Engineer (SRE) – Platform Reliability & Operational Excellence

Overview
At Client, we're scaling a mission‑critical safety and automation platform as we evolve from a monolith into distributed, event‑driven and microservice-based systems. Reliability, latency, and operational efficiency are foundational—not afterthoughts.

We're seeking a Site Reliability Engineer (SRE) who blends software engineering discipline with systems, infrastructure, and observability expertise. You'll own availability, performance, scalability, and production readiness across services—driving automation, reducing toil, and enabling fast, safe delivery.

This role is for someone who wants to shape a modern reliability culture while protecting a platform that directly advances road safety through real-time data, analytics, and AI.

Key Responsibilities

  • Define, implement, and iterate SLIs/SLOs (latency, availability, errors, saturation); operationalize error budgets and trigger corrective action.
  • Engineer end‑to‑end observability (metrics, logs, traces, events) leveraging Datadog to accelerate detection and root cause analysis.
  • Automate infrastructure (Terraform), deployment workflows, self‑healing mechanisms, and progressive delivery (canary / blue‑green).
  • Lead incident lifecycle: detection, triage, mitigation, coordination, communication, and high-quality post‑incident reviews that drive systemic fixes.
  • Build and optimize CI/CD pipelines (GitHub Actions or equivalent) with reliability, rollback safety, and change quality controls.
  • Perform capacity & performance engineering: load modeling, autoscaling policies, cost / efficiency tuning.
  • Reduce toil via tooling, runbooks, proactive failure analysis, chaos / fault injection (AWS FIS or similar).
  • Partner with development teams on architectural reviews, production readiness (operability, resilience, security, observability).
  • Enforce least‑privilege, secrets management , and infrastructure security; integrate policy as code.
  • Improve alert quality (noise reduction, actionable context) to lower MTTR and fatigue.
  • Champion reliability patterns: backpressure, graceful degradation,, circuit breaking
  • Support distributed systems debugging (timeouts, partial failures, consistency anomalies) with emphasis on AI.
  • Contribute to governance of change management, deployment health gates, and release safety.
  • Document playbooks, escalation paths, and evolving reliability standards.
  • Treat reliability as a product: roadmap, KPIs, stakeholder alignment, continuous improvement.

Preferred Qualifications

  • 3+ years in SRE / Production Engineering / DevOps
  • Proficient in one or more: Go, Python, , or Ruby for automation, tooling, and services.
  • Strong Linux internals and networking fundamentals (DNS, TLS, HTTP, routing, load balancing).
  • Hands-on Infrastructure as Code (Terraform) and GitOps workflows.
  • Containers & orchestration (AWS ECS) including resource tuning & scaling strategies.
  • Production-grade observability: Prometheus, Grafana, OpenTelemetry, ELK, Datadog (preferred).
  • CI/CD design (pipelines, promotion strategies, automated verification, rollout / rollback).
  • Full incident management lifecycle & quantitative postmortem practices.
  • Experience with distributed systems failure modes (latency spikes, retry storms, thundering herds).
  • Chaos / fault injection frameworks (AWS FIS preferred).
  • Performance / load testing (k6, Locust, Gatling) and profiling for bottleneck isolation.
  • BS/MS in Computer Science, Engineering, or equivalent practical expertise.

Mindset & Behaviors

  • Bias for automation and measurable reliability outcomes.
  • Calm, clear communicator under pressure; drives clarity during ambiguity.
  • Sees reliability as a product with customers, SLAs, and iteration cycles.
  • Data-driven; prefers leading indicators over reactive firefighting.
  • Raises the bar for operational excellence and shared ownership.

Why Join Us

  • Shape quality & reliability strategy for a modern, mission-driven safety platform.
  • Direct impact: your work protects communities and improves public safety outcomes.
  • Work across observability, distributed systems, infrastructure automation, and high-velocity delivery.
  • Influence engineering culture: shift-left reliability, proactive resilience, sustainable on-call.
  • Collaborate with teams modernizing architecture (microservices, event streaming, serverless, edge).
  • Leverage advanced tooling (Datadog, Terraform, progressive delivery frameworks).
  • Join a culture focused on learning loops, autonomy, and meaningful impact.

Job responsibilities

same as above

What we offer

Culture of caring.
At GlobalLogic, we prioritize a culture of caring. Across every region and department, at every level, we consistently put people first. From day one, you'll experience an inclusive culture of acceptance and belonging, where you'll have the chance to build meaningful connections with collaborative teammates, supportive managers, and compassionate leaders.

Learning and development.
We are committed to your continuous learning and development. You'll learn and grow daily in an environment with many opportunities to try new things, sharpen your skills, and advance your career at GlobalLogic. With our Career Navigator tool as just one example, GlobalLogic offers a rich array of programs, training curricula, and hands-on opportunities to grow personally and professionally.

Interesting & meaningful work.
GlobalLogic is known for engineering impact for and with clients around the world. As part of our team, you'll have the chance to work on projects that matter. Each is a unique opportunity to engage your curiosity and creative problem-solving skills as you help clients reimagine what's possible and bring new solutions to market. In the process, you'll have the privilege of working on some of the most cutting-edge and impactful solutions shaping the world today.

Balance and flexibility.
We believe in the importance of balance and flexibility. With many functional career areas, roles, and work arrangements, you can explore ways of achieving the perfect balance between your work and life. Your life extends beyond the office, and we always do our best to help you integrate and balance the best of work and life, having fun along the way

High-trust organization.
We are a high-trust organization where integrity is key. By joining GlobalLogic, you're placing your trust in a safe, reliable, and ethical global company. Integrity and trust are a cornerstone of our value proposition to our employees and clients. You will find truthfulness, candor, and integrity in everything we do.

About GlobalLogic
GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world's largest and most forward-thinking companies. Since 2000, we've been at the forefront of the digital revolution – helping create some of the most innovative and widely used digital products and experiences. Today we continue to collaborate with clients in transforming businesses and redefining industries through intelligent products, platforms, and services.



  • Bengaluru, Karnataka, India Programming Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Role - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...


  • Bengaluru, Karnataka, India FOSS United Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    All JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...


  • Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • Bengaluru, Karnataka, India eBay Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    At eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.Our customers are our compass, authenticity...


  • Bengaluru, Karnataka, India NatWest Group Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer, AVP Join us as a Site Reliability EngineerYou'll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We'll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of...


  • Bengaluru, Karnataka, India NatWest Group Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    We are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...


  • Bengaluru, Karnataka, India PROGRESS SOFTWARE Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Job Description Site Reliability Engineer Hybrid Hyderabad, IndiaBengaluru, India DevOps Apply nowJob Summary We are Progress (Nasdaq: PRGS) - the trusted provider of software that enables our customers to develop, deploy and manage responsible, AI-powered applications and experience with agility and ease. Were proud to have a diverse, global team...


  • Bengaluru, Karnataka, India NatWest Group Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer,VP Join us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...


  • Bengaluru, Karnataka, India Zetamicron Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job Title: Site Reliability Engineer (SRE)About the RoleWe are seeking a highly skilled and proactive Site Reliability Engineer (SRE)to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance...