
Senior Site Reliability Engineer
3 weeks ago
e're looking for a hands-on Site Reliability / DevOps Engineer to be our first hire in this function, responsible for owning and scaling the reliability, observability, and infrastructure of our platform running entirely on Microsoft Azure.
You'll be critical in shaping DevOps culture, architecting fault-tolerant systems, and deploying automation to improve uptime, performance, and cost efficiency.
This is a hybrid role combining SRE and DevOps principles - ideal for builders comfortable working in fast-paced, product-driven environments.
What You'll Own :
Cloud Infrastructure (Microsoft Azure Must Have) :
- Architect, deploy, and maintain services across Azure App Services, Azure Container Apps, Cosmos DB, Event Hubs, Azure Monitor, Azure VMs, and Azure Kubernetes Service (AKS).
- Design and manage networking (VNets, Subnets, NSGs) and identity/access controls (PIM, Managed Identities, Enterprise Applications, Role-based Access Control).
- Own infrastructure provisioning using Terraform / Bicep.
- Implement cost-effective, scalable, and secure cloud environments across development, staging, and production.
Monitoring, Observability & Incident Response :
- Set up end-to-end observability using Prometheus, Grafana, Azure Monitor, ELK Stack, and Sentry.
- Define and enforce standards for logging, metrics, traces, SLIs/SLOs, and error budgets.
- Build proactive alerting systems for APIs, RabbitMQ, Databricks pipelines, and external integrations.
- Establish on-call rotations, incident response runbooks, and lead RCAs to minimize MTTR.
CI/CD, Automation & Tooling :
- Automate deployments and infrastructure lifecycle using GitHub Actions, Terraform modules, and CLI tools.
- Improve CI/CD for faster, safer releases across containerized and VM-based workloads.
- Build internal tools for diagnostics, rollback safety, and release automation.
- Integrate resilience patterns : retries, circuit breakers, backoff strategies, failovers.
DevOps & System Reliability :
- Optimize system performance, memory usage, and availability for core services like RabbitMQ, APIs, analytics pipelines on Databricks.
- Implement zero-downtime deployments, self-healing systems, and infrastructure audits.
- Perform regular cost analysis, right-sizing, and tag-based budget enforcement.
Security & Compliance Collaboration :
- Work with security teams to maintain infrastructure and data flow diagrams, support ISO 27001, GDPR, PDPA readiness.
- Participate in threat modeling, define trust boundaries, and implement audit-ready infrastructure practices.
Tech Stack You'll Work With :
- Cloud : Microsoft Azure (App Services, Container Apps, AKS, Cosmos DB, Event Hubs, Monitor, VMs).
- IaC : Terraform, Bicep.
- CI/CD : Azure Devops,GitHub Actions.
- Monitoring & Logs : Prometheus, Grafana, Azure Monitor, ELK, Sentry.
- Queueing : RabbitMQ, Kafka.
- Languages : Node.js, Python (mostly for debugging
-
Cloud Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 15,00,000 - ₹ 25,00,000 per yearBe at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...
-
Senior Site Reliability Engineer
2 days ago
Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full timeDear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWSDevelop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, HarnessOwn and implement...
-
Site Reliability Engineer
7 days ago
Chennai, Tamil Nadu, India NatWest Group Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services You'll enjoy significant...
-
Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India Elgebra Full time ₹ 6,00,000 - ₹ 18,00,000 per yearHiring: Site Reliability Engineer – 7+ YearsLocation: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 DaysRole Overview:We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and the...
-
Senior Engineer, Site Reliability
1 week ago
Chennai, Tamil Nadu, India SES Full time ₹ 80,00,000 - ₹ 2,00,00,000 per yearJob Description Senior Engineer, Site Reliability India - Chennai The Senior Engineer, Site Reliability is directly responsible for the Development, Monitoring, Operation and Support of the global SES Cloud and on premise install base, with a strong focus on Systems. They act as the main backup of the Senior Manager IT Systems and in support of the IT...
-
Site Reliability Engineer
3 days ago
Chennai, Tamil Nadu, India Trimble Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer Job Summary We are seeking a motivated Site Reliability Engineer (SRE) Level 1 to enhance the infrastructure and operational reliability of our ERP product, specifically within Azure and Windows environments. The ideal candidate will utilize SRE principles to ensure high system availability, stability, and performance while...
-
Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Concord Full timeSRE Sr. Engineers (Individual Contributors)Key Attributes:Strong SRE (Site Reliability Engineering) experienceDevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc.Excellent troubleshooting and debugging skills (infrastructure + application level)Perseverance – must push through complex/challenging issues without giving upAble to...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Elgebra Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our client, Qincline. The ideal candidate will have 7 or more years of dedicated experience in Site Reliability Engineering or a closely related discipline. This pivotal role requires a strong focus on ensuring the...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Miratech Full timeCompany Description Miratech helps visionaries change the world We are a global IT services and consulting company that brings together enterprise and start-up innovation Today we support digital transformation for some of the world s largest enterprises By partnering with both large and small players we stay at the leading edge of technology remain...
-
Site Reliability Engineer
1 week ago
Chennai, Tamil Nadu, India Grootan Technologies Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4 to 5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...