
Senior Site Reliability Engineer
4 days ago
Role and Responsibilities
Manage and support production environments on cloud platforms, with a strong preference for Microsoft Azure. Apply expertise in observability tools such as Dynatrace, Splunk, Datadog, Grafana, and New Relic to monitor system health. Implement modern observability practices including end-to-end (E2E) instrumentation, telemetry, and unified dashboard creation. Drive organizational change by influencing senior leadership and improving SRE practices company-wide. Write automation scripts using Python (strongly preferred) to streamline operations and eliminate manual effort. Deploy cloud infrastructure using tools like Ansible, Terraform, and Azure DevOps. Work confidently with Continuous Integration/Continuous Deployment (CI/CD) tools such as GitLab, Jenkins, Bamboo, Travis CI, and CircleCI. Operate and orchestrate containerized environments using Kubernetes and Docker. Troubleshoot complex issues and provide reliable, scalable solutions. Embrace continuous learning and demonstrate a strong passion for automation and process improvement. Use logging stacks like ELK (Elasticsearch, Logstash, and Kibana), Loki, and Splunk to maintain visibility and traceability. Influence organizational adoption of Infrastructure as Code (IaC) and CI/CD methodologies. Define and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs). Lead incident response efforts and perform Root Cause Analysis (RCA) to minimize recurrence.Skills and Experience
Bachelor’s degree in Computer Science, Information Science, Engineering, or a related discipline. 6+ years of experience in Site Reliability Engineering (SRE) or DevOps roles, with a focus on cloud-based production systems. Ensure the availability, low latency, performance, and cost efficiency of global e-commerce platforms. Design and maintain full-stack observability solutions, including dashboards and standardized instrumentation. Implement advanced monitoring and alerting systems tailored for both internal engineering teams and external stakeholders. Advocate for SRE best practices and promote operational excellence across teams and departments. Collaborate with engineering, product, and operations teams to increase reliability and accelerate delivery timelines. Build automation tools that support incident response, system recovery, and software delivery pipelines. Track and maintain error budgets, achieve defined SLOs, and guarantee high uptime for mission-critical services. Identify system bottlenecks and anomalies proactively, ensuring optimal performance under peak loads. Automate infrastructure management to reduce costs and scale efficiently during traffic surges. Lead strategic, cross-functional initiatives that enhance overall system architecture and reliability.-
Senior Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India JA Consulting Full timeAbout the job : Role : Senior Site Reliability Engineer SaaS Real Estate Platform About the Client : We are hiring on behalf of our reputed SaaS product-based client based in Hyderabad. They are a global leader in real estate software development.The Role : Were seeking a Senior Site Reliability Engineer (SRE) with a strong Software Engineering background...
-
Senior Site Reliability Engineer
2 weeks ago
Hyderabad, India CloudHire Full timeJob Summary The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Senior Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India CloudHire Full time ₹ 7,00,000 - ₹ 12,00,000 per yearJob SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Site Reliability Engineer
1 day ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site reliability engineer
2 days ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: Hyderabad About SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API...
-
Senior Site Reliability Engineer
1 day ago
Hyderabad, Telangana, India Goldman Sachs Services Pvt Ltd Full time ₹ 10,00,000 - ₹ 25,00,000 per yearEngineering-L2-Hyderabad-Vice President-Software Engineering-Bengaluru/Hyderabad Site Reliability Engineer - Vice President Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run scalable, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for...
-
Site Reliability Engineer
2 days ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India Jigya Software Services Full timeJob Title:Senior Site Reliability Engineer (SRE) - AWS/Kubernetes Location:Hyderabad - Onsite Job Type:Full-Time About the Role: We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance,...
-
Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Jigya Software Services Full time ₹ 1,50,000 - ₹ 28,00,000 per yearJob Title:Senior Site Reliability Engineer (SRE) - AWS/KubernetesLocation:Hyderabad - OnsiteJob Type:Full-TimeAbout the Role:We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance, and...
-
Senior Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India Microsoft Full timeThe Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level Windows 365 Cloud PC W365 and Azure Virtual Desktop AVD have recently been recognized as leaders in the Gartner Magic Quadrant TM for...