
Site Reliability Engineer
3 weeks ago
Hiring for SRE -
Exp- 6+ Years
Notice Period - Immediate - 15 days
About the Role
We are seeking a skilled and passionate Observability Engineer (SRE) to join our team and drive reliability, performance, and visibility across our infrastructure and applications. You will play a key role in designing and implementing observability solutions, improving system uptime, and enabling proactive incident response.
Key Responsibilities
- Design, implement, and maintain observability platforms using tools like Dynatrace, LogicMonitor, and Splunk
- Develop and maintain monitoring dashboards, alerts, and automated responses
- Collaborate with development and infrastructure teams to define SLIs/SLOs/SLAs
- Automate operational tasks using Python and Bash
- Manage and optimize containerized workloads on EKS (Amazon Elastic Kubernetes Service)
- Ensure high availability and performance of services hosted on AWS, with exposure to Azure
- Conduct root cause analysis and post-incident reviews to improve system reliability
- Advocate for SRE best practices including chaos engineering, capacity planning, and incident management
Required Skills & Qualifications
- Hands-on experience with APM tools such as Dynatrace, LogicMonitor
- Proficiency in Python and Bash scripting
- Strong experience with Splunk for log analysis and visualization
- Solid understanding of Kubernetes, especially EKS
- Experience with AWS services; Azure exposure is a plus
- Deep understanding of SRE principles and practices
- Excellent troubleshooting and problem-solving skills
- Strong communication and collaboration abilities
-
Senior Site Reliability Engineer
3 weeks ago
Gurugram, India Freecharge Full timeJob Title: Site Reliability Engineer (SRE)3 Years Experience About the Role: We are looking for a Site Reliability Engineer (SRE) with 3 years of experience to join our team. You will be responsible for ensuring the reliability, scalability, and efficiency of our production systems. This role requires a balance of software engineering, system administration,...
-
Urgent! Site Reliability Engineer
2 weeks ago
Gurugram, Pune, India Prerna Malhotra (Proprietor Of Praxis Hr Solutions) Full timeJob Description Description We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team in India. The SRE will be responsible for ensuring the reliability, availability, and performance of our applications and services. This role requires a combination of software engineering and systems engineering to build and maintain scalable and...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India Gemini Solutions Pvt Ltd Full timePosition Summary In this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliability Engineering practices. Your contribution will be pivotal in ensuring the availability, scalability, and performance of our systems and applications. Leveraging your strong technical skills and...
-
Site Reliability Engineer
5 days ago
Gurugram, Gurugram, India Impronics Technologies Full timeJob Description We are seeking a seasoned Site Reliability Engineer (SRE) with a solid background in payment systems and high-availability architectures. The ideal candidate will have hands-on experience managing large-scale, distributed systems in production, with a deep understanding of reliability, scalability, and performance tuning in the financial...
-
Site Reliability Engineer
1 week ago
Gurugram, Hyderabad, India Talent Hired-the Job Store Full time ₹ 15,00,000 - ₹ 25,00,000 per year9+ years of experience in a Site Reliability Engineering or DevOps role.Hands-on experience with Dynatrace and Splunk for monitoring, logging, and alerting.Strong proficiency in Terraform for infrastructure provisioning (AWS, Azure, or GCP).Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI, Azure DevOps).Deep understanding of...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India Leapwork Full timeAt Leapwork, our vision is to break down the barriers between humans and computers through the world's most accessible automation platform. We are the leading global AI-powered visual test automation solution, enabling some of the world's largest enterprises to adopt, scale, and maintain automation – in under 30 days. In today's environment, where...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India S&P Global Market Intelligence Full timeAbout the Role: OSTTRA India The RoleSite Reliability Engineer The TeamSRE is a global team that provides technical support across the suite of OSTTRA products. The SRE team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our...
-
Site Reliability Engineer-2
5 days ago
Gurugram, Haryana, India Realign LLC Full time**Job Type: Full Time**: **Job Category: IT**: Job Title: Site Reliability Engineer Job Summary: Responsibilities and Duties: - Implement and maintain automated monitoring and alerting systems to proactively identify and mitigate issues - Collaborate with development teams to design and implement scalable and reliable services - Troubleshoot and resolve...
-
Site Reliability Engineer,VP
2 weeks ago
Bengaluru, Chennai, Gurugram, India Natwest Digitalx Full time ₹ 1,04,000 - ₹ 1,30,878 per yearJoin us as a Site Reliability EngineerIn this key role, youll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYoull enjoy significant stakeholder interaction, working in...
-
Gurugram, India TheThreeAcross Full timeJob Description : Role : SRE/ Devops Support Engineer TradeExperience : 4-9 YearsLocation : GurugramShift Timings : 9 to 5 and 12 : 00PM to 8 : 00PMJob Description : As a key team member, you will combine the responsibilities of an Application Support Engineer and Site Reliability Engineer (SRE) to ensure the stability, reliability, and performance of...