Hiring For Site Reliability Engineer
1 week ago
Primary Responsibilities
Site Reliability Engineering (SRE) is an engineering discipline that combines software and system engineering to build and run large scale, massively distributed, fault-tolerant systems. SREs ensure managed service offerings and customer deployments have reliability and uptime appropriate to users needs and a fast rate of improvement while monitoring and validating capacity and performance. Focused on reliability, scalability, and the development of automation to manage a set of repetitive tasks at scale.
Knowledge &Skills
- In depth knowledge on SRE practices and concepts like SLA, SLO, SLI, Error budget, Toil elimination, Post-mortem etc.
- Should have experience in any Monitoring and Observability tools: Grafana, Splunk, Dynatrace, gcp operation suite etc.
- Should have understanding and knowledge into any APM tools App dynamics, Datadog etc preferably app dynamics.
- Should have experience in IaC: Terraform, Ansible etc.
- Should have experience working with cloud-native applications to manage them effectively in GCP or Azure.
- Should have experience into creating pipelines in CI/CD any tools like GitHub action, Azure devops, Jenkins etc.
- Should have knowledge into version control any tools like Git,BitBucket etc.
- Knowledge into any of the scripting languages like powershell,python,bash etc.
- Coding infrastructure automation across the CI/CD pipeline
- Responsible for ensuring the availability, performance, and scalability of a website or application.
- Knowledge into containerization and orchestration: Docker, Kubernetes, Cloudrun(GCP) etc.
- Involved in capacity planning and performance tuning to ensure that the site can handle increased traffic without issue.
- Responsible for ensuring the availability, performance, and scalability of a website or application.
- Should have experience working with cloud-native applications to manage them effectively.
- Work closely with developers to identify and fix potential issues before they cause problems for users.
- Deep understanding of how distributed systems work in order to be able to troubleshoot and optimize them.
- Deep understanding of how different types of databases work in order to be able to effectively troubleshoot any issues that may arise.
- Ability to communicate clearly and concisely about system alerts or outages to other members of your team.
- Below points to be noted: Apart from JD, Customer is looking for a candidate who can mature their SRE practice across the division. Someone who is comfortable being a champion and leader in the SRE space.
Kindly provide below details over email to
Looking for PAN India and open to work in Hybrid model
Total experience
Relevant experience into SRE
Current CTC
Expected CTC
Pan Number
Date of Birth
Notice Period
Note - Please apply if your notice period is less than 45 days or if you are currently serving notice period.
-
Site Reliability Engineer L1
18 hours ago
New Delhi, India APTO SOLUTIONS - EXECUTIVE SEARCH & CONSULTANTS Full timeWe’re Hiring | Site Reliability Engineer – L1Location: HyderabadExperience: 2+ YearsQualification: B.E / B.Tech (CS or related field)Shift: RotationalBuild scalable & reliable systems through automationBridge between Dev & Ops teams for faster deliveryWork on CI/CD, Docker, Kubernetes, Jenkins, GitHands-on with Linux, DBs (Oracle, PostgreSQL,...
-
New Delhi, India Tata Consultancy Services Full timeTCS Hiring For Site reliability engineer/application support engineer location: Delhi NCRExperience: 4-10JDRequired Skills :Splunk tool Application support Grafana Devops Kubernetes Monitoring tool Site reliability
-
Delhi, India Tata Consultancy Services Full timeTCS Hiring For Site reliability engineer/application support engineerlocation: Delhi NCRExperience: 4-10JDRequired Skills :Splunk toolApplication supportGrafanaDevopsKubernetesMonitoring toolSite reliability
-
Site Reliability Engineer
3 days ago
Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure , AI infrastructure , and automation . The ideal candidate will have a solid background in managing...
-
Site Reliability Engineer
3 days ago
Delhi, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – Azure & AIExperience: 7+ yearsWork Mode: HybridWork Location: Chennai/Mumbai/GurgaonJob Summary:We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure, AI infrastructure, and automation. The ideal candidate will have a solid background in managing cloud...
-
Site Reliability Engineer
6 days ago
Bengaluru, Delhi, Hyderabad, NCR, India Vlink Full timeAs a Site Reliability Engineer (SRE), you will play a key role in maintaining the reliability and performance of critical services. Your expertise will help bridge the gap between development and operations, ensuring robust, scalable, and responsive infrastructure. This role emphasizes strong system architecture and design principles, focusing on key SRE...
-
Site Reliability Engineer
1 day ago
Delhi, India Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
Site Reliability Engineer
6 hours ago
Delhi, India Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
Site Reliability Engineer
4 weeks ago
New Delhi, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...
-
Site Reliability Engineer
4 weeks ago
New Delhi, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...