
Site Reliability Engineer
1 week ago
Job Title: SRE Lead (Engineering & Reliability)
Job Summary:
We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to
oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead,
you will play a pivotal role in establishing and implementing SRE practices, leading a team
of engineers, and driving automation, monitoring, and incident response strategies. This
position combines software engineering and systems engineering expertise to build and
maintain high-performing, reliable systems.
Experience: 7+ years
Key Responsibilities:
Reliability & Performance:
- Lead efforts to maintain high availability and reliability of critical services.
- Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met.
- Proactively identify and resolve performance bottlenecks and system inefficiencies.
Incident Management & Response:
- Establish and improve incident management processes and on-call rotations.
- Lead incident response and root cause analysis for high-priority outages.
- Drive post-incident reviews and ensure actionable insights are implemented.
Automation & Tooling:
- Develop and implement automated solutions to reduce manual operational tasks.
- Enhance system observability through metrics, logging, and distributed tracing tools
(e.g., Prometheus, Grafana, Elastic APM).
- Optimize CI/CD pipelines for seamless deployments.
Collaboration:
- Partner with software engineering teams to improve the reliability of applications and
infrastructure.
- Work closely with product/ engineering teams to design scalable and robust systems.
- Ensure seamless integration of monitoring and alerting systems across teams.
Leadership & Team Building:
- Manage, mentor, and grow a team of SREs.
- Promote SRE best practices and foster a culture of reliability and performance across
the organization.
Drive performance reviews, skills development, and career progression for team
members.
Capacity Planning & Cost Optimization:
• Perform capacity planning and implement autoscaling solutions to handle traffic
spikes.
• Optimize infrastructure and cloud costs while maintaining reliability and
performance.
Skills &cQualifications:
Required Skills:
• Technical Expertise:
- Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes.
- Hands-on knowledge of infrastructure-as-code tools like Terraform /Helm/
Ansible.
- Proficiency in Java
- Expertise in distributed systems, databases, and load balancing.
• Monitoring & Observability:
- Proficient with tools like Prometheus, Grafana,, Elastic APM, or New relic.
- Understanding of metrics-driven approaches for system monitoring and
alerting.
• Automation & CI/CD:
- Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines
etc).
- Skilled in automation frameworks and tools for infrastructure and application
deployments.
• Incident Management:
- Proven track record in handling incidents, post-mortems, and implementing
solutions to prevent recurrence.
Leadership & Communication Skills:
• Strong people management and leadership skills with the ability to inspire and
motivate teams.
• Excellent problem-solving and decision-making skills.
• Clear and concise communication, with the ability to translate technical concepts for
non-technical stakeholders.
Preferred Qualifications:
• Experience with database optimization, Kafka, or other messaging systems.
• Knowledge of autoscaling techniques
• Previous experience in an SRE, DevOps, or infrastructure engineering leadership role.
• Understanding of compliance and security best practices in distributed systems.
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Programming Full time ₹ 1,04,000 - ₹ 1,30,878 per yearRole - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Enterprise Minds, Inc Full timeWe're Hiring | Site Reliability Engineer | 8-10 years
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India FOSS United Full time ₹ 1,04,000 - ₹ 1,30,878 per yearAll JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India TRUGlobal Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob Title: Site Reliability Engineer (SRE) with Python Development ExpertisePosition Overview: We are seeking a skilled Site Reliability Engineer (SRE) with strong Python development experience to join our team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our services across both on-premises and...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
site reliability engineer
2 weeks ago
Bengaluru, Karnataka, India Randstad Full timeRole: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India CorroHealth Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India ViewSonic Full time ₹ 1,04,000 - ₹ 1,30,878 per yearJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India ViewSonic Full timeJob Requirements:1. Bachelor's degree in Computer Science, Engineering, or a related field.2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.4. Interest and understanding of Platform...