Site Reliability Engineer
2 days ago
Job Title: Site Reliability Engineer (SRE)
About the Role
We are seeking a highly skilled and proactive Site Reliability Engineer (SRE)
to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance through continuous improvement and innovation.
Key Responsibilities
1. Platform Stability and Reliability
- Ensure the platform consistently meets defined performance, availability, and reliability SLAs.
- Identify and resolve performance bottlenecks and potential production risks proactively.
- Maintain and enhance monitoring, logging, and alerting systems to prevent downtime and incidents.
2. Incident Management
- Serve as the primary responder during critical incidents, ensuring rapid resolution and minimal impact.
- Conduct post-incident analysis and implement preventive measures.
- Develop and maintain detailed runbooks and playbooks to improve operational readiness.
3. Automation and Efficiency
- Build and maintain automation tools for deployment, scaling, and failover.
- Enhance CI/CD pipeline performance for faster and more reliable releases.
- Implement and manage
Infrastructure as Code (IaC)
using tools like
Terraform
or
Pulumi
.
4. Collaboration and Mentorship
- Collaborate closely with SRE, CI/CD, Developer Experience, and Templates teams to improve platform reliability.
- Mentor junior engineers and promote best practices in SRE and system operations.
- Partner with development teams to integrate observability and reliability into the application lifecycle.
5. Observability and Metrics
- Implement and optimize observability tools such as
Dynatrace
,
Prometheus
, or
Grafana
. - Define and maintain key performance metrics and dashboards for system health monitoring.
- Continuously analyze operational data to identify areas for optimization and improvement.
Qualifications
Required:
- Minimum
5 years of experience
in Site Reliability Engineering, Software Engineering, or related domains. - At least
3 years of experience
managing
AWS
cloud environments. - Strong programming proficiency in
Python
,
Java
,
, or
TypeScript
. - Hands-on experience with
Kubernetes
and
Docker
. - Proficiency in CI/CD tools like
GitLab
,
Jenkins
, or similar. - Experience with monitoring and alerting tools (preferably
Dynatrace
).
Preferred:
- Advanced expertise in
Kubernetes (K8s)
for container orchestration and deployment. - Familiarity with observability stacks like
Prometheus
and
Grafana
. - Exposure to
Agile
development environments. - Experience with additional cloud platforms (
Azure
or
Google Cloud
) is a plus.
Why Join Us
- Opportunity to work on
cutting-edge cloud and DevOps technologies
. - Collaborative, growth-oriented, and learning-driven work culture.
- Competitive compensation and clear career progression.
-
Site Reliability Engineering
7 days ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
-
Site Reliability Engineering
5 days ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Site Reliability Engineer
3 hours ago
Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearThis posting is for Site Reliability Engineer in the Oracle Analytics Warehouse product development organization. Fully handled Cloud service that provides customers a turn-key enterprise warehouse on the cloud for Fusion Applications. The service is being built on a sophisticated technology stack demonstrating a brand-new data integration platform and the...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Chevron Full time ₹ 20,00,000 - ₹ 25,00,000 per yearTotal Number of Openings2About the position:Come join our Subsurface Digital Platform where we are driving continuous innovations to improve reliability, scalability and sustainability of Chevron business via Chevron's Digital Transformation. We are seeking a T-shaped dynamic Senior Site Reliability Engineer to lead and provide end-to-end solution support...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Infogrowth Full time ₹ 15,00,000 - ₹ 25,00,000 per yearRole : SRE Engineer (Site Reliability Engineer) Location : Marathali Bangalore. Work Mode : Hybrid Mode (Weekly 3 days) Exp : 6 – 10 Years Required Candidate profileSkills :Python, AWS (EC2, IAM, Lambda, API Gateway, SNS, SQS & etc.), GITHUB Actions, Service Management, Incident Management etc. & CAPAs.Share resume on or
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Empower Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOur vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own. We have a flexible work environment, and fluid career paths. We not only encourage but celebrate internal mobility. We also recognize the importance of purpose, well-being, and work-life balance. Within Empower and...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India d416f97b-2589-437a-8e64-3348cfe4008b Full time ₹ 12,00,000 - ₹ 36,00,000 per yearHiring Site Reliability EngineersExp : 2.5 +years [Excluding internship]Location : BangaloreApply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Progress Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are Progress (Nasdaq: PRGS) - the trusted provider of software that enables our customers to develop, deploy and manage responsible, AI-powered applications and experience with agility and ease.We're proud to have a diverse, global team where we value the individual and enrich our culture by considering varied perspectives because we believe people power...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Aerospike Full time US$ 10,000 - US$ 60,000 per yearAbout AerospikeAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel,...