Current jobs related to Site Reliability Engineer - Bengaluru - Xebia


  • Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....


  • Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...


  • Bengaluru, India Whatjobs IN C2 Full time

    SRE – Site Reliability Engineer: Experience: 6+ years Location: Bangalore Mode of work: Hybrid Job Description The Resy Site Reliability Engineering group’s goal is to ensure Resy Customers can always use the service reliably. We're looking for engineers to be part of an empowered, self-organizing group, with the opportunity to use modern languages and...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...


  • Bengaluru, India HDFC Limited Full time

    Hiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore Location Experience - 8 - 14 YearsJob PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability...


  • Bengaluru, India HDFC Limited Full time

    Hiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 YearsJob PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability...

Site Reliability Engineer

4 weeks ago


Bengaluru, India Xebia Full time
We are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident response practices to ensure high availability, performance, and resilience of business-critical systems.
Key Responsibilities
- Cloud Infrastructure (AWS):
- Design, implement, and manage scalable, resilient, and cost-optimized cloud infrastructure using AWS services (EC2, EKS, Lambda, RDS, S3, CloudFront, IAM, VPC, etc.).
- Implement Infrastructure as Code (IaC) using tools like Terraform / CloudFormation.
- DevOps & Automation:
- Build and maintain CI/CD pipelines (Jenkins, GitHub Actions, GitLab CI, or AWS CodePipeline) for automated deployments.
- Automate repetitive tasks to improve development velocity and operational efficiency.
- Observability & Monitoring:
- Define and implement observability strategy covering monitoring, logging, tracing, and alerting.
- Work with tools like Prometheus, Grafana, ELK/EFK stack, AWS CloudWatch, Datadog, New Relic, Splunk, or Dynatrace.
- Establish SLIs, SLOs, and SLAs to measure and improve system reliability.
- Site Reliability Engineering (SRE):
- Drive incident management processes – detection, alerting, root cause analysis, and postmortems.
- Apply chaos engineering principles to validate resilience and recovery.
- Optimize reliability, latency, scalability, and system efficiency.
- Security & Compliance:
- Implement best practices for cloud security, identity & access management, and compliance frameworks (ISO, SOC2, GDPR, etc.).
- Ensure observability and monitoring meet security and audit requirements.
- Collaboration & Leadership:
- Partner with development, QA, and product teams to ensure seamless deployments.
- Mentor junior engineers and promote a culture of reliability, automation, and continuous improvement.
Required Skills & Qualifications
- 7+ years of professional experience in DevOps, Cloud Infrastructure, or SRE roles.
- Strong expertise in AWS Cloud (certification preferred: AWS Certified DevOps Engineer, Solutions Architect, or SysOps).
- Proficiency in IaC tools (Terraform, CloudFormation).
- Solid experience in CI/CD pipeline tools (Jenkins, GitHub Actions, GitLab CI/CD, AWS CodePipeline).
- Hands-on with observability tools: Prometheus, Grafana, CloudWatch, ELK, Datadog, New Relic, Splunk, or similar.
- Deep understanding of SRE principles: SLIs/SLOs, error budgets, incident response, chaos testing.
- Strong scripting/coding experience (Python, Bash, Go, or similar).
- Knowledge of containers & orchestration (Docker, Kubernetes, EKS).
- Familiarity with security best practices in cloud-native environments.
Preferred Skills
- Experience with multi-cloud or hybrid-cloud environments.
- Exposure to resiliency testing & chaos engineering tools (Gremlin, Litmus, Chaos Mesh).
- Knowledge of cost-optimization and FinOps in AWS.
- Excellent communication and stakeholder management skills.
What We Offer
- Opportunity to work on cutting-edge cloud-native architectures.
- A culture focused on automation, reliability, and innovation.
- Growth opportunities with certifications, training, and leadership exposure.