Site Reliability Engineer

2 days ago


Bengaluru, Karnataka, India Xebia Full time ₹ 15,00,000 - ₹ 20,00,000 per year

We are seeking an experienced
AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE)
to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident response practices to ensure high availability, performance, and resilience of business-critical systems.

Key Responsibilities

  • Cloud Infrastructure (AWS):
  • Design, implement, and manage scalable, resilient, and cost-optimized cloud infrastructure using AWS services (EC2, EKS, Lambda, RDS, S3, CloudFront, IAM, VPC, etc.).
  • Implement Infrastructure as Code (IaC) using tools like
    Terraform / CloudFormation
    .
  • DevOps & Automation:
  • Build and maintain
    CI/CD pipelines
    (Jenkins, GitHub Actions, GitLab CI, or AWS CodePipeline) for automated deployments.
  • Automate repetitive tasks to improve development velocity and operational efficiency.
  • Observability & Monitoring:
  • Define and implement
    observability strategy
    covering monitoring, logging, tracing, and alerting.
  • Work with tools like
    Prometheus, Grafana, ELK/EFK stack, AWS CloudWatch, Datadog, New Relic, Splunk, or Dynatrace
    .
  • Establish
    SLIs, SLOs, and SLAs
    to measure and improve system reliability.
  • Site Reliability Engineering (SRE):
  • Drive incident management processes – detection, alerting, root cause analysis, and postmortems.
  • Apply
    chaos engineering
    principles to validate resilience and recovery.
  • Optimize reliability, latency, scalability, and system efficiency.
  • Security & Compliance:
  • Implement best practices for cloud security, identity & access management, and compliance frameworks (ISO, SOC2, GDPR, etc.).
  • Ensure observability and monitoring meet security and audit requirements.
  • Collaboration & Leadership:
  • Partner with development, QA, and product teams to ensure seamless deployments.
  • Mentor junior engineers and promote a culture of
    reliability, automation, and continuous improvement
    .

Required Skills & Qualifications

  • 7+ years
    of professional experience in DevOps, Cloud Infrastructure, or SRE roles.
  • Strong expertise in AWS Cloud
    (certification preferred: AWS Certified DevOps Engineer, Solutions Architect, or SysOps).
  • Proficiency in
    IaC tools
    (Terraform, CloudFormation).
  • Solid experience in
    CI/CD pipeline tools
    (Jenkins, GitHub Actions, GitLab CI/CD, AWS CodePipeline).
  • Hands-on with
    observability tools
    : Prometheus, Grafana, CloudWatch, ELK, Datadog, New Relic, Splunk, or similar.
  • Deep understanding of
    SRE principles
    : SLIs/SLOs, error budgets, incident response, chaos testing.
  • Strong scripting/coding experience (Python, Bash, Go, or similar).
  • Knowledge of
    containers & orchestration
    (Docker, Kubernetes, EKS).
  • Familiarity with
    security best practices
    in cloud-native environments.

Preferred Skills

  • Experience with
    multi-cloud or hybrid-cloud environments
    .
  • Exposure to
    resiliency testing & chaos engineering tools
    (Gremlin, Litmus, Chaos Mesh).
  • Knowledge of cost-optimization and FinOps in AWS.
  • Excellent communication and stakeholder management skills.

What We Offer

  • Opportunity to work on cutting-edge cloud-native architectures.
  • A culture focused on
    automation, reliability, and innovation
    .
  • Growth opportunities with certifications, training, and leadership exposure.


  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India Randstad Full time

    Role: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India TRUGlobal Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Title: Site Reliability Engineer (SRE) with Python Development ExpertisePosition Overview: We are seeking a skilled Site Reliability Engineer (SRE) with strong Python development experience to join our team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our services across both on-premises and...


  • Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.


  • Bengaluru, Karnataka, India IDESLABS PRIVATE LIMITED Full time US$ 90,000 - US$ 1,20,000 per year

    Experience: 5+ YearsSkill:Site reliability engineerLocation: BangaloreNotice Period:Immediate.Employment Type: ContractWorking Mode: HybridJob DescriptionSite Reliability Engineer Tech StackPrimaryAWSTerraformAnsibleDockerSecondaryPythonBashGithubJenkins


  • Bengaluru, Karnataka, India Success Pact Consulting Pvt Ltd Full time

    Position : Site Reliability EngineerExperience : 5 - 9 YearsLocation : Bangalore, IndiaJob Summary : We are seeking an experienced Site Reliability Engineer (SRE) with 5-9 years of experience to join our Platform Engineering team. This role is crucial for ensuring the high availability, performance, and scalability of our AI-powered code review platform....


  • Bengaluru, Karnataka, India Coforge Full time

    Job Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...


  • Bengaluru, Karnataka, India Infrasoft Technologies Limited Full time

    Job DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...