Site reliability engineer

14 hours ago


Bangalore, India Xebia Full time

We are seeking an experienced AWS Dev Ops Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (Ia C), CI/CD, monitoring & observability frameworks, and incident response practices to ensure high availability, performance, and resilience of business-critical systems. Key Responsibilities Cloud Infrastructure (AWS): Design, implement, and manage scalable, resilient, and cost-optimized cloud infrastructure using AWS services (EC2, EKS, Lambda, RDS, S3, Cloud Front, IAM, VPC, etc.). Implement Infrastructure as Code (Ia C) using tools like Terraform / Cloud Formation . Dev Ops & Automation: Build and maintain CI/CD pipelines (Jenkins, Git Hub Actions, Git Lab CI, or AWS Code Pipeline) for automated deployments. Automate repetitive tasks to improve development velocity and operational efficiency. Observability & Monitoring: Define and implement observability strategy covering monitoring, logging, tracing, and alerting. Work with tools like Prometheus, Grafana, ELK/EFK stack, AWS Cloud Watch, Datadog, New Relic, Splunk, or Dynatrace . Establish SLIs, SLOs, and SLAs to measure and improve system reliability. Site Reliability Engineering (SRE): Drive incident management processes – detection, alerting, root cause analysis, and postmortems. Apply chaos engineering principles to validate resilience and recovery. Optimize reliability, latency, scalability, and system efficiency. Security & Compliance: Implement best practices for cloud security, identity & access management, and compliance frameworks (ISO, SOC2, GDPR, etc.). Ensure observability and monitoring meet security and audit requirements. Collaboration & Leadership: Partner with development, QA, and product teams to ensure seamless deployments. Mentor junior engineers and promote a culture of reliability, automation, and continuous improvement . Required Skills & Qualifications 7+ years of professional experience in Dev Ops, Cloud Infrastructure, or SRE roles. Strong expertise in AWS Cloud (certification preferred: AWS Certified Dev Ops Engineer, Solutions Architect, or Sys Ops). Proficiency in Ia C tools (Terraform, Cloud Formation). Solid experience in CI/CD pipeline tools (Jenkins, Git Hub Actions, Git Lab CI/CD, AWS Code Pipeline). Hands-on with observability tools : Prometheus, Grafana, Cloud Watch, ELK, Datadog, New Relic, Splunk, or similar. Deep understanding of SRE principles : SLIs/SLOs, error budgets, incident response, chaos testing. Strong scripting/coding experience (Python, Bash, Go, or similar). Knowledge of containers & orchestration (Docker, Kubernetes, EKS). Familiarity with security best practices in cloud-native environments. Preferred Skills Experience with multi-cloud or hybrid-cloud environments . Exposure to resiliency testing & chaos engineering tools (Gremlin, Litmus, Chaos Mesh). Knowledge of cost-optimization and Fin Ops in AWS. Excellent communication and stakeholder management skills. What We Offer Opportunity to work on cutting-edge cloud-native architectures. A culture focused on automation, reliability, and innovation . Growth opportunities with certifications, training, and leadership exposure.



  • Bangalore, India ViewSonic Full time

    Job Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, Dev Ops Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, Cloud Watch, Lambda, and RDS. Interest and understanding of Platform...


  • Bangalore, India ViewSonic Full time

    Job Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. Interest and understanding of...


  • Bangalore, India ViewSonic Full time

    Job Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. Interest and understanding of...


  • bangalore, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • bangalore, India HDFC Limited Full time

    Hiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 Years Job PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance. Job Responsibilities: Help build a Site Reliability...


  • Bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our...


  • Bangalore, India Tavant Full time

    About Tavant: With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tech-enabled transformation across a wide range of industries such as Consumer Lending, Manufacturing, Agtech, Media & Entertainment, and Retail in...


  • bangalore, India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Fully RemoteType - 6 months ContractWork Ex - 5+ YrsWe’re working with a AI product company that’s building the next generation of GenAI powered developer platforms.We’re looking for an experienced Site Reliability Engineer to join their Platform...


  • Bangalore, India Xebia Full time

    We are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...


  • Bangalore, India Tavant Full time

    About Tavant: With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tech-enabled transformation across a wide range of industries such as Consumer Lending, Manufacturing, Agtech, Media & Entertainment, and Retail in...