AWS Site Reliability Engineer
4 weeks ago
Description : Role Overview :As an AWS SRE, youll leverage DevOps and SRE best practices to build, automate, and maintain scalable, reliable cloud infrastructure.Your focus will be on elevating system performance, observability, and incident response while fostering operational excellence.Key Responsibilities : - Define, monitor, and uphold Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets to guide reliability efforts AvahiGeeksforGeeks.- Build and maintain infrastructure resilience through automation (IaC with Terraform, CloudFormation), on-call tooling, and self-healing practices SquareOpsAvahiAmazon Web Services, Inc.- Monitor system health using tools like Prometheus, Grafana, Datadog, CloudWatch, and ELK Stack; establish proactive alerts to detect issues before they escalate team- Lead incident response - including detection, troubleshooting, mitigation, and conducting blameless postmortems - Execute capacity planning and performance optimization to accommodate growth and improve efficiency - Collaborate with development and operations teams to embed reliability in software lifecycle and deployments Optimize costs and performance while maintaining operational effectiveness through AWS-native solutions and observability Alp Consultings- Support disaster recovery planning, fault tolerance, and ensure compliance with reliability standards.Required Skills And Qualifications : - Bachelors degree in Computer Science, IT, or related field.- 6 - 8 years of experience in SRE, DevOps, or infrastructure engineering, with strong exposure to AWS environments.- Expert in infrastructure automation (e., Terraform, CloudFormation), containerization, and orchestration platforms.- Proficient in one or more programming/scripting languages (e., Python, Go, Bash).- Hands-on experience with monitoring, observability, and incident management tools (e., Prometheus, Grafana, CloudWatch, ELK, Datadog).- Strong understanding of system design, distributed systems, networking, and performance tuning.- Proven track record of managing production systems, incident response, and performing blameless postmortems.- Adept at capacity planning, performance benchmarking, and cost optimization.Preferred Qualifications : - AWS certifications such as AWS Certified DevOps Engineer or AWS Certified Solutions Architect.- Familiarity with container orchestration like EKS/Kubernetes.- Experience with on-call practices, runbook development, and SRE methodologies (SLIs/SLOs, error budgets).- Exposure to chaos engineering or resilience testing framework (ref:hirist.tech)
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production Support Location: Bengaluru Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management . The ideal candidate will be responsible for maintaining the reliability, performance, and...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India YOMA TECHNOLOGIES PRIVATE LIMITED Full time ₹ 8,00,000 - ₹ 24,00,000 per yearDescription :Job Title : Site Reliability Engineer (SRE) - DataDog / AWS Lambda / DynamoDB / ServerlessLocation : Bangalore / Pune / HyderabadExperience : 5- 10 YearsAbout the Role : We are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in DataDog integration, AWS Lambda, DynamoDB, and Serverless architectures. The ideal...
-
Aquera - Site Reliability Engineer - AWS
3 weeks ago
Bengaluru, India Aquera Full timeJob Summary : We are looking for an experienced AWS Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of AWS services, infrastructure automation, and the ability to collaborate effectively with development teams to ensure high availability, performance, and scalability of our cloud-based systems.Key...
-
AWS Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India AKSHAYA BUSINESS IT SOLUTIONS PRIVATE LIMITED Full time ₹ 15,00,000 - ₹ 25,00,000 per yearDescription : Role Overview : As an AWS SRE, youll leverage DevOps and SRE best practices to build, automate, and maintain scalable, reliable cloud infrastructure. Your focus will be on elevating system performance, observability, and incident response while fostering operational excellence.Key Responsibilities : - Define, monitor, and uphold...
-
Site Reliability Engineer
6 days ago
Bengaluru, India ViewSonic Full timeJob Requirements: 1. Bachelor's degree in Computer Science, Engineering, or a related field. 2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. 3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. 4. Interest and understanding of...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...