
Cloud Reliability and Chaos Engineering Expert
3 days ago
We are seeking a highly skilled Cloud Engineer to design, build, and validate resilient cloud-native environments.
The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency of critical systems.
Cloud Engineering (AWS):Architect, implement, and manage secure, scalable, and cost-efficient AWS infrastructure.
Automate infrastructure provisioning and configuration using Terraform / CloudFormation and AWS SDKs.
Manage containerized workloads (Docker, Kubernetes).
Python Development:
Build automation scripts, deployment utilities, and infrastructure tooling using Python (Boto3, Flask, FastAPI).
Develop custom monitoring/alerting integrations with APIs, SDKs, and third-party observability platforms.
Chaos Engineering & Resiliency:
Use tools like Gremlin, Litmus, Chaos Mesh, or AWS Fault Injection Simulator.
DevOps & Observability:
Build and maintain CI/CD pipelines for automated deployments (Jenkins, GitHub Actions, GitLab CI, AWS CodePipeline).
Integrate observability frameworks (Prometheus, Grafana, ELK/EFK, CloudWatch, Datadog) for monitoring and tracing.
Apply AWS security best practices for IAM, networking, and data protection.
Ensure compliance with internal and external regulatory frameworks (SOC2, ISO, GDPR).
6–10 years of experience in Cloud, DevOps, or SRE roles.
~ Strong hands-on expertise in AWS Cloud (certifications preferred: AWS DevOps Engineer / Solutions Architect).
~ Advanced Python development skills for automation and tooling (Boto3 a must).
~ Experience designing and running chaos experiments (Gremlin, AWS FIS, Litmus, Chaos Mesh, or custom Python-based fault injection).
~ Proficiency in containers & orchestration (Docker, Kubernetes).
~ Strong background in monitoring, observability, and incident management.
~ Familiarity with DevOps toolchain (CI/CD, Git, Jenkins, GitLab, CodePipeline).
~ Knowledge of Go / Shell scripting in addition to Python.
Experience with chaos testing in production-like environments.
Exposure to multi-cloud or hybrid-cloud environments.
Opportunity to lead cloud reliability & chaos engineering initiatives.
Growth opportunities through certifications, R&D projects, and leadership roles.
-
Associate Principal Engineer, Chaos
3 days ago
India Nagarro Full time ₹ 15,00,000 - ₹ 20,00,000 per yearWe're Nagarro.We are a Digital Product Engineering company that is scaling in a big way We build products, services, and experiences that inspire, excite, and delight. We work at scale across all devices and digital mediums, and our people exist everywhere in the world experts across 39 countries, to be exact). Our work culture is dynamic and...
-
Site Reliability Engineer
3 days ago
India Xebia Full timeWe are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...
-
Site Reliability Engineer
2 weeks ago
India Xebia Full timeWe are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...
-
Site Reliability Engineer
1 week ago
India Xebia Full timeWe are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...
-
Senior Cloud Reliability Engineer
1 week ago
India beBeeCloudReliability Full time ₹ 1,80,00,000 - ₹ 2,20,00,000Key Responsibilities:Design and implement scalable, secure, and cost-efficient cloud infrastructure.Automate infrastructure provisioning using Terraform / CloudFormation and AWS SDKs.Manage containerized workloads (Docker, Kubernetes) and automate deployment utilities.Develop automation scripts and tooling using Python (Boto3).Implement monitoring/alerting...
-
AWS Cloud Architect
2 weeks ago
India beBeeReliability Full time US$ 90,000 - US$ 1,20,000We are seeking a highly skilled professional to fill a Cloud Reliability Engineer position. This individual will be responsible for designing, building, and validating resilient cloud-native environments using AWS Cloud. **Key Responsibilities**: ">Architect and implement secure, scalable, and cost-efficient AWS infrastructure, including EC2, Lambda, EKS,...
-
Cloud Reliability Engineer
1 week ago
India beBeeCloudReliability Full time ₹ 90,00,000 - ₹ 1,50,00,000We are obsessed with delivering exceptional customer experiences by driving growth and innovation in the cloud.Our MissionWe strive to become the world's safest and most reliable cloud service provider through our relentless pursuit of excellence in quality, security, and reliability.Azure Reliability Team OverviewWe are a multidisciplinary team of engineers...
-
VP, Site Reliability Engineering
2 days ago
India Natobotics Full timeWith our client, you will be exposed to the latest technologies and work with some of the brightest minds in the industry. Our client is leading Banking company so you will be playing a key role as a VP – Site Reliability Engineering (SRE) , who can assist with the below: The successful candidate will work with a small number of SREs in the platform...
-
Senior Cloud Reliability Engineer
1 week ago
India beBeeExpertise Full time ₹ 24,56,888 - ₹ 32,13,625Job Title: Senior Cloud Reliability EngineerAbout the RoleWe are seeking a highly skilled and experienced Senior Cloud Reliability Engineer to join our team. As a key member of our Platform Engineering Practice, you will design, manage, and scale large-scale observability infrastructure.Your primary focus will be on ensuring the high availability and...
-
Reliable System Engineer Needed
1 day ago
India beBeeSiteReliabilityEngineer Full time ₹ 9,00,000 - ₹ 12,00,000Job Description">We are seeking an experienced Site Reliability Engineer (SRE) to join our platform engineering and operations teams. As an SRE, you will play a key role in ensuring the reliability and efficiency of our infrastructure and services.">As a member of our team, you will work closely with our development teams to identify and resolve issues...