
Leading Cloud Reliability and DevOps Expert
1 day ago
We are seeking an experienced professional to spearhead cloud reliability and chaos engineering initiatives, focusing on designing, building, and validating scalable and automated cloud-native environments.
**Key Responsibilities:**
- Cloud Engineering:
- Design, implement, and manage secure, scalable, and cost-efficient AWS infrastructure using EC2, Lambda, EKS, S3, RDS, IAM, CloudFront, etc.
- Automate infrastructure provisioning and configuration using Terraform/CloudFormation and AWS SDKs.
- Manage containerized workloads using Docker, Kubernetes, EKS.
- Automation and Tooling:
- Build automation scripts, deployment utilities, and infrastructure tooling using Python, including Boto3, Flask, FastAPI, etc.
- Develop custom monitoring/alerting integrations with APIs, SDKs, and third-party observability platforms.
- Implement self-healing and resilience-focused automation scripts.
- Chaos Engineering & Resiliency:
- Design and execute chaos experiments using fault injection, latency, outages, resource failures to validate system resilience.
- Use tools like Gremlin, Litmus, Chaos Mesh, or AWS Fault Injection Simulator.
- Partner with SRE and development teams to define SLIs, SLOs, and error budgets.
- Document learnings from chaos tests and improve incident response & recovery playbooks.
- DevOps & Observability:
- Build and maintain CI/CD pipelines for automated deployments using Jenkins, GitHub Actions, GitLab CI, AWS CodePipeline.
- Integrate observability frameworks, including Prometheus, Grafana, ELK/EFK, CloudWatch, Datadog, for monitoring and tracing.
- Ensure proactive alerting and real-time visibility into system health.
- Security and Compliance:
- Apply AWS security best practices for IAM, networking, and data protection.
- Ensure compliance with internal and external regulatory frameworks, such as SOC2, ISO, GDPR, etc.
**Requirements and Qualifications:**
- 6-10 years of experience in Cloud, DevOps, or SRE roles.
- Strong hands-on expertise in AWS Cloud (certifications preferred: AWS DevOps Engineer/Solutions Architect).
- Advanced Python development skills for automation and tooling (Boto3 a must).
- Experience designing and running chaos experiments (Gremlin, AWS FIS, Litmus, Chaos Mesh, or custom Python-based fault injection).
- Solid knowledge of IaC (Terraform/CloudFormation).
- Proficiency in containers & orchestration (Docker, Kubernetes, EKS).
- Strong background in monitoring, observability, and incident management.
- Familiarity with DevOps toolchain (CI/CD, Git, Jenkins, GitLab, CodePipeline).
- Good understanding of resilient architectures, reliability principles, and disaster recovery.
**Growth Opportunities:**
- Lead cloud reliability and chaos engineering initiatives.
- Culture focused on automation, resilience, and continuous improvement.
- Growth opportunities through certifications, R&D projects, and leadership roles.
-
Reliable Cloud Infrastructure Expert
5 days ago
Gandhinagar, Gujarat, India beBeeAzure Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Site Reliability EngineerAbout People Prime Worldwide :Transforming businesses through strategic partnerships, industry expertise and digital innovation.A Site Reliability Engineer (SRE) plays a pivotal role in ensuring the reliability and scalability of our cloud infrastructure. This entails designing, implementing and maintaining systems that underpin our...
-
Expert System Reliability Specialist
1 day ago
Gandhinagar, Gujarat, India beBeeSystemReliability Full time ₹ 90,00,000 - ₹ 1,20,00,000Senior System Reliability ExpertWe're seeking a seasoned Senior Site Reliability Engineer to guarantee the dependability, scalability, and performance of mission-critical systems. As part of our cutting-edge software development team, you'll work closely with development and operations teams to craft and maintain infrastructure, enhance system reliability,...
-
Expert Cloud Engineer
3 days ago
Gandhinagar, Gujarat, India beBeeCloud Full time ₹ 18,75,000 - ₹ 21,25,000Cloud Infrastructure ExpertWe are seeking a highly skilled Cloud Infrastructure Expert to contribute to the management, operation, and optimization of our cloud infrastructure.About the Role:This is an exciting opportunity to drive strategic decision-making for our cloud infrastructure on GCP, AWS, and Azure. As a key member of our team, you will play a...
-
Cloud Reliability Engineer
4 days ago
Gandhinagar, Gujarat, India beBeeReliability Full time ₹ 15,00,000 - ₹ 25,00,000Job DescriptionWe are seeking a skilled Cloud Architect to join our team. As a Cloud Architect, you will design and build secure, scalable, and cost-efficient cloud-native environments using AWS.Key Responsibilities:Cloud Engineering (AWS):Architect, implement, and manage secure, scalable, and cost-efficient AWS infrastructure (EC2, Lambda, EKS, S3, RDS,...
-
Technical Lead
5 days ago
Gandhinagar, Gujarat, India beBeeDevOps Full time ₹ 29,54,600 - ₹ 35,17,395Job Title: Technical LeadDescription:We are seeking a seasoned Technical Lead to spearhead our DevOps initiatives. This role will drive the implementation, integration, and refinement of DevOps methodologies across all applications. The ideal candidate will have a proven track record of fostering collaboration between development, operations, and quality...
-
Site Reliability Lead
11 hours ago
Gandhinagar, Gujarat, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Title: Site Reliability Engineering ManagerAs a leader in our reliability engineering function, you will blend technical leadership with team mentorship and cross-functional coordination.Develop and implement organizational reliability strategies, aligning SLAs, SLOs, and Error Budgets with business goals and customer expectations.Establish incident...
-
DevOps Leader for Build and Deployment Efforts
17 hours ago
Gandhinagar, Gujarat, India beBeeDevOps Full time ₹ 1,20,00,000 - ₹ 2,00,00,000Job DescriptionAbout the RoleThe primary responsibility of this position is to lead a team of DevOps engineers in building and supporting build and deployment efforts, including debugging issues and supporting engineering deliveries.Key ResponsibilitiesLead a team of DevOps engineers to develop and implement efficient build and deployment...
-
Senior Cloud Solution Designer
9 hours ago
Gandhinagar, Gujarat, India beBeeExpert Full time ₹ 50,00,000 - ₹ 70,00,000Cloud Architect Position OverviewThis role involves leading the development of a cloud strategy, driving collaboration between engineering, security, product management, and business stakeholders to create a roadmap that empowers organizational growth.The ideal candidate will have experience architecting and deploying large-scale cloud solutions, with a...
-
Cloud Architect Specialist
3 days ago
Gandhinagar, Gujarat, India beBeeCloud Full time ₹ 1,20,00,000 - ₹ 2,00,00,000Cloud Infrastructure SpecialistAbout our client is a leading cloud solutions provider with a strong global presence, specializing in helping enterprises optimize cloud costs, improve cloud reliability, and adopt next-generation technologies like AI on Azure.The ideal candidate brings deep Azure expertise, cost optimization skills, and a passion for...
-
Reliable Systems Engineer Lead
4 days ago
Gandhinagar, Gujarat, India beBeeSite Full time ₹ 1,80,00,000 - ₹ 2,50,00,000About the RoleWe are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response...