Cloud Native System Reliability Specialist

3 days ago


Hyderabad, Telangana, India beBeeReliability Full time ₹ 18,00,000 - ₹ 25,00,000
Reliability Engineer

We are seeking a skilled, results-driven Reliability Engineer to drive reliability, availability, and performance across our global digital product ecosystem. This role is central to ensuring a seamless and resilient experience for our users by combining deep technical expertise with operational excellence.

You will be part of a global team supporting 260+ modern cloud-native applications across consumer, commercial, supply chain, and enablement functions. Your mission: prevent incidents before they occur, ensure rapid recovery when they do, and build scalable systems that evolve with our growing business.

Key Responsibilities:

* Champion reliability and observability across mission-critical applications.
* Develop and maintain service-level indicators (SLIs), objectives (SLOs), and error budgets to measure system performance.
* Implement automated monitoring, alerting, and recovery mechanisms to reduce manual intervention and improve response times.
* Collaborate closely with engineering, platform, and operations teams to embed best practices across the development lifecycle.
* Lead and participate in incident response, root cause analysis, and postmortem reviews to drive long-term improvements.
* Continuously improve resiliency design, capacity planning, and release management in production systems.
* Influence engineering teams with best practices on cloud-native architecture and deployment strategies.

Requirements:

* 5+ years of experience in production engineering, DevOps, or SRE roles.
* Strong foundation in Linux systems, networking, and cloud platforms (Azure, AWS, or GCP).
* Hands-on experience with observability tools (e.g., AppDynamics, Prometheus, Grafana, ELK, FullStory).
* Proficiency in scripting or programming (e.g., Python, Bash, Go) and automation frameworks (e.g., Ansible, Terraform).
* Deep understanding of CI/CD pipelines, release strategies, and deployment automation.
* Experience in managing high-scale, distributed systems in cloud-native environments.
* Strong analytical skills and passion for continuous improvement.

Preferred Skills:

* Familiarity with microservices, Kubernetes, containers, and service mesh architecture.
* Exposure to incident and problem management frameworks (e.g., ITIL, RCA practices).
* Experience working in global teams supporting mission-critical applications.



  • Hyderabad, Telangana, India beBeeResilience Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Reliability Engineering Lead\rDrive the development of resilient systems and processes that deliver high-quality experiences for users. As a key member of our global SRE practice, you will support 260+ cloud-native applications across diverse functions.\rPrevent incidents before they occur, ensure rapid recovery when they do, and build scalable systems that...


  • Hyderabad, Telangana, India beBeeBackendDeveloper Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Title: Cloud-Native Backend DeveloperWe are seeking a skilled Cloud-Native Backend Developer to join our team. The ideal candidate will have experience in designing and developing scalable, reliable backend services and cloud-native applications.Key Responsibilities:Design and develop scalable, reliable backend services and cloud-native...


  • Hyderabad, Telangana, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    We're looking for a skilled professional to join our team as a Reliability Engineer. This role is responsible for designing, writing, and delivering software to improve the availability, latency, and efficiency of our products.">Key Responsibilities:Design and develop scalable, cloud-native architectures that support business growth.Implement self-healing...


  • Hyderabad, Telangana, India beBeeOperations Full time ₹ 1,80,00,000 - ₹ 2,00,00,000

    Site Reliability EngineerWe are looking for a skilled Systems Operations Specialist with extensive experience, responsible for ensuring the reliability, availability, and performance of critical systems.Key Responsibilities:Implement scalable, secure services in cloud environments (AWS) adhering to SRE principles.Develop and manage Continuous...


  • Hyderabad, Telangana, India beBeeReliability Full time ₹ 30,00,000 - ₹ 40,00,000

    We are seeking a skilled System Reliability Specialist to join our team. As a System Reliability Specialist, you will play a critical role in ensuring the performance and reliability of our systems.- Design and implement Service Level Agreements (SLAs), Service Level Indicators (SLIs), and error budgets to improve system reliability.- Monitor and optimize...


  • Hyderabad, Telangana, India LUCIDSPIRE PRIVATE LIMITED Full time

    Job Description :We are seeking a skilled Cloud Native Developer with hands-on expertise in Azure to help drive the development and deployment of scalable, high-performance applications. This role primarily focuses on Azure-based cloud-native technologies, including containers, microservices, and serverless computing, with the possibility of future...


  • Hyderabad, Telangana, India beBeeSoftwareEngineering Full time ₹ 15,00,000 - ₹ 18,00,000

    Azure Cloud Software Engineering SpecialistAs a cloud software engineering specialist, you will design and develop cloud-native applications on Microsoft Azure. You will work collaboratively across development and architecture teams to deliver scalable, secure, and high-performance solutions.Key ResponsibilitiesDesign and build cloud-native applications...


  • Hyderabad, Telangana, India beBeeSiteReliabilityEngineer Full time ₹ 10,50,000 - ₹ 18,50,000

    Job DescriptionWe are seeking a skilled Associate Manager SRE to join our team. As an Associate Manager Site Reliability Engineer (SRE), you will be part of a global SRE practice supporting a portfolio of 260+ modern cloud-native applications across consumer, commercial, supply chain, and enablement functions.Your mission is to prevent incidents before they...


  • Hyderabad, Telangana, India Careernet Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Key Skills: Cloud, Kubernetes, Python, Jenkins, OpenTelemetry, AppDynamics, Site Reliability Engineer.Roles & Responsibilities:Design, implement, and manage cloud infrastructure to ensure high availability and reliability.Utilize Kubernetes for container orchestration and management.Develop and maintain monitoring solutions using OpenTelemetry and...


  • Hyderabad, Telangana, India beBeeSpecialist Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Opening:We are currently seeking a seasoned professional to fill the role of Cloud Native Solutions Architect. The ideal candidate will have extensive experience in designing and implementing scalable, cloud-based applications for enterprise systems. In this position, you will leverage your expertise in microservices architecture, containerization, and...