Reliability Systems Architect

18 hours ago


Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 15,00,000 - ₹ 20,00,000
Job Overview

We are seeking a highly skilled Reliability Engineer to join our team. The ideal candidate will have expertise in designing and implementing reliable systems, as well as experience with Kubernetes, Containers, Cloud, and Database.

The Reliability Engineer will be responsible for ensuring the availability, reliability, and performance of our production environment. This includes managing processes for incidents, changes, releases, and deployments, as well as developing automation tools to enhance efficiency.

Key Responsibilities:
  • Collaborate with Platform, Production engineering and application SREs to manage and resolve complex production issues.
  • Improve Platform performance, availability, and reliability.
  • Implement observability solutions for proactive issue identification and optimization.
  • Manage processes for incidents, changes, releases, and deployments.
  • Develop automation tools (IaC, alerts as code, dashboard as code) to enhance efficiency.
  • Conduct POCs to implement tools to improve performance, scaling, reliability and availability.
  • Analyse trends in incidents, problems, and alerts to drive operational improvements.
  • Document SOPs, critical systems information, and best practices for current and future use.
  • Provide technical guidance to necessary stakeholders.
  • Stay updated on advancements in Software Engineering with extended focus on Reliability Engineering.
Required Skills and Qualifications:
  • Programming Languages: Proficient in one or more of the following languages (Java, Python and Go) with full SDLC experience.
  • Reliability Engineering principles: Expertise in anomaly detection, root cause analysis, and predictive maintenance.
  • SLIs, SLOs, and error budgets: Knowledge in defining SLIs, SLOs, and error budgets.
  • Kubernetes, Containers, Cloud, and Database: Hands-on experience with Kubernetes, Containers, Cloud, and Database.
  • Observability Tools and Open Telemetry: Strong knowledge in Observability Tools and Open Telemetry.
  • DevOps methodologies: Familiarity with DevOps methodologies, tools, and automating (e.g. Azure Pipelines, Terraform, Helm etc.,)
  • AWS and Azure: Experience with public/private cloud platforms including AWS and Azure.
  • Operations team management: Experience in leading an operations team in application Production Environments.
Preferred Skills:
  • Messaging Platforms: Experience in Messaging Platforms (e.g. MQ/Solace/Kafka), API Gateways and Service Mesh.
  • Generative AI and Responsible AI: Knowledge in Generative AI and Responsible AI.


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,01,00,000

    Job Title: Cloud Reliability Architect We are seeking an experienced Cloud Reliability Architect to lead our cloud reliability and resilience initiatives. The ideal candidate will have a strong background in cloud engineering, DevOps, and resilience with expertise in designing, building, and validating scalable, automated cloud-native environments. Key...


  • Chennai, Tamil Nadu, India beBeeReliability Full time

    System Reliability ExpertWe are seeking a talented and proactive system reliability expert to join our infrastructure team. The ideal candidate will combine software engineering expertise with systems engineering skills to build scalable, reliable, and efficient systems.Key Responsibilities:Design, implement, and manage scalable, resilient, and secure...


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 8,00,000 - ₹ 12,00,000

    Job Title: System Reliability Specialist Experience:7 to 12 YearsQualification:Diploma/BE (Mech./Instru.) Locations:VadodaraChennaiResponsibility:We are seeking an experienced System Reliability Specialist to join our team. The successful candidate will be responsible for maintaining and troubleshooting complex systems, including instruments, valves,...


  • Chennai, Tamil Nadu, India beBeeObservability Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Description:">We are seeking a skilled Site Reliability Engineer to join our team. The ideal candidate will have experience with Dynatrace, observability, and cloud computing platforms. They will be responsible for designing and implementing reliable systems that can handle production traffic efficiently.


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 1,20,00,000 - ₹ 2,50,00,000

    Reliability Engineering SpecialistThis is a technical leadership role that involves collaborating with development teams to enhance system reliability.In this position, you will be responsible for designing and building highly available and scalable production services. You will also define and implement Service Level Objectives (SLOs) and Service Level...


  • Chennai, Tamil Nadu, India beBeeEngineer Full time US$ 1,04,000 - US$ 1,30,878

    Job TitleSenior Site Reliability EngineerWe are looking for a Senior Site Reliability Engineer to join our team. This is an exciting opportunity to work with us and contribute to the success of our organization.As a Senior Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and services. You will...


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Reliable Systems Engineer PositionA Reliability Engineer leads the development and operation of high-reliability systems, making key technical decisions and collaborating with teams.The reliability mission is to design, build, and maintain highly reliable systems that support business growth. By quantitatively measuring and managing system reliability,...


  • Chennai, Tamil Nadu, India beBeeTechnical Full time US$ 1,20,000 - US$ 1,40,000

    We are seeking a skilled technical expert to enhance our automated risk detection systems. This role involves supporting and maintaining rule-based logic, data analysis processes, and scalable tools.In this position, you will play a vital part in system reliability, operational support, and technical troubleshooting, contributing to the protection of...

  • System Architect

    6 days ago


    Chennai, Tamil Nadu, India beBeeCloud Full time ₹ 1,04,000 - ₹ 1,30,878

    Microservices Architect PositionWe are seeking a skilled Microservices Architect to join our organization.About the RoleThe successful candidate will be responsible for designing and implementing microservices-based systems, ensuring they are scalable, resilient, and secure. They will also be involved in selecting appropriate technologies, patterns, and best...


  • Chennai, Tamil Nadu, India beBeesite Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Our team is seeking a skilled Site Reliability Engineer to join our dynamic group. As a key member of the team, you will play a crucial role in ensuring the health and performance of our complex web-scale systems.You will be responsible for working closely with development teams to design platforms that are 'operable' by nature. This will involve gaining...