Distributed Systems Reliability Expert

2 days ago


Bengaluru Bangalore Gurgaon Gurugram, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878
Senior Reliability Engineer

Key Responsibilities:

  • Develop and implement a comprehensive reliability strategy aligned with the company's goals and objectives. Lead a team of reliability professionals to drive system reliability, performance, and scalability.
  • Establish real-time monitoring practices to ensure insights into system performance and customer experience. Implement metrics and dashboards to proactively identify and address potential issues.
  • Lead all aspects of the end-to-end production support process, including incident management, problem resolution, and service-level agreement (SLA) compliance. Drive continuous improvement initiatives to enhance operational effectiveness and reduce mean time to resolution (MTTR).
  • Collaborate with multi-functional teams to enhance customer journeys through seamless and reliable technology experiences.
  • Promote and implement standard methodologies for reliability engineering, including error budgeting, chaos engineering, and disaster recovery planning. Cultivate a culture of resilience and reliability within technology.
  • Champion automation initiatives to streamline operational workflows, deployment processes, and incident response tasks. Leverage automation tools and orchestration to improve reliability and reduce manual intervention.

Qualifications:

  • 3-8 years of experience in Computer Science, Information Technology, or related field. Advanced certifications in reliability engineering or related are a plus.
  • Deep understanding of observability tools and methodologies, including experience with logging, monitoring, tracing, and performance analysis platforms.
  • Strong leadership and people management skills, with the ability to inspire and empower successful reliability teams.

Required Skills:

  • Hands-on coding and system design of highly available distributed systems.
  • Java/Golang/Javascript, Kubernetes, Docker.
  • Knowledge on modern observability stack – splunk, elastic search, Prometheus, Grafana.
  • Knowledge of cloud-based reliability practices and experience with public cloud platforms such as AWS, Azure, or Google Cloud.
  • Familiarity with containerization technologies (e.g., Kubernetes, Docker) and microservices architecture.
  • Demonstrated expertise in driving culture change, DevOps practices, and continuous improvement in reliability and production support functions.


  • Bengaluru, Karnataka, India beBeeInfrastructure Full time ₹ 1,80,00,000 - ₹ 2,10,00,000

    Site Reliability Engineer Job DescriptionJob Overview:We are seeking a Site Reliability Engineer to join our team, responsible for ensuring the reliability, performance, and efficiency of our distributed systems and containerized deployments.Key Responsibilities:Improve the reliability and performance of next-generation distributed systems and containerized...


  • Bengaluru, Karnataka, India beBeeInfrastructureManager Full time ₹ 1,20,00,000 - ₹ 2,40,00,000

    Job OverviewA System Operations Expert is needed to lead the management of distributed systems, ensuring optimal performance and high availability. This role will focus on troubleshooting issues across hardware, software, and network layers, optimizing system operations using Python and other scripting tools.Key ResponsibilitiesManage proxy infrastructure...


  • Bengaluru, Karnataka, India beBeeEngineering Full time US$ 1,25,000 - US$ 1,75,000

    About Our Software Engineering RoleWe are seeking a highly skilled Software Engineer II to play a key role in designing and implementing high-throughput, low-latency systems that operate reliably in production, even as data volumes scale to billions of events per day.This position requires a strong understanding of computer science fundamentals, data...


  • Gurgaon, Haryana, India beBeeElectrical Full time ₹ 1,04,000 - ₹ 1,30,878

    Electrical Distribution Systems SpecialistWe are seeking a highly skilled and experienced Electrical Distribution Systems Specialist to join our team.Responsibilities:Manage customer relationships with All India DISCOMs and EPCs, ensuring timely submission of offers that meet customer specifications and targets.Drive the preparation of reviews and...

  • Data Engineer

    7 days ago


    Bengaluru, Karnataka, India beBeeDataEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Job Title: Data Engineer - Distributed Systems ExpertJob DescriptionWe are seeking a highly skilled Data Engineer to join our organization. The ideal candidate will have extensive experience with Big Data technologies and distributed data processing frameworks.This individual will be responsible for designing, developing, and maintaining large-scale data...


  • Bengaluru, Karnataka, India beBeeInfrastructure Full time ₹ 20,00,000 - ₹ 30,00,000

    Cloud Infrastructure Engineer OpportunityWe're looking for highly skilled engineers with expertise in solving complex problems in distributed systems, virtualized infrastructure, and highly available services.Key ResponsibilitiesDesign, develop, and deploy software to enhance the availability, scalability, and efficiency of cloud products and services.Design...


  • Bengaluru, Karnataka, India beBeeSupport Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    About This RoleWe are seeking a highly skilled Support Engineer to join our team.As a Support Engineer, you will play a critical role in ensuring the reliability and integrity of our systems.You will be responsible for owning system resolution, engaging with engineering teams, and leading efforts to drive operational excellence.Key ResponsibilitiesOwning...


  • Bengaluru, Karnataka, India beBeeEnterprise Full time US$ 1,80,000 - US$ 2,50,000

    Key Roles in Distributed SystemsAt our company, we are revolutionizing how complex systems are developed and integrated. Our Platform team is building a Customer Sandbox environment from the ground up.As a Senior Engineer on this team, you will be responsible for taking it from 0 to 1 to 10. This is a foundational initiative that will redefine the way we...


  • Bengaluru, Karnataka, India beBeeDistributed Full time ₹ 16,24,900 - ₹ 24,92,300

    Backend Software Engineer PositionWe are seeking an accomplished Backend Software Engineer to contribute to the development of scalable and fault-tolerant distributed systems.Main Responsibilities:Design and implement high-quality software components with a focus on reliability, scalability, and performance.Develop well-defined abstractions and software...


  • Gurgaon / Gurugram, Bengaluru / Bangalore, India beBeeReliability Full time US$ 1,50,000 - US$ 2,00,000

    Job Title: Reliability and Performance Lead">The primary function of this position is to lead a team in the development and implementation of strategies for enhancing system reliability, performance, and scalability. The ideal candidate will have extensive knowledge of observability tools and methodologies, as well as strong leadership skills.">Key...