Current jobs related to System Reliability Expert - Kolkata Delhi Mumbai - beBeeSoftwareReliability


  • Mumbai, Maharashtra, India beBeeTechnical Full time ₹ 18,00,000 - ₹ 25,00,000

    Software Stability SpecialistThis is a role that combines technical expertise with operational efficiency, ensuring the reliability and performance of digital platforms.The ideal candidate will have experience with monitoring tools, cloud platforms, and automation systems, including Grafana, Splunk, Docker, and Kubernetes. They will also be proficient in...


  • Kolkata, Delhi, Mumbai, India beBeeDevops Full time ₹ 5,00,000 - ₹ 8,00,000

    Transform Your Career with a Leading Role in System ReliabilityJob DescriptionAs a Principal Site Reliability Engineer, you will play a pivotal role in shaping the company's technological landscape by designing and implementing sophisticated systems and software aligned with customer needs. You will collaborate closely with cross-functional teams, including...


  • Delhi, Mumbai, Kolkata, India beBeeSite Full time ₹ 1,04,000 - ₹ 13,08,780

    Job Description:Reliability Engineering ExpertWe are seeking an experienced reliability engineering expert to join our team. As a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software that meet our customer's business goals.You will work with product management, engineering teams, customer success, and...


  • Kolkata, West Bengal, India beBeeCloudEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    About the PositionWe are seeking a seasoned professional to fill the role of Site Reliability Engineer with strong technical background, DevOps expertise, and hands-on experience with cloud platforms and operations.Key Responsibilities:Provide technical leadership and mentoring through knowledge sharing, code reviews, and solution designDesign and implement...


  • Delhi, Delhi, India beBeeSite Full time ₹ 18,00,000 - ₹ 25,50,000

    Job SummaryWe are seeking an experienced Site Reliability Engineer to drive the reliability and performance of our systems.About the Role:The ideal candidate will have a strong understanding of distributed systems, cloud platforms (AWS, Azure or GCP), and microservices architecture.They will be responsible for ensuring scalability and availability of...


  • Mumbai, Maharashtra, India beBeeSystemReliability Full time ₹ 15,00,000 - ₹ 20,00,000

    System reliability is crucial to the success of complex systems.We are seeking a highly skilled System Reliability Engineer to take on this critical role. The ideal candidate will have a minimum of 2-4 years of relevant experience in ensuring the reliability, scalability, and performance of large-scale systems.Implement monitoring tools like Grafana,...


  • Mumbai, Maharashtra, India beBeeAnalyst Full time ₹ 80,00,000 - ₹ 1,50,00,000

    Technical Support Specialist RoleThis position focuses on ensuring system reliability by resolving incidents and implementing production stability.Monitor Murex applications for optimal performance and availability.Resolve production issues within the agreed timeframe.Conduct log analysis, identify root causes, and implement permanent fixes.Collaborate with...


  • Kolkata, Delhi, Mumbai, India beBeeReliability Full time ₹ 7,00,000 - ₹ 12,00,000

    Job Title: Site Reliability EngineerAbout the Role:This role is for a highly skilled and experienced Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing, building, and maintaining sophisticated systems and software based on customer business goals, needs, and general...


  • Kolkata, Delhi, Mumbai, India beBeeMessaging Full time ₹ 1,04,000 - ₹ 1,30,878

    Our team is seeking a Senior Lead Messaging Systems Expert to oversee the development and deployment of high-performance messaging systems.This position requires a strong understanding of software engineering concepts, with 8+ years of applied experience in messaging systems such as Kafka, ActiveMQ, and other similar technologies.The ideal candidate will...

  • Reliability Engineer

    2 weeks ago


    Mumbai, Maharashtra, India beBeeReliability Full time ₹ 25,16,802 - ₹ 30,91,323

    About This RoleWe are seeking a skilled Reliability Engineer to join our team. As a Reliability Engineer, you will be responsible for designing and implementing systems that ensure high availability, scalability, and performance.Job DescriptionThis is an exciting opportunity to work on critical infrastructure projects, collaborating with cross-functional...

System Reliability Expert

3 weeks ago


Kolkata Delhi Mumbai, India beBeeSoftwareReliability Full time US$ 90,000 - US$ 1,20,000
Job Title: Software Reliability Specialist

We are seeking a highly skilled Software Reliability Specialist to maintain the stability of our software product throughout its entire development lifecycle. You will be responsible for measuring and monitoring the system's general state, analyzing incident data, automating monitoring processes, and developing frameworks and scripts to enhance product stability and reliability.

Responsibilities:

  • Ensure the stability of the software product throughout the entire software development process.
  • Measure and monitor the system's overall health on multiple mediums using tools like DataDog, GCP Matrix/Platform, and Grafana.
  • Collect and analyze incident data and post-mortem reports to identify root causes and implement preventive measures.
  • Utilize various tools to automate the monitoring of the software system.
  • Identify new instruments and technologies to develop and streamline product stability.
  • Develop monitoring and testing frameworks, solutions, or scripts in various programming languages.
  • Maintain and run tests to ensure product reliability and stability.
  • Apply knowledge of Kubernetes for managing containerized applications and deployments.
  • Contribute to defining and achieving Service Level Objectives (SLOs) and managing Error Budgeting.
  • Optimize deployment processes for efficiency and reliability.

Requirements:

  • Proficiency with monitoring tools such as DataDog, GCP Matrix/Platform, and Grafana.
  • Experience with Kubernetes.
  • Strong understanding of deployment processes.
  • Knowledge of SLOs, Error Budgeting, and related tools.
  • Ability to measure and monitor system state effectively.
  • Experience in collecting and analyzing incident data and post-mortems.
  • Proficiency in automating software system monitoring.
  • Capability to identify and implement new instruments for product stability.
  • Experience in developing monitoring and testing frameworks, solutions, or scripts in various programming languages.