Reliability Engineering Leader

6 days ago


Vizag, Andhra Pradesh, India beBeeReliability Full time ₹ 25,00,000 - ₹ 40,00,000

Job Title: Reliability Engineering Lead

">

Job Summary:

We are seeking a seasoned Reliability Engineering Lead to oversee the reliability, scalability, and performance of our critical systems. As a key member of our team, you will play a pivotal role in establishing and implementing reliability best practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies.

">
  • ">
  • Lead efforts to maintain high availability and reliability of critical services.">
  • Define and monitor service level indicators (SLIs), service level objectives (SLOs), and service level agreements (SLAs) to ensure business requirements are met.">
  • Proactively identify and resolve performance bottlenecks and system inefficiencies.">
">Key Responsibilities:">

Reliability & Performance:

">
  • ">
  • Develop and implement automated solutions to reduce manual operational tasks.">
  • Enhance system observability through metrics, logging, and distributed tracing tools.">
  • Optimize CI/CD pipelines for seamless deployments.">
">

Incident Management & Response:

">
  • ">
  • Establish and improve incident management processes and on-call rotations.">
  • Lead incident response and root cause analysis for high-priority outages.">
  • Drive post-incident reviews and ensure actionable insights are implemented.">
">

Automation & Tooling:

">
  • ">
  • Partner with software engineering teams to improve the reliability of applications and infrastructure.">
  • Work closely with product/engineering teams to design scalable and robust systems.">
  • Ensure seamless integration of monitoring and alerting systems across teams.">
">

Leadership & Team Building:

">
  • ">
  • Manage, mentor, and grow a team of reliability engineers.">
  • Promote reliability best practices and foster a culture of reliability and performance across the organization.">
  • Drive performance reviews, skills development, and career progression for team members.">
">

Capacity Planning & Cost Optimization:

">
  • ">
  • Perform capacity planning and implement autoscaling solutions to handle traffic spikes.">
  • Optimize infrastructure and cloud costs while maintaining reliability and performance.">
">Required Skills & Qualifications:">
  • ">
  • Technical Expertise:">
    • ">
    • Experience with cloud platforms and Kubernetes.">
    • Hands-on knowledge of infrastructure-as-code tools like Terraform/Helm/Ansible.">
    • Proficiency in Java.">
    • Expertise in distributed systems, databases, and load balancing.">
    ">
  • Monitoring & Observability:">
    • ">
    • Proficient with tools like Prometheus/Grafana/Elastic APM or New Relic.">
    • Understanding of metrics-driven approaches for system monitoring and alerting.">
    ">
  • Automation & CI/CD:">
    • ">
    • Hands-on experience with CI/CD pipelines (e.g., Jenkins/Azure Pipelines).">
    • Skilled in automation frameworks and tools for infrastructure and application deployments.">
    ">
  • Incident Management:">
    • ">
    • Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence.">
    ">
">Leadership & Communication Skills:">
  • ">
  • Strong people management and leadership skills with the ability to inspire and motivate teams.">
  • Excellent problem-solving and decision-making skills.">
  • Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.">
">Why Join Us?">

Be a key driver in building and scaling reliable systems in a fast-paced environment.

">

Work with cutting-edge technologies and influence the evolution of the infrastructure.

">

Lead a high-impact team and foster a culture of reliability and innovation.

"]},

  • Vizag, Andhra Pradesh, India beBeeResponsibility Full time ₹ 15,00,000 - ₹ 25,00,000

    Job DescriptionReliability Engineering OpportunityOur organization seeks a skilled reliability engineer to lead cloud reliability initiatives. The ideal candidate will have expertise in designing and implementing resilient, scalable systems.About the Role:Design and implement secure, scalable, and cost-efficient cloud infrastructure using AWS Cloud (EC2,...


  • Vizag, Andhra Pradesh, India beBeeCloudLeader Full time ₹ 19,30,000 - ₹ 25,15,000

    Job Summary:The SRE Manager will lead the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance. This hybrid role blends technical leadership with team mentorship and cross-functional coordination.Key Responsibilities:Establish organizational reliability strategies, aligning SLAs, SLOs, and Error Budgets...


  • Vizag, Andhra Pradesh, India beBeeReliability Full time ₹ 20,00,000 - ₹ 25,00,000

    Job Title: SRE LeadAs a high-achieving Site Reliability Engineering (SRE) Lead, you will be responsible for overseeing the reliability, scalability, and performance of our critical systems. This role requires expertise in cloud platforms, infrastructure-as-code tools, and distributed systems.This position combines software engineering and systems engineering...


  • Vizag, Andhra Pradesh, India beBeeStrategy Full time ₹ 1,50,40,000 - ₹ 2,51,12,000

    Senior Reliability Engineer LeadThis is an exciting opportunity to lead a senior team and drive strategic initiatives that improve operational efficiency, enhance service quality/SLA, and optimize delivery.Develop Strategic Roadmap: Help define and implement the SRE strategy to promote an 'Automate-first' culture in operating services.Process Optimization:...


  • Vizag, Andhra Pradesh, India beBeeEngineering Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Title: Academic Leader in Engineering EducationWe are seeking a visionary academic leader to spearhead the development of our School of Engineering.This is an exceptional opportunity to join a forward-thinking institution and shape the future of engineering education in India.About the Role:Lead Academic Programs: Provide strategic direction and...


  • Vizag, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 18,00,000 - ₹ 24,00,000

    Job Title: Infrastructure Performance Specialist The key objective of this role is to guarantee the operational excellence and scalability of complex systems. About the Job We are seeking a skilled Site Reliability Engineer to join our team. The ideal candidate will be responsible for ensuring the high reliability, scalability, and performance of...


  • Vizag, Andhra Pradesh, India beBeeReliability Full time ₹ 23,00,000 - ₹ 25,00,000

    Senior SRE Role SummaryWe are seeking an experienced Senior Site Reliability Engineer to enhance our system resilience and reduce production outages. As part of our digital transformation journey, we are investing heavily in automation and reliability engineering.Key Responsibilities:Investigate and resolve high-impact production issues across infrastructure...


  • Vizag, Andhra Pradesh, India beBeeEngineering Full time ₹ 1,80,00,000 - ₹ 2,40,00,000

    Job OpportunityDrive engineering excellence across the organization, focusing on quality, reliability, and efficiency.Develop and maintain tools, frameworks, and dashboards to enhance software development lifecycle (SDLC) performance.Collaborate with engineering teams to identify areas for improvement and implement best practices.Lead initiatives to improve...


  • Vizag, Andhra Pradesh, India beBeeCloudEngineering Full time ₹ 20,00,000 - ₹ 25,00,000

    Job Title: Cloud Data Engineering LeaderAs a seasoned Cloud Data Engineering Leader, you will be responsible for managing and leading the migration of an on-premises Enterprise Data Warehouse to a modern cloud-based data platform using Azure Cloud data tools and Snowflake.Key Responsibilities:Lead the migration of the on-premises SQL Server Enterprise Data...


  • Vizag, Andhra Pradesh, India beBeeSystemReliability Full time ₹ 18,00,000 - ₹ 20,00,000

    System Reliability SpecialistJob Overview:We are seeking an experienced System Reliability Specialist to join our team. The successful candidate will play a critical role in monitoring system health, managing incident communications, and ensuring high reliability of globally deployed web applications.Key Responsibilities:Monitor Grafana dashboards and...