Reliability Engineering Specialist

5 days ago


Chennai, Tamil Nadu, India beBeeReliability Full time US$ 17,825
Job Summary

This is a challenging role that requires the ability to work with complex production systems and resolve issues proactively. As a Senior Software Engineer in Reliability Centre of Excellence, you will play a pivotal role in ensuring the availability, reliability, and performance of our critical Software Engineering Platforms.

Key Responsibilities:
  • Collaborate with Platform, Production engineering, and application SREs to manage and resolve complex production issues
  • Develop strategies to improve Platform performance, availability, and reliability
  • Implement observability solutions for proactive issue identification and optimization
  • Manage processes for incidents, changes, releases, and deployments
  • Create automation tools, IaC, alerts as code, dashboard as code, to enhance efficiency
  • Conduct POCs to implement tools to improve performance, scaling, reliability, and availability
  • Analyse trends in incidents, problems, and alerts to drive operational improvements
  • Document SOPs, critical systems information, and best practices for current and future use
  • Provide technical guidance to stakeholders
  • Stay updated on advancements in Software Engineering with an extended focus on Reliability Engineering
Skills and Experience:
  • Programming Languages: Linux, VM, Containers, and Kubernetes
  • AWS and Azure Database Observability
  • Mandatory Skills:
    • Proficient in one or more of the following languages: Java, Python, and Go, with full SDLC experience
    • Expertise in Reliability Engineering principles: Anomaly detection, root cause analysis, and predictive maintenance
    • Knowledge in defining SLIs, SLOs, and error budgets
    • Hands-on experience with Kubernetes, Containers, Cloud, and Database
    • Strong knowledge in Observability Tools and Open Telemetry
    • Familiarity with DevOps methodologies, tools, and automating: e.g., Azure Pipelines, Terraform, Helm, etc.
  • Experience with public private cloud platforms, including AWS and Azure
  • Experience in leading an operations team in application Production Environments
  • Preferred Skills:
    • Experience in Messaging Platforms: e.g., MQ, Solace, Kafka
    • API Gateways and Service Mesh
    • Knowledge in Generative AI and Responsible AI
About Us:

We're an international bank, nimble enough to act, big enough for impact. For over 170 years, we've worked to make a positive difference for our clients, communities, and each other. We question the status quo, love a challenge, and enjoy finding new opportunities to grow and do better than before.

Our purpose - to drive commerce and prosperity through our unique diversity, together with our brand promise - to be here for good - are achieved by how we each live our valued behaviours. When you work with us, you'll see how we value difference and advocate inclusion. Together, we:

  • Do the right thing and are assertive, challenging one another, and living with integrity while putting the client at the heart of what we do.
  • Never settle, continuously striving to improve and innovate, keeping things simple, and learning from doing well, and not so well.
  • Are better together, we can be ourselves, be inclusive, see more good in others, and work collectively to build for the long term.
What We Offer:

We offer a competitive salary and benefits to support your mental, physical, financial, and social wellbeing. Time-off including annual leave, parental, maternity (20 weeks), sabbatical (12 months maximum), and volunteering leave (3 days). Flexible working options based around home and office locations, with flexible working patterns. Proactive wellbeing support through Unmind, a market-leading digital wellbeing platform, development courses for resilience and other human skills, global Employee Assistance Programme, sick leave, mental health first-aiders, and all sorts of self-help toolkits.

A continuous learning culture to support your growth, with opportunities to reskill and upskill, and access to physical, virtual, and digital learning. Being part of an inclusive and values-driven organisation, one that embraces and celebrates our unique diversity, across our teams, business functions, and geographies - everyone feels respected, and can realise their full potential.



  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 8,00,000 - ₹ 12,00,000

    Job Title: Equipment Reliability SpecialistAbout the Role:We are seeking a skilled professional to serve as an Equipment Reliability Specialist. In this role, you will be responsible for monitoring equipment performance and providing recommendations for improvement.Key Responsibilities:Collect and analyze data from various sources to identify trends and...


  • Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 8,00,000 - ₹ 12,00,000

    Job Title: System Reliability Specialist Experience:7 to 12 YearsQualification:Diploma/BE (Mech./Instru.) Locations:VadodaraChennaiResponsibility:We are seeking an experienced System Reliability Specialist to join our team. The successful candidate will be responsible for maintaining and troubleshooting complex systems, including instruments, valves,...

  • Reliability Engineer

    4 weeks ago


    Chennai, Tamil Nadu, India Supply Chain Resources Group, Inc. Full time

    Responsibilities- Translate product management reliability goals into appropriate testable goals.- Perform statistical data analysis, Accelerated Life Testing (ALT) and modeling, and risk assessment.- Develop reliability performance metrics and lead management reviews to review progress against those metrics.- Drive the failure analysis process for all...


  • Chennai, Tamil Nadu, India Supply Chain Resources Group, Inc. Full time

    Responsibilities Translate product management reliability goals into appropriate testable goals. Perform statistical data analysis, Accelerated Life Testing (ALT) and modeling, and risk assessment. Develop reliability performance metrics and lead management reviews to review progress against those metrics. Drive the failure analysis process for all failures...


  • Chennai, Tamil Nadu, India Supply Chain Resources Group, Inc. Full time

    ResponsibilitiesTranslate product management reliability goals into appropriate testable goals.Perform statistical data analysis, Accelerated Life Testing (ALT) and modeling, and risk assessment.Develop reliability performance metrics and lead management reviews to review progress against those metrics.Drive the failure analysis process for all failures...


  • Chennai, Tamil Nadu, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE)Experience: 4 – 10 YearsLocation: Chennai (Hybrid – 2 days in office)Role Overview:We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services.Key Responsibilities- Design,...


  • Chennai, Tamil Nadu, India beBeeReliability Full time

    System Reliability ExpertWe are seeking a talented and proactive system reliability expert to join our infrastructure team. The ideal candidate will combine software engineering expertise with systems engineering skills to build scalable, reliable, and efficient systems.Key Responsibilities:Design, implement, and manage scalable, resilient, and secure...


  • Chennai, Tamil Nadu, India beBeeReliabilityEngineer Full time ₹ 12,00,000 - ₹ 15,00,000

    Ensuring product reliability is a top priority in today's fast-paced market. As a Reliability Engineer, you will play a critical role in identifying and mitigating potential risks to deliver high-quality products that meet customer expectations.About the RoleWe are seeking a skilled and experienced Reliability Engineer to join our team. The ideal candidate...


  • Chennai, Tamil Nadu, India FTC Solar, Inc Full time

    Position Overview: We are seeking a highly skilled and motivated Solar Tracker Reliability Engineer to join our team. The Solar Tracker Reliability Engineer will play a key role in ensuring the reliability and performance of our solar tracker systems. This position offers an exciting opportunity to work at the forefront of renewable energy technology and...


  • Chennai, Tamil Nadu, India FTC Solar, Inc Full time

    Position Overview: We are seeking a highly skilled and motivated Solar Tracker Reliability Engineer to join our team.The Solar Tracker Reliability Engineer will play a key role in ensuring the reliability and performance of our solar tracker systems.This position offers an exciting opportunity to work at the forefront of renewable energy technology and...