
Advanced Production Reliability Specialist
1 week ago
We are seeking a highly skilled Reliable System Engineer to join our team.
This is a hands-on leadership role where you will take full end-to-end ownership of production environments, ensuring scalability, reliability, security, and cost efficiency.
As a system engineer, you will work with large-scale Kubernetes (GKE) clusters, developing Infrastructure-as-Code (Terraform, HashiCorp tools) to orchestrate production and development environments.
You will lead incident management, plan and execute high-pressure production maintenance tasks, perform root cause analysis, and implement solutions to drive MTTR reduction.
Additionally, you will build automation and self-healing tools for proactive remediation of known issues, CI/CD pipelines, and deployment processes.
Solid experience in cloud platforms, containerized and virtualized environments, CI/CD and release automation, programming skills, and knowledge of databases are required.
Excellent communication, prioritization, and multitasking skills are also essential.
We offer a collaborative environment, allowing you to work effectively with internal and external stakeholders.
- Take full end-to-end ownership of SaaS production environments deployed on GCP, ensuring scalability, reliability, security, and cost efficiency.
- Work with large-scale Kubernetes (GKE) clusters, while developing Infrastructure-as-Code (Terraform, HashiCorp tools) to orchestrate production and development environments.
- Lead incident management (L3 on-call) — plan and execute high-pressure production maintenance tasks, perform root cause analysis, and implement solutions to drive MTTR reduction.
- Build automation and self-healing tools for proactive remediation of known issues, CI/CD pipelines, and deployment processes.
- Secure production environments by integrating best-in-class security tools and practices.
- Collaborate with development and research teams to enhance architecture, improve service reliability, and optimize performance.
- 10+ years as a DevOps/SRE Engineer with a strong passion for technology, reliability, and service excellence.
- Hands-on expertise in cloud platforms, with proven ability to manage large-scale, multi-project, multi-cluster environments.
- Strong background in infrastructure automation and orchestration using Terraform, HashiCorp tools, and Infrastructure-as-Code practices.
- Proficiency in containerized and virtualized environments (Kubernetes, Docker), with advanced scaling strategies.
- Solid experience in CI/CD and release automation (GitLab, ArgoCD, Jenkins), including configuration management and deployment pipelines.
- Programming skills in Python or Go with high proficiency in Linux systems.
- Knowledge of databases such as Cassandra, ScyllaDB, MemSQL, or MySQL.
- Strong expertise in incident response, RCA, and post-mortem analysis, driving MTTR reduction and operational resilience.
- Demonstrated ability to collaborate with cross-functional research, security, and development teams.
Our company offers a competitive salary and benefits package.
Moreover, we provide opportunities for professional growth and development.
-
Reliability Specialist
3 days ago
Bengaluru, Karnataka, India beBeeQuality Full time ₹ 9,00,000 - ₹ 12,00,000Job Title:Reliability SpecialistAbout the Role:We are seeking a highly skilled Reliability Specialist to join our team. As a key member of our quality assurance department, you will be responsible for developing and implementing quality assurance processes and procedures to ensure product compliance with industry standards and customer requirements.Key...
-
System Reliability Specialist
2 weeks ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878System Reliability SpecialistWhat to ExpectA System Reliability Specialist is responsible for maintaining the reliability of a system. This involves ensuring that automated processes are efficient, streamlined, and effective in responding to errors.Key ResponsibilitiesThe System Reliability Specialist spends a significant amount of time identifying and...
-
Advanced Reliability Specialist
1 week ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 15,00,000 - ₹ 30,00,000Job Overview:We are seeking a skilled RAMS Engineer to join our team. The ideal candidate will have a strong background in reliability and safety assessments, with experience in executing RAMS analyses and documentation.The key responsibilities of this role include:Executing RAMS analyses and documentation in French and English per project...
-
Product Reliability Expert
7 days ago
Bengaluru, Karnataka, India beBeeTest Full time ₹ 12,50,000 - ₹ 17,50,000Quality Assurance SpecialistWe are seeking an experienced Quality Assurance Specialist to lead our end-to-end quality assurance, post-launch monitoring, and drive data-driven improvements for customer satisfaction.Key Responsibilities:Develop and execute comprehensive test strategies for TV platforms (Android TV, Fire TV, Roku) and backend services to ensure...
-
Senior DFT Specialist
2 weeks ago
Bengaluru, Karnataka, India beBeeDigitalFaultTolerance Full time ₹ 20,00,000 - ₹ 25,00,000Job OverviewA senior-level Digital Fault Tolerance (DFT) specialist is sought to spearhead the integration of DFT into our products.Main Responsibilities:Design and implement DFT components, including EDT, SSN, and MBIST insertion, to enhance product reliability.Develop a profound understanding of DFT architecture and streamlining through RTL files.Utilize...
-
Product Trainer
3 days ago
Bengaluru, Karnataka, India Publicis Production Full time ₹ 1,04,000 - ₹ 1,30,878 per yearKey Responsibilities:1. Training Delivery & ExecutionConduct technical and functional training sessions for employees, clients, and partners on Tech products.Deliver both online and in-person training sessions.Provide hands-on demonstrations and real-world use cases to enhance learning.Ensure all trainees understand product features, integrations,...
-
Reliability Specialist
2 weeks ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 1,80,00,000 - ₹ 2,50,00,000About This RoleWe are seeking a skilled engineer to join our team as a Reliability Specialist.Key Responsibilities:Design, develop and implement systems software that improves the stability, scalability, availability and latency of our products.Take ownership of one or more services and have the freedom to make decisions that benefit our business and...
-
Reliable Systems Specialist
5 days ago
Bengaluru, Karnataka, India beBeeSpecialist Full time ₹ 1,80,00,000 - ₹ 2,10,00,000Job OverviewWe are seeking a highly skilled Reliable Systems Specialist to join our team at a world leader in payments and technology. As a key member of our organization, you will play a critical role in ensuring the security, reliability, and performance of our systems and applications.We process over 259 billion transactions annually across more than 200...
-
Reliability Engineer Specialist
2 weeks ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878Reliability Engineering Specialist">We are seeking a skilled Reliability Engineer to join our team and drive system reliability forward.An ideal candidate will be responsible for ensuring system performance, availability, reliability, efficiency, change management, monitoring, and emergency response.The role requires collaboration with diverse teams to...
-
Product Reliability Strategist
7 days ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 30,00,000 - ₹ 50,00,000Key Leadership RolesDevelop and execute comprehensive product reliability strategies that span design, testing, manufacturing, and field performance for electrolysers.Collaborate closely with design engineering, R&D, and systems engineering teams to integrate reliability into early design stages (FMEA, DfR, HALT, etc.).Partner with operations and quality...