High Availability System Engineer

2 days ago


Bengaluru, Karnataka, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000
Senior SRE Position Overview

About the Role:

This senior position involves leading infrastructure management, cloud-native system design, and production operations. The ideal candidate will have extensive experience in managing Kubernetes clusters at scale, data platforms like Kafka and ClickHouse, and a strong programming background.

Key Responsibilities:

  • Design and Deploy SaaS Infrastructure - Lead the development of scalable and highly available systems to support growing user demands.
  • Kubernetes Cluster Management - Build, operate, and scale EKS, GKE, AKS, or OpenShift clusters in production environments.
  • Data Platform Operations - Manage infrastructure for Kafka, ClickHouse, and event-driven systems to ensure seamless data flow.
  • Cloud-Native Design Patterns - Implement efficient, scalable architectures on AWS and Azure using cloud-native design patterns.
  • Automation and Orchestration - Automate infrastructure provisioning with Terraform, Helm, ArgoCD, CloudFormation, and other IaC tools.
  • Maintenance and Monitoring - Ensure high service standards by maintaining monitoring, logging, and observability systems using Prometheus, Grafana, Datadog, CloudWatch, ELK/Opensearch, and OTel/Jaeger.
  • Leadership and Mentorship - Guide SRE and DevOps engineers to improve team efficiency and adherence to best practices.

Required Skills:

  • SRE Experience - 13+ years in equivalent roles.
  • SaaS Infrastructure Management - 8+ years in cloud-native system design and production operations.
  • Kubernetes Expertise - 5+ years managing clusters at scale in production environments.
  • Data Platforms - Hands-on experience with Kafka, ClickHouse, or similar.
  • Programming Background - Strong skills in Python, Go, Bash, or equivalent languages.
  • Automation Tools - Proficiency in Terraform, Helm, ArgoCD, CloudFormation.
  • CICD Pipelines - Experience with GitOps and deployment automation.
  • Observability - Expertise in monitoring, logging, tracing.
  • Disaster Recovery - Strong understanding of principles and high availability architectures.
  • Security Operations - Knowledge of IAM, encryption, cloud security best practices.
  • Leadership - Proven leadership and mentoring experience in SRE/DevOps teams.


  • Bengaluru, Karnataka, India beBeeReliability Full time ₹ 25,00,000 - ₹ 35,00,000

    Site Reliability Engineering RoleOur organization seeks a skilled Site Reliability Engineer (SRE) to guarantee the dependability, scalability, and performance of our critical systems. The SRE will collaborate closely with development and operations teams to design and maintain high availability services, automate operational tasks, and monitor system...


  • Bengaluru, Karnataka, India beBeeLinux Full time ₹ 20,00,000 - ₹ 25,00,000

    System Operations SpecialistWe are seeking an experienced System Operations Specialist to join our global team. The ideal candidate will have a strong background in server infrastructure, virtualization, and containerization.Key Responsibilities:Provide high-level support for Linux Server platforms, including on-call coverage and collaboration with...


  • Bengaluru, Karnataka, India beBeeReliability Full time US$ 1,00,000 - US$ 1,60,000

    Job Description We are seeking a seasoned system reliability expert to lead the design, implementation, and maintenance of our mission-critical systems. The successful candidate will work closely with cross-functional teams to ensure the scalability, reliability, and performance of our infrastructure. About This Role Infrastructure Management: Develop and...


  • Bengaluru, Karnataka, India beBeeReliability Full time US$ 90,000 - US$ 1,20,000

    Cloud Engineer - High Availability SpecialistWe are seeking a highly skilled Cloud Engineer to join our team and help us design, build, and manage scalable and reliable cloud environments. The ideal candidate will have hands-on experience with AWS services and a deep understanding of Site Reliability Engineering (SRE) principles.Job Description:The...


  • Bengaluru, Karnataka, India beBeeEngineering Full time ₹ 1,04,000 - ₹ 1,30,878

    Job DescriptionArchitect and build high-performance, scalable distributed systems and services. Develop and maintain highly available, low-latency platforms with stringent service level agreements (SLAs).Design modular APIs and scalable storage solutions for transactional and analytical systems.Mentor engineers and drive best practices across the platform...


  • Bengaluru, Karnataka, India beBeeInfrastructure Full time ₹ 9,00,000 - ₹ 12,00,000

    Job DescriptionTroubleshoot and resolve full-stack issues across hardware, software, application, and network layers to ensure seamless system performance.Key Deliverables:Design, build, and maintain scalable and high-availability core infrastructure solutions.Automate infrastructure tasks using Python and monitor distributed systems performance for optimal...


  • Bengaluru, Karnataka, India beBeeScalability Full time ₹ 20,00,000 - ₹ 30,00,000

    Job Title: Performance and Scalability ExpertRole Overview:We are seeking a highly skilled and motivated performance and scalability expert to ensure the reliability, efficiency, and performance of our systems and applications.Key Responsibilities:Performance Monitoring and Optimization:Use monitoring tools to optimize system performance and availability....


  • Bengaluru, Karnataka, India beBeehydraulic engineer Full time ₹ 12,00,000 - ₹ 20,00,000

    Job Title:">Senior Hydraulic Systems Design Expert"> ">Job Summary:"]}We are seeking an experienced Senior Hydraulic Systems Design Expert to join our team. As a key member of the engineering department, you will be responsible for designing, developing, and improving hydraulic systems, components, and handling ECNs and ECRs for power equipment and...


  • Bengaluru, Karnataka, India beBeeSoftware Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Senior Fullstack Software Developer RoleWe are seeking an experienced fullstack developer to join our engineering team. As a key member, you will design and build high-performance systems ensuring that solutions created are both robust and efficient.Key Responsibilities:Take ownership of complex projects from concept to deployment.Drive project delivery...


  • Bengaluru, Karnataka, India beBeeEngineer Full time ₹ 20,00,000 - ₹ 25,00,000

    Site Reliability EngineerIn today's fast-paced digital landscape, ensuring the reliability and performance of large-scale systems is crucial for business success.We're seeking an experienced professional to fill this critical role, collaborating with cross-functional teams to design and implement scalable, fault-tolerant, and highly available systems.Develop...