Senior Site Reliability Engineer

3 days ago


Bengaluru, Karnataka, India SolarWinds Full time

About the Role:

As a Senior Staff Site Reliability Engineer (SRE) at SolarWinds, you will drive the reliability, scalability, and performance of our Observability Platform. This role focuses on managing SaaS infrastructure at scale, improving system reliability through cloud-native architecture, advanced data platform operations, and automation. You will collaborate with engineering, security, and product teams to ensure operational excellence and lead a team of SREs to maintain high service standards.

Key Responsibilities:

  • Lead the design, deployment, and operation of SaaS infrastructure ensuring high availability and reliability.
  • Build, operate, and scale Kubernetes clusters (EKS, GKE, AKS, OpenShift) in production.
  • Design and manage data platform infrastructure including Kafka, ClickHouse, and event-driven systems.
  • Implement cloud-native design patterns and scalable architectures across AWS and Azure.
  • Automate infrastructure provisioning and deployment using Terraform, Helm, ArgoCD, CloudFormation, and other Infrastructure as Code (IaC) tools.
  • Maintain monitoring, logging, and observability systems using Prometheus, Grafana, Datadog, CloudWatch, ELK/Opensearch, and OTel/Jaeger.
  • Develop and maintain disaster recovery plans and high availability strategies.
  • Lead incident response, conduct blameless postmortems, and implement preventive measures.
  • Mentor and guide SRE and DevOps engineers to improve team efficiency and adherence to best practices.
  • Collaborate cross-functionally with engineering, product, and security teams to optimize system performance, reliability, and cost efficiency.

MUST HAVE:

  • 13+ years in SRE, DevOps, Platform Engineering, or equivalent roles.
  • 8+ years in SaaS infrastructure management, cloud-native system design, and production operations.
  • 5+ years managing Kubernetes clusters at scale in production environments.
  • Hands-on experience with data platforms: Kafka, ClickHouse, or similar.
  • Strong programming/scripting in Python, Go, Bash, or equivalent.
  • Infrastructure automation using Terraform, Helm, ArgoCD, CloudFormation.
  • Experience with CI/CD pipelines, GitOps, and deployment automation.
  • Expertise in observability: monitoring, logging, tracing (Prometheus, Grafana, Datadog, CloudWatch, ELK/Opensearch, OTel/Jaeger).
  • Strong understanding of disaster recovery principles and high availability architectures.
  • Security operations knowledge: IAM, encryption, cloud security best practices.
  • Proven leadership and mentoring experience in SRE/DevOps teams.

Preferred Qualifications:

  • Experience with Karpenter or KEDA for Kubernetes autoscaling.
  • Experience managing distributed SaaS services across multiple regions.
  • Familiarity with FinOps or cloud cost optimization.
  • Experience with protocol buffers (Buf), event-driven system optimizations, or cloud-native databases.
  • Knowledge of container orchestration patterns (ECS, EKS, GKE, AKS, OpenShift).


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India Josys Full time US$ 1,50,000 - US$ 2,00,000 per year

    Senior Site Reliability Engineer (SRE)About JOSYSJosys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and securing $125 million in Series A and B funding. Our platform enables businesses to conquer the complexities of work-from-anywhere setups, rapid digital...


  • Bengaluru, Karnataka, India HireAlpha Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    We're Hiring | Senior Site Reliability Engineer (SRE)Bangalore | HybridPermanent RoleAre you ready to help shape the future of cloud contact centers? we're building scalable, reliable, and cutting-edge infrastructure for world-class customer experiences — and we're looking for aSenior SREto join our teamWhat you'll do:Lead efforts in building a seamless ...


  • Bengaluru, Karnataka, India Aerospike Full time

    Job DescriptionAbout AerospikeAt Aerospike, we dream big. Our focus is helping companies tackle seemingly insurmountable problems and doing whats never been done before. That is why we developed the world&aposs leading real-time data platform that powers mission-critical applications at the world&aposs most innovative, category-disrupting companies....


  • Bengaluru, Karnataka, India Aerospike Full time US$ 1,50,000 - US$ 2,00,000 per year

    About AerospikeAt Aerospike, we dream big. Our focus is helping companies tackle seemingly insurmountable problems and doing what's never been done before. That is why we developed the world's leading real-time data platform that powers mission-critical applications at the world's most innovative, category-disrupting companies. Aerospike companies have...


  • Bengaluru, Karnataka, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we've grown our organization to 14,500+...


  • Bengaluru, Karnataka, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we've grown our organization to...


  • Bengaluru, Karnataka, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we've grown our organization to...


  • Bengaluru, Karnataka, India Aerospike Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    About Aerospike Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases. Global leaders, including Adobe, Airtel, Barclays,...