Senior Site Reliability Engineer

13 hours ago


Bengaluru, Karnataka, India WSO2 Full time ₹ 20,00,000 - ₹ 25,00,000 per year

About WSO2

Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 countries. WSO2's products and platforms—including our next-gen internal developer platform, Choreo—empower organizations to leverage the full potential of artificial intelligence and APIs for securely delivering the next generation of AI-enabled digital services and applications. Our open-source, AI-driven, API-first approach frees developers and architects from vendor lock-in and enables rapid digital product creation. Recognized as leaders by industry analysts, WSO2 has more than 800 employees worldwide with offices in Australia, Brazil, Germany, India, Singapore, Spain, Sri Lanka, the UAE, the UK, and the US, with over USD100M in annual recurring revenue. Visit to learn more. Follow WSO2 on LinkedIn and X (formerly Twitter).

Role Overview:

As a Senior Site Reliability Engineer at WSO2, you'll be instrumental in both supporting our existing customers with their managed or private cloud deployments and initiating new deployments across leading cloud platforms such as Azure, AWS, and GCP. Your mission will include ensuring the seamless operation, scalability, and security of WSO2 cloud services, alongside automating processes to boost both efficiency and reliability.

Responsibilities:

Deployment Setup and Management:

  • Lead the design and implementation of new cloud deployments, tailoring solutions to meet stakeholder requirements on platforms like Azure, AWS, GCP, and Kubernetes.
  • Optimize cloud architectures for scalability and cost-effectiveness, adhering to best practices for networking, security, and access controls.
  • Gain and maintain deep knowledge of cloud infrastructure providers to create robust solutions.
  • Proactively introduce continuous improvements and cost-optimized solutions to enhance infrastructure adaptability and streamline deployment processes.

Automation and CI/CD:

  • Craft and manage automation scripts and infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation.
  • Deploy CI/CD pipelines to streamline software delivery, testing, and deployment processes, ensuring efficient version control and configuration management.

Managed Cloud Support:

  • Ensure the availability of services by configuring system monitors and alerts and attending to critical alerts in a timely manner.
  • Offer continuous support and maintenance for existing deployments, monitoring system performance and swiftly resolving issues to maintain high availability and reliability.
  • Implement strategies for performance optimization and failure prevention, conducting thorough root cause analyses to avoid future issues.
  • Demonstrate strong ownership during critical incident scenarios, ensuring smooth operations under pressure by delivering timely resolutions. Implement effective workarounds and conduct thorough root cause analysis (RCA).

Monitoring and Security:

  • Establish comprehensive monitoring and alerting systems to oversee customer deployments, setting thresholds for incident response.
  • Conduct regular security assessments and stay abreast of the latest threats and trends to fortify cloud environments against risks.

Collaboration and Knowledge Sharing:

  • Foster a collaborative environment with product developers, operations, and QA teams to enhance workflows and product quality.
  • Share knowledge and best practices, contributing to the team's collective expertise through documentation, training, and mentorship.

Skills and Experience:

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • 2+ years of hands-on experience as a Site Reliability Engineer, managing and improving production systems at scale.
  • Strong collaboration and leadership skills, with a proven ability to lead teams, drive cross-functional initiatives, deliver results, and align efforts toward organizational goals.
  • Expertise in cloud platforms such as Azure, AWS, and GCP.
  • Expertise in Linux and virtualization and containerization technologies such as Docker and Kubernetes.
  • A solid understanding of networking, security principles, and compliance frameworks.
  • Proficiency in IaC tools (Terraform, CloudFormation), configuration management (Puppet, Chef, Helm), and scripting languages (Python, Bash, PowerShell).
  • Experience with CI/CD tools (Github Actions, Jenkins) and monitoring/logging tools (Prometheus, ELK stack, Splunk).
  • Exceptional problem-solving, analytical, and troubleshooting skills, coupled with a proactive, customer-centric mindset.
  • Strong communication skills and the ability to collaborate effectively in a team environment.

What WSO2 Offers:

  • A culture that values hard work and flexibility, with a sensible vacation/leave plan.
  • Comprehensive health insurance for you and your family.
  • Competitive compensation package and opportunities for professional growth.

Diversity Drives Innovation

We've built our business on a commitment to diversity and inclusion. We believe it's important to foster an environment that values and respects each individual's strengths, perspectives, and ideas. Doing so not only drives innovation; it also ensures that we can create superior experiences for our customers, partners, and employees worldwide. We value the diversity of our team regardless of race, ethnicity, religion, gender, age, national origin, disability, sexual orientation, or veteran or marital status, and we do not tolerate any form of discrimination.



  • Bengaluru, Karnataka, India Akamai Full time

    Job Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Job Role: - SRE (Senior Site Reliability Engineer)We began life in 2001 as a small, self-funded team of technology specialists. Innovative tech solutions for business We're now a leading global digital consulting firm, providing innovative technology solutions for...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    We are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...


  • Bengaluru, Karnataka, India LanceSoft, Inc. Full time ₹ 6,00,000 - ₹ 8,00,000 per year

    Role DescriptionThis is a full-time on-site role for a Senior Site Reliability Engineer based in Bangalore/Chennai/Pune. The Senior Site Reliability Engineer will be responsible for maintaining and enhancing the reliability and performance of the company's IT infrastructure & Development. Daily tasks include troubleshooting system issues, ensuring system...


  • Bengaluru, Karnataka, India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • Bengaluru, Karnataka, India beBeeSiteReliability Full time ₹ 20,00,000 - ₹ 30,00,000

    As a senior site reliability engineer, you will play a critical role in ensuring the stability and scalability of financial platforms.Key Responsibilities:Ensure defined SLAs, SLOs, and SLIs are met for performance, reliability, and uptime.Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and...


  • Bengaluru, Karnataka, India Procore Full time ₹ 5,00,000 - ₹ 8,00,000 per year

    Job DescriptionWe're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...


  • Bengaluru, Karnataka, India Procore Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Job Description We're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...