Sre - Cloud Security and Observability

7 days ago


Bengaluru Karnataka, India Rapid Circle Full time

Making a difference and driving positive change is what we do every day at Rapid Circle. Our Cloud Pioneers help our clients in their digital transformation. Are you someone who goes for constant, positive change? Then this vacancy is for you

As a Cloud Pioneer at Rapid Circle, you will work with our customers on different projects. For example, making impact in the healthcare sector, by making research data safely available. But also, awesome projects in the manufacturing or energy market make this job very challenging.

At Rapid Circle we are curious and are constantly improving our expertise to help customers find their way in a rapidly changing world. We share our knowledge and discover new ways to learn.

Rapid Circle is growing rapidly and are therefore looking for the right person for the role. You will be given lots of freedom to develop personally. We also have a lot of in-house knowledge (MVPs) within the Netherlands, Australia, and India. By working closely with your (international) colleagues, you can continue to challenge yourself and create your own growth path. Freedom, entrepreneurship, and development are key at Rapid Circle, so also in the role of a Site Reliability Engineer.
- Develop, manage, and optimize Terraform modules and deployments across multiple environments.
- Handle SRE operational duties including responding to pull requests and ensuring smooth continuous integration and delivery processes.
- Explore and experiment with new technologies through Proof-of-Concepts to enhance existing functionalities or discover new opportunities.
- Automate deployment, configuration, and operational processes to improve efficiency and accuracy.
- Collaborate with development teams to guide system architecture and design, focusing on reliability, efficiency, and scalability.
- Implement and manage observability tools such as Grafana, Prometheus, and New Relic to ensure all critical services are monitored effectively.
- Develop custom reliability tools and frameworks for use by engineering teams.
- Participate in an on-call rotation for critical systems, lead incident responses, and conduct thorough post-mortem analyses.
- Drive system and process efficiencies including capacity planning, configuration management, performance tuning, monitoring, and root cause analysis.
- Act as a consultant within the organization for best practices in infrastructure management and assist teams in effective infrastructure utilization.
- Play a key role in capacity planning to help teams prepare for scaling and growth.

**Must have**:

- In-depth knowledge of cloud service providers like Azure or AWS, with a professional or specialty level certification (security certification is a plus).
- Strong understanding of REST and/or Graph APIs.
- Experience with state machines such as AWS Step Functions or Azure Logic Apps.
- Deep knowledge in telemetry and observability; experience with Prometheus, OpenTelemetry, or DynaTrace is highly desirable.
- Proficiency in Kubernetes with CKA/CKAD certification being advantageous.
- Expertise in Terraform, with experience in setting up pipelines for multi-environment deployments.
- Good programming skills in high-level languages, with a preference for Python. Go, or any other compiled languages is an advantage.
- Familiarity with Observability tools like Grafana, Prometheus, and New Relic.
- Strong project management and organizational skills.
- An open mindset with the ability to quickly adapt to new technologies and learning practices.

**About Cloud Native Engineering**

The Cloud Native Engineering Practice is an organization of engineers who work with our production services throughout their entire life cycle, from design and architecture, through implementation, deployment, and sustaining operation. SRE’s delivers important system properties: reliability, performance, efficiency, and scalability, for the products and platforms that our customers use every day.

SREs work in high-performance squads with expertise on large scale system reliability and in-depth understanding of critical business components architecture, as well as dedicated engineering teams building comprehensive tools, platform and infrastructure.

We need your skills and passion to help make it happen


  • SRE Cloud Security

    1 week ago


    Bengaluru, Karnataka, India Xebia It Architects Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    SRE Cloud Security & ObservabilityLocation: Bangalore (Hybrid 3 days office per week)We are looking for a Cloud Site Reliability Engineer (SRE) with strong expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms.ResponsibilitiesArchitect and optimize Terraform modules for multi-environment deployments.Drive...


  • Bengaluru, Karnataka, India Rubrik Security Cloud Full time

    **About the team**: The Information Security organization advances the overall state of security at Rubrik through purposeful initiatives and coordination of large security projects. Information Security builds technologies, tools, and processes to better enable teams at Rubrik to develop secure software and protect data and systems with appropriate security...

  • Cloud Sre

    1 week ago


    Bengaluru, Karnataka, India Tata Elxsi Full time

    **Cloud SRE Network**: **What you'll do**: - Use SRE practices to improve the reliability, performance and efficiency of the infrastructure. - Support Sky’s Production and Lab Telco Cloud environments, including incident resolution, changes and upgrades. - Create and maintain internal design documentation. - Use Python scripting knowledge to automate...


  • Bengaluru, Karnataka, India Kotak Mahindra Bank Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Principal Manager-Dev Ops Engineering-SUPPORT SERVICES-Applications-CTB Title : Observability Platforms and SRE Engg. The Company : World of Kotak product suite encompasses a powerful suite of cross banking assets, all-in-one stop banking services, securities, and investment banking; insights across a wide spectrum of the major financial and banking...


  • Bengaluru, India Virtusa Full time

    SRE Observability Architect - Description Experience: • Minimum 10 years of relevant work experience with monitoring setup using any product (Dynatrace, Datadog, ELK stack, Splunk, Grafana/Prometheus, etc.) set up in critical production environments. • Minimum 5-6 years of work experience in end-to-end observability covering technical, user experience...


  • Bengaluru, Karnataka, India Xebia It Architects Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    SRE Cloud Native ConnectivityLocation: Bangalore (Hybrid 3 days office per week)Join our Cloud Native Engineering team as a Site Reliability Engineer (SRE) Connectivity and help build next-gen digital platforms with focus on networking, automation, and security.ResponsibilitiesDesign and review system architectures for reliability, scalability, and...


  • Bangalore, Karnataka, India Toast Inc Full time

    The Observability System Administrator role at Toast fits within the Observability Enablement Administration team which is part of Site Reliability Engineering responsible for overseeing Toast production services with a commitment to quality reliability and low latency The Observability Enablement Administration team is responsible for setting the overall...


  • Bengaluru, Karnataka, India Populace World Solutions Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Position- Java/Python/GO/Terraform -with SRE Observability DEVELOPERExperience- 5+ yearsLocation- BangaloreRequired Skills & Qualifications:Solid Development experience of at least 5 years is a must.• Required Technical skills: 6+yrs (Terraform Primary skill& Automation is Primary role)• Development Experience in any one of the programming languages:...


  • Bengaluru, India Toast Full time

    Job Description The Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible...


  • Bengaluru, India Toast Full time

    The Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible for setting the...