Sre - Cloud Security and Observability

2 days ago

Bengaluru Karnataka, India Rapid Circle Full time

Making a difference and driving positive change is what we do every day at Rapid Circle. Our Cloud Pioneers help our clients in their digital transformation. Are you someone who goes for constant, positive change? Then this vacancy is for you

As a Cloud Pioneer at Rapid Circle, you will work with our customers on different projects. For example, making impact in the healthcare sector, by making research data safely available. But also, awesome projects in the manufacturing or energy market make this job very challenging.

At Rapid Circle we are curious and are constantly improving our expertise to help customers find their way in a rapidly changing world. We share our knowledge and discover new ways to learn.

Rapid Circle is growing rapidly and are therefore looking for the right person for the role. You will be given lots of freedom to develop personally. We also have a lot of in-house knowledge (MVPs) within the Netherlands, Australia, and India. By working closely with your (international) colleagues, you can continue to challenge yourself and create your own growth path. Freedom, entrepreneurship, and development are key at Rapid Circle, so also in the role of a Site Reliability Engineer.
- Develop, manage, and optimize Terraform modules and deployments across multiple environments.
- Handle SRE operational duties including responding to pull requests and ensuring smooth continuous integration and delivery processes.
- Explore and experiment with new technologies through Proof-of-Concepts to enhance existing functionalities or discover new opportunities.
- Automate deployment, configuration, and operational processes to improve efficiency and accuracy.
- Collaborate with development teams to guide system architecture and design, focusing on reliability, efficiency, and scalability.
- Implement and manage observability tools such as Grafana, Prometheus, and New Relic to ensure all critical services are monitored effectively.
- Develop custom reliability tools and frameworks for use by engineering teams.
- Participate in an on-call rotation for critical systems, lead incident responses, and conduct thorough post-mortem analyses.
- Drive system and process efficiencies including capacity planning, configuration management, performance tuning, monitoring, and root cause analysis.
- Act as a consultant within the organization for best practices in infrastructure management and assist teams in effective infrastructure utilization.
- Play a key role in capacity planning to help teams prepare for scaling and growth.

**Must have**:

- In-depth knowledge of cloud service providers like Azure or AWS, with a professional or specialty level certification (security certification is a plus).
- Strong understanding of REST and/or Graph APIs.
- Experience with state machines such as AWS Step Functions or Azure Logic Apps.
- Deep knowledge in telemetry and observability; experience with Prometheus, OpenTelemetry, or DynaTrace is highly desirable.
- Proficiency in Kubernetes with CKA/CKAD certification being advantageous.
- Expertise in Terraform, with experience in setting up pipelines for multi-environment deployments.
- Good programming skills in high-level languages, with a preference for Python. Go, or any other compiled languages is an advantage.
- Familiarity with Observability tools like Grafana, Prometheus, and New Relic.
- Strong project management and organizational skills.
- An open mindset with the ability to quickly adapt to new technologies and learning practices.

**About Cloud Native Engineering**

The Cloud Native Engineering Practice is an organization of engineers who work with our production services throughout their entire life cycle, from design and architecture, through implementation, deployment, and sustaining operation. SRE’s delivers important system properties: reliability, performance, efficiency, and scalability, for the products and platforms that our customers use every day.

SREs work in high-performance squads with expertise on large scale system reliability and in-depth understanding of critical business components architecture, as well as dedicated engineering teams building comprehensive tools, platform and infrastructure.

We need your skills and passion to help make it happen

Sre - Devops And Observability

3 weeks ago

Bangalore, Karnataka, India EMBARKGCC SERVICES PRIVATE LIMITED Full time

Key Responsibilities - Own and manage AKS-based Kubernetes clusters multi-tenant namespace isolation - Implement and maintain GitOps workflows using FluxCD and Helm - Manage infrastructure as code with Terraform - Build and operate observability stack Prometheus Grafana Loki Tempo and integrate with external tools Datadog Dynatrace Grafana Cloud - Implement...
SRE - DevOps and Observability

1 week ago

Bengaluru, India EMBARKGCC SERVICES PRIVATE LIMITED Full time

Key Responsibilities - Own and manage AKS-based Kubernetes clusters (multi-tenant, namespace isolation). - Implement and maintain GitOps workflows using FluxCD and Helm. - Manage infrastructure as code with Terraform. - Build and operate observability stack (Prometheus, Grafana, Loki, Tempo) and integrate with external tools (Datadog, Dynatrace, Grafana...
SRE - DevOps and Observability

1 week ago

Bengaluru, India EMBARKGCC SERVICES PRIVATE LIMITED Full time

Job Description Key Responsibilities - Own and manage AKS-based Kubernetes clusters (multi-tenant, namespace isolation). - Implement and maintain GitOps workflows using FluxCD and Helm. - Manage infrastructure as code with Terraform. - Build and operate observability stack (Prometheus, Grafana, Loki, Tempo) and integrate with external tools (Datadog,...
Gcp Observability Engineer

6 days ago

Bengaluru, India Whatjobs IN C2 Full time

Role Description We are seeking an experienced and motivated engineer to join the Observability fleet which focuses on delivering tools in private and public cloud environments. The role focuses on developing and modernizing Observability platforms for cloud-native and hybrid applications, with a primary focus on Google Cloud Platform (GCP). This role...
Cloud Sre

4 days ago

Bengaluru, Karnataka, India Tata Elxsi Full time

**Cloud SRE Network**: **What you'll do**: - Use SRE practices to improve the reliability, performance and efficiency of the infrastructure. - Support Sky’s Production and Lab Telco Cloud environments, including incident resolution, changes and upgrades. - Create and maintain internal design documentation. - Use Python scripting knowledge to automate...
SRE Observability Architect

1 week ago

Bengaluru, India Virtusa Full time

SRE Observability Architect - Description Experience: • Minimum 10 years of relevant work experience with monitoring setup using any product (Dynatrace, Datadog, ELK stack, Splunk, Grafana/Prometheus, etc.) set up in critical production environments. • Minimum 5-6 years of work experience in end-to-end observability covering technical, user experience...
DevOps / Sre / Cloud Platform & Observability

2 weeks ago

Bengaluru, Karnataka, India SAP Full time

**We help the world run better** At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...
DEVOPS SRE

2 weeks ago

Bengaluru, India RARR Technologies Full time

Job Description Key Responsibilities: SRE & DevOps Strategy: - Design and develop a robust SRE ecosystem following industry best practices. - Formulate SRE strategies based on emerging trends and organizational needs. - Implement best practices into local functional teams for consistent adoption. Platform & Automation: - Develop scaffolding libraries for...
Cloud Engineer-Observability

3 days ago

Bengaluru, India Smarsh Full time

About the team: The Observability team builds and manages the single telemetry and observability service used by all product teams on the Smarsh platform. It provides "as a service" telemetry, monitoring, and visualization capabilities that enable our product teams to operate, support, and triage the applications and services under their product portfolio.We...
Cloud Engineer-Observability

1 day ago

Bengaluru, India Smarsh Full time

About the team : The Observability team builds and manages the single telemetry and observability service used by all product teams on the Smarsh platform. It provides "as a service" telemetry, monitoring, and visualization capabilities that enable our product teams to operate, support, and triage the applications and services under their product...

Americas

Europe

Asia / Oceania

Africa

Sre - Cloud Security and Observability