Site Reliability Engineer

1 week ago


Delhi, India Exoscale Full time
Job DescriptionExoscale is the leading Swiss/European cloud service provider.

With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order to let its clients focus on their core business.

As part of its ongoing efforts to grow its infrastructure footprint Exoscale is hiring a Site Reliability Engineer.

The site reliability engineer plays a critical role in ensuring constant availability of the Exoscale platform. The engineering team at Exoscale works on all aspects from designing & developing products, to their operation and support.

With an expanding customer base and new products to further advance Exoscale's product portfolio, site reliability engineers build and maintain a wide range of technologies. As users of Exoscale itself, site reliability engineers also take active part in improving products.

This position focuses on database persistence and visibility stacks. A range of topics are covered: Platform development and maintenance, tooling development, automation, self-service infrastructure delivery and more.

Some of the challenges you will be working on:

- Design and maintain key platforms such as:

- Our database systems consisting of Mysql, FoundationDB and Apache Cassandra.

- Our data streaming processing platform based on Apache Kafka

- Our visibility stack based on Prometheus compatible components

- Our logging platform based on Elastic ecosystem

- Automate our database provisioning and maintenance operations.

- Help design our next tracing service.

- Help improve the developer experience (DX) through the delivery of self-service systems and pipelines.

- Contribute to the overall design and the architecture of the Exoscale platform systems.

- Contribute to internal tooling development.

- Improve our systems and processes to be scalable and highly available, helping achieve outstanding SLAs.

- Participate in code & changes reviews.

- Take part in the on-call roll after a training period.

Ideal candidates:

- Have solid experience dealing with Linux on a daily basis.

- Have a good knowledge of Apache Kafka.

- Are familiar with transactional database systems like MySQL and PostgreSQL

- Are used to deal with Prometheus monitoring and its ecosystem like Grafana and Mimir

- Are familiar with logs management platform like Elastic ecosystem

- Have experience with Containerization, Kubernetes a plus

- Have a good experience with Golang, Clojure & Python a plus

- Have experience with configuration management solutions and large scale infrastructure.

- Love to automate anything that could be.

- Are curious, autonomous and embrace learning new things everyday.

- Are team players and are comfortable working in a distributed team.

- Have good English communication skills, written and spoken.

What we offer:

- Flexible working hours and working from home.

- Autonomous working conditions with a lot of freedom to create.

- Modern working atmosphere and centrally located office with great public transport. connection

- Team events as well as training and further education.

Candidates who are not familiar with all the topics above but willing to learn are encouraged to apply.

We look forward to receiving your application



  • Delhi, Delhi, India Serendipity Recruiting Full time

    Job DescriptionAs a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government.Our client firmly believes that exceptional technology services are built upon exceptional individuals. For over two decades, our...


  • Delhi, Delhi, India Exoscale Full time

    Job DescriptionExoscale is the leading Swiss/European cloud service provider.With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order to let its clients focus on their core business.Join a dynamic working environment with...


  • Delhi, India Aventurine Technologies Inc Full time

    Job DescriptionSRE (Site Reliability Engineer)Dallas, TX – Hybrid (F2F interview will be requested)6+ Mon ContractNote: Look for candidates with over 9+ Years’ experience.Job Description (SRE)• Collaborating closely with engineering teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO’s and...


  • Delhi, India Azilen Technologies Full time

    Job purpose:Design & implement the best engineered technical solutions using latest technologies and tools.Who you are:Bachelors degree in Computer Science, E&C Engineering, IT Engineering or related field. (2023-2024 passout)Any professional certification in area like Cloud Administration (AWS, Azure, GCP etc.), Site Reliability Engineering, Security etc....


  • Delhi, Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEApplications written in .NET (python or any other scripting would be good) we need more of a dev background then operations.Automation experience: Ansible preferred but good with Terraform as well.Doesn’t need to come from a 24x7 environment but needs to be okay working in that environment.AWS preferred but any...


  • Delhi, Delhi, India WaferWire Cloud Technologies Full time

    Role:SRE (Site Reliability Engineer)Experience:4+ YearsAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the digital era....


  • new delhi, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • Delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • Delhi, India Next-Link Full time

    Job DescriptionSenior Site Reliability EngineerDesirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...


  • Delhi, Delhi, India SkySys Full time

    Role: Site Reliability Engineer (SRE) Position Type: Full-Time Contract (40hrs/week) Contract Duration: Long Term Work Time zone: IST Work Schedule: 8 hours/day (Mon-Fri) Location: 100% remote (candidate can work from anywhere in India) Must haves: Monitoring and deploying .net applications Maintaining code, writing scripts Monitor application...


  • Delhi, Delhi, India Next-Link Full time

    Job DescriptionSenior Site Reliability EngineerDesirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...


  • Delhi, Delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • Delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • Delhi, India UBS Full time

    Your roleWe're looking for a Site Reliability Engineer to:• work as a part of an agile pod (team)• determine the reliability of our digital products, technology services, and the infrastructure that underpins them• minimize the risk and impact of failures by engineering operational improvements, such as predictive monitoring, auto scaling or...


  • Delhi, India Hansen Technologies Full time

    About The RoleIf you are an experienced Site Reliability Engineer join our team inPunelocation to become a driving force in ensuring the reliability, performance, and scalability of our systems. As an SRE, you'll be more than just a technical expert, you’ll be a creative problem solver with exceptional customer relationship skills. Your primary mission...


  • delhi, India SLK Full time

    **Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability and Site Reliability Engineer (SRE) , emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • Delhi, India SLK Full time

    **Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability andSite Reliability Engineer (SRE) ,emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • Delhi, Delhi, India SLK Full time

    Immediate Joiners only We are hiring an Senior Engineer with expertise in Observability andSite Reliability Engineer (SRE) ,emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • Delhi, Delhi, India SLK Full time

    Immediate Joiners only We are hiring an Senior Engineer with expertise in Observability and Site Reliability Engineer (SRE) , emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...