SRE (Site Reliability Engineer)

21 hours ago


India VXI Global Solutions Full time

Job Description

It's fun to work in a company where people truly BELIEVE in what they are doing

We're committed to bringing passion and customer focus to the business.

Job Summary:
We are seeking a skilled SRE Engineer to design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus, Grafana, Cloud Monitoring, and OpenTelemetry, along with exposure to SolarWinds. You should be comfortable working with metrics, logs, and traces, and be able to correlate telemetry data to proactively detect, diagnose, and resolve performance issues.

Key Responsibilities:

  • Design and maintain observability pipelines using OpenTelemetry, Prometheus, and Grafana.
  • Build dashboards and alerts to monitor system health, application performance, and business KPIs.
  • Integrate observability solutions with Google Cloud Platform services and SolarWinds.
  • Correlate logs, metrics, and traces to troubleshoot incidents and reduce MTTR.
  • Collaborate with SREs, DevOps, and development teams to improve end-to-end system observability.
  • Implement best practices for telemetry data collection, enrichment, storage, and visualization.

Requirements:

  • Strong experience with Prometheus and Grafana for monitoring and alerting.
  • Proficiency in OpenTelemetry for instrumenting distributed systems.
  • Working knowledge of observability tools in Google Cloud (e.g., Cloud Monitoring, Logging, Trace).
  • Exposure to SolarWinds for network and infrastructure monitoring.
  • Solid understanding of telemetry data types: metrics, logs, and traces.
  • Ability to correlate and analyze multi-source observability data.
  • Scripting skills (Python, Bash) and familiarity with Infrastructure-as-Code is a plus.

Preferred Qualifications:

  • Experience in Site Reliability Engineering or Platform Engineering roles.
  • Knowledge of SLIs/SLOs and performance benchmarking.
  • Experience with APM tools (e.g., Datadog, New Relic) is a plus.

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us

Original Title: SRE (Site Reliability Engineer)

Req Id: R25_04397

Posted At: Tue Sep :00:00 GMT+0000 (Coordinated Universal Time)

Information Systems

Full Time

India,



  • India VXI Global Solutions Full time US$ 90,000 - US$ 1,20,000 per year

    Job DescriptionIt's fun to work in a company where people truly BELIEVE in what they are doing We're committed to bringing passion and customer focus to the business.Job Summary:We are seeking a skilled SRE Engineer to design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes: - Strong SRE (Site Reliability Engineering) experience - DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. - Excellent troubleshooting and debugging skills (infrastructure + application level) - Perseverance – must push through complex/challenging issues without...


  • Mumbai, India Natobotics Full time

    Job Description Were on an exciting journey with our client and we want you to join us. With our client, you will be exposed to the latest technologies and work with some of the brightest minds in the industry. Our client is leading Banking company so you will be playing a key role as a VP Site Reliability Engineering (SRE), who can assist with the...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues without giving up...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors)Key Attributes:Strong SRE (Site Reliability Engineering) experienceDevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc.Excellent troubleshooting and debugging skills (infrastructure + application level)Perseverance – must push through complex/challenging issues without giving upAble to...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Bangalore/ RemoteType - ContractWork Ex - 4-6 yrsWe're working with a AI product company that's building the next generation of GenAI powered developer platforms.We're looking for an experienced Site Reliability Engineer to join their Platform Engineering...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Bangalore/ Remote Type - Contract Work Ex - 4-6 yrs We're working with a AI product company that's building the next generation of GenAI powered developer platforms . We're looking for an experienced Site Reliability Engineer to join their Platform...


  • Bengaluru, India VidPro Consultancy Services Full time

    Job Description Experience: 2.55 Years Location: Bangalore (On-site) Work Mode: 5 Days WFO Mandatory Skills: Site Reliability engineer or SRE ,Linux, System architecture, TCP/IP. HTTP,DNS ,Grafana, Prometheus and Loki Troubleshooting ,Root cause, complex systems ,Ci/CD, Docker, Kubernetes Experience : 2-4 years of relevant experience Key Skills...