Site Reliability Engineer

3 days ago


Hyderabad, Telangana, India VXI Global Solutions Full time

We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with
Prometheus
,
Grafana
,
Google Cloud Monitoring
, and
OpenTelemetry
, along with exposure to
SolarWinds
. You should be comfortable working with
metrics, logs, and traces
, and be able to
correlate telemetry data
to proactively detect, diagnose, and resolve performance issues.

Key Responsibilities:

  • Design and maintain observability pipelines using OpenTelemetry, Prometheus, and Grafana.
  • Build dashboards and alerts to monitor system health, application performance, and business KPIs.
  • Integrate observability solutions with Google Cloud Platform services and SolarWinds.
  • Correlate logs, metrics, and traces to troubleshoot incidents and reduce MTTR.
  • Collaborate with SREs, DevOps, and development teams to improve end-to-end system observability.
  • Implement best practices for telemetry data collection, enrichment, storage, and visualization.

Requirements:

  • Strong experience with Prometheus and Grafana for monitoring and alerting.
  • Proficiency in OpenTelemetry for instrumenting distributed systems.
  • Working knowledge of observability tools in Google Cloud (e.g., Cloud Monitoring, Logging, Trace).
  • Exposure to SolarWinds for network and infrastructure monitoring.
  • Solid understanding of telemetry data types: metrics, logs, and traces.
  • Ability to correlate and analyze multi-source observability data.
  • Scripting skills (Python, Bash) and familiarity with Infrastructure-as-Code is a plus.

Preferred Qualifications:

  • Experience in Site Reliability Engineering or Platform Engineering roles.
  • Knowledge of SLIs/SLOs and performance benchmarking.
  • Experience with APM tools (e.g., Datadog, New Relic) is a plus.


  • Hyderabad, Telangana, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Principal Site Reliability Engineer Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX. Site Reliability Engineer expected to work with multiple service and product development teams,...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, Telangana, India Technology Next Full time ₹ 20,00,000 - ₹ 30,00,000 per year

    Urgently hiring for Site Reliability Engineer (SRE) / Chaos EngineerLocation: HyderabadJob Type: Full-time, PermanentJob Description:We are looking for an experienced Site Reliability Engineer (SRE) with strong Python automation skills (Boto3 required) and hands-on experience in chaos engineering to improve system reliability and resilience. The ideal...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, Telangana, India BYLD Group Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    DescriptionJob Title :Site Reliability Engineer (SRE) - DataDog / AWS Lambda / DynamoDB / ServerlessLocation :Bangalore / Pune / HyderabadExperience :5- 10 YearsAbout The RoleWe are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in DataDog integration, AWS Lambda, DynamoDB, and Serverless architectures. The ideal candidate will...


  • Hyderabad, Telangana, India Evalify-IQ Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Skills Required:AWS, Azure, Terraform, CloudFormation, Cloudformation, Pulumi, CICD, GitHub Actions,GitLab CI, Jenkins, ArgoCD, Prometheus, Splunk, Grafana, Cloudwatch, Datadog, SRE,Site Reliability, Python, Powershell, Shell, Go, Kubernetes, Docker, Performance Tuning,Performance Enhancements, Performance Enhancement, PerformanceExperience Range:2 - 5...


  • Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX.  Site Reliability Engineer expected to work with multiple service and product development teams, identifying cross-team issues that...


  • Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and monitoring to provide the stability,...


  • Hyderabad, Telangana, India Jade Global Software Pvt Ltd Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Senior Site Reliability Engineer (SRE) – Datadog ObservabilitySenior Site Reliability Engineer (SRE) – Datadog Observability1 Job Title: Senior Site Reliability Engineer (SRE) – Datadog ObservabilityExperience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in DatadogLocation: Hyderabad...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    SRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...