Observability SRE

2 days ago


Hyderabad, India Ifintalent Global Private Limited Full time

Job Description Key Responsibilities: - Design, build, and maintain observability platforms including monitoring, logging, tracing, and alerting systems. - Implement and optimize metrics collection using tools like Prometheus, Grafana, OpenTelemetry, or similar. - Develop and maintain centralized logging infrastructure (e.g., Data Dog, Open Telemetry, Splunk, or Google Cloud Logging). - Implement distributed tracing solutions using tools such as Jaeger, Zip kin, AppDynamics, or OpenTelemetry. - Collaborate with engineering teams to define SLIs, SLOs, and alerting thresholds. - Automate observability workflows and integrate observability into CI/CD pipelines. - Analyze and interpret telemetry data to proactively identify system issues and performance bottlenecks. - Provide training and documentation to teams on best practices in observability. - Continuously evaluate and adopt new observability technologies and practices. Tools & Technologies: - Skilled in AppDynamics, Splunk, Thousand Eyes, ITRS for instrumentation, monitoring, alerting, and incident response. - Deep hands-on knowledge of Terraform, Kubernetes (GKE), GitLab CI/CD. - Familiar with modern observability practices like Open Telemetry, Grafana, Datadog - Strong knowledge of data platforms: Big Query, Cassandra, Kafka, PostgreSQL, MySQL. - Experience with AI/ML-based operations tools for automation, anomaly detection, and predictive alerting. Qualifications: - Bachelor's degree in Computer Science, Engineering, or related fieldor equivalent experience. - Proven experience as an SRE or DevOps engineer, particularly in Google Cloud Platform (GCP). - Expertise in designing and managing observability platforms and tools. - Hands-on experience with monitoring systems like Prometheus, Grafana, Datadog, New Relic, etc. - Proficient in logging solutions such as ELK, Splunk, Fluentd, or Google Cloud Logging. - Familiarity with distributed tracing tools like Open Telemetry, Jaeger, or Zip kin. - Strong scripting and automation skills using Python, Go, Bash, or similar. - Experience with cloud platforms (AWS, GCP, Azure) and their observability services. - Solid understanding of Kubernetes and observability in containerized environments. - Deep knowledge of networking, application performance, and distributed systems. - Exposure to AI/ML-based observability or anomaly detection tools. - Excellent troubleshooting, debugging, and analytical capabilities. - Strong communication and cross-team collaboration skills.



  • Hyderabad, India Virtusa Full time

    SRE Observability Platform Architect - Description Observability Platform Architect Experience: · Minimum 10 years of relevant work experience with monitoring setup using any product (Dynatrace, Datadog, ELK stack, Splunk, Grafana/Prometheus, etc.) set up in critical production environments. · Minimum 5-6 years of work experience in end-to-end...

  • SRE

    1 day ago


    Hyderabad, India Virtusa Full time

    SRE - CREQ Description Bi Tools, API & Batch monitoring Support Responsibilities 1. Troubleshoot Recurring failures & participate in incident triages 2. Troubleshoot issues, both from a production as well as a performance standpoint 3. on-call to be able to respond during App failures 4. Monitor critical applications and services to minimize downtime and...


  • Hyderabad, Telangana, India algoleap Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of implementing...

  • sre

    1 week ago


    Gurugram, Hyderabad, Noida, India Zensar Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Short Description for Internal CandidatesBachelors degree in Computer Science, IT, or equivalent. - 3–6 years in SRE, Observability, Application Monitoring, or Performance Engineering roles. - Hands-on exposure to Glassbox and Sumo Logic strongly preferred.*Description for CandidatesWe are seeking a Site Reliability Engineer (SRE) with a strong focus on...

  • Sre Implementation

    1 week ago


    Hyderabad, India Alignity Solutions Full time

    Do you love a career where you Experience, Grow & Contribute at the same time, while earning at least 10% above the market? If so, we are excited to have bumped onto you. Learn how we are redefining the meaning of work, and be a part of the team raved by Clients, Job-seekers and Employees. Jobseeker Video Testimonials Employee Glassdoor Reviews If you...


  • Hyderabad, India Tata Consultancy Services Full time

    Job Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...


  • Hyderabad, India Tata Consultancy Services Full time

    Job Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...


  • Hyderabad, India Tata Consultancy Services Full time

    Job Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...


  • hyderabad, India Tata Consultancy Services Full time

    Job Role: Observability Architect Location: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, Indore Role Overview We are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for...


  • Hyderabad, India Tata Consultancy Services Full time

    Job Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole Overview We are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for...