Observability SRE
2 days ago
Job Description Key Responsibilities: - Design, build, and maintain observability platforms including monitoring, logging, tracing, and alerting systems. - Implement and optimize metrics collection using tools like Prometheus, Grafana, OpenTelemetry, or similar. - Develop and maintain centralized logging infrastructure (e.g., Data Dog, Open Telemetry, Splunk, or Google Cloud Logging). - Implement distributed tracing solutions using tools such as Jaeger, Zip kin, AppDynamics, or OpenTelemetry. - Collaborate with engineering teams to define SLIs, SLOs, and alerting thresholds. - Automate observability workflows and integrate observability into CI/CD pipelines. - Analyze and interpret telemetry data to proactively identify system issues and performance bottlenecks. - Provide training and documentation to teams on best practices in observability. - Continuously evaluate and adopt new observability technologies and practices. Tools & Technologies: - Skilled in AppDynamics, Splunk, Thousand Eyes, ITRS for instrumentation, monitoring, alerting, and incident response. - Deep hands-on knowledge of Terraform, Kubernetes (GKE), GitLab CI/CD. - Familiar with modern observability practices like Open Telemetry, Grafana, Datadog - Strong knowledge of data platforms: Big Query, Cassandra, Kafka, PostgreSQL, MySQL. - Experience with AI/ML-based operations tools for automation, anomaly detection, and predictive alerting. Qualifications: - Bachelor's degree in Computer Science, Engineering, or related fieldor equivalent experience. - Proven experience as an SRE or DevOps engineer, particularly in Google Cloud Platform (GCP). - Expertise in designing and managing observability platforms and tools. - Hands-on experience with monitoring systems like Prometheus, Grafana, Datadog, New Relic, etc. - Proficient in logging solutions such as ELK, Splunk, Fluentd, or Google Cloud Logging. - Familiarity with distributed tracing tools like Open Telemetry, Jaeger, or Zip kin. - Strong scripting and automation skills using Python, Go, Bash, or similar. - Experience with cloud platforms (AWS, GCP, Azure) and their observability services. - Solid understanding of Kubernetes and observability in containerized environments. - Deep knowledge of networking, application performance, and distributed systems. - Exposure to AI/ML-based observability or anomaly detection tools. - Excellent troubleshooting, debugging, and analytical capabilities. - Strong communication and cross-team collaboration skills.
-
SRE Observability Platform Architect
1 day ago
Hyderabad, India Virtusa Full timeSRE Observability Platform Architect - Description Observability Platform Architect Experience: · Minimum 10 years of relevant work experience with monitoring setup using any product (Dynatrace, Datadog, ELK stack, Splunk, Grafana/Prometheus, etc.) set up in critical production environments. · Minimum 5-6 years of work experience in end-to-end...
-
SRE
1 day ago
Hyderabad, India Virtusa Full timeSRE - CREQ Description Bi Tools, API & Batch monitoring Support Responsibilities 1. Troubleshoot Recurring failures & participate in incident triages 2. Troubleshoot issues, both from a production as well as a performance standpoint 3. on-call to be able to respond during App failures 4. Monitor critical applications and services to minimize downtime and...
-
Observability Engineer
5 days ago
Hyderabad, Telangana, India algoleap Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole: Observability EngineerJob Description:Senior Platform EngineerWe are seeking a highly experienced and driven Senior Observability Engineer to lead the design, development, and maintenance of observability solutions across our infrastructure, applications, and services. As a Senior Observability Engineer, you will be at the forefront of implementing...
-
sre
1 week ago
Gurugram, Hyderabad, Noida, India Zensar Full time ₹ 15,00,000 - ₹ 25,00,000 per yearShort Description for Internal CandidatesBachelors degree in Computer Science, IT, or equivalent. - 3–6 years in SRE, Observability, Application Monitoring, or Performance Engineering roles. - Hands-on exposure to Glassbox and Sumo Logic strongly preferred.*Description for CandidatesWe are seeking a Site Reliability Engineer (SRE) with a strong focus on...
-
Sre Implementation
1 week ago
Hyderabad, India Alignity Solutions Full timeDo you love a career where you Experience, Grow & Contribute at the same time, while earning at least 10% above the market? If so, we are excited to have bumped onto you. Learn how we are redefining the meaning of work, and be a part of the team raved by Clients, Job-seekers and Employees. Jobseeker Video Testimonials Employee Glassdoor Reviews If you...
-
Observability Architect
2 weeks ago
Hyderabad, India Tata Consultancy Services Full timeJob Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...
-
Observability Architect
1 week ago
Hyderabad, India Tata Consultancy Services Full timeJob Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...
-
Observability Architect
1 week ago
Hyderabad, India Tata Consultancy Services Full timeJob Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole OverviewWe are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for deifning...
-
Observability Architect
2 weeks ago
hyderabad, India Tata Consultancy Services Full timeJob Role: Observability Architect Location: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, Indore Role Overview We are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for...
-
Observability Architect
1 week ago
Hyderabad, India Tata Consultancy Services Full timeJob Role: Observability ArchitectLocation: Hyderabad, Chennai, Bangalore, Pune, Mumbai, Delhi, Kolkata, Noida, IndoreRole Overview We are seeking an experienced observability architect with deep expertise in AppDynamics and end to end observability design across enterprise applications, infra and cloud platforms. The architect will be responsible for...