Site Reliability Engineer
4 days ago
JOB DESCRIPTION Req ID: We are currently seeking a Site Reliability Engineer to join our team in Noida, Uttar Pradesh (IN-UP), India (IN). Job Description – Site Reliability Engineer (5–8 Years Experience) Role Overview We are seeking an experienced Site Reliability Engineer (SRE) with 5–8 years of expertise in ensuring the reliability, scalability, and performance of critical systems. The ideal candidate must have strong hands-on experience in observability, monitoring, alerting, Splunk, and telemetry, along with solid understanding of cloud-native infrastructure and automation. Key Responsibilities Implement and maintain observability across metrics, logs, traces, and events. Build and optimize monitoring dashboards and service health indicators using Splunk or similar tools. Configure, fine-tune, and maintain proactive alerts with high signal-to-noise ratio. Lead incident response, conduct root cause analysis (RCA), and drive long-term corrective measures. Define, measure, and enhance SLIs, SLOs, reliability KPIs, and error budgets. Improve system performance, scalability, and availability across environments. Automate monitoring, alerting, and operational workflows to reduce manual toil. Standardize and maintain telemetry instrumentation across services. Own and optimize logging pipelines, ingestion, parsing, indexing, and retention. Collaborate with engineering teams to integrate reliability best practices into application development. Participate in on-call rotations and ensure timely incident resolution. Partner with cloud/platform teams to enhance deployment readiness and operational stability. Required Skills & Experience 5–8 years of experience in SRE, DevOps, or system reliability roles. Strong hands-on experience with Splunk (queries, dashboards, alerts, ingestion). Solid understanding of observability tools (Splunk, Prometheus, Grafana, Datadog, OpenTelemetry, etc.). Strong knowledge of Linux, networking fundamentals, and distributed systems. Experience with cloud platforms (AWS / Azure / GCP) and container technologies (Docker, Kubernetes). Proficiency in scripting (Python, Shell, or similar). Experience with production on-call environments and incident management. Familiarity with SLIs/SLOs, capacity planning, and reliability engineering concepts. Experience with OpenTelemetry–based instrumentation. Exposure to APM tools (Dynatrace, AppDynamics, New Relic). Knowledge of IaC tools like Terraform or Ansible. Understanding of microservices architecture and CI/CD pipelines. About NTT DATA NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. our consulting and Industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D.
-
Site Reliability Engineer
5 hours ago
Noida, Uttar Pradesh, India CorroHealth Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
-
Site Reliability Engineer
6 days ago
Noida, India NTT Data Full timeJob Description NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Site Reliability Engineer to join our team in Noida, Uttar Pradesh (IN-UP), India (IN). Job Description - Site...
-
Site Reliability Engineer
3 days ago
Noida, India NTT DATA North America Full timeJob Description Req ID: 350360 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Site Reliability Engineer to join our team in Noida, Uttar Pradesh (IN-UP), India (IN). Role Overview...
-
Site Reliability Engineer
5 days ago
Noida, Uttar Pradesh, India Microsoft Full timeOverview Do you want to work on a product that is used by millions of people around the world daily, and growing rapidly? Do you care deeply about how software is designed with a focus on supporting global scale? Do you want to be part of a world-class team that continuously pushes the boundary of service and engineering excellence? The Web...
-
Site Reliability Engineer
5 days ago
Noida, India Microsoft Full timeJob Description Overview Do you want to work on a product that is used by millions of people around the world daily, and growing rapidly Do you care deeply about how software is designed with a focus on supporting global scale Do you want to be part of a world-class team that continuously pushes the boundary of service and engineering excellence TheWeb...
-
Site Reliability Engineer
2 days ago
Noida, Uttar Pradesh, India MyOperator Full timeAbout Us:MyOperator is a Business AI Operator, a category leader that unifies WhatsApp, Calls, and AI-powered chat & voice bots into one intelligent business communication platform. Unlike fragmented communication tools, MyOperator combines automation, intelligence, and workflow integration to help businesses run WhatsApp campaigns, manage calls, deploy AI...
-
Site Reliability Engineer
3 weeks ago
Greater Noida, India TRH Consultancy Services Full timeDescription : We are seeking a Site Reliability Engineer with expertise in OpenTelemetry to join our team in India. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our systems while implementing best practices for observability and monitoring.Responsibilities : - Design, implement, and maintain reliable...
-
Site Reliability Engineer
6 days ago
Noida Berger Tower, India Thales Full timeLocation: Noida, IndiaThales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become smarter and much more....
-
Site Reliability Engineer
5 days ago
Noida, Uttar Pradesh, India NTT DATA Full timeImplement and maintain observability across metrics, logs, traces, and events. Build and optimize monitoring dashboards and service health indicators using Splunk or similar tools. Configure, fine-tune, and maintain proactive alerts with high signal-to-noise ratio. Lead incident response, conduct root cause analysis (RCA), and drive long-term corrective...
-
Associate Site Reliability Engineer
1 week ago
Noida, Uttar Pradesh, India TSYS|Total System Services Full timeEvery day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing results....