Observability and SRE Architect

4 weeks ago


Noida, India NTT Full time
JOB DESCRIPTION

Req ID:    

We are currently seeking a Observability and SRE Architect to join our team in Noida, Uttar Pradesh (IN-UP), India (IN).

Job Description: Technical Architect – Observability & SRE Frameworks

Position Title: Technical Architect – Observability & Site Reliability Engineering (SRE)

Location: Noida, India

Experience: 15+ years (including 5+ years in observability/SRE architecture)

Employment Type: Full-time

Role Overview

We are looking for a highly experienced Technical Architect to lead the design, strategy, and implementation of Observability and SRE frameworks for enterprise-scale, microservices-based applications. The ideal candidate will bring deep technical knowledge of both Splunk Observability Stack and Open Source tools (like OpenTelemetry, Prometheus, Grafana, Jaeger), and be capable of defining and executing architecture strategies for complex distributed systems.

This role requires hands-on ability to create architecture blueprints , lead technical teams, and work directly with stakeholders and platform owners to embed observability and reliability practices across the SDLC.

Key Responsibilities

Architecture & Blueprinting Design and deliver end-to-end observability architecture (Metrics, Logs, Traces, Events) for cloud-native and hybrid environments. Create technical architecture diagrams , data flow maps, and integration blueprints using tools like Lucidchart, Draw.io, or Visio. Lead the definition of SLIs, SLOs, and Error Budgets aligned with business KPIs and DORA metrics. Toolchain Strategy & Implementation Architect telemetry pipelines using OpenTelemetry Collector and Splunk Observability Cloud (SignalFx, APM, RUM, Log Observer). Define tool adoption strategy and integration roadmap for OSS tools (Prometheus, Loki, Grafana, Jaeger) and Splunk-based stacks. Guide teams on instrumentation approaches (auto/manual) across languages like Java, Go, Python, .NET, etc. Reliability Engineering Enablement Lead adoption of SRE principles including incident management frameworks, resiliency testing, and runbook automation. Collaborate with DevOps to integrate observability into CI/CD pipelines (e.g., Jenkins, ArgoCD, GitHub Actions). Define health checks, golden signals, and SPoG (Single Pane of Glass) dashboards. Exposure to AIOps , ML-based anomaly detection, or business observability. Stakeholder Management & Governance Serve as a technical liaison between client leadership, SREs, developers, and infrastructure teams. Run workshops, assessments, and evangelize observability-first culture across teams. Provide guidance on data retention, access control, cost optimization, and compliance (especially with Splunk ingestion policies). Performance & Optimization Continuously monitor and fine-tune observability data flows to prevent alert fatigue and ensure actionability. Implement root cause analysis practices using telemetry correlation across metrics, logs, and traces. Lead efforts to build self-healing systems using automated playbooks and AIOps integrations (where applicable).

Required Skills & Qualifications

15+ years in IT, with 5 years in Observability/SRE architecture roles Proven experience designing architecture for microservices, containers (Docker, Kubernetes), and distributed systems Strong hands-on expertise with: Splunk Observability Cloud (SignalFx, Log Observer, APM) OpenTelemetry (SDKs + Collector) Prometheus + Grafana Jaeger / Zipkin for distributed tracing CI/CD tools : Jenkins, GitHub Actions, ArgoCD Ability to build and present clear architecture diagrams and solution roadmaps Working knowledge of cloud environments (AWS, Azure, GCP) and container orchestration (K8s/OpenShift) Familiarity with SRE and DevOps best practices (error budgets, release engineering, chaos testing)

Nice to Have

Splunk certifications: Core Consultant, Observability Specialist, Admin Knowledge of ITIL and modern incident management frameworks (PagerDuty, OpsGenie) Experience in banking or regulated enterprise environments

Soft Skills

Strong leadership and cross-functional collaboration Ability to work in ambiguous, fast-paced environments Excellent documentation and communication skills Passion for mentoring teams and building best practices at scale

Why This Role Matters

The client is on a journey to mature its Observability and SRE ecosystem , and this role will be critical in:

Unifying legacy and modern telemetry stacks Driving reliability-first mindset and tooling Establishing a scalable blueprint for production excellence

About NTT DATA

NTT DATA is a $30 billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long term success. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure and connectivity. We are one of the leading providers of digital and AI infrastructure in the world. NTT DATA is a part of NTT Group, which invests over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. Visit us at 



  • Noida, India NTT Full time

    JOB DESCRIPTION Req ID:     We are currently seeking a Observability and SRE Architect to join our team in Noida, Uttar Pradesh (IN-UP), India (IN). Job Description: Technical Architect – Observability & SRE Frameworks Position Title: Technical Architect – Observability & Site Reliability Engineering (SRE) Location: Noida, India Experience: 15+...


  • Noida, India NTT Data Full time

    Job Description NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Observability and SRE Architect to join our team in Noida, Uttar Pradesh (IN-UP), India (IN). Job Description:...


  • Noida, Uttar Pradesh, India NTT DATA North America Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Req ID:340251NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Observability and SRE Architect to join our team in Noida, Uttar Pradesh (IN-UP), India (IN).Job Description: Technical...

  • SRE Architect

    3 weeks ago


    Noida, India Coforge Full time

    Job Description Role: SRE Architect Experience: 15- 22 years Location: Greater Noida/Pune/Hyderabad Core Skills: SRE, Observability, Cloud, Pre-sales Work Mode: WFO We at Coforge are looking for SRE Architects with following skill set. We are looking for a highly skilled and client-facing SRE Architect to join our dynamic team. This role is pivotal in...

  • sre

    2 weeks ago


    Gurugram, Hyderabad, Noida, India Zensar Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Short Description for Internal CandidatesBachelors degree in Computer Science, IT, or equivalent. - 3–6 years in SRE, Observability, Application Monitoring, or Performance Engineering roles. - Hands-on exposure to Glassbox and Sumo Logic strongly preferred.*Description for CandidatesWe are seeking a Site Reliability Engineer (SRE) with a strong focus on...

  • Computer Scientist

    4 weeks ago


    Noida, India Adobe Full time

    Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies...

  • Computer Scientist

    4 weeks ago


    Noida, India Adobe Full time

    Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies...

  • Computer Scientist

    2 weeks ago


    Noida, Uttar Pradesh, India Adobe Full time

    Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies...

  • Architect

    1 week ago


    Bengaluru, Noida, India Emids Technologies Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    The Solution Architect will be responsible for defining the overall architecture and ensuring seamless integration of the C&P platform with the new department.This includes designing scalable solutions using Java Spring Boot, Kafka, APIs, and Databricks, optimizing system performance, ensuring security and compliance, and addressing technical challenges.The...

  • Devops Architect

    2 weeks ago


    Greater Noida, Uttar Pradesh, India crescendo global Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job Description DevOps Architect Position: DevOps ArchitectExperience: 15 yearsLocation: NoidaDiscipline: Technical ArchitectJob Type: Permanent Contact Name: Suleka K.Contact Email: Job Reference: 78202Published: About 12 hours agoSummary An exciting opportunity for senior DevOps leaders to design and implement enterprise-grade DevOps ecosystems for a...