Observability Engineer – SRE

1 week ago


Gurugram, India GSPANN Full time

Description GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.Role and Responsibilities Leverage Application Performance Management (APM) tools such as Dynatrace and Prometheus to monitor, analyze, and enhance system performance. Write and maintain scripts using Python or Java to automate monitoring tasks and streamline alerting mechanisms. Deploy and manage Splunk to handle log analysis, system monitoring, and troubleshooting of production issues. Analyze user behavior and application performance using Real User Monitoring (RUM) tools such as Quantum Metrics to drive user experience improvements. Ensure the reliability and efficiency of Enterprise Resource Planning (ERP) applications through proactive monitoring and support. Incorporate Site Reliability Engineering (SRE) principles to improve system uptime, scalability, and fault tolerance. Respond to incidents swiftly, resolving them to minimize business disruptions and ensure service continuity. Optimize system performance proactively, using data-driven monitoring insights and continuous analysis. Collaborate with development and operations teams to integrate observability tools seamlessly and align monitoring with deployment workflows. Skills and Experience Bachelor’s degree in Computer Science, Information Technology, or a related field. Bring 12-15 years of experience in observability engineering or a similar technical role. Hold certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent. Have experience working on cloud platforms like Amazon Web Services (AWS) and Microsoft Azure. Understand and apply performance optimization frameworks and related best practices in production environments. Demonstrate proficiency with APM tools (e.g., Dynatrace, Prometheus), scripting languages (Python, Java), and Splunk. Possess hands-on experience with RUM tools like Quantum Metrics and the monitoring of ERP applications. Show a strong grasp of SRE principles and practices applied in real-world systems. Exhibit excellent problem-solving abilities and communication skills. Adapt easily to fast-paced and dynamic environments.



  • Gurugram, India Success Pact Consulting Pvt Ltd Full time

    Description : Position : Infrastructure EngineerExperience : 7+ Years (Principal or Staff Level)Job Type : Full-timeJob Summary : We are seeking a highly experienced Infrastructure Engineer at the Principal or Staff level, with 7+ years of specialized experience in cloud infrastructure, DevOps, or Site Reliability Engineering (SRE). This critical role...

  • SRE Lead

    5 hours ago


    Gurugram, Hyderabad, India GSPANN Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role & responsibilitiesBuilding unified dashboardsDashboards to track overall health of the production support (Derived from SNOW)Dashboards to track overall health of the applications (Using Observability tools like AppDynamics, DynatraceIdentify automation opportunities and implement the sameSelf healing using scripting languages of Java, PythonIntegration...

  • Devops/SRE Engineer

    7 days ago


    Bengaluru, Gurugram, Hyderabad, India Xebia It Architects Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    As a Cloud Site Reliability Engineer at our company, you will play a critical role in ensuring the robustness, performance, and security of our cloud-based systems. Your focus will be on maintaining and improving our cloud infrastructure with a special emphasis on cloud security and observability. You will work closely with development teams to architect,...


  • Gurugram, India AHEAD Full time

    We are looking for talented, creative, and proactive individuals who are passionate about solving complex business problems and contributing to the next generation of modern applications. Our goal is to help our customers understand the connections between application performance, user experience, and business outcomes, thereby creating exceptional customer...


  • Gurugram, India AHEAD Full time

    As an Observability Engineer, you will utilize your extensive Information Technology knowledge and experience to support/streamline AHEAD’s Managed Services platforms and services. You will work with a collaborative team ensuring development efforts are well documented and delivered with quality along with maintaining the tooling architecture at the...


  • Gurugram, India Apps Associates Full time

    Skills Senior Operations Engineer I (SRE) Total years of experience – Min to Cloud Platforms: AWS (Extensive Hands-on experience) Good understanding of key services like CloudFormation, KMS, S, EC, CloudWatch, IAM, Code Commit Secrets management service from AWS and ability to understand other secrets management systems Ability to analyze logs and...


  • Gurugram, India Nexthire Full time

                                          Designation/Role : Cloud and Observability Engineer Role : Cloud and Observability Engineer Experience : 3-6 Years+ Location : Gurugram About the Job Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique...


  • Gurugram, India Epam Full time

    Description Join our organization as a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications. The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies, and...


  • Gurugram, India Barycenter Technologies Pvt.Ltd. Full time

    About the Role : We are seeking an experienced Java Production Support Engineer with strong expertise in IT operations, system reliability, and application support. The ideal candidate will have hands-on experience in Java-based systems, site reliability engineering (SRE) practices, and cloud-native environments. This role requires a proactive approach to...

  • SRE AI Ops Engineer

    4 days ago


    Gurugram, Hyderabad, India GSPANN Full time ₹ 5,00,000 - ₹ 12,00,000 per year

    Looking for SRE AI Ops Engineer who can join immediatelyWork Mode : 5 days from officeShift Timing : 12:30PM to 9:30PM - Cab facility will be providedLocation : Hyderabad OR GurugramJob DescriptionConcentrate of AI Ops, auto identification of issuesAnalysis of issues using AI Ops toolsSelf healing of issues using either AI Ops tools or scripting...