Observability Engineer – SRE

6 days ago


Gurugram, India GSPANN Full time
Description GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.

Role and Responsibilities

Leverage Application Performance Management (APM) tools such as Dynatrace and Prometheus to monitor, analyze, and enhance system performance. Write and maintain scripts using Python or Java to automate monitoring tasks and streamline alerting mechanisms. Deploy and manage Splunk to handle log analysis, system monitoring, and troubleshooting of production issues. Analyze user behavior and application performance using Real User Monitoring (RUM) tools such as Quantum Metrics to drive user experience improvements. Ensure the reliability and efficiency of Enterprise Resource Planning (ERP) applications through proactive monitoring and support. Incorporate Site Reliability Engineering (SRE) principles to improve system uptime, scalability, and fault tolerance. Respond to incidents swiftly, resolving them to minimize business disruptions and ensure service continuity. Optimize system performance proactively, using data-driven monitoring insights and continuous analysis. Collaborate with development and operations teams to integrate observability tools seamlessly and align monitoring with deployment workflows.

Skills and Experience

Bachelor’s degree in Computer Science, Information Technology, or a related field. Bring 12-15 years of experience in observability engineering or a similar technical role. Hold certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent. Have experience working on cloud platforms like Amazon Web Services (AWS) and Microsoft Azure. Understand and apply performance optimization frameworks and related best practices in production environments. Demonstrate proficiency with APM tools (e.g., Dynatrace, Prometheus), scripting languages (Python, Java), and Splunk. Possess hands-on experience with RUM tools like Quantum Metrics and the monitoring of ERP applications. Show a strong grasp of SRE principles and practices applied in real-world systems. Exhibit excellent problem-solving abilities and communication skills. Adapt easily to fast-paced and dynamic environments.

  • Gurugram, India GSPANN Full time

    Description GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.Role and Responsibilities Leverage Application Performance Management (APM) tools such as...


  • Gurugram, India GSPANN Full time

    Description GSPANN is hiring an experienced Observability Engineer (AI Ops) with 12-15 years of expertise in monitoring, automation, and AI-driven operations. The role involves enhancing system reliability and performance through APM tools, cloud observability, scripting, and Site Reliability Engineering (SRE) practices.Role and Responsibilities Use...


  • Gurugram, India GSPANN Full time

    Description GSPANN is hiring an experienced Observability Engineer (AI Ops) with 12-15 years of expertise in monitoring, automation, and AI-driven operations. The role involves enhancing system reliability and performance through APM tools, cloud observability, scripting, and Site Reliability Engineering (SRE) practices.Role and Responsibilities Use...


  • Gurugram, India Success Pact Consulting Pvt Ltd Full time

    Description : Position : Infrastructure EngineerExperience : 7+ Years (Principal or Staff Level)Job Type : Full-timeJob Summary : We are seeking a highly experienced Infrastructure Engineer at the Principal or Staff level, with 7+ years of specialized experience in cloud infrastructure, DevOps, or Site Reliability Engineering (SRE). This critical role...


  • Gurugram, Haryana, India Wipro Limited Full time

    Gurugram, India; Noida, India - DOP SLH - 3045461 **Job Description**: **Responsibilities**: - System Reliability- Lead efforts to enhance the reliability, availability, and performance of critical systems - Perform in-depth analysis of system behavior, identifying areas for improvement and implementing solutions Automation Frameworks- Design, implement,...


  • Gurugram, Gurugram, India Impronics Technologies Full time

    Job Description We are seeking a seasoned Site Reliability Engineer (SRE) with a solid background in payment systems and high-availability architectures. The ideal candidate will have hands-on experience managing large-scale, distributed systems in production, with a deep understanding of reliability, scalability, and performance tuning in the financial...


  • Gurugram, India Ahead Full time

    AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. AtAHEAD, we prioritize creating a culture of belonging,where all perspectives and voices are represented, valued, respected, and heard. We...


  • Gurugram, India GSPANN Full time

    Hiring for SRE - Exp- 6+ Years Notice Period - Immediate - 15 days About the Role We are seeking a skilled and passionate Observability Engineer (SRE) to join our team and drive reliability, performance, and visibility across our infrastructure and applications. You will play a key role in designing and implementing observability solutions, improving system...


  • Gurugram, India AHEAD Full time

    AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation.At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We...


  • Gurugram, India AHEAD Full time

    AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation.At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We...