
Observability Engineer – SRE
6 days ago
Role and Responsibilities
Leverage Application Performance Management (APM) tools such as Dynatrace and Prometheus to monitor, analyze, and enhance system performance. Write and maintain scripts using Python or Java to automate monitoring tasks and streamline alerting mechanisms. Deploy and manage Splunk to handle log analysis, system monitoring, and troubleshooting of production issues. Analyze user behavior and application performance using Real User Monitoring (RUM) tools such as Quantum Metrics to drive user experience improvements. Ensure the reliability and efficiency of Enterprise Resource Planning (ERP) applications through proactive monitoring and support. Incorporate Site Reliability Engineering (SRE) principles to improve system uptime, scalability, and fault tolerance. Respond to incidents swiftly, resolving them to minimize business disruptions and ensure service continuity. Optimize system performance proactively, using data-driven monitoring insights and continuous analysis. Collaborate with development and operations teams to integrate observability tools seamlessly and align monitoring with deployment workflows.Skills and Experience
Bachelor’s degree in Computer Science, Information Technology, or a related field. Bring 12-15 years of experience in observability engineering or a similar technical role. Hold certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent. Have experience working on cloud platforms like Amazon Web Services (AWS) and Microsoft Azure. Understand and apply performance optimization frameworks and related best practices in production environments. Demonstrate proficiency with APM tools (e.g., Dynatrace, Prometheus), scripting languages (Python, Java), and Splunk. Possess hands-on experience with RUM tools like Quantum Metrics and the monitoring of ERP applications. Show a strong grasp of SRE principles and practices applied in real-world systems. Exhibit excellent problem-solving abilities and communication skills. Adapt easily to fast-paced and dynamic environments.-
Observability Engineer – SRE
6 days ago
Gurugram, India GSPANN Full timeDescription GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.Role and Responsibilities Leverage Application Performance Management (APM) tools such as...
-
Observability Engineer
6 days ago
Gurugram, India GSPANN Full timeDescription GSPANN is hiring an experienced Observability Engineer (AI Ops) with 12-15 years of expertise in monitoring, automation, and AI-driven operations. The role involves enhancing system reliability and performance through APM tools, cloud observability, scripting, and Site Reliability Engineering (SRE) practices.Role and Responsibilities Use...
-
Observability Engineer
6 days ago
Gurugram, India GSPANN Full timeDescription GSPANN is hiring an experienced Observability Engineer (AI Ops) with 12-15 years of expertise in monitoring, automation, and AI-driven operations. The role involves enhancing system reliability and performance through APM tools, cloud observability, scripting, and Site Reliability Engineering (SRE) practices.Role and Responsibilities Use...
-
Cloud Infrastructure Engineer
2 days ago
Gurugram, India Success Pact Consulting Pvt Ltd Full timeDescription : Position : Infrastructure EngineerExperience : 7+ Years (Principal or Staff Level)Job Type : Full-timeJob Summary : We are seeking a highly experienced Infrastructure Engineer at the Principal or Staff level, with 7+ years of specialized experience in cloud infrastructure, DevOps, or Site Reliability Engineering (SRE). This critical role...
-
Sre Production Architect
3 days ago
Gurugram, Haryana, India Wipro Limited Full timeGurugram, India; Noida, India - DOP SLH - 3045461 **Job Description**: **Responsibilities**: - System Reliability- Lead efforts to enhance the reliability, availability, and performance of critical systems - Perform in-depth analysis of system behavior, identifying areas for improvement and implementing solutions Automation Frameworks- Design, implement,...
-
Site Reliability Engineer
1 hour ago
Gurugram, Gurugram, India Impronics Technologies Full timeJob Description We are seeking a seasoned Site Reliability Engineer (SRE) with a solid background in payment systems and high-availability architectures. The ideal candidate will have hands-on experience managing large-scale, distributed systems in production, with a deep understanding of reliability, scalability, and performance tuning in the financial...
-
Observability Engineer
3 weeks ago
Gurugram, India Ahead Full timeAHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. AtAHEAD, we prioritize creating a culture of belonging,where all perspectives and voices are represented, valued, respected, and heard. We...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India GSPANN Full timeHiring for SRE - Exp- 6+ Years Notice Period - Immediate - 15 days About the Role We are seeking a skilled and passionate Observability Engineer (SRE) to join our team and drive reliability, performance, and visibility across our infrastructure and applications. You will play a key role in designing and implementing observability solutions, improving system...
-
Observability Engineer
6 days ago
Gurugram, India AHEAD Full timeAHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation.At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We...
-
Observability Engineer
6 days ago
Gurugram, India AHEAD Full timeAHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation.At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We...