Observability Engineer – SRE
1 week ago
Description GSPANN is hiring an Observability Engineer with expertise in Site Reliability Engineering (SRE) The role focuses on leveraging SRE principles, automation, and AI-driven observability to enhance reliability and scalability across cloud and ERP environments.Role and Responsibilities Leverage Application Performance Management (APM) tools such as Dynatrace and Prometheus to monitor, analyze, and enhance system performance. Write and maintain scripts using Python or Java to automate monitoring tasks and streamline alerting mechanisms. Deploy and manage Splunk to handle log analysis, system monitoring, and troubleshooting of production issues. Analyze user behavior and application performance using Real User Monitoring (RUM) tools such as Quantum Metrics to drive user experience improvements. Ensure the reliability and efficiency of Enterprise Resource Planning (ERP) applications through proactive monitoring and support. Incorporate Site Reliability Engineering (SRE) principles to improve system uptime, scalability, and fault tolerance. Respond to incidents swiftly, resolving them to minimize business disruptions and ensure service continuity. Optimize system performance proactively, using data-driven monitoring insights and continuous analysis. Collaborate with development and operations teams to integrate observability tools seamlessly and align monitoring with deployment workflows. Skills and Experience Bachelor’s degree in Computer Science, Information Technology, or a related field. Bring 12-15 years of experience in observability engineering or a similar technical role. Hold certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or equivalent. Have experience working on cloud platforms like Amazon Web Services (AWS) and Microsoft Azure. Understand and apply performance optimization frameworks and related best practices in production environments. Demonstrate proficiency with APM tools (e.g., Dynatrace, Prometheus), scripting languages (Python, Java), and Splunk. Possess hands-on experience with RUM tools like Quantum Metrics and the monitoring of ERP applications. Show a strong grasp of SRE principles and practices applied in real-world systems. Exhibit excellent problem-solving abilities and communication skills. Adapt easily to fast-paced and dynamic environments.
-
Cloud Infrastructure Engineer
2 weeks ago
Gurugram, India Success Pact Consulting Pvt Ltd Full timeDescription : Position : Infrastructure EngineerExperience : 7+ Years (Principal or Staff Level)Job Type : Full-timeJob Summary : We are seeking a highly experienced Infrastructure Engineer at the Principal or Staff level, with 7+ years of specialized experience in cloud infrastructure, DevOps, or Site Reliability Engineering (SRE). This critical role...
-
SRE Lead
5 hours ago
Gurugram, Hyderabad, India GSPANN Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole & responsibilitiesBuilding unified dashboardsDashboards to track overall health of the production support (Derived from SNOW)Dashboards to track overall health of the applications (Using Observability tools like AppDynamics, DynatraceIdentify automation opportunities and implement the sameSelf healing using scripting languages of Java, PythonIntegration...
-
Devops/SRE Engineer
7 days ago
Bengaluru, Gurugram, Hyderabad, India Xebia It Architects Full time ₹ 8,00,000 - ₹ 24,00,000 per yearAs a Cloud Site Reliability Engineer at our company, you will play a critical role in ensuring the robustness, performance, and security of our cloud-based systems. Your focus will be on maintaining and improving our cloud infrastructure with a special emphasis on cloud security and observability. You will work closely with development teams to architect,...
-
Senior Technical Consultant, Observability
1 week ago
Gurugram, India AHEAD Full timeWe are looking for talented, creative, and proactive individuals who are passionate about solving complex business problems and contributing to the next generation of modern applications. Our goal is to help our customers understand the connections between application performance, user experience, and business outcomes, thereby creating exceptional customer...
-
Observability Engineer
1 week ago
Gurugram, India AHEAD Full timeAs an Observability Engineer, you will utilize your extensive Information Technology knowledge and experience to support/streamline AHEAD’s Managed Services platforms and services. You will work with a collaborative team ensuring development efforts are well documented and delivered with quality along with maintaining the tooling architecture at the...
-
SRE Operations Engineer
4 days ago
Gurugram, India Apps Associates Full timeSkills Senior Operations Engineer I (SRE) Total years of experience – Min to Cloud Platforms: AWS (Extensive Hands-on experience) Good understanding of key services like CloudFormation, KMS, S, EC, CloudWatch, IAM, Code Commit Secrets management service from AWS and ability to understand other secrets management systems Ability to analyze logs and...
-
Coralogix- Cloud and Observability Engineer
1 week ago
Gurugram, India Nexthire Full timeDesignation/Role : Cloud and Observability Engineer Role : Cloud and Observability Engineer Experience : 3-6 Years+ Location : Gurugram About the Job Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. Our unique...
-
Lead Systems Engineer
1 week ago
Gurugram, India Epam Full timeDescription Join our organization as a Lead Systems Engineer (DevOps & SRE) and play a crucial role in ensuring the reliability, scalability, capacity planning, and performance of our infrastructure and applications. The ideal candidate will have a strong background in software engineering, system administration, containerization, and cloud technologies, and...
-
BaryTech - Java Production Support Engineer
3 weeks ago
Gurugram, India Barycenter Technologies Pvt.Ltd. Full timeAbout the Role : We are seeking an experienced Java Production Support Engineer with strong expertise in IT operations, system reliability, and application support. The ideal candidate will have hands-on experience in Java-based systems, site reliability engineering (SRE) practices, and cloud-native environments. This role requires a proactive approach to...
-
SRE AI Ops Engineer
4 days ago
Gurugram, Hyderabad, India GSPANN Full time ₹ 5,00,000 - ₹ 12,00,000 per yearLooking for SRE AI Ops Engineer who can join immediatelyWork Mode : 5 days from officeShift Timing : 12:30PM to 9:30PM - Cab facility will be providedLocation : Hyderabad OR GurugramJob DescriptionConcentrate of AI Ops, auto identification of issuesAnalysis of issues using AI Ops toolsSelf healing of issues using either AI Ops tools or scripting...