
System Reliability Expert
4 days ago
This is a key role in ensuring the end-to-end visibility and reliability of SAP ERP, middleware, and business-critical applications.
The successful candidate will be responsible for designing, implementing, and governing a comprehensive monitoring, observability, automation, and job management architecture across SAP (SAP S/4HANA, SAP BTP, SAP eWM, ATTP, GRC etc.), middleware, and hyper-specialized business systems.
Key Responsibilities:- Design, implement, and govern a comprehensive monitoring, observability, automation, and job management architecture across SAP (SAP S/4HANA, SAP BTP, SAP eWM, ATTP, GRC etc.), middleware, and hyper-specialized business systems.
- Execute the automation and observability roadmap in alignment with IT strategy, business SLAs, and customer needs.
- Standardize and scale monitoring patterns using SAP Focused Run, and/or enterprise strategic tools like SAP Cloud ALM, Grafana etc.
- Define and manage service level indicators (SLIs), service level objectives (SLOs), error budgets, and establish a reliability engineering culture across SAP operations.
- Lead the continuous improvement of observability maturity through tooling, telemetry coverage, documentation, and team enablement.
- Conduct thorough root cause analysis and introduce operational best practices for proactive incident prevention.
- Integrate AI-driven monitoring, anomaly detection, and predictive analytics for faster incident detection and auto-resolution.
- Build event-driven automation pipelines for common incident scenarios using OCC guided procedures or external orchestration tools.
- Enhance root cause analysis using automated correlation of system metrics, exceptions, and transaction traces.
- Lead the setup of business process monitoring for critical flows (e.g., intercompany supply chain, order-to-cash, procure-to-pay) to ensure performance and SLA visibility.
- Define and operationalize business KPIs with dashboards and alerting tied to user experience and transaction health.
- Actively engage business and technical stakeholders to gather feedback, identify pain points, and co-develop enhancements to observability capabilities.
- Regularly present monitoring performance and roadmap updates to leadership and service teams.
- 8+ years in SAP system architecture or basis, monitoring automation design, or SRE roles.
- 3+ years of experience with SAP OCC technologies (Focused Run, or Cloud ALM, Solution Manager).
- Understanding of SAP S/4HANA, BTP, and middleware (e.g., mulesoft).
- Proven track record in designing and scaling observability platforms and automation frameworks.
- Proficiency in integration with IT service management (ITSM) and one or more enterprise-wide observability tools (e.g., servicenow, grafana, splunk, dynatrace, prometheus).
- Having formal certifications in SAP solution manager or focused run, SAP cloud alM operations is an added advantage.
- Strong stakeholder management, including customer-facing experience.
- Excellent communication and cross-functional collaboration skills.
- PASSION FOR RELIABILITY, AUTOMATION, AND MEASURABLE IMPROVEMENT.
-
Site Reliability Engineering Expert
3 weeks ago
Pune, Maharashtra, India Fiserv Full timeSite Reliability Engineering Expert (Architect) Exp. Range:- 15 to 19 Years Location:- Pune Job Description: What does a successful Site Reliability Engineer (SRE) Expert do at Fiserv? The Site reliability engineer blends the principles of software engineering with the discipline of operations to create high-performing and reliable software systems....
-
Site Reliability Engineering Expert
2 weeks ago
Pune, Maharashtra, India Fiserv Full timeSite Reliability Engineering Expert (Architect) Exp. Range:- 9 to 12 Years Location:- Pune Job Description: What does a successful Site Reliability Engineer (SRE) Expert do at Fiserv? The Site reliability engineer blends the principles of software engineering with the discipline of operations to create high-performing and reliable software systems....
-
NPI and Product Reliability Expert
2 days ago
Pune, Maharashtra, India beBeeNpi Full timeJob Title:NPI & Reliability ExpertJoin us as a key member of our NPI and reliability team, working closely with cross-functional teams to drive successful product launches.
-
System Operations Expert
3 days ago
Pune, Maharashtra, India beBeeExpert Full time ₹ 8,00,000 - ₹ 12,00,000Job Title:System Operations Expert Job DescriptionWe are seeking a skilled System Operations Expert to lead our team in delivering high-quality solutions. As a key member of our operations team, you will be responsible for managing and deploying Windows OS Servers, ensuring the smooth operation of our production environment.With expertise in managing virtual...
-
System Reliability Engineer
3 days ago
Pune, Maharashtra, India beBeeSoftwareSupport Full time ₹ 12,00,000 - ₹ 20,00,000About This RoleWe are seeking a highly skilled Software Support Specialist to join our team. In this role, you will play a critical part in ensuring the smooth operation of our business systems.You will work closely with various teams to identify and resolve software-related challenges, analyzing system performance and implementing solutions to enhance...
-
Expert Hydraulic Systems Designer
22 hours ago
Pune, Maharashtra, India beBeeHydraulic Full time ₹ 1,00,00,000 - ₹ 1,50,00,000Hydraulic Systems Design ExpertWe are seeking a skilled mechanical engineer with expertise in designing hydraulic systems to enhance our product offerings.Key Responsibilities:Design and develop high-performance hydraulic systems for optimal efficiency and reliability.Select and analyze hydraulic components and systems, including valves, pumps, actuators,...
-
Reliability Systems Specialist
3 days ago
Pune, Maharashtra, India beBeeDevops Full time ₹ 80,00,000 - ₹ 1,20,00,000We are seeking a highly skilled Site Reliability Engineer to support our IT operations. The ideal candidate will have extensive experience in application support or production support, with expertise in monitoring tools like Grafana, Prometheus, Splunk, or Dynatrace.Key Responsibilities:Hands-on experience with incident management processes and...
-
Highly Skilled Reliability Engineer
1 day ago
Pune, Maharashtra, India beBeeReliability Full time ₹ 15,00,000 - ₹ 25,00,000Reliability ExpertWe are looking for an experienced Senior Reliability Specialist to ensure the optimal operation of our systems. The successful candidate will be responsible for guaranteeing the reliability and efficiency of our technology infrastructure.Key Responsibilities:Design and implement automated software build and deployment processes to minimize...
-
Senior IT System Reliability Engineer
2 days ago
Pune, Maharashtra, India beBeeReliability Full timeJob Description:">Primary Responsibility: To apply software engineering techniques, automation, and best practices in incident response to ensure the reliability, availability, and scalability of systems, platforms, and technology.Purpose of the Role:Ensure high availability, performance, and scalability of systems and services through proactive monitoring,...
-
Expert Systems Engineer
3 days ago
Pune, Maharashtra, India Mindpool Technologies Full time ₹ 9,00,000 - ₹ 12,00,000 per yearAllscripts is hiring for Expert Systems Engineer- Linux/Unix, Azure, Kubernetes. Expert Systems Engineer, Pune ResponsibilitiesProvide production support for the Linux/Unix infrastructure.24X7 Operational support Linux/Unix environment.Linux administration, monitoring, and troubleshooting.Linux user management and file permissions.Responsible to create...