System Reliability Specialist
4 weeks ago
Job Title: System Reliability Specialist
Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, health care, and manufacturing.
The Role
- Provide L2 support to production systems like applications, databases, middleware components, infrastructure, and network components.
- Manage production incidents end-to-end within defined SLAs with a focus on resolution rather than who caused it.
- Interact with various stakeholders such as release managers, program leads, service managers, development, and test leads.
- Review operational readiness requirements such as monitoring and alerting, log rotation, and resilience of the components and report the gaps.
- Provide pre-implementation support with activities such as release notes review and implementation dry runs.
- Protect production components by running health checks, monitoring latency, and memory utilization.
- Automate day-to-day activities and propose changes that improve reliability.
- Participate in CAB and provide feedback on change requests.
- Support the DevOps team in testing the promote pipelines and suggest automation of configuration items.
- Practice incident management best practices and perform RCA.
- Participate in disaster recovery tests and operational acceptance tests.
- Analyze the technology stack that makes up the product and optimize recovery time objective.
- Work with team members spread across and time zones.
- Share knowledge, document improvements, and mentor junior resources.
Responsibility Matrix
- Deployments MTF/Prod
- Maintenance items (including stop/start, Disaster Recovery-related activities, etc.)
- Monitoring
- Support TRTs
- Incident creation
- CR for changes in MTF/Prod
Tools
- Log Monitoring Tool - Splunk
- Application Monitoring tool - DynaTrace
- Domo Tool - Reporting tool to provide direct, simplified, real-time access to business data for decision makers across the company with minimal IT involvement
- Ticketing incident/problem management tool - Remedy
- Dev-ops Basics - CI-CD Basics, Overview of git, Bit bucket, SonarQube, Fortify, CI(Jenkins), ARA, Saltstack, Chef, Artifactory, MC DevOps Tool chain
-
System Reliability Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the Role:Fulcrum Digital seeks a skilled System Reliability Specialist to join our team. The ideal candidate will have experience with Linux, Shell Scripting, and ITIL/ITSM.Key Responsibilities:Plan, manage, and oversee all aspects of a Production Environment.Define strategies for Application Performance Monitoring and Optimization.Respond to Incidents...
-
Reliability Systems Specialist
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob RoleFulcrum Digital, an agile and next-generation digital accelerating company, is seeking a Reliability Systems Specialist to provide L2 support to production systems.Key ResponsibilitiesManage production incidents end-to-end within defined SLAs.Interact with stakeholders such as release managers, program leads, and service managers.Review operational...
-
System Reliability Specialist
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob OverviewFulcrum Digital is seeking a skilled System Reliability Specialist to join our team. As a key member of our operations team, you will be responsible for ensuring the smooth operation of our Java-based applications. This includes planning, managing, and overseeing the production environment, as well as defining strategies for application...
-
System Reliability Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Overview Fulcrum Digital seeks a System Reliability Specialist to oversee all aspects of our Production Environment. As a key member of our team, you will define strategies for Application Performance Monitoring and Optimization, ensuring timely and accurate batch production scheduling and processes. Responsibilities Plan, manage, and oversee all...
-
System Reliability Automation Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeFulcrum Digital is a digital transformation company that accelerates business growth and innovation. As a System Reliability Automation Specialist, you will be responsible for planning, managing, and overseeing all aspects of a Production Environment. Your strategies for Application Performance Monitoring Optimization will ensure the platform's reliability...
-
System Reliability Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: System Reliability SpecialistAbout the Role:Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher...
-
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: System Reliability Engineer Automation SpecialistAbout the Role:At Fulcrum Digital, we are seeking a highly skilled System Reliability Engineer Automation Specialist to join our team. As a key member of our digital transformation and technology services team, you will play a crucial role in planning, managing, and overseeing all aspects of our...
-
Digital Systems Reliability Specialist
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeFulcrum Digital, a leading digital transformation and technology services company, is seeking a talented Digital Systems Reliability Specialist to join their team. In this role, the successful candidate will be responsible for planning, managing, and overseeing all aspects of a Production Environment.Key Responsibilities:Plan and manage Production...
-
Reliable Infrastructure Specialist
2 days ago
Pune, Maharashtra, India Camlin Group Full timeAt Camlin Group, we're seeking a highly skilled Reliable Infrastructure Specialist to join our dynamic team. This role requires a unique blend of software engineering and operations expertise to build and maintain large-scale, distributed, fault-tolerant systems.The successful candidate will be responsible for ensuring the reliability, performance, and...
-
Reliability Systems Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeSystem Reliability Engineer RoleFulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services.The Role ResponsibilitiesProvide L2 support to production systems including applications, databases, middleware components, infrastructure, and network components.Manage production incidents...
-
System Reliability Specialist
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the RoleWe are seeking an experienced Cloud Infrastructure Engineer to join our team at Fulcrum Digital. As a key member of our infrastructure team, you will be responsible for planning, managing, and overseeing all aspects of our production environment.Key ResponsibilitiesDefine strategies for application performance monitoring and optimization in our...
-
Nutanix Systems Reliability Specialist
2 weeks ago
Pune, Maharashtra, India Nutanix Full timeThe OpportunityAre you a top-tier Systems Reliability Engineer with expertise in networking, virtualization, OS, Cloud, DevOps, and Storage domains? We seek a dedicated professional to join our team of leading technology specialists and contribute to cutting-edge projects that impact customer satisfaction and business success.About the TeamMeet Ikram Khan,...
-
Reliability Engineering Specialist
2 weeks ago
Pune, Maharashtra, India F337 Deutsche India Private Limited, Pune Branch Full timeJob OverviewWe are seeking a highly skilled Reliability Engineering Specialist to join our team at F337 Deutsche India Private Limited, Pune Branch. As a key member of our infrastructure operations team, you will play a pivotal role in ensuring the reliability, scalability, and performance of our systems.As an SRE at Deutsche Bank, you will collaborate...
-
Senior Cloud Reliability Specialist
2 weeks ago
Pune, Maharashtra, India U-SET Full timeJob Title: Senior Cloud Reliability SpecialistDescription:We are seeking a highly skilled Senior Cloud Reliability Specialist to join our team at U-SET. As a key member of our operations team, you will be responsible for ensuring the reliability and efficiency of our cloud-based applications. The ideal candidate will have a deep understanding of SRE...
-
Payment Systems Specialist
2 weeks ago
Pune, Maharashtra, India Coforge Full timeJob DescriptionWe are seeking a skilled Payment Systems Specialist to join our team at Coforge. As a Payment Systems Specialist, you will be responsible for testing and ensuring the reliability and security of our payment systems.Responsibilities:Develop and execute test cases for payment systems, focusing on GPP Classic and ISO.Collaborate with development...
-
Reliability Engineering Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the RoleFulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services. As a Reliability Engineering Specialist, you will oversee all aspects of a Production Environment, defining strategies for Application Performance Monitoring, Optimization in Prod environment, and responding to...
-
System Reliability Engineering Specialist
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeFulfilling the Next Era of Digital TransformationAbout the Role:We are seeking a skilled Sr System Reliability Engineer to join our agile team at Fulcrum Digital. As a key member of our digital accelerating company, you will be responsible for defining strategies for Application Performance Monitoring, Optimization in Prod environment.Your Key...
-
Payment Systems Specialist
2 weeks ago
Pune, Maharashtra, India Coforge Full timeJob DescriptionWe are seeking an experienced Payment Systems Specialist to join our team. The ideal candidate will have a strong background in payment processing and be responsible for testing and ensuring the reliability and security of our payment systems. Key responsibilities include:ResponsibilitiesDevelop and execute test cases for payment systems,...
-
Cloud Infrastructure Reliability Specialist
4 weeks ago
Pune, Maharashtra, India Arista Networks Full timeJob Title: Cloud Infrastructure Reliability SpecialistJob Description:At Arista Networks, we are seeking a highly skilled Cloud Infrastructure Reliability Specialist to join our team. As a key member of our engineering team, you will be responsible for ensuring the scalability, performance, and resilience of our suite of products.Responsibilities:Develop and...
-
Reliability Systems Architect
2 weeks ago
Pune, Maharashtra, India Hansen Technologies Full timeAbout The RoleIn the esteemed role of Site Reliability Engineer at Hansen Technologies, you will be at the forefront of ensuring the reliability, performance, and scalability of our systems. As a seasoned professional, you will possess an exceptional blend of technical expertise and creative problem-solving skills, with a passion for automating tasks...