Sr System Reliability Engineer
2 weeks ago
Location : Pune/Hybrid
Permanent Role
Imp Skills : Linux, SQL, Any Monitoring tool (Splunk/Dynatrace), Java App Troubleshooting, Basic Knowledge of MQ, ITIL/ITSM.
Job Description:
Plan, manage, and oversee all aspects of a Production Environment
Define strategies for Application Performance Monitoring, Optimization in Prod environment
Respond to Incidents and improvise platform based on feedback and measure the reduction of incidents over time.
Support deployment of code into multiple lower environments. Supporting current processes with an emphasis on automating everything as soon as possible.
Design, develop and standardize Monitoring and Alerting mechanism for the supported applications.
Take a holistic approach to problem solving, by connecting the dots during a production event through the various technology stack that makes up the platform, to optimize meantime to recover.
Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
Analyse ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.
Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead in DevOps automation and best practices.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
Scale systems sustainably through mechanisms like automation and evolving systems by pushing for changes that improve reliability and velocity.
Work with a global team spread across tech hubs in multiple geographies and time zones.
Ability to share knowledge and explain processes and procedures to others.
Share knowledge and mentor junior resources
Able to perform on-call duties on a rotational basis.
Occasional off hours work required.
Requirements
Must Have
Linux
Shell Scripting - good to have
ITIL / ITSM
SQL
Java Application Troubleshooting
Experience in REST and WEB API Support
Experience in Cloud based apps Support
Any Monitoring tool (Splunk/Dynatrace)
Knowledge on messaging platforms
Event-Driven Architectures
MQ or NATS broker or similar messaging solutions.
Kafka
Good To Have –
Jenkins - CI/CD - Basic / Good to have
Groovy Scripting/Yaml - Good to have
Git basic/bit bucket - Good to have
Ansible/Chef - Good to have
Even Framework architecture
-
Sr System Reliability Engineer
1 week ago
Pune, India Fulcrum Digital Inc Full timeSr System Reliability Engineer (Application Support with MQ knowledge)Location : Pune/HybridPermanent RoleImp Skills : Linux, SQL, Any Monitoring tool (Splunk/Dynatrace), Java App Troubleshooting, Basic Knowledge of MQ, ITIL/ITSM.Job Description:Plan, manage, and oversee all aspects of a Production EnvironmentDefine strategies for Application Performance...
-
Sr System Reliability Engineer
2 weeks ago
Pune, India Fulcrum Digital Inc Full timeSr System Reliability Engineer (Application Support with MQ knowledge) Location : Pune/Hybrid Permanent Role Imp Skills : Linux, SQL, Any Monitoring tool (Splunk/Dynatrace), Java App Troubleshooting, Basic Knowledge of MQ, ITIL/ITSM. Job Description: Plan, manage, and oversee all aspects of a Production Environment Define strategies for Application...
-
Sr System Reliability Engineer
2 weeks ago
Pune, India Fulcrum Digital Inc Full timeSr System Reliability Engineer (Application Support with MQ knowledge)Location : Pune/HybridPermanent RoleImp Skills : Linux, SQL, Any Monitoring tool (Splunk/Dynatrace), Java App Troubleshooting, Basic Knowledge of MQ, ITIL/ITSM.Job Description:Plan, manage, and oversee all aspects of a Production EnvironmentDefine strategies for Application Performance...
-
System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Reveille Technologies Full timeSenior System Reliability Engineer (Application Support + Automation). The Senior System Reliability Engineer will be responsible for day-to-day tasks including reliability engineering, analytical skills, failure analysis, reliability centered maintenance, and troubleshooting. The role will also involve developing and implementing automation solutions to...
-
Pune, India Fulcrum Digital Inc Full timeSr System Reliability Engineer (Application Support with MQ knowledge)Location : Pune/HybridPermanent RoleImp Skills : Linux, SQL, Any Monitoring tool (Splunk/Dynatrace), Java App Troubleshooting, Basic Knowledge of MQ, ITIL/ITSM.Job Description:- Plan, manage, and oversee all aspects of a Production Environment- Define strategies for Application Performance...
-
Systems Reliability Engineer II
4 weeks ago
Pune, India Nutanix Full timeThe Opportunity Are you a top-tier Systems Reliability Engineer with a passion for customer success and expertise in networking, virtualization, OS, Cloud, DevOps, and Storage domains? If so, you would thrive in our team of leading technology specialists and have the opportunity to work on cutting-edge projects that directly impact customer satisfaction...
-
Systems Reliability Engineer II
2 weeks ago
Pune, Maharashtra, India Nutanix Full timeThe Opportunity Are you a top-tier Systems Reliability Engineer with a passion for customer success and expertise in networking, virtualization, OS, Cloud, DevOps, and Storage domains? If so, you would thrive in our team of leading technology specialists and have the opportunity to work on cutting-edge projects that directly impact customer satisfaction...
-
Engineer - Reliability
4 weeks ago
Pune, India Seagate Full timeThe Reliability Engineering Team (part of the Product Assurance & Customer Advocacy organization) is accountable and responsible for reliability prediction, modeling, Design for Reliability (DFR), and Design Quality Assurance (DQA) of NPI products in Seagate’s Systems, SSD, and HDD portfolios. Each team member has a role in making Seagate’s products...
-
Engineer - Reliability
2 weeks ago
Pune, Maharashtra, India Seagate Full timeThe Reliability Engineering Team (part of the Product Assurance & Customer Advocacy organization) is accountable and responsible for reliability prediction, modeling, Design for Reliability (DFR), and Design Quality Assurance (DQA) of NPI products in Seagate's Systems, SSD, and HDD portfolios. Each team member has a role in making Seagate's products...
-
Systems Reliability Engineer II
2 weeks ago
pune, India Nutanix Full timeThe Opportunity Are you a top-tier Systems Reliability Engineer with a passion for customer success and expertise in networking, virtualization, OS, Cloud, DevOps, and Storage domains? If so, you would thrive in our team of leading technology specialists and have the opportunity to work on cutting-edge projects that directly impact customer...
-
Systems Reliability Engineer II
1 week ago
Pune, India Nutanix Full timeThe Opportunity Are you a top-tier Systems Reliability Engineer with a passion for customer success and expertise in networking, virtualization, OS, Cloud, DevOps, and Storage domains? If so, you would thrive in our team of leading technology specialists and have the opportunity to work on cutting-edge projects that directly impact customer satisfaction...
-
Sr System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeWho are weFulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and...
-
Sr System Reliability Engineer
4 days ago
Pune, India Fulcrum Digital Full timeJob DescriptionWho are weFulcrumDigital is an agile and next-generation digital accelerating company providingdigital transformation and technology services right from ideation toimplementation. These services have applicability across a variety ofindustries, including banking & financial services, insurance, retail,higher education, food, healthcare, and...
-
Sr System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeWho are we? Fulcrum Digital is a dynamic company focused on digital transformation and technology services for various industries such as banking & financial services, insurance, retail, higher education, food, health care, and manufacturing. The Role: Plan, manage, and oversee all aspects of a Production Environment for Big Data Platforms. Define...
-
Engineer - Reliability
4 weeks ago
Pune, India Seagate Full timeThe Reliability Engineering Team (part of the Product Assurance & Customer Advocacy organization) is accountable and responsible for reliability prediction, modeling, Design for Reliability (DFR), and Design Quality Assurance (DQA) of NPI products in Seagate’s Systems, SSD, and HDD portfolios. Each team member has a role in making Seagate’s products...
-
Engineer - Reliability
2 weeks ago
Pune, Maharashtra, India Seagate Full timeThe Reliability Engineering Team (part of the Product Assurance & Customer Advocacy organization) is accountable and responsible for reliability prediction, modeling, Design for Reliability (DFR), and Design Quality Assurance (DQA) of NPI products in Seagate's Systems, SSD, and HDD portfolios. Each team member has a role in making Seagate's products...
-
Sr System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeWho are we Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and...
-
Sr System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Description Who are we Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food,...
-
Sr System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeWho are we Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and...
-
Sr System Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeWho are we Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and...