Senior System Reliability Engineer
3 weeks ago
The ideal candidate will have a solid understanding of Production Environment Java J2EE Spring Boot applications, as well as strategies for Application Performance Monitoring Optimization. They will also be responsible for responding to Incidents, improving the platform based on feedback, and measuring the reduction of incidents over time.
Key Responsibilities:
- Plan, manage, and oversee all aspects of a Production Environment.
- Define strategies for Application Performance Monitoring Optimization.
- Respond to Incidents and improvise platform based on feedback and measure the reduction of incidents overtime.
- Ensure batch production scheduling and process are accurate and timely.
- Create and execute queries to big data platforms and relational data tables to identify process issues or perform mass updates.
- Perform ad hoc requests from users such as data research, file manipulation/transfer, research of process issues, etc.
- Take a holistic approach to problem solving by connecting the dots during a production event through the various technology stack that makes up the platform to optimize meantime to recover.
- Engage in and improve the whole lifecycle of services from inception and design through deployment, operation, and refinement.
- Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.
- Support services before they go live through activities such as system design consulting, capacity planning, and launch reviews.
- Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating and lead in DevOps automation and best practices.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
Requirements
Skills:
Must Have:
- Linux
- Kubernetes
- ITIL/ITSM
- Application Troubleshooting
- Any Monitoring tool (Preferred Splunk/Dynatrace)
- Jenkins CI/CD
Good To Have:
- Even Framework architecture
- Git basic/bitbucket
- Ansible/Chef Basic
- Shell Scripting Basic
- SQL
- Groovy Scripting/Yaml
Benefits
-
Senior System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Red Hat India Private Limited Full timeRed Hat is seeking a skilled Senior System Reliability Engineer to develop, scale, and operate our OpenShift managed cloud services. Our ideal candidate will possess experience in large-scale distributed system design, cloud infrastructure management, and enterprise software development.
-
Reliability Systems Engineer
12 hours ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout Fulcrum Digital: We are a dynamic company seeking an experienced Senior Reliability Engineer to join our team. As a key contributor, you will play a pivotal role in ensuring the reliability, scalability, and performance of our critical systems. Our company culture emphasizes collaboration and innovation. You will work closely with development,...
-
AWS System Reliability Engineer
2 days ago
Pune, Maharashtra, India ScaleneWorks Full time**Company Overview**ScaleneWorks is a cutting-edge technology firm that values innovation and collaboration. We are committed to delivering exceptional results and fostering a culture of growth and development. Salary: $120,000 - $180,000 per year **Job Description**As an AWS System Reliability Engineer at ScaleneWorks, you will play a crucial role in...
-
Reliability Systems Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the RoleFulcrum Digital is seeking a skilled System Reliability Engineer to join our team. As a System Reliability Engineer, you will be responsible for designing, implementing, and enhancing our deployment automation based on Chef. Your goal will be to develop a reliable and efficient release and deployment process.Key ResponsibilitiesDesign and...
-
System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the RoleFulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation.Our team is looking for a highly skilled System Reliability Engineer to plan, manage, and oversee all aspects of a Production Environment for Big Data Platforms.
-
Senior Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Procore Technologies Full timeSenior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Reliability Engineer with strong backend software engineering skills to join our team at Procore Technologies. As a Senior Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring the smooth operation of...
-
Senior Systems Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Nutanix Full timeThe OpportunityWe are seeking a highly skilled Senior Systems Reliability Engineer to join our team of leading technology specialists. As a key member of our team, you will have the opportunity to work on cutting-edge projects that directly impact customer satisfaction and business success.About the TeamMeet Ikram Khan, Sr. Manager, Worldwide Support: With a...
-
Reliability Systems Engineer
1 month ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: Sr System Reliability EngineerAbout the Role:Fulcrum Digital is a leading digital transformation company that provides innovative technology services to various industries. We are seeking a highly skilled Sr System Reliability Engineer to join our team.Key Responsibilities:Design and implement a robust deployment automation process using...
-
Senior Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Global Payments Asia-Pacific India Private Limited Full timeAbout This RoleAt Global Payments Asia-Pacific India Private Limited, we're shaping the future of payments technology. As a Senior Site Reliability Engineer, you'll play a key role in ensuring our systems are highly available, resilient, and performant.What You'll DoDesign and implement chaos experiments to test our systems' reliability and resilience.Push...
-
System Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: System Reliability EngineerAbout the Role:Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher...
-
System Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the Role:Fulcrum Digital is a digital transformation company that provides technology services across various industries. We are seeking a skilled System Reliability Engineer to join our team.Key Responsibilities:Plan, manage, and oversee all aspects of a Production Environment for Java J2EE Spring Boot applications.Define strategies for Application...
-
System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the Role:We are seeking a skilled System Reliability Engineer to join our team at Fulcrum Digital. As a key member of our infrastructure team, you will play a crucial role in ensuring the stability and performance of our production environment.Key Responsibilities:Plan, manage, and oversee all aspects of a Production EnvironmentDefine strategies for...
-
System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob OverviewFulcrum Digital is seeking a skilled System Reliability Engineer to join our team. As a key member of our IT department, you will be responsible for ensuring the smooth operation of our production environment.Key ResponsibilitiesPlan, manage, and oversee all aspects of a Production EnvironmentDefine strategies for Application Performance...
-
Reliable Systems Engineer
1 week ago
Pune, Maharashtra, India One2N Full timeWe're seeking a meticulous engineer to oversee the stability and scalability of our software systems. The ideal candidate will primarily collaborate with clients on One-to-N kind problems, focusing on Proof of Concept development, system maintainability, and reliability.About YouAt least 2 years of experience in DevOps/SRE rolesFamiliarity with Linux systems...
-
System Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: Sr System Reliability EngineerAbout the Role:Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher...
-
Sr System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: Sr System Reliability EngineerJob Summary:Fulcrum Digital seeks a talented Sr System Reliability Engineer to join our team. As a Sr System Reliability Engineer, you will be responsible for planning, managing, and overseeing all aspects of a Production Environment.Key Responsibilities:Plan, manage, and oversee all aspects of a Production...
-
Senior Site Reliability Engineer
1 month ago
Pune, Maharashtra, India RED HAT Full timeJob DescriptionRed Hat is seeking a highly skilled Senior Site Reliability Engineer to join our team and contribute to the development, scaling, and operation of our OpenShift managed cloud services. As a key member of our SRE team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key...
-
Senior Systems Engineer
3 weeks ago
Pune, Maharashtra, India PubMatic Full timeJob Title: Senior Systems EngineerDescription:At PubMatic, we're seeking a highly skilled Senior Systems Engineer to join our team in the L4 support capacity. This role requires extensive experience in providing technical support and troubleshooting for CentOS, Ubuntu, and Rocky Linux environments.Key Responsibilities:* Provide L4 support for CentOS, Ubuntu,...
-
Sr System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob Title: Sr System Reliability EngineerAbout the RoleFulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services. We are seeking a Sr System Reliability Engineer to plan, manage, and oversee all aspects of a Production Environment.Key ResponsibilitiesDefine strategies for Application...
-
System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeOur Company, Fulcrum Digital, is looking for a skilled professional to fill the role of a System Reliability Engineer.Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries...