System Reliability Engineer Kubernetes Expert
3 weeks ago
Company Overview:
Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries including banking & financial services, insurance, retail, higher education, food, healthcare, and manufacturing.
The Role:
- Plan, manage, and oversee all aspects of a Production Environment for Java J2EE Spring Boot applications.
- Define strategies for Application Performance Monitoring Optimization in the Production environment.
- Respond to Incidents and improvise platforms based on feedback and measure the reduction of incidents over time.
- Ensure that batch production scheduling and processes are accurate and timely.
- Able to create and execute queries to big data platforms and relational data tables to identify process issues or to perform mass updates preferred.
- Perform ad hoc requests from users, such as data research, file manipulation/transfer, research of process issues, etc.
- Take a holistic approach to problem-solving by connecting the dots during a production event through the various technology stack that makes up the platform to optimize mean time to recover.
- Engage in and improve the whole lifecycle of services from inception and design through deployment, operation, and refinement.
- Analyze ITSM activities of the platform and provide feedback loops to development teams on operational gaps or resiliency concerns.
- Support services before they go live through activities such as system design consulting, capacity planning, and launch reviews.
- Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating and lead in DevOps automation and best practices.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
- Work with a global team spread across tech hubs in multiple geographies and time zones.
- Ability to share knowledge and explain processes and procedures to others.
Requirements
Skills:
Must Have:
- Linux
- Kubernetes
- ITIL/ITSM
- Application Troubleshooting
- Any Monitoring tool (Preferred Splunk/Dynatrace)
- Jenkins CI/CD
Good To Have:
- Even Framework architecture
- Git basic/bitbucket
- Ansible/Chef Basic
- Shell Scripting Basic
- SQL
- Groovy Scripting/Yaml
Benefits
System Reliability Engineer Expert
-
System Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the Role:Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services from ideation to implementation. These services have applicability across various industries, including banking, financial services, insurance, retail, higher education, food, healthcare, and manufacturing.Key...
-
Senior System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob DescriptionThe ideal candidate will have a solid understanding of Production Environment Java J2EE Spring Boot applications, as well as strategies for Application Performance Monitoring Optimization. They will also be responsible for responding to Incidents, improving the platform based on feedback, and measuring the reduction of incidents over time.Key...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Siemens Industry Software (India) Private Limited Full timeJob Title: Site Reliability Engineer - Cloud ExpertJob Summary:Siemens Digital Industries Software is a leading provider of solutions for the design, simulation, and manufacture of products across many different industries. As a Site Reliability Engineer - Cloud Expert, you will be responsible for ensuring the availability, reliability, and performance of...
-
Cloud Infrastructure Engineer
2 weeks ago
Pune, Maharashtra, India Procore Technologies Full time**Job Title:** Cloud Infrastructure Engineer - Reliability Expert**Estimated Salary:** $140,000 - $170,000 per yearWe are seeking a highly skilled Senior Reliability Engineer with strong backend software engineering skills to join our team at Procore Technologies. This is an exciting opportunity to design, implement, and maintain cloud infrastructure that...
-
Reliability Systems Expert
3 weeks ago
Pune, Maharashtra, India Nutanix Full timeThe OpportunityAs a top-tier Systems Reliability Engineer at Nutanix, you will have the opportunity to work on cutting-edge projects that directly impact customer satisfaction and business success.About the TeamMeet Ikram Khan, Sr. Manager, Worldwide Support:With a career built around customer support, I lead a team of technical experts who provide...
-
Reliable Systems Engineer
1 week ago
Pune, Maharashtra, India One2N Full timeWe're seeking a meticulous engineer to oversee the stability and scalability of our software systems. The ideal candidate will primarily collaborate with clients on One-to-N kind problems, focusing on Proof of Concept development, system maintainability, and reliability.About YouAt least 2 years of experience in DevOps/SRE rolesFamiliarity with Linux systems...
-
Reliability Systems Expert
2 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeFulcrum Digital seeks a skilled Reliability Systems Expert to oversee production environments and implement strategies for application performance monitoring and optimization.Key responsibilities include planning, managing and overseeing all aspects of production environments, responding to incidents and improving platforms based on feedback and...
-
System Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeAbout the Role:Fulcrum Digital is a digital transformation company that provides technology services across various industries. We are seeking a skilled System Reliability Engineer to join our team.Key Responsibilities:Plan, manage, and oversee all aspects of a Production Environment for Java J2EE Spring Boot applications.Define strategies for Application...
-
DevOps Engineer
4 weeks ago
Pune, Maharashtra, India BMC Software, Inc. Full timeJob SummaryWe are seeking a highly skilled DevOps Engineer with a focus on Site Reliability Engineering to join our SaaS Ops department. As an SRE Specialist, you will be responsible for ensuring the reliability and resilience of our cloud-based systems.Key Responsibilities:Design and implement automation to auto-remediate/self-heal issues in...
-
Reliability Expert
5 days ago
Pune, Maharashtra, India LTIMindtree Full timeJob DescriptionThe LTIMindtree organization is seeking a skilled Cloud Infrastructure Specialist to join their team. As a key member of our infrastructure engineering team, you will be responsible for designing, building, and maintaining scalable, secure, and efficient cloud-based systems. This includes ensuring the reliability and performance of our...
-
System Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeJob DescriptionAt Fulcrum Digital, we are seeking a skilled System Reliability Engineer to join our team and help us deliver high-quality digital transformation and technology services. As a System Reliability Engineer, you will play a critical role in ensuring the smooth operation of our production environment. You will define strategies for Application...
-
Reliability Systems Architect
3 weeks ago
Pune, Maharashtra, India Hansen Technologies Full timeAbout The RoleIn the esteemed role of Site Reliability Engineer at Hansen Technologies, you will be at the forefront of ensuring the reliability, performance, and scalability of our systems. As a seasoned professional, you will possess an exceptional blend of technical expertise and creative problem-solving skills, with a passion for automating tasks...
-
Pune, Maharashtra, India LTIMindtree Full timeAbout Us: LTIMindtree is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will play a vital role in ensuring the availability, scalability, and reliability of our cloud-native applications.Key Responsibilities: • Collaborate with cross-functional teams to design, implement, and maintain...
-
Site Reliability Engineer
1 month ago
Pune, Maharashtra, India Tata Consultancy Services Full timeJob Title: Site Reliability Engineer - Java ExpertAbout the Role:We are seeking a highly skilled Site Reliability Engineer with expertise in Java to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our Java-based systems.Key Responsibilities:Design and...
-
Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Red Hat India Private Limited Full timeAbout the Role:We are seeking a skilled Site Reliability Engineer to join our Cloud Operations team in India. As a key member of our team, you will contribute to the development, scaling, and operation of our Red Hat OpenShift Managed Cloud platform. Your expertise in cloud providers and technologies, including kubernetes, will be crucial in enabling...
-
Senior Site Reliability Engineering Leader
3 weeks ago
Pune, Maharashtra, India Ensono Full timeAbout EnsonoEnsono is a leading expert in technology advisory and managed services. We empower clients to accelerate their digital transformation, driving lasting business outcomes. Our dedicated team optimizes today's systems across any hybrid environment, offering services in consulting, mainframe and application modernization, public cloud migration, and...
-
Cloud Engineer
4 weeks ago
Pune, Maharashtra, India Vodafone Full timeJob Title: Cloud Engineer - Kubernetes, DevOps, and AWS ExpertJob Summary:We are seeking a highly skilled Cloud Engineer with expertise in Kubernetes, DevOps, and AWS to join our team. The ideal candidate will have a strong understanding of cloud computing, containerization, and automation.Key Responsibilities:Design and implement scalable and secure cloud...
-
Senior Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Procore Technologies Full timeSenior Site Reliability EngineerAbout the RoleWe are seeking a highly skilled Senior Reliability Engineer with strong backend software engineering skills to join our team at Procore Technologies. As a Senior Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring the smooth operation of...
-
Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Roche Full timeAbout the PositionJob SummaryAt Roche, we are seeking a skilled Site Reliability Engineer - Cloud Expert to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our production systems.Key ResponsibilitiesDesign, implement, and maintain site reliability engineering practices that ensure...
-
System Reliability Engineering Specialist
3 weeks ago
Pune, Maharashtra, India Fulcrum Digital Full timeFulfilling the Next Era of Digital TransformationAbout the Role:We are seeking a skilled Sr System Reliability Engineer to join our agile team at Fulcrum Digital. As a key member of our digital accelerating company, you will be responsible for defining strategies for Application Performance Monitoring, Optimization in Prod environment.Your Key...