IT Resiliency Engineer
2 weeks ago
Tasks
End-to-End Engineering Leadership:
Oversee the design and implementation of resilient engineering across the technology domains.
Cloud and On-Premises Infrastructure Expertise:
Design and review resilient solutions in both cloud-based and on-premises environments.
Chaos Engineering Infrastructure Initiatives:
Lead chaos engineering efforts to proactively identify and mitigate potential system weaknesses.
Standards for Monitoring and Alerting:
Collaborate with Teams to evolve existing standards for system monitoring and alerting to ensure rapid detection and response.
Resiliency Architecture Reviews:
Represent the IT Resiliency Office during the Architectural Review Board.
Enterprise-wide Collaboration and stakeholder management
: Collaborate with various teams across the organization to align and prioritize resiliency and recovery efforts.
Automation:
Expertise with IaC and Tools such as Ansible.
Incident Response and Recovery:
Integrate with post mortem process, from a major incident, to identify areas of opportunity for enhancing resiliency.
Development:
Evangelize standards and practices among the Technology organization to enrich our resiliency posture.
Reporting and Documentation:
Develop standardized regular reporting on resilience activities, risks, and improvements to the Leadership team.
Requirements
Qualifications:
Bachelor's degree or equivalent experience.
5-10 years experience with platform engineering with a focus on IaC, DevOps practices, and orchestration tools.
Preferred but not required experience as a Team lead or a hands on Technical Manager role that can engage and deliver projects to completion
A track record of successfully architecting and deploying enterprise-level solutions that prioritize system uptime and data integrity across various operational scenarios.
Demonstrated ability to design and implement systems that ensure high availability, support massive transaction volumes, and facilitate seamless disaster recovery processes.
Infrastructure and service architecture & engineering experience, including functional and technical requirements gathering, and solution development.
Strong dedication to customer needs, with excellent communication and the ability to build lasting relationships, alongside the capability to articulate complex resilience strategies in a clear and impactful manner.
Deep insight into the complexities of multi-AZ and multi-Region cloud platforms, with a keen understanding of how these impact system resilience and disaster recovery planning.
Proven experience in the ongoing management of mission-critical systems that require constant uptime, including out-of-hours support and rapid response to incidents.
Knowledgeable in evaluating and deciding on trade-offs between consistency, availability, and partition tolerance, especially in the context of system failures and recovery strategies.
Well-versed in various cloud service models such as SaaS, PaaS, and IaaS, with hands-on experience in designing resilient services on leading public cloud platforms.
Proficient in Chaos Engineering principles and practices, with experience in designing and conducting experiments to validate the system's capability to withstand turbulent conditions.
Skilled in implementing observability solutions that provide real-time insights into the performance and health of systems, aiding in proactive issue detection and resolution.
Practical experience operating in an Agile development environment.
-
Sr Principal Engineer, IT Resiliency Office
7 days ago
Pune, Maharashtra, India DigitalXNode Full timeAbout The RoleAs a Sr Principal Engineer in our IT Resiliency Office, you will play a pivotal role in ensuring the reliability and resilience of our technology infrastructure. You will lead the charge in designing, implementing, and maintaining robust systems that can withstand disruptions and recover swiftly.Key ResponsibilitiesResiliency Architecture:...
-
Manager, Technology Regulatory Resilience
1 week ago
Pune, Maharashtra, India Mastercard Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title:Manager, Technology Regulatory Resilience (Product Management - Technical) Overview:OverviewAs a Technology Regulatory Resilience Manager within Mastercard's Technology Regulatory Execution (T-Rex) team, you will help deliver Mastercard's South Asia regulatory resilience agenda. This role is pivotal in aligning Mastercard's infrastructure,...
-
Manager, Technology Regulatory Resilience
1 week ago
Pune, Maharashtra, India Mastercard Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Analyst, Business Resiliency
1 week ago
Pune, Maharashtra, India NielsenIQ Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob Description Day-to-day coordination and governance of the Business Continuity Management (BCM) Program, including Disaster Recovery, Business Continuity, and Crisis Management planning, execution, and documentation.Job ResponsibilitiesDesign, implement, and maintain an annual tabletop exercise program for NIQ HUBs and offices.Maintain an up-to-date...
-
Analyst, Business Resiliency
7 days ago
Pune, Maharashtra, India NielsenIQ Full timeJob DescriptionDay-to-day coordination and governance of the Business Continuity Management (BCM) Program, including Disaster Recovery, Business Continuity, and Crisis Management planning, execution, and documentation.Job ResponsibilitiesDesign, implement, and maintain an annual tabletop exercise program for NIQ HUBs and offices.Maintain an up-to-date...
-
Manager, Site Reliability Engineering
1 week ago
Pune, Maharashtra, India Veeam Software Full time US$ 12,00,000 - US$ 24,00,000 per yearVeeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data recovery, data portability, data security, and data intelligence. Based in Seattle, Veeam protects over 550,000 customers worldwide who trust Veeam to keep their...
-
Senior Software Engineer, Reliability
5 days ago
Pune, Maharashtra, India Veeam Software Full timeVeeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data recovery, data portability, data security, and data intelligence. Based in Seattle, Veeam protects over 550,000 customers worldwide who trust Veeam to keep their...
-
Senior Software Engineer, Reliability
5 days ago
Pune, Maharashtra, India Veeam Software Full timeVeeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data recovery, data portability, data security, and data intelligence. Based in Seattle, Veeam protects over 550,000 customers worldwide who trust Veeam to keep...
-
Software Engineer
1 day ago
Pune, Maharashtra, India Vibe It Solutions Full timeHiring Software Engineer & ) in Pune. 34 yrs experience in , , PostgreSQL & SQL. Build scalable web apps, REST APIs, optimize DBs, and work in a collaborative engineering team. Full-time role.Provident fund
-
Engineer AMPS
18 hours ago
Pune, Maharashtra, India Rehlko Full timeWhy Work at RehlkoWe have met today's energy needs while planning for tomorrow's for over 100 years. Beginning with the first modern generator, the Rehlko Automatic Power & Light, launched in 1920, Rehlko has been an innovative leader in energy resilience.Our product range includes engines, generators, power conversion, UPS systems, EV components and...