
Technical Lead, Infrastructure Resilience
4 days ago
Job Title: Technical Lead, Infrastructure Resilience
Overview:
The company is seeking a skilled and experienced technical lead to play a critical role in ensuring the stability, scalability, and operational excellence of accounting and finance platforms.
This position involves leading the operational health of our platforms, ensuring we are delivering highly reliable financial applications and data services that meet demanding requirements for accuracy, compliance, and availability that support business operations.
The successful candidate will build automation, implement monitoring, improve incident response, and champion DevOps practices that enable finance and accounting systems to operate with consistency and trustworthiness. They will also coach and mentor junior engineers to ensure overall Operational Excellence.
Key Responsibilities:
- Operational Oversight: Own day-to-day operations for accounting and finance applications and data platforms, ensuring they run smoothly and meet business expectations.
- Reliability & Availability: Ensure accounting and finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.
- Automation & Efficiency: Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk.
- Observability & Monitoring: Implement and maintain comprehensive monitoring, alerting, and logging for accounting applications and data pipelines (e.g., Snowflake, dbt workflows, ERP integrations).
- Incident Response: Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues.
- Operational Excellence: Establish and enforce best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems.
- Collaboration with Engineering & Finance: Partner with software engineers, data engineers, and finance/accounting teams to ensure operational needs are met from development through production.
- Team Coordination: Manage workload, priorities, and escalations for operations staff and partner teams, ensuring alignment with SLAs and compliance requirements.
- Security & Compliance: Ensure financial applications and data pipelines meet audit, compliance, and security requirements.
- Continuous Improvement: Drive post-incident reviews, implement lessons learned, and proactively identify opportunities to improve system resilience.
- Audit & Compliance Support: Ensure operational practices meet internal controls, audit requirements, and financial compliance standards.
Requirements:
- Bachelor's in computer science, engineering, information technology, or related field (or equivalent experience).
- 12-15 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications.
- Strong experience with monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent).
- Hands-on expertise with CI/CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.).
- Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective.
- Strong scripting/programming experience (Python, Bash, Go, or similar) for automation and tooling.
- Proven ability to manage incident response and conduct blameless postmortems.
- Experience ensuring compliance, security, and audit-readiness in enterprise applications.
Nice To Have:
- Experience supporting financial applications (ERP, revenue recognition systems, accounting platforms).
- Exposure to FinOps practices for optimizing cloud spend in finance-related platforms.
- Familiarity with containers and orchestration (Docker, Kubernetes).
- Experience building resilience into data pipelines and ensuring auditability for accounting data.
- Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.
-
Chief Resilience Architect
4 days ago
Salem, Tamil Nadu, India beBeeResilience Full time ₹ 1,50,00,000 - ₹ 2,00,00,000About this role:A Chief Resilience Architect plays a critical role in ensuring the reliability and resilience of our systems. This includes identifying and eliminating Single Points of Failure (SPOFs), conducting Failure Mode and Effects Analysis (FMEA), and developing mitigation strategies to enhance system resiliency.Key Responsibilities:Identify and...
-
Enterprise Resilience Professional
1 hour ago
Salem, Tamil Nadu, India beBeeResiliency Full time ₹ 15,00,000 - ₹ 25,00,000Job DescriptionKey Responsibilities:Develop and implement resilient testing strategies for large-scale applications and infrastructure.Analyze system behavior during disruption scenarios, highlighting performance bottlenecks and proposing enhancements to improve resilience.Maintain a strong understanding of system architecture, networking concepts, and...
-
Cloud Infrastructure Technical Lead
3 days ago
Salem, Tamil Nadu, India beBeeITLeader Full time ₹ 15,00,000 - ₹ 25,00,000Job TitleA technical leader with strong organizational and IT Service Management skills is required to oversee the platform and infrastructure team.About This RoleThis leadership position requires a proven track record of escalating management experience, a solid understanding of operational best practices and business processes. The ideal candidate will be...
-
Technical Lead
6 days ago
Salem, Tamil Nadu, India beBeeTechnical Full time US$ 1,50,000 - US$ 2,00,000Job Title:Backend Engineering DirectorOverview:We are seeking a seasoned Backend Engineering Director to drive the development and implementation of our backend systems. As a key member of our engineering team, you will be responsible for leading the design, development, and deployment of high-quality software solutions that meet the needs of our...
-
Infrastructure Engineering Director
1 week ago
Salem, Tamil Nadu, India beBeeLeadership Full time ₹ 1,80,00,000 - ₹ 2,00,00,000Lead Infrastructure Engineering ExpertAs a seasoned leader in infrastructure engineering, you will play a pivotal role in collaborating closely with cross-functional teams to deliver exceptional technology services.This key leadership position drives efficiency, optimization, and service delivery while managing a significant portion of our global...
-
Enterprise Resiliency Specialist
2 days ago
Salem, Tamil Nadu, India beBeeResilience Full time ₹ 1,20,00,000 - ₹ 1,70,00,000Job Opportunity: Resiliency ExpertKey Responsibilities:Design and implement robust testing strategies for complex applications and infrastructure.Develop, execute, and maintain comprehensive test plans, scripts, and scenarios that validate system recovery and high availability.Perform rigorous stress, load, chaos, and failure testing to simulate real-world...
-
AWS Platform Engineer
6 days ago
Salem, Tamil Nadu, India beBeeResilient Full time ₹ 15,00,000 - ₹ 20,10,000**System Engineer for Financial Systems**Join our team as a system engineer with expertise in AWS Platform Engineering. Your primary responsibility will be to oversee the reliability and scalability of mission-critical financial systems.This is an opportunity for technical leaders who want to own platforms end-to-end, applying SRE principles to financial...
-
EDP Technical Business Analyst
1 week ago
Salem, Tamil Nadu, India beBeeBusiness Full time ₹ 90,00,000 - ₹ 1,10,00,000Job TitleTechnical Business Analyst - EDP PlatformOverviewThe Technical Business Analyst supports critical operations across process improvement, data platform engineering, and automation while contributing to the customer's strategic vision of expanding EDP data entitlement.Core Responsibilities:Improve EDP workflows and operational processes by identifying...
-
Software Infrastructure Lead
6 days ago
Salem, Tamil Nadu, India beBeeEngineering Full time ₹ 2,00,00,000 - ₹ 2,50,00,000We are seeking an exceptional individual to fill the role of Software Infrastructure Lead. As a key member of our team, you will be responsible for overseeing the development and maintenance of critical platforms that underpin our operations.Key ResponsibilitiesLead the design, implementation, and scaling of platform infrastructure to support Large Language...
-
Resilience Architect
3 days ago
Salem, Tamil Nadu, India beBeeCompliance Full time ₹ 13,50,000 - ₹ 2,02,50,000Unlock Compliance ExpertiseWe're seeking a seasoned compliance professional to lead our efforts in embedding security, privacy, and governance into every aspect of our operations.As our Associate Compliance Manager, you'll develop and implement structured playbooks, automation, and mentorship support for ISO, SOC2, GDPR, and DPDP practices.Cross-Foundation...