
Lead Site Reliability Engineer
1 day ago
Talent500 is seeking a highly skilled individual to fill the role of Principal Engineer, Site Reliability Engineering. This position will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.
This key role will focus on leading the operational health of our platforms, ensuring we are delivering highly reliable financial applications and data services that meet demanding requirements for accuracy, compliance, and availability.
Responsibilities
- Operational Oversight: Ensure day-to-day operations for Accounting and Finance applications and data platforms run smoothly and meet business expectations.
- Reliability & Availability: Guarantee Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.
- Automation & Efficiency: Develop automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk.
- Observability & Monitoring: Implement comprehensive monitoring, alerting, and logging for accounting applications and data pipelines.
- Incident Response: Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues.
- Operational Excellence: Establish best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems.
- Collaboration with Engineering & Finance: Partner with software engineers, data engineers, and Finance/Accounting teams to ensure operational needs are met from development through production.
- Team Coordination: Manage workload, priorities, and escalations for operations staff and partner teams.
- Security & Compliance: Ensure financial applications and data pipelines meet audit, compliance, and security requirements.
- Continuous Improvement: Drive post-incident reviews and proactively identify opportunities to improve system resilience.
- Audit & Compliance Support: Ensure operational practices meet internal controls, audit requirements, and financial compliance standards.
Requirements
- Bachelor's in computer science, engineering, information technology, or related field (or equivalent experience).
- 12-15 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications.
- Strong experience with monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent).
- Hands-on expertise with CI/CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.).
- Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective.
- Strong scripting/programming experience (Python, Bash, Go, or similar) for automation and tooling.
- Proven ability to manage incident response and conduct blameless postmortems.
- Experience ensuring compliance, security, and audit-readiness in enterprise applications.
Mandatory Skills
- Automation
- Operational Excellence
- Snowflake
- SQL
- Data Engineering
- Financial Systems
Nice to Have
- Experience supporting financial applications.
- Exposure to FinOps practices for optimizing cloud spend in finance-related platforms.
- Familiarity with containers and orchestration.
- Experience building resilience into data pipelines and ensuring auditability for accounting data.
- Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.
-
Site Reliability Manager
7 days ago
Solapur, Maharashtra, India beBeeMaintenance Full time ₹ 1,20,00,000 - ₹ 1,80,00,000Job Title: Site Reliability ManagerAbout the Role:The Site Reliability Manager is responsible for ensuring the overall reliability and efficient operation of all plant utilities, mechanical, and electrical systems across the facility.Main Responsibilities:Oversee regular maintenance, repairs, and improvements to equipment and systems.Implement a preventive...
-
Chief Reliability Engineer
7 days ago
Solapur, Maharashtra, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: Chief Reliability OfficerAs a visionary leader in Site Reliability Engineering (SRE), you will oversee the reliability, scalability, and performance of our mission-critical systems.You will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response...
-
Site Reliability Engineer
2 days ago
Solapur, Maharashtra, India 5paisa Full timeAbout us:5paisa Capital Ltd. is one of India's fastest-growing fintech companies, revolutionizing how retail investors and traders engage with financial markets. We provide a robust digital platform offering a wide suite of products, including Stocks, Futures & Options, Mutual Funds, and IPOs—accessible seamlessly through our user-friendly mobile and web...
-
Highly Skilled Site Reliability Engineer
1 week ago
Solapur, Maharashtra, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Site Reliability EngineerWe're pushing the boundaries of what's possible in software development. Our Mass Customization Platform is a modular, multi-tenant service that enables our businesses to choose the solutions that work for them or assemble any custom combination they need. This makes it easier and faster to introduce new products, reach customers,...
-
Site Reliability Network Architect
1 day ago
Solapur, Maharashtra, India beBeeReliability Full time ₹ 13,80,000 - ₹ 21,60,000Network Reliability EngineerAbout the RoleThis role focuses on designing, deploying, automating, and monitoring traditional and cloud network infrastructure securely.Key Responsibilities:Secure Network Design & Deployment: Lead the design, installation, and configuration of critical infrastructure at new and existing sites to ensure high reliability...
-
Reliable Engineering Leader
16 hours ago
Solapur, Maharashtra, India beBeeReliability Full time US$ 15,00,000 - US$ 20,00,000As a Senior Site Reliability Engineer (SRE II) at Zafin, you will be responsible for owning the availability, latency, performance, and efficiency of our SaaS on Azure. You will define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale.You will have the opportunity to make a significant impact on our...
-
Reliable Systems Engineer
1 day ago
Solapur, Maharashtra, India beBeeSre Full time ₹ 15,00,000 - ₹ 25,00,000Job OpportunityWe're embarking on an exciting project with cutting-edge technologies and a talented team. By joining us, you'll be exposed to the latest innovations and collaborate with experts in the industry.As a VP – Site Reliability Engineering, you will play a key role in shaping our SRE function within the organization. You will work closely with a...
-
Remote Technical Manager
1 day ago
Solapur, Maharashtra, India beBeeReliability Full time ₹ 90,00,000 - ₹ 1,20,00,000Job Opportunity:The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of skilled professionals, ensuring operational excellence and fostering a high-performing team culture. Reporting to senior leadership, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with the...
-
Senior Systems Reliability Engineer
1 week ago
Solapur, Maharashtra, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Title: SRE Lead (Engineering & Reliability)We are seeking a highly skilled and experienced Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems.This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems that meet...
-
Product Reliability Lead
1 day ago
Solapur, Maharashtra, India beBeeReliability Full time ₹ 1,00,00,000 - ₹ 1,50,00,000Job OverviewWe are seeking an experienced professional to lead our end-to-end product reliability strategy for electrolyzer systems.Key Responsibilities:Develop and implement a comprehensive product reliability strategy, encompassing design, testing, manufacturing, and field performance.Collaborate closely with cross-functional teams to embed reliability...