Lead Site Reliability Engineer

1 day ago


Solapur, Maharashtra, India beBeeEngineering Full time ₹ 20,00,000 - ₹ 25,00,000
Job Opportunity

Talent500 is seeking a highly skilled individual to fill the role of Principal Engineer, Site Reliability Engineering. This position will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.

This key role will focus on leading the operational health of our platforms, ensuring we are delivering highly reliable financial applications and data services that meet demanding requirements for accuracy, compliance, and availability.

Responsibilities

  • Operational Oversight: Ensure day-to-day operations for Accounting and Finance applications and data platforms run smoothly and meet business expectations.
  • Reliability & Availability: Guarantee Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.
  • Automation & Efficiency: Develop automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk.
  • Observability & Monitoring: Implement comprehensive monitoring, alerting, and logging for accounting applications and data pipelines.
  • Incident Response: Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues.
  • Operational Excellence: Establish best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems.
  • Collaboration with Engineering & Finance: Partner with software engineers, data engineers, and Finance/Accounting teams to ensure operational needs are met from development through production.
  • Team Coordination: Manage workload, priorities, and escalations for operations staff and partner teams.
  • Security & Compliance: Ensure financial applications and data pipelines meet audit, compliance, and security requirements.
  • Continuous Improvement: Drive post-incident reviews and proactively identify opportunities to improve system resilience.
  • Audit & Compliance Support: Ensure operational practices meet internal controls, audit requirements, and financial compliance standards.

Requirements

  • Bachelor's in computer science, engineering, information technology, or related field (or equivalent experience).
  • 12-15 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications.
  • Strong experience with monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent).
  • Hands-on expertise with CI/CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.).
  • Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective.
  • Strong scripting/programming experience (Python, Bash, Go, or similar) for automation and tooling.
  • Proven ability to manage incident response and conduct blameless postmortems.
  • Experience ensuring compliance, security, and audit-readiness in enterprise applications.

Mandatory Skills

  • Automation
  • Operational Excellence
  • Snowflake
  • SQL
  • Data Engineering
  • Financial Systems

Nice to Have

  • Experience supporting financial applications.
  • Exposure to FinOps practices for optimizing cloud spend in finance-related platforms.
  • Familiarity with containers and orchestration.
  • Experience building resilience into data pipelines and ensuring auditability for accounting data.
  • Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.


  • Solapur, Maharashtra, India beBeeMaintenance Full time ₹ 1,20,00,000 - ₹ 1,80,00,000

    Job Title: Site Reliability ManagerAbout the Role:The Site Reliability Manager is responsible for ensuring the overall reliability and efficient operation of all plant utilities, mechanical, and electrical systems across the facility.Main Responsibilities:Oversee regular maintenance, repairs, and improvements to equipment and systems.Implement a preventive...


  • Solapur, Maharashtra, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Title: Chief Reliability OfficerAs a visionary leader in Site Reliability Engineering (SRE), you will oversee the reliability, scalability, and performance of our mission-critical systems.You will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response...


  • Solapur, Maharashtra, India 5paisa Full time

    About us:5paisa Capital Ltd. is one of India's fastest-growing fintech companies, revolutionizing how retail investors and traders engage with financial markets. We provide a robust digital platform offering a wide suite of products, including Stocks, Futures & Options, Mutual Funds, and IPOs—accessible seamlessly through our user-friendly mobile and web...


  • Solapur, Maharashtra, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Site Reliability EngineerWe're pushing the boundaries of what's possible in software development. Our Mass Customization Platform is a modular, multi-tenant service that enables our businesses to choose the solutions that work for them or assemble any custom combination they need. This makes it easier and faster to introduce new products, reach customers,...


  • Solapur, Maharashtra, India beBeeReliability Full time ₹ 13,80,000 - ₹ 21,60,000

    Network Reliability EngineerAbout the RoleThis role focuses on designing, deploying, automating, and monitoring traditional and cloud network infrastructure securely.Key Responsibilities:Secure Network Design & Deployment: Lead the design, installation, and configuration of critical infrastructure at new and existing sites to ensure high reliability...


  • Solapur, Maharashtra, India beBeeReliability Full time US$ 15,00,000 - US$ 20,00,000

    As a Senior Site Reliability Engineer (SRE II) at Zafin, you will be responsible for owning the availability, latency, performance, and efficiency of our SaaS on Azure. You will define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale.You will have the opportunity to make a significant impact on our...


  • Solapur, Maharashtra, India beBeeSre Full time ₹ 15,00,000 - ₹ 25,00,000

    Job OpportunityWe're embarking on an exciting project with cutting-edge technologies and a talented team. By joining us, you'll be exposed to the latest innovations and collaborate with experts in the industry.As a VP – Site Reliability Engineering, you will play a key role in shaping our SRE function within the organization. You will work closely with a...


  • Solapur, Maharashtra, India beBeeReliability Full time ₹ 90,00,000 - ₹ 1,20,00,000

    Job Opportunity:The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of skilled professionals, ensuring operational excellence and fostering a high-performing team culture. Reporting to senior leadership, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with the...


  • Solapur, Maharashtra, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job Title: SRE Lead (Engineering & Reliability)We are seeking a highly skilled and experienced Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems.This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems that meet...


  • Solapur, Maharashtra, India beBeeReliability Full time ₹ 1,00,00,000 - ₹ 1,50,00,000

    Job OverviewWe are seeking an experienced professional to lead our end-to-end product reliability strategy for electrolyzer systems.Key Responsibilities:Develop and implement a comprehensive product reliability strategy, encompassing design, testing, manufacturing, and field performance.Collaborate closely with cross-functional teams to embed reliability...