Principal Engineer Site Reliability Specialist

1 day ago


Vijayawada, Andhra Pradesh, India beBeeSite Full time ₹ 1,50,00,000 - ₹ 2,00,00,000
Role Overview:

The Principal Engineer, SRE will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.

This role focuses on leading the operational health of our platforms, delivering highly reliable financial applications and data services that meet accuracy, compliance, and availability requirements supporting business operations.

As an Principle SRE, you will build automation, implement monitoring, improve incident response, and champion DevOps practices to enable Finance and Accounting systems to operate with consistency and trustworthiness. Additionally, you will coach and mentor junior SREs to ensure overall Operational Excellence.

Responsibilities:
  • Operational Oversight: Own day-to-day operations for Accounting and Finance applications and data platforms, ensuring they run smoothly and meet business expectations.
  • Reliability & Availability: Ensure Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.
  • Automation & Efficiency: Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk.
  • Observability & Monitoring: Implement and maintain comprehensive monitoring, alerting, and logging for accounting applications and data pipelines (e.g., Snowflake, dbt workflows, ERP integrations).
  • Incident Response: Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues.
  • Operational Excellence: Establish and enforce best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems.
  • Collaboration: Partner with software engineers, data engineers, and Finance/Accounting teams to ensure operational needs are met from development through production.
  • Team Coordination: Manage workload, priorities, and escalations for operations staff and partner teams, ensuring alignment with SLAs and compliance requirements.
  • Security & Compliance: Ensure financial applications and data pipelines meet audit, compliance, and security requirements.
  • Continuous Improvement: Drive post-incident reviews, implement lessons learned, and proactively identify opportunities to improve system resilience.
Requirements:
  • Education: Bachelor's in computer science, Engineering, Information Technology, or related field (or equivalent experience).
  • Experience: 12-15 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications.
  • Monitoring Tools: Strong experience with monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent).
  • Automation Frameworks: Hands-on expertise with CI/CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.).
  • Financial Systems: Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective.
  • Scripting Languages: Strong scripting/programming experience (Python, Bash, Go, or similar) for automation and tooling.
  • Incident Response: Proven ability to manage incident response and conduct blameless postmortems.
  • Compliance: Experience ensuring compliance, security, and audit-readiness in enterprise applications.
Must-Have Skills:
  • Automation,
  • Operational Excellence,
  • Snowflake,
  • SQL,
  • Data Engineering,
  • Financial Systems
Nice-To-Have Skills:
  • Supporting Financial Applications: Experience supporting financial applications (ERP, revenue recognition systems, accounting platforms).
  • FinOps Practices: Exposure to FinOps practices for optimizing cloud spend in finance-related platforms.
  • Containerization: Familiarity with containers and orchestration (Docker, Kubernetes).
  • Resilience and Auditability: Experience building resilience into data pipelines and ensuring auditability for accounting data.
  • Communication: Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.


  • Vijayawada, Andhra Pradesh, India beBeeSre Full time ₹ 1,50,00,000 - ₹ 2,02,50,000

    Site Reliability Engineering Leadership OpportunityWe are seeking a seasoned SRE professional to drive the implementation of our site reliability strategy, promote automation, and develop methodologies for process efficiency and reliability.The ideal candidate will have hands-on experience with defining and implementing Service Level Objectives, operating to...


  • Vijayawada, Andhra Pradesh, India beBeeEngineering Full time ₹ 70,00,000 - ₹ 1,00,00,000

    Reliability Engineering SpecialistWe are seeking a highly skilled engineer to design, implement, and maintain scalable monitoring, alerting, and logging solutions.The ideal candidate will have a basic understanding of cloud-based solutions and be familiar with monitoring and observability tools such as Prometheus and Grafana.Main Responsibilities:Design and...


  • Vijayawada, Andhra Pradesh, India beBeeObservability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Observability Engineer PositionThe ideal candidate will be responsible for designing and developing the platform components for the Observability product.This role involves working closely with various teams, including performance, data ingestion, platform DevOps, and data visualization, to ensure seamless integration and delivery of the Observability...


  • Vijayawada, Andhra Pradesh, India beBeeElk Full time ₹ 1,00,00,000 - ₹ 1,50,00,000

    Job OverviewWe are seeking a highly skilled Senior Site Reliability Engineer with expertise in ELK to design, manage, and scale ELK clusters.This is a high-impact engineering opportunity focused on performance, observability, and operational excellence at scale. The ideal candidate will have 7+ years of experience in SRE, DevOps, or Cloud Engineering, with...


  • Vijayawada, Andhra Pradesh, India beBeeReliability Full time ₹ 17,50,000 - ₹ 24,67,500

    As a Site Reliability Engineer, you will play a vital role in ensuring the reliability and efficiency of our systems. Your primary responsibility will be to identify potential system issues early on, implement preventive measures, and boost system resilience.To achieve this goal, you will work closely with cross-functional teams to build tools, pipelines,...


  • Vijayawada, Andhra Pradesh, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Description:As a senior site reliability engineer, you will be part of a dynamic team working closely with product development teams to ensure the effective observability and improve reliability of products.Using your background in SRE or development, you will take on challenging problems both technical and process related in a highly distributed...


  • Vijayawada, Andhra Pradesh, India beBeesite Full time ₹ 1,80,00,000 - ₹ 2,40,00,000

    Job DescriptionWe are seeking a skilled Site Reliability Engineer to join our team and help ensure the reliability, scalability, and performance of our critical systems.As an SRE, you will collaborate closely with development and operations teams to build and maintain highly available services, automate operational tasks, and monitor system health.Key...


  • Vijayawada, Andhra Pradesh, India beBeeDevops Full time ₹ 20,00,000 - ₹ 25,00,000

    We're seeking a highly skilled Site Reliability Engineer to join our organization.The ideal candidate will possess a strong background in DevOps, SRE, or related fields with experience in on-premise DevOps migration and installation, as well as GitHub, Team City, Jenkins, Jira, Confluence.Key responsibilities include:Setting up and managing DevOps tools such...


  • Vijayawada, Andhra Pradesh, India beBeeSystem Full time US$ 1,20,000 - US$ 1,40,000

    About the Position:We are seeking a skilled Reliable System Engineer to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems.Design and develop resiliency in application code, troubleshoot incidents, engage with teams to address failure patterns, and participate in incident management.Monitor,...


  • Vijayawada, Andhra Pradesh, India beBeeReliability Full time ₹ 20,00,000 - ₹ 30,00,000

    Job Overview:This role involves ensuring the reliability and efficiency of our application suite. The ideal candidate will have a strong background in site reliability engineering, with experience in building and supporting reliable applications.Main Responsibilities:Executing on incident, change management, and problem management processesBuilding and...