
Highly Available System Reliability Expert
3 days ago
Job Description
We are seeking a Senior Site Reliability Engineer to enhance system performance and reliability, automate manual processes, and collaborate with globally dispersed teams.
The ideal candidate will provide technical leadership, design and implement solutions to improve platform reliability, build and maintain monitoring systems, and conduct Root Cause Analyses.
Key Responsibilities:
- Provide technical guidance and mentoring through knowledge sharing, code reviews, and solution design
- Design and implement solutions to improve platform reliability with mitigation strategies and operational playbooks
- Build and maintain monitoring, alerting, and logging systems to ensure proactive incident response
- Conduct Root Cause Analyses (RCAs) and blameless post-mortems
- Participate in on-call rotations to support system reliability
- Automate infrastructure provisioning and management using Infrastructure as Code
- Monitor and optimize databases for high availability and performance
- Partner with product engineering teams to deliver observable, resilient software
- Contribute to defining Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs)
Required Skills and Qualifications
- Bachelor's degree in Computer Science or related field
- 2+ years' experience in Platform Engineering, SRE, or Software Engineering roles
- 1+ years' experience working on a SaaS platform
- Proficient in PHP/Laravel development
- Hands-on experience with Kubernetes and public cloud platforms (Azure, AWS, or GCP)
- Expertise in Infrastructure as Code (Terraform preferred, Ansible or CloudFormation)
- Scripting/automation skills in Bash, PowerShell, or Python
- Experience with observability tools (DataDog, Prometheus, Grafana)
- Proven track record in maintaining high-availability, high-performance production environments
Why Join?
- Work with a leading SaaS-based real estate product company
- Be part of a global, high-performing engineering team
- Opportunity to own platform reliability and scalability at scale
- Competitive compensation and benefits
-
Highly Available System Specialist
1 week ago
Ellore, Andhra Pradesh, India beBeeReliability Full time ₹ 15,00,000 - ₹ 25,00,000Reliable Systems EngineerAbout the Role:We are seeking a skilled System Reliability Engineer to ensure the reliability, scalability, and performance of critical systems.As an SRE, you will collaborate closely with development and operations teams to build and maintain highly available services, automate operational tasks, and monitor system health.Key...
-
Highly Scalable Systems Architect
6 days ago
Ellore, Andhra Pradesh, India beBeeSystemReliability Full time ₹ 16,57,100 - ₹ 25,54,300Job DescriptionA highly skilled System Reliability Expert is sought to work with a leading financial services organization. The role involves managing the end-to-end application and system stack, ensuring high reliability, scalability, and performance of distributed systems.Key Responsibilities:Engage in and improve the software development lifecycle –...
-
System Reliability Specialist
1 week ago
Ellore, Andhra Pradesh, India beBeeSRE Full time ₹ 90,00,000 - ₹ 1,20,00,000High Availability and Scalability ExpertWe are seeking an experienced Senior SRE Engineer to deliver insights from large-scale data in real-time.The ideal candidate will design robust cloud infrastructure for high availability and scalability, monitor system health and optimize performance for improved user experience, and collaborate with product teams and...
-
Reliable Systems Engineer
1 week ago
Ellore, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 15,00,000 - ₹ 25,00,000Job DescriptionWe are seeking a highly skilled infrastructure professional to join our team. As a Site Reliability Engineer, you will play a key role in building, managing, and scaling modern infrastructure systems for high-availability applications.Key ResponsibilitiesDrive initiatives to improve platform scalability and operational efficiency.Lead...
-
Financial Systems Expert
4 days ago
Ellore, Andhra Pradesh, India beBeeResilience Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Overview:The Senior Engineer, Site Reliability (SRE) plays a pivotal role in guaranteeing the robustness, scalability, and operational excellence of accounting and finance systems.This position focuses on delivering highly reliable financial applications and data services that meet demanding requirements for accuracy, compliance, and availability to...
-
Reliable Software Expert
4 days ago
Ellore, Andhra Pradesh, India beBeeSite Full time ₹ 1,50,00,000 - ₹ 2,00,00,000About This RoleWe are seeking a skilled Site Reliability Engineer to join our global team. As a key member of our technical organization, you will be responsible for designing and developing resiliency in application code, troubleshooting incidents, engaging with squads to address failure patterns, and participating in incident management.Key...
-
Site Reliability Expert
3 days ago
Ellore, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 20,00,000 - ₹ 25,00,000Site Reliability ExpertWe are seeking a skilled Site Reliability Engineer to fill this key role.Main Responsibilities:To design and implement scalable infrastructure solutions using DevOps practices and CI/CD pipelines.To develop and maintain monitoring tools ensuring the reliability of our systems.To collaborate with cross-functional teams identifying and...
-
High Availability Infrastructure Specialist
2 weeks ago
Ellore, Andhra Pradesh, India beBeeInfrastructure Full time US$ 1,50,000 - US$ 2,00,000Ensuring the reliability and performance of critical systems is crucial in today's fast-paced digital landscape. As a National Pen brand, Pens.com provides customized marketing solutions to 22 countries worldwide.The Mass Customization Platform is a modular, multi-tenant service that enables businesses to choose the solutions they need or assemble custom...
-
Highly Skilled Software Engineer
4 days ago
Ellore, Andhra Pradesh, India beBeeEngineering Full time ₹ 1,50,00,000 - ₹ 2,50,00,000We are seeking a highly skilled and experienced engineer to join our team.The ideal candidate will have 10+ years of experience in designing and developing large-scale distributed applications architected for scale, with the ability to support multiple tenants seamlessly and integrate with various external payment processors and intermediaries.Key...
-
Cloud Reliability Specialist
3 days ago
Ellore, Andhra Pradesh, India beBeeEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team.As a Site Reliability Engineer, you will be responsible for ensuring the availability, latency, performance, and efficiency of our cloud-based platform. You will work closely with cross-functional teams to define and enforce reliability standards, lead high-impact...