
Reliable Systems Engineer
3 days ago
About the Role
This position offers a unique opportunity to contribute to impactful projects that enhance reliability and reduce manual work through automation.
As a key member of our technical team, you will leverage your experience across a range of SRE practices to maintain resilient, distributed systems and automate processes to protect critical services.
You will participate in on-call rotations and offer guidance and support to your colleagues. Your insights and process improvements will help shape our inclusive culture of technical excellence and continuous learning.
Responsibilities:
- Automate Manual Tasks: Lead efforts to automate manual and repetitive tasks, contributing to resilient and reliable systems.
- Develop Self-Healing Infrastructure Solutions: Develop and implement self-healing infrastructure solutions to drive operational efficiency and reduce incidents.
- Create Automation Tools: Create and maintain automation and tools to promote system performance and uptime.
- Post-Release Validation: Support post-release validation and operational readiness for new deployments.
- On-Call Support: Provide occasional support outside of standard hours as needed for major releases or critical changes, with consideration for work-life balance.
- Design Scalable Infrastructure: Design infrastructure following best practices for scalability, fault tolerance, and security.
- Define Service Level Indicators: Define and manage Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to partner with teams in ensuring reliable services.
- Collaborate with Engineering Teams: Collaborate with engineering teams to enhance deployment pipelines and make recommendations for improved architecture, release speed, and productivity.
Requirements:
- Professional Experience: Professional experience in a Site Reliability Engineering, DevOps, or related technical role (all relevant pathways and learning experiences welcomed).
- Cloud Platform Familiarity: Cloud Platform Familiarity Especially with AWS services such as EC2, Lambda, DynamoDB, Aurora RDS PostgreSQL, and AWS OpenSearch. Experience with similar platforms is also valued.
- Infrastructure as Code: Hands-on experience (preferably 2 or more years) with tools like Terraform, or similar, to automate and manage cloud resources.
- Containerization: Experience with containerization, using Docker, with Kubernetes skills considered a plus.
- Configuration Management: Familiarity with configuration management tools such as Puppet, Ansible, or comparable systems.
- Monitoring and Observability: Experience with monitoring, alerting, and observability tools (e.g., Elastic Search, Grafana, Open Telemetry, GitHub Actions, Azure DevOps, TeamCity, Jenkins).
- Certifications: Relevant certifications in AWS, Kubernetes, or related areas are appreciated but not required.
-
System Reliability Engineer
5 days ago
Gurgaon, Haryana, India beBeeReliability Full time ₹ 15,00,000 - ₹ 20,00,000Job Title: System Reliability EngineerWe are seeking a highly skilled System Reliability Engineer to lead capacity management, operational support, and incident resolution for our platforms. This role requires a professional with a background in both SRE and application support, who can collaborate with development and infrastructure teams to ensure the...
-
Reliable Systems Engineer
6 days ago
Chennai, Tamil Nadu, India beBeeReliability Full timeSystem Reliability ExpertWe are seeking a talented and proactive system reliability expert to join our infrastructure team. The ideal candidate will combine software engineering expertise with systems engineering skills to build scalable, reliable, and efficient systems.Key Responsibilities:Design, implement, and manage scalable, resilient, and secure...
-
Reliability Systems Engineer
22 hours ago
Gurgaon, Haryana, India beBeeReliability Full time ₹ 18,00,000 - ₹ 22,00,000About the Role :We are seeking an experienced Reliability Systems Engineer to join our high-performance infrastructure team.Your Key Responsibilities :Incident & Alert Management : Monitor production systems and handle alerts to ensure minimal service disruption.Act as the first point of escalation for production incidents and critical system issues.Drive...
-
Site Reliability Engineer
3 weeks ago
Ahmedabad, Gurugram, Chennai, India Robotics Technologies Full timeJob DescriptionDescriptionWe are seeking a skilled Site Reliability Engineer (SRE) to join our team in India. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our production systems. You will work closely with development teams to build and maintain scalable applications while implementing automation...
-
Reliable System Architect
21 hours ago
Gurgaon, Haryana, India beBeeSystem Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Reliable System ArchitectJob Overview:This is a challenging position that requires software engineering and operations expertise to design, build, and maintain systems capable of handling high production traffic efficiently.Experience with Observability Tools: A solid understanding of Dynatrace, including on-premises and SaaS solutions, is essential for...
-
Distributed Systems Reliability Expert
20 hours ago
Bengaluru / Bangalore, Gurgaon / Gurugram, India beBeeReliability Full time ₹ 1,04,000 - ₹ 1,30,878Senior Reliability EngineerKey Responsibilities:Develop and implement a comprehensive reliability strategy aligned with the company's goals and objectives. Lead a team of reliability professionals to drive system reliability, performance, and scalability.Establish real-time monitoring practices to ensure insights into system performance and customer...
-
System Reliability Engineer
4 days ago
Chennai, Tamil Nadu, India beBeeTechnical Full time US$ 1,20,000 - US$ 1,40,000We are seeking a skilled technical expert to enhance our automated risk detection systems. This role involves supporting and maintaining rule-based logic, data analysis processes, and scalable tools.In this position, you will play a vital part in system reliability, operational support, and technical troubleshooting, contributing to the protection of...
-
Site Reliability Engineer
3 hours ago
Gurgaon, Haryana, India ElevenX Capital Full time US$ 1,50,000 - US$ 2,00,000 per yearAbout the Role:We are looking for a skilled Site Reliability Engineer (SRE) to join our team and help us ensure the reliability, scalability, and performance of our critical systems. As an SRE, you will work closely with development and operations teams to build and maintain highly available services, automate operational tasks, and monitor system health.Key...
-
Distinguished System Reliability Engineer
19 hours ago
Mumbai, Maharashtra, India beBeeSystemReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: System Reliability Leader">We are seeking a seasoned system reliability engineer to lead our team in designing and implementing robust, scalable, and fault-tolerant systems. This role involves mentoring junior engineers, providing technical guidance, and fostering a culture of collaboration and continuous learning.The ideal candidate will have a...
-
Reliable System Architect
3 days ago
Gurgaon, Haryana, India beBeeEngineering Full time ₹ 1,04,000 - ₹ 1,30,878System Reliability RoleEnsure continuous system availability by designing and implementing scalable global systems.Develop and maintain robust system architecture, deploying and scaling applications to meet business needs.Lead incident response efforts, analyzing system failures and implementing solutions to prevent recurrence.Implement automation processes...