
Principal System Reliability Engineer
2 weeks ago
Job Summary:
A system reliability engineer is a key player in ensuring the overall performance and stability of large-scale distributed systems. We are seeking a highly skilled professional to manage our infrastructure and collaborate with developers to establish quality and performance benchmarks.
Responsibilities:- Oversee and maintain the infrastructure that supports ad exchange applications, including load balancers, data stores, CI/CD pipelines, and monitoring stacks.
- Continuously improve infrastructure resilience, scalability, and efficiency to meet demands of massive request volume and stringent latency requirements.
- Develop policies and procedures that improve platform stability and participate in shared on-call schedule.
- Work closely with developers to establish and uphold quality and performance benchmarks, ensuring applications meet necessary criteria before they are deployed to production.
- Participate in design reviews and provide feedback on infrastructure-related aspects to improve system performance and reliability.
- Develop tools to simplify and enhance infrastructure management, automate processes, and improve operational efficiency.
- These tools may address areas such as monitoring, alerting, deployment automation, and failure detection and recovery, critical in minimizing latency and maintaining uptime.
- Focus on reducing latency and maximizing efficiency across all components, from request handling in load balancers to database optimization.
- Implement best practices and tools for performance monitoring, including real-time analysis and response mechanisms.
- Bachelor's or master's degree in Computer Science, Information Technology, or related field.
- 2-4 years of experience managing services in large-scale distributed systems.
- Strong understanding of networking concepts (e.g., TCP/IP, routing, SDN) and modern software architectures.
- Proficiency in programming and scripting languages such as Python, Go, or Ruby, with focus on automation.
- Experience with container orchestration tools like Kubernetes and virtualization platforms (preferably GCP).
- Ability to independently own problem statements, manage priorities, and drive solutions.
- Infrastructure as Code: Experience with Terraform.
- Configuration management tools like Nix, Ansible.
- Monitoring and Logging Tools: Expertise with Prometheus, Grafana, or ELK stack.
- OLAP databases: Clickhouse and Apache Druid.
- CI/CD Pipelines: Hands-on experience with Jenkins or ArgoCD.
- Databases: Proficiency in MySQL (relational) or Redis (NoSQL).
- Load Balancers Servers: Familiarity with Haproxy or Nginx.
- Strong knowledge of operating systems and networking fundamentals.
- Experience with version control systems such as Git.
-
Principal System Reliability Specialist
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeReliability Full time ₹ 20,00,000 - ₹ 25,00,000Key to this role is the ability to drive system reliability and performance.Responsibilities:Design automated infrastructure deployment using Terraform for cross-cloud environmentsDevelop monitoring systems integrating cloud services, leveraging Python and Dynatrace skillsImplement resilient design patterns ensuring disaster recovery capabilities in...
-
Reliable Systems Engineer
6 days ago
Ghaziabad, Uttar Pradesh, India beBeeSoftware Full time ₹ 15,00,000 - ₹ 25,00,000Job TitleWe are seeking a skilled and experienced Software Reliability Engineer to join our team.
-
Site Reliability Engineering Manager
2 weeks ago
Ghaziabad, Uttar Pradesh, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: Reliability Engineering Lead About the Role:We are seeking an experienced Reliability Engineering Lead to join our team. As a key member of our engineering organization, you will be responsible for leading the reliability engineering function, ensuring infrastructure resiliency and optimal operational performance.This role requires a strong...
-
Reliable Systems Specialist
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeSenior Full time US$ 10,00,000 - US$ 15,00,000Job OverviewWe are seeking a highly skilled Senior Engineer to join our team. The ideal candidate will be responsible for ensuring the stability, scalability, and operational excellence of accounting and finance platforms.Key Responsibilities:Ensure accounting and finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and...
-
Highly Reliable Systems Specialist
2 weeks ago
Ghaziabad, Uttar Pradesh, India beBeeSiteReliabilityEngineer Full time ₹ 17,74,000 - ₹ 23,16,000About the Role:We are seeking a skilled Senior Site Reliability Engineer to join our team. As a key member of our organization, you will play a vital role in ensuring the reliability and performance of our applications.The successful candidate will have a strong background in system administration, scripting languages, and IT service management. They will be...
-
Site Reliability Engineering Lead
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeEngineer Full time ₹ 1,60,00,000 - ₹ 2,80,00,000At ANSR, we're seeking a seasoned Site Reliability Engineering Lead to ensure the stability and scalability of our financial platforms. This high-profile role requires leading operational health, delivering reliable financial applications, and coaching junior engineers.Key Responsibilities:Operational Oversight: Ensure day-to-day operations for Accounting...
-
Site Reliability Leader
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeAutomation Full time US$ 1,50,000 - US$ 1,75,000Job Summary:Achieving Operational Excellence as a Principal SRE requires leadership skills to oversee the health and reliability of finance and accounting platforms.About the Role:The ideal candidate will leverage automation, monitoring tools, and DevOps practices to ensure consistency and trustworthiness in financial systems. This involves building...
-
Principal Engineer
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeEmbeddedSystems Full time ₹ 1,80,00,000 - ₹ 2,52,00,000Job OpportunityAre you a skilled engineer looking to drive innovation and excellence in your career?We are seeking a highly experienced Engineering professional to join our team as an Assistant Vice President, Practice Lead - Embedded Systems. The ideal candidate will have extensive knowledge of mechanical engineering principles and practices, AI...
-
Reliability Engineer for Financial Platforms
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeReliability Full time ₹ 20,00,000 - ₹ 30,00,000Job DescriptionAbout the Role:This reliability engineer will play a key role in ensuring the stability, scalability, and operational excellence of financial platforms.Main Responsibilities:Ensure financial platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.Build automation for deployments, monitoring, scaling, and...
-
Senior Systems Architect
1 week ago
Ghaziabad, Uttar Pradesh, India beBeeDesigner Full time ₹ 1,20,00,000 - ₹ 1,50,00,000Job OverviewThe Principal Design Engineer will play a key role in developing high-speed communication protocol-based solutions. This individual will be responsible for creating and implementing IP designs, as well as integrating existing IPs into the overall system design.