
Senior Engineering Reliability Manager
1 day ago
Job Title: Senior Engineering Reliability Manager
Description:
We seek a seasoned professional to oversee the reliability, scalability, and performance of our critical systems.
You will play a pivotal role in establishing and implementing engineering reliability best practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies.
This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.
Responsibilities:
Main Responsibilities
- Maintain high availability and reliability of critical services.
- Define and monitor Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs).
- Proactively identify and resolve performance bottlenecks and system inefficiencies.
Incident Management & Response
- Establish and improve incident management processes and on-call rotations.
- Lead incident response and root cause analysis for high-priority outages.
- Drive post-incident reviews and ensure actionable insights are implemented.
Automation & Tooling
- Develop and implement automated solutions to reduce manual operational tasks.
- Enhance system observability through metrics, logging, and distributed tracing tools.
- Optimize CI/CD pipelines for seamless deployments.
Leadership & Team Building
- Manage, mentor, and grow a team of SREs.
- Promote engineering reliability best practices and foster a culture of reliability and performance across the organization.
- Drive performance reviews, skills development, and career progression for team members.
Capacity Planning & Cost Optimization
- Perform capacity planning and implement autoscaling solutions to handle traffic spikes.
- Optimize infrastructure and cloud costs while maintaining reliability and performance.
Skills & Qualifications
Required Skills
- Technical Expertise:
- Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes.
- Hands-on knowledge of infrastructure-as-code tools like Terraform / Helm / Ansible.
- Proficiency in Java.
- Expertise in distributed systems, databases, and load balancing.
- Monitoring & Observability:
- Proficient with tools like Prometheus, Grafana, Elastic APM, or New Relic.
- Understanding of metrics-driven approaches for system monitoring and alerting.
- Automation & CI/CD:
- Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines).
- Skilled in automation frameworks and tools for infrastructure and application deployments.
- Incident Management:
- Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence.
Leadership & Communication Skills
- Strong people management and leadership skills with the ability to inspire and motivate teams.
- Excellent problem-solving and decision-making skills.
- Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.
Benefits
- Be a key driver in building and scaling reliable systems in a fast-paced environment.
- Work with cutting-edge technologies and influence the evolution of the infrastructure.
- Lead a high-impact team and foster a culture of reliability and innovation.
-
Site Reliability Engineering Leader
12 hours ago
Ghaziabad, Uttar Pradesh, India beBeeReliability Full time ₹ 1,00,00,000 - ₹ 1,25,00,000Job Summary:We are seeking an experienced and dynamic Site Reliability Engineering professional to oversee the reliability, scalability, and performance of our critical systems.This role combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.Main Responsibilities:Reliability &...
-
Reliability Engineer for Scalable Systems
2 days ago
Ghaziabad, Uttar Pradesh, India beBeeSite Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: SRERole OverviewWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team. As an SRE, you will play a critical role in ensuring the reliability and scalability of our systems.Key ResponsibilitiesSystem Uptime and Availability: Proactively monitor and resolve issues to ensure system uptime and...
-
High Performance System Reliability Engineer
12 hours ago
Ghaziabad, Uttar Pradesh, India beBeeSre Full time ₹ 25,00,000 - ₹ 35,00,000High Performance SRE RoleWe're committed to delivering exceptional software solutions that drive meaningful outcomes. Our team of site reliability engineers (SREs) empowers users with a rich feature set, high availability, and stellar performance to pursue their goals.As we expand our customer base, we're seeking an experienced SRE to deliver real-time...
-
Reliable System Specialist
2 days ago
Ghaziabad, Uttar Pradesh, India beBeeSre Full time ₹ 23,00,000 - ₹ 25,00,000Job Title: Senior SREWe are seeking a highly skilled and experienced Senior Site Reliability Engineer to drive high-performing systems and infrastructure.This is an exceptional opportunity for a motivated individual to utilize their technical expertise and leadership skills to deliver results-oriented solutions.The ideal candidate will possess a strong...
-
Senior Data Engineer
3 days ago
Ghaziabad, Uttar Pradesh, India beBeeDataEngineer Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Title: Senior Data EngineerWe are seeking a seasoned Senior Data Engineer to join our team. As a key member of our data engineering group, you will design, develop, and maintain large-scale data pipelines and ETL processes on Databricks.The ideal candidate will have 7+ years of experience in Data Engineering and a proven track record of delivering...
-
Reliability Engineer Position
2 days ago
Ghaziabad, Uttar Pradesh, India beBeeSystem Full time ₹ 18,00,000 - ₹ 26,40,000Job Opportunity:Site Reliability SpecialistCollaborate with cross-functional teams to implement and maintain high-availability infrastructure.Develop and execute strategies for monitoring and troubleshooting complex systems.Provide expert technical guidance to enhance system reliability and performance.Identify areas of improvement and drive initiatives to...
-
Senior Field Engineer
4 days ago
Ghaziabad, Uttar Pradesh, India beBeeRailway Full timeKey Roles and ResponsibilitiesAs a Senior Field Engineer, you will take on a new challenge and apply your extensive field engineering expertise in a new cutting-edge field. You will work alongside dedicated and solution-driven teammates to drive technical excellence.You'll provide leadership in troubleshooting and resolving technical/quality issues,...
-
Platform Reliability Specialist
12 hours ago
Ghaziabad, Uttar Pradesh, India beBeeReliability Full time US$ 1,80,000 - US$ 2,50,000Job OverviewWe are seeking a skilled professional to join our global team as an Associate Platform Reliability Engineer. This role is critical to ensuring the stability, reliability, and resilience of front-to-back technology infrastructure, with a focus on post-trade processing, operations, and regulatory support.Key ResponsibilitiesCollaborate with a...
-
Senior Project Manager
12 hours ago
Ghaziabad, Uttar Pradesh, India beBeeCivilEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Project Management RoleThe successful candidate will be responsible for overseeing the construction, fit-out of new showrooms and related works, ensuring adherence to the launch plan.They will lead a project team, working closely with senior management and site engineers to ensure on-time execution of expansion plans.Key Responsibilities:Coordinate and...
-
Ghaziabad, Uttar Pradesh, India beBeeSystemEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job DescriptionWe are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the stability, performance, and scalability of our systems.You will play a key role in various migration activities, including Kubernetes cluster upgrades, and application re-platforming. A significant part of your...