System Reliability Leader

2 weeks ago


Kozhikode, Kerala, India beBeeReliability Full time ₹ 18,00,000 - ₹ 24,00,000

 

Job Title:
  • Reliability Engineering Manager

 

We are seeking an experienced and dynamic Site Reliability Engineer to oversee the reliability, scalability, and performance of our critical systems.

 

This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.

 

Responsibilities:

  • Reliability & Performance:
    • Maintain high availability and reliability of critical services
    • Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met
    • Proactively identify and resolve performance bottlenecks and system inefficiencies
  • Incident Management & Response:
    • Establish and improve incident management processes and on-call rotations
    • Lead incident response and root cause analysis for high-priority outages
    • Drive post-incident reviews and ensure actionable insights are implemented
  • Automation & Tooling:
    • Develop and implement automated solutions to reduce manual operational tasks
    • Enhance system observability through metrics, logging, and distributed tracing tools (e.g., Prometheus, Grafana)
    • Optimize CI/CD pipelines for seamless deployments
  • Collaboration:
    • Partner with software engineering teams to improve the reliability of applications and infrastructure
    • Work closely with product/engineering teams to design scalable and robust systems
  • Leadership & Team Building:
    • Manage, mentor, and grow a team of SREs
    • Promote SRE best practices and foster a culture of reliability and performance across the organization
  • Capacity Planning & Cost Optimization:
    • Perform capacity planning and implement autoscaling solutions to handle traffic spikes
    • Optimize infrastructure and cloud costs while maintaining reliability and performance

 

Requirements:

  • Technical Expertise:
    • Experience with cloud platforms (AWS / Azure) and Kubernetes
    • Hands-on knowledge of infrastructure-as-code tools like Terraform/Helm
    • Proficiency in Java
    • Expertise in distributed systems, databases, and load balancing
  • Monitoring & Observability:
    • Proficient with tools like Prometheus, Grafana, or New Relic
    • Understanding of metrics-driven approaches for system monitoring and alerting
  • Automation & CI/CD:
    • Hands-on experience with CI/CD pipelines (e.g., Jenkins)
    • Skilled in automation frameworks and tools for infrastructure and application deployments
  • Incident Management:
    • Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence

 

Benefits:

  • Be a key driver in building and scaling reliable systems in a fast-paced environment
  • Work with cutting-edge technologies and influence the evolution of the infrastructure
  • Lead a high-impact team and foster a culture of reliability and innovation

 

Why Work with Us?

  • Enjoy a collaborative and dynamic work environment
  • Take ownership of projects and drive results-oriented initiatives
  • Grow your skills and career in a supportive and inclusive organization
],

  • Kozhikode, Kerala, India beBeeSiteReliabilityEngineer Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Unlock the future of site reliability engineering as a key player in our exciting journey with clients.Key ResponsibilitiesDevelop and implement the SRE strategy to drive business efficiency and quality.Promote an automate-first culture by reducing operational overhead through automation.Establish methodologies for identifying and eliminating toil-heavy...


  • Kozhikode, Kerala, India beBeeSystemReliability Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Summary">">The successful candidate will have 2-4 years of experience in a related field.">">Key Skills and Qualifications">">Extensive knowledge of Linux and Windows systems is required.">Proficiency in setting up alerts, dashboards, and analyzing metrics/logs for system performance and reliability is necessary.">Familiarity with system architecture and...


  • Kozhikode, Kerala, India beBeeTroubleshooting Full time ₹ 20,00,000 - ₹ 25,00,000

    Ensuring Application Reliability and ResilienceDelta Tech Hub is at the forefront of innovation, delivering niche solutions that enhance customer experience.As a Senior SRE, you will be instrumental in building and maintaining our reliable application suite, providing consultation and direct technical support in life cycle planning, problem management,...


  • Kozhikode, Kerala, India beBeeReliability Full time ₹ 25,00,000 - ₹ 40,00,000

    Job Title: System Reliability Engineering LeadThe Role:Our company is seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems.As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving...


  • Kozhikode, Kerala, India beBeeSenior Full time ₹ 1,80,00,000 - ₹ 2,00,00,000

    Job Title: Senior Reliability EngineerWe're looking for an experienced Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and implementing solutions to improve platform reliability, automating manual processes, and collaborating with globally dispersed SRE and Platform...


  • Kozhikode, Kerala, India beBeeLeader Full time ₹ 20,00,000 - ₹ 35,00,000

    Job Title: Advanced Control Systems LeaderDescription:We are seeking a highly skilled and experienced leader to oversee the design, implementation, and execution of advanced control systems for large-scale projects.The ideal candidate will possess strong leadership skills, excellent communication abilities, and a deep understanding of control system...


  • Kozhikode, Kerala, India beBeeReliability Full time ₹ 12,00,000 - ₹ 18,00,000

    As a Reliability Operations Expert, you will be responsible for ensuring the efficiency and reliability of our systems. This includes designing and implementing monitoring solutions to provide real-time insights into system performance.Main Responsibilities:Configure and set up dashboards for monitoring system performance using Grafana/DynatraceDevelop...


  • Kozhikode, Kerala, India beBeeReliability Full time ₹ 2,50,00,000 - ₹ 3,50,00,000

    Job Overview">The role of Site Reliability Engineer focuses on ensuring the reliability and scalability of financial systems.Sustainability Management: Responsible for day-to-day operations of accounting and finance applications and data platforms to ensure they meet business expectations.Availability and Reliability: Ensure that accounting and finance...


  • Kozhikode, Kerala, India beBeeSystem Full time ₹ 20,00,000 - ₹ 30,00,000

    Job Summary:The role of a Site Reliability Engineer is to provide technical expertise to ensure the reliability and scalability of our systems.Key Responsibilities:Infrastructure Deployment: Design, build, and deploy scalable infrastructure to support our growing business needs.Reliability Engineering: Work on identifying and resolving issues that impact...


  • Kozhikode, Kerala, India beBeeSiteReliability Full time ₹ 18,00,000 - ₹ 21,60,000

    We're looking for a skilled and experienced professional to join our team as a Site Reliability Engineer.Key Responsibilities:Assist in designing, implementing, and maintaining scalable monitoring, alerting, and logging solutions to ensure backend services' availability and performance.Support the development and implementation of observability tools and...