Senior Site Reliability Engineer

1 week ago


Thiruvananthapuram, Kerala, India Zafin Full time

Job Summary

Zafin is seeking a Cloud Site Reliability Engineer II (CSRE II) to lead strategic initiatives in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This advanced role requires mastery in cloud technologies, strategic planning, and incident management to drive innovative solutions and operational excellence.

As a CSRE II, you will influence the direction of cloud reliability strategies, mentor junior engineers, and lead significant projects that have a broad organizational impact. This position reports directly to the VP of Cloud Services and requires a proactive, collaborative mindset to achieve operational and strategic objectives.

Key Responsibilities

  • Lead and manage the resolution of complex technical issues involving Zafin's products and Azure cloud environment.
  • Design and implement strategic operational enhancements to improve resiliency and system reliability.
  • Conduct in-depth Root Cause Analysis (RCA) for high-severity incidents and drive initiatives to reduce error recurrence.
  • Represent the organization in external client escalation calls, providing expert guidance and solutions.
  • Architect and optimize cloud infrastructure for high performance, scalability, and cost-effectiveness.
  • Provide thought leadership in managing and scaling container orchestration platforms such as AKS and OpenShift.
  • Oversee the implementation of advanced monitoring solutions and integrate predictive analytics for proactive issue resolution.
  • Develop and execute automation strategies to streamline operational workflows and incident responses.
  • Create and maintain comprehensive documentation of cloud architectures, processes, and incident management strategies.
  • Mentor and coach junior engineers, fostering a culture of continuous learning and innovation.
  • Drive strategic initiatives, collaborating with cross-functional teams to achieve organizational objectives.

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field (Master's degree preferred).
  • 7- 12 years of experience in cloud support, operations, or a related role.
  • Advanced expertise in Microsoft Azure (preferred) or equivalent cloud platforms.
  • Demonstrated experience in designing and scaling container orchestration systems like AKS or OpenShift.
  • Proven leadership in managing automated deployment pipelines, including Azure DevOps.
  • Mastery in enterprise monitoring platforms (e.g., Azure Insights, Grafana) and predictive analytics tools.
  • Advanced scripting skills with PowerShell, Python, or similar languages.
  • Extensive experience in incident management and defining SLAs for global production environments.
  • In-depth knowledge of database management, particularly Postgres.

Preferred Qualifications

  • Advanced certifications in cloud platforms (e.g., Azure Solutions Architect Expert).
  • Experience with ITSM tools and processes (e.g., ServiceNow).
  • Comprehensive understanding of security and compliance in cloud environments.

Soft Skills

  • Exceptional analytical and problem-solving abilities.
  • Strong leadership and mentoring skills.
  • Advanced communication and collaboration capabilities.
  • Visionary approach to operational innovation and strategic planning.


  • Thiruvananthapuram, Kerala, India Zafin Full time

    Job SummaryZafin is seeking a Cloud Site Reliability Engineer II (CSRE II) to lead strategic initiatives in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This advanced role requires mastery in cloud technologies, strategic planning, and incident management to drive innovative solutions and operational...


  • Thiruvananthapuram, Kerala, India beBeeTechnical Full time ₹ 20,00,000 - ₹ 25,00,000

    Job TitleSite Reliability Engineer - Technical Leader and Problem SolverKey Responsibilities:Investigate and resolve high-impact production issues across infrastructure and applications.Collaborate with development teams to improve performance, reliability, and architecture of systems.Participate in incident response efforts as a technical expert.Develop...


  • Thiruvananthapuram, Kerala, India CareStack™ - Dental Practice Management Full time

    Job Location - TrivandrumRotational ShiftsResponsibilities:1. Manage and maintain day-to-day BAU operations, including monitoring systemperformance, troubleshooting issues, and ensuring high availability.2. Build infrastructure as code (IAC) patterns that meet security and engineeringstandards.3. Build CI/CD pipelines using Octopus, GitLab-CI and...

  • Junior Site Engineer

    4 weeks ago


    Thiruvananthapuram, Kerala, India ALL-iN Engineering Management Services Full time

    Company Description ALL-iN EMC is a comprehensive engineering consultancy firm. We specialize in design, structural, and management services across various sectors, including residential, renovation, commercial, industrial, and assembly projects. Our multidisciplinary team delivers innovative and practical solutions, ensuring project success from conception...


  • Thiruvananthapuram, Kerala, India beBeeEngineer Full time ₹ 10,00,000 - ₹ 15,12,500

    Site Engineer Job Description We are looking for a skilled site engineer with extensive experience to join our team. Inspect facilities and analyze operational data to identify areas of improvement. Maintain compliance with safety and regulatory standards, ensuring a secure working environment. Compile estimates for technical and material requirements for...


  • Thiruvananthapuram, Kerala, India beBeeDevelopment Full time ₹ 8,00,000 - ₹ 12,00,000

    Job Overview:We are seeking a highly skilled Site Development Manager to oversee construction activities at project sites.Responsibilities:Supervise and monitor construction progress to ensure compliance with approved drawings, specifications, and quality standards.Collaborate with architects, consultants, and subcontractors to coordinate site...


  • Thiruvananthapuram, Kerala, India Terumo Blood and Cell Technologies Full time

    JOB SUMMARY We are looking for a highly skilled and experienced Senior Embedded Systems Engineer to join our dynamic team. In this role, he/ she will: Be responsible for Designing, developing, and maintaining embedded systems and software for medical devices. Work closely with cross-functional teams to ensure the successful integration of hardware and...


  • Thiruvananthapuram, Kerala, India ICON plc Full time US$ 90,000 - US$ 1,20,000 per year

    Senior Software Engineer- Chennai / TrivandrumICON plc is a world-leading healthcare intelligence and clinical research organization. We're proud to foster an inclusive environment driving innovation and excellence, and we welcome you to join us on our mission to shape the future of clinical developmentWe are currently seeking a Senior Software Engineer to...


  • Thiruvananthapuram, Kerala, India beBeeReliability Full time ₹ 16,00,000 - ₹ 22,00,000

    Job Posting:We are seeking a highly skilled and experienced IT professional to fill the role of Reliability Engineer. As part of our dynamic engineering team, you will be responsible for managing and maintaining day-to-day operations, including monitoring system performance, troubleshooting issues, and ensuring high availability.Key Responsibilities:Oversee...


  • Thiruvananthapuram, Kerala, India iVedha Inc. Full time

    Job Title: Senior LogicMonitor DeveloperJob Summary:iVedha's Platform Engineering Practice is looking for a Senior LogicMonitor Developer with deep expertise in API-based integration and automation. In this role, you will lead the design and implementation of monitoring solutions across hybrid environments, working closely with platform and operations teams...