Site Reliability Engineer

2 weeks ago


Delhi, India Pro5.ai Full time

Our client is seeking a Site Reliability Engineer I to join their growing technology operations team. This role is ideal for someone passionate about system reliability, incident response, and cross-team collaboration in a large-scale cloud environment. What You’ll Do Act as the first point of contact for all customer-affecting issues. Drive and manage the resolution of technical incidents. Ensure proper incident management processes and completion of post-mortems. Provide consistent and clear communication to management. Respond to Zabbix alerts and perform regular monitoring, taking direct action or escalating as needed. Ensure smooth handoff of escalations. Maintain pod health across all sites, including defining pod alerts in Zabbix. Perform daily filesystem checks for pods. Troubleshoot advanced technical issues for DC Technicians (pods, deployments, migrations, Ansible playbooks). Identify and escalate potential network issues. Handle Vault pre-deployment configuration, testing, and migration monitoring. Document and automate daily operational tasks. Provide network IP documentation for upcoming deployments. Monitor server farm releases/updates and escalate issues when necessary. Participate in on-call rotation. Support TechOps team members with tasks as needed. Recommend improvements to enhance productivity. Work outside normal business hours as required (weekends, holidays, evenings). Requirements Must be located in Bangalore . 2–4 years of relevant experience. Strong sysadmin and Linux skills. Willingness to learn and grow technical capabilities. Strong analytical and problem-solving skills. Excellent communication and teamwork skills. Knowledge of network cabling, network classification, and network topology. Pro5 is a global platform helping thousands of vetted professionals get hired by top employers. See what others say on our public Google Reviews and learn how we keep your data safe in our Trust Center .



  • delhi, India AQUMEN LABS Full time

    Company Description AQUMEN Labs is a trusted Quality Engineering, DevOps, and AI Consulting partner to several industry leaders and high-growth startup companies across India and global markets. We actively engage with the startup ecosystem to help teams build resilient, scalable, and production-ready platforms by applying lessons learned from working with...


  • Delhi, India AQUMEN LABS Full time

    Company DescriptionAQUMEN Labs is a trusted Quality Engineering, DevOps, and AI Consulting partner to several industry leaders and high-growth startup companies across India and global markets. We actively engage with the startup ecosystem to help teams build resilient, scalable, and production-ready platforms by applying lessons learned from working with...


  • Delhi, India Crescent Techservices Full time

    Role : Site Reliability Engineer Type : Contract to Hire Location : Hyderabad, Telangana Experience : 10+ Years Requirements: 10+ years of experience Site Reliability engineer (SRE) to take care of L1/L2 Support of Digital Asset Need to be proficient with DevOps and be able to build CI/CD pipeline. Should be familiar with Java and Spring Boot, Prometheus and...


  • Delhi, India Lorven Technologies Inc. Full time

    Job Title:SRELocation:Hyderabad (Hybrid/On-site)Experience:7+ YearsJob Description:We are looking for a skilled SRE with strong expertise inGrafana, Dynatrace, Datadog, Splunk and Team Handlingto help build and maintain high-performance data pipelines for both real-time and batch processing. This role involves working in a collaborative environment to...


  • delhi, India Lorven Technologies Inc. Full time

    Job Title:SRE Location:Hyderabad (Hybrid/On-site) Experience:7+ YearsJob Description: We are looking for a skilled SRE with strong expertise inGrafana, Dynatrace, Datadog, Splunk and Team Handlingto help build and maintain high-performance data pipelines for both real-time and batch processing. This role involves working in a collaborative environment to...


  • New Delhi, India AQUMEN LABS Full time

    Company Description AQUMEN Labs is a trusted Quality Engineering, DevOps, and AI Consulting partner to several industry leaders and high-growth startup companies across India and global markets. We actively engage with the startup ecosystem to help teams build resilient, scalable, and production-ready platforms by applying lessons learned from working with...


  • New Delhi, India AQUMEN LABS Full time

    Company DescriptionAQUMEN Labs is a trusted Quality Engineering, DevOps, and AI Consulting partner to several industry leaders and high-growth startup companies across India and global markets. We actively engage with the startup ecosystem to help teams build resilient, scalable, and production-ready platforms by applying lessons learned from working with...


  • Delhi, India DevRev Full time

    About the Role: We are seeking an experienced Site Reliability Engineer / Platform Engineer to join our team and help build and maintain a resilient, scalable infrastructure supporting our applications across multiple cloud providers. In this role, you will design and implement infrastructure solutions, automate operational processes, and work closely with...


  • New Delhi, India NAZZTEC Full time

    Hiring: Site Reliability Engineer (SRE) | Gurugram (Cyber City)Location: GurgaonExperience: Minimum 5 YearsEducation: 15 years of full-time education requiredRole SummaryWe are looking for a skilled Site Reliability Engineer (SRE) to join our custom software engineering team. In this role, you will be responsible for ensuring the reliability, scalability,...


  • New Delhi, India COZZERA INTERNATIONAL LLP Full time

    Job Title: Senior DevOps & Site Reliability Engineer (SRE) Experience: 5+ Years Location: RemoteJob Summary We are looking for an experienced and highly motivatedSenior DevOps & Site Reliability Engineer (SRE)to design, automate, and optimizehybrid and multi-cloud infrastructureacrossAWS and Azureenvironments. This role combinesDevOps engineering best...