Site Reliability Engineer

1 day ago


Hyderabad, Telangana, India Intraedge Technologies Ltd. Full time

Role Overview :

We are seeking a Site Reliability Engineer (SRE) with a strong focus on customer-facing technical support.

In this role, you will be the primary point of contact for our enterprise SaaS customers, addressing and resolving technical issues to ensure optimal system performance and user satisfaction.

Your responsibilities will encompass managing incoming support tickets, providing timely solutions, and maintaining high system uptime and application availability.

This position requires a deep understanding of systems engineering principles, extensive Linux system administration expertise, and the ability to monitor and manage large-scale cloud clusters.

Your technical acumen, combined with excellent communication skills, will be crucial in delivering a superior support experience and contributing to the reliability and efficiency of our SaaS platform.

Key Responsibilities :

Technical & Product Support :

- Serve as the first line of support for customer-reported technical issues related to our SaaS platform.

- This involves data connectivity issues, report errors, performance concerns, access problems, data inconsistencies, software bugs, integration challenges etec.

- Understand and empathize with the challenges ThoughtSpot users face, offering tailored solutions to improve their user experience.

- Ensure prompt and accurate updates, meet SLAs and provide timely resolution to customer issues via tickets and calls.

- Create knowledge-base articles to document knowledge and help customers self service.

System Reliability & Monitoring :

- Maintain, monitor, and troubleshoot ThoughtSpot cloud infrastructure.

- Monitor system health and performance through metrics, logs, and dashboards using tools like Prometheus, Grafana, to detect and prevent issues early.

- Work with Engineering teams to define, and implement tools to enhance debuggability, supportability, availability, scalability, and performance.

- Be an expert in cloud and on-premise infrastructure by developing automation and best practices.

- Participate in on-call rotation for critical SRE systems, lead the incident review and root cause analysis.

Required Skills & Experience :

- Exceptional communication skills, both written and verbal, to effectively engage with cross-functional teams, customers, and stakeholders.

- Relevant work experience troubleshooting complex Linux Systems and managing distributed systems.

- Experience in virtualization and Cloud technologies.

- Experience in enterprise customer support, on-call rotation for critical SRE systems, leading incident review and root cause analysis.

- Ability to diagnose technical problems and work with Engineering on escalated issues.

- Strong problem solving skills, algorithmic thinking and a strong foundation in how systems should work.

- Understanding of tools & frameworks required to Operate and manage Cloud infrastructure.

- Strong customer service skills.

- Solid communication skills and ability to work independently.

- Ability to leverage automation, monitoring and data analysis to ensure high availability.

- Familiarity with scripting languages such as Python, JavaScript or Bash.

- Exposure to infrastructure and service monitoring tools.

Ideal Candidate Profile :

You thrive in dynamic, customer-facing environments and are passionate about ensuring system reliability and customer satisfaction.

You have a balanced mix of technical expertise in cloud operations and a proven record in handling support incidents and end-user queries, setting you apart from candidates with purely systems or cloud engineering backgrounds.

(ref:hirist.tech)

  • Hyderabad, Telangana, India Talent Worx Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, Telangana, India Talent Worx Full time US$ 1,20,000 - US$ 2,00,000 per year

    Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services.Your work will involve both software engineering and systems operations as you strive to improve customer experiences and operational...


  • Hyderabad, Telangana, India Talent Worx Full time

    Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services.Your work will involve both software engineering and systems operations as you strive to improve customer experiences and operational...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:- Strong leadership and people management skills.- Exceptional technical proficiency in Pearson's technology stack.- Advanced project management capabilities.- Excellent communication and collaboration skills.- Adept at risk assessment and...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability Engineer Experience: 7+ Years Location: Hyderabad Skills for Principal: ~ Strong leadership and people management skills. ~ Exceptional technical proficiency in Pearson's technology stack. ~ Advanced project management capabilities. ~ Excellent communication and collaboration skills. ~ Adept at risk assessment and crisis management. ~...


  • Hyderabad, Telangana, India IntraEdge Full time US$ 1,20,000 - US$ 2,00,000 per year

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis management.Strategic thinking with a...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability Engineer Experience: 7+ Years Location: Hyderabad Hybrid 4-day office and 1 Day remote Skills for Principal: Strong leadership and people management skills. Exceptional technical proficiency in Pearson's technology stack. Advanced project management capabilities. Excellent communication and collaboration skills. Adept at risk assessment...


  • Hyderabad, Telangana, India ServiceNow Full time

    Site Reliability Engineer (SRE)Experience : 6+ YearsAbout the Role : We are seeking a seasoned SRE to ensure the reliability, availability, and performance of our critical services. You will combine software engineering with systems administration to create scalable and highly reliable software systems.Responsibilities : - Design, build, and maintain...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    SRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...