Senior Site Reliability Engineer

3 weeks ago


Hyderabad, Telangana, India Options Executive Search Private Limited Full time

Job Title : SRE Lead Engineer.

Location : Hyderabad, India.

We are seeking a DevOps / SRE Lead Engineer to architect and scale our client's multi-tenant SaaS platform with AI/ML at the core.

Our client, a fast-growing AI-powered SaaS company in the FinTech space, is looking for a Site Reliability Engineering (SRE) Lead Engineer to join their dynamic team.

This is an opportunity to design and operate large-scale SaaS systems that integrate cutting-edge AI/ML capabilities.

About the Role :

As the SRE Lead Engineer, you will be responsible for architecting, building, and maintaining infrastructure that powers a multi-tenant SaaS platform.

Youll drive reliability, scalability, and security, while supporting AI/ML pipelines in production.

This is a hands-on role with significant ownership, requiring both technical depth and leadership in site reliability practices.

Key Responsibilities :

- Architect, design, and deploy end-to-end infrastructure for large-scale, microservices-based SaaS platforms.

- Ensure system reliability, scalability, and security for AI/ML model integrations and data pipelines.

- Automate environment provisioning and management using Terraform in AWS (EKS-focused).

- Implement full-stack observability across applications, networks, and operating systems.

- Lead incident management and participate in 24/7 on-call rotation.

- Optimize SaaS reliability while enabling REST APIs, SSO integrations (Okta/Auth0), and cloud data services (RDS/MySQL, Elasticsearch).

- Define and maintain backup and disaster recovery for critical workloads.

Required Skills & Experience :

- 8+ years in SRE/DevOps roles, managing enterprise SaaS applications in production.

- Minimum 1 year experience with AI/ML infrastructure or model-serving environments.

- Strong expertise in AWS cloud, particularly EKS, container orchestration, and Kubernetes.

- Hands-on experience with Infrastructure as Code (Terraform), Docker, and scripting (Python, Bash).

- Solid Linux OS and networking fundamentals.

- Experience in monitoring and observability with ELK, CloudWatch, or similar tools.

- Strong track record with microservices, REST APIs, SSO, and cloud databases.

Nice-to-Have Skills :

- Experience with MLOps and AI/ML pipeline observability.

- Cost optimization and security hardening in multi-tenant SaaS.

- Prior exposure to FinTech or enterprise finance solutions.

Qualifications :

- Bachelors degree in Computer Science, Engineering, or related discipline.

- AWS Certified Solutions Architect (strongly preferred).

- Experience in early-stage or high-growth startups is an advantage.

Why Join?

- Be at the forefront of AI/ML-powered SaaS innovation in FinTech.

- Work with a high-energy, entrepreneurial team building next-gen infrastructure.

- Take ownership of mission-critical reliability challenges.

- Grow your career in an environment that values impact, adaptability, and innovation.

(ref:hirist.tech)

  • Hyderabad, Telangana, India JA Consulting Full time

    About the job : Role : Senior Site Reliability Engineer SaaS Real Estate Platform About the Client : We are hiring on behalf of our reputed SaaS product-based client based in Hyderabad. They are a global leader in real estate software development.The Role : Were seeking a Senior Site Reliability Engineer (SRE) with a strong Software Engineering background...


  • Hyderabad, Telangana, India Microsoft Full time

    The Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level Windows 365 Cloud PC W365 and Azure Virtual Desktop AVD have recently been recognized as leaders in the Gartner Magic Quadrant TM for...


  • Hyderabad, Telangana, India Microsoft Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    The Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform, as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level.Windows 365 Cloud PC (W365) and Azure Virtual Desktop (AVD) have recently been recognized as leaders in the Gartner Magic Quadrant for Desktop...


  • Hyderabad, Telangana, India Goldman Sachs Services Pvt Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Engineering-L2-Hyderabad-Vice President-Software Engineering Senior Site Reliability Engineer (SRE) Job Description (12 Years Experience) Short Description for Internal Candidates The Senior Site Reliability Engineer (SRE) will serve as a technical leader and subject matter expert, responsible for defining, implementing, and optimizing the...


  • Hyderabad, Telangana, India Microsoft Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    The Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform, as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level.Windows 365 Cloud PC (W365) and Azure Virtual Desktop (AVD) have recently been recognized as leaders in the Gartner Magic Quadrant for Desktop...


  • Hyderabad, Telangana, India INDIGLOBE IT SOLUTIONS PRIVATE LIMITED Full time

    Job Summary :We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). Youll be responsible for owning application support, maintaining our microservices...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, Telangana, India Chase Bank Full time

    Job DescriptionGuide and shape the future of technology at a globally recognized firm, driven by pride in ownership.As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Consumer & Community Banking, youare the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team's...


  • Hyderabad, Telangana, India Cubic Corporation Full time

    Job DescriptionBusiness Unit:Cubic Transportation SystemsCompany Details:When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people's lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation. Led...


  • Hyderabad, Telangana, India Cubic Corporation Full time ₹ 1,20,000 - ₹ 2,60,000 per year

    Business Unit:Cubic Transportation SystemsCompany Details:When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people's lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation. Led by our...