Senior Site Reliability Engineer
2 weeks ago
Job Title : SRE Lead Engineer. Location : Hyderabad, India. We are seeking a DevOps / SRE Lead Engineer to architect and scale our client's multi-tenant SaaS platform with AI/ML at the core. Our client, a fast-growing AI-powered SaaS company in the FinTech space, is looking for a Site Reliability Engineering (SRE) Lead Engineer to join their dynamic team. This is an opportunity to design and operate large-scale SaaS systems that integrate cutting-edge AI/ML capabilities. About the Role : As the SRE Lead Engineer, you will be responsible for architecting, building, and maintaining infrastructure that powers a multi-tenant SaaS platform. Youll drive reliability, scalability, and security, while supporting AI/ML pipelines in production. This is a hands-on role with significant ownership, requiring both technical depth and leadership in site reliability practices. Key Responsibilities : - Architect, design, and deploy end-to-end infrastructure for large-scale, microservices-based SaaS platforms. - Ensure system reliability, scalability, and security for AI/ML model integrations and data pipelines. - Automate environment provisioning and management using Terraform in AWS (EKS-focused). - Implement full-stack observability across applications, networks, and operating systems. - Lead incident management and participate in 24/7 on-call rotation. - Optimize SaaS reliability while enabling REST APIs, SSO integrations (Okta/Auth0), and cloud data services (RDS/MySQL, Elasticsearch). - Define and maintain backup and disaster recovery for critical workloads. Required Skills & Experience : - 8+ years in SRE/DevOps roles, managing enterprise SaaS applications in production. - Minimum 1 year experience with AI/ML infrastructure or model-serving environments. - Strong expertise in AWS cloud, particularly EKS, container orchestration, and Kubernetes. - Hands-on experience with Infrastructure as Code (Terraform), Docker, and scripting (Python, Bash). - Solid Linux OS and networking fundamentals. - Experience in monitoring and observability with ELK, CloudWatch, or similar tools. - Strong track record with microservices, REST APIs, SSO, and cloud databases. Nice-to-Have Skills : - Experience with MLOps and AI/ML pipeline observability. - Cost optimization and security hardening in multi-tenant SaaS. - Prior exposure to FinTech or enterprise finance solutions. Qualifications : - Bachelors degree in Computer Science, Engineering, or related discipline. - AWS Certified Solutions Architect (strongly preferred). - Experience in early-stage or high-growth startups is an advantage. Why Join? - Be at the forefront of AI/ML-powered SaaS innovation in FinTech. - Work with a high-energy, entrepreneurial team building next-gen infrastructure. - Take ownership of mission-critical reliability challenges. - Grow your career in an environment that values impact, adaptability, and innovation. (ref:hirist.tech)
-
Senior Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India Jade Global Software Pvt Ltd Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSenior Site Reliability Engineer (SRE) – Datadog ObservabilitySenior Site Reliability Engineer (SRE) – Datadog Observability1 Job Title: Senior Site Reliability Engineer (SRE) – Datadog ObservabilityExperience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in DatadogLocation: Hyderabad...
-
Senior Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Jade Global Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSenior Site Reliability Engineer (SRE) – Datadog Observability1Job Title: Senior Site Reliability Engineer (SRE) – Datadog ObservabilityExperience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in DatadogLocation: Hyderabad preferable but open for Pune and remoteJob Summary:We are seeking an...
-
Senior Site Reliability Engineer
6 days ago
Hyderabad, Telangana, India Jade Global Full time ₹ 1,00,00,000 - ₹ 2,00,00,000 per yearJob Title: Senior Site Reliability Engineer (SRE) – Datadog ObservabilityExperience Required: 8+ years overall in SRE and Infrastructure Operations with minimum 3 + years hands-on experience in Datadog Location: Hyderabad preferable but open for Pune and remoteJob Summary:We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE...
-
Site Reliability Engineer
2 weeks ago
hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Senior Site Reliability Engineer
2 days ago
Hyderabad, Telangana, India Instaresz Business Services Pvt Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Title: Senior Site Reliability Engineer (SRE)Experience Required:10+ YearsLocation:Hyderabad (On-site)Employment Type:Full-TimeAbout InstareszInstaresz Business Services Pvt. Ltd. focuses on building and scalinghigh-performance SaaSproductswith expertise in:• SaaS Product Development• Infrastructure & DevOps• Data & Analytics• AI & AutomationOur...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India Whatjobs IN C2 Full timeJob Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: Hyderabad About SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: Hyderabad About SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: Hyderabad About SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...