
SRE DevOps Lead Engineer
3 weeks ago
We have an exciting role as below in Hyderabad for an AI SaaS Fintech Product Firm.
SRE DevOps Lead Engineer (SaaS) || 8-12 Y || Hyderabad (Hybrid) || Quick Starter ||
Key Responsibilities :
- Architect, design, and deploy end-to-end infrastructure solutions for a multi-tenant microservices-based SaaS application with a focus on AI/ML model integration.
- Ensure system reliability, scalability, performance, and security, specifically enhancing AI/ML processing pipelines and workflows.
- Utilize Terraform scripting for on-demand environment provisioning within the AWS cloud, optimized for AI/ML workloads.
- Implement and refine monitoring and alerting systems across application, network, and OS layers to support AI model operations and data processing.- Diagnose, support, and resolve production issues and alerts, participating in a 24/7 on-call rotation to maintain seamless AI/ML service operations.
Scope Of Work :
- Actively participate in the Scrum team, delivering test automation for sprint features and ensuring high-quality product increments by certifying new and regression features using automated test suites
- Integrate automated tests into the CI/CD pipeline and schedule them to run periodically in product development environments
- Identify defects, collaborate with development engineers to resolve them, and verify the fixes
- Maintain continuous availability in alignment with startup culture, staying informed and up to date with communications across various channels and email threads
- Focus on the primary goal of minimizing customer-reported bugs to near zero.
Required Qualification :
- 8+ years of experience in Site Reliability Engineering (SRE) and DevOps roles with a track record of managing large-scale enterprise SaaS services in production, including 1+ year in AI/ML infrastructure
- Demonstrated expertise with AWS public cloud technologies, including extensive experience in deploying and managing large-scale container clusters using AWS, EKS.- Skilled in Infrastructure as Code (IaC) using Terraform, and container technologies such as Docker and Kubernetes.
- Proficient in scripting and programming for automation (Python, Bash, etc.), with strong Linux OS and networking fundamentals relevant to AI/ML workloads.
- Experience in establishing monitoring systems to ensure high availability, performance, and security integrity, using tools like ELK Stack, CloudWatch, and others tailored for AI/ML monitoring.
- Hands-on experience managing microservices architecture SaaS products, enabling RESTful web services, SSO integration (Okta, Auth0), and utilizing cloud databases like EC2-RDS, MySQL, and Elasticsearch, especially in AI/ML deployments.
- Proficient in backup and disaster recovery strategies specific to AI/ML data resources like RDS and Elasticsearch.
- AWS Certified Solutions Architect is strongly preferred.
- Self-driven, proactive, and adaptable to thrive in an early-stage startup environment, with a keen interest in integrating AI/ML technologies into modern SaaS solutions.
- Strictly, prefer applicants with stable career (consistent employment) within 0-30 days NP only
(ref:hirist.tech)-
Sre Lead
3 weeks ago
Hyderabad, Telangana, India People Prime Worldwide Full timeAbout Client One of our MNC clients offers technology consulting and digital solutions to global enterprises across industries enabling transformative scale at unparalleled speed With 145 000 professionals across 90 countries helping 1100 clients it provides a full spectrum of services including consulting information technology enterprise...
-
Lead DevOps Engineer
2 days ago
Hyderabad, Telangana, India Horizontal Full time ₹ 20,00,000 - ₹ 25,00,000 per yearRole Overview : As a Lead DevOps Engineer, you will be responsible for the end-to-end lifecycle of our infrastructure, from design and implementation to maintenance and monitoring. You will lead a team of DevOps engineers, championing automation, CI/CD best practices, and a "fail-fast, learn-fast" mindset. This is a critical role that will directly impact...
-
sre
14 hours ago
Hyderabad, Telangana, India TechVedika Full time US$ 90,000 - US$ 1,20,000 per yearCompany DescriptionTechVedika is a technology services company specializing in AI/ML, Product Engineering, and Cloud-based solutions. Since our founding in 2010, we have been committed to providing innovative technology solutions to enterprise clients across various industries, including Manufacturing, BFSI, Healthcare, IT, Supply Chain & Logistics, Retail,...
-
SRE DevOps Engineer
2 days ago
Hyderabad, Telangana, India Cognizant Full time ₹ 4,00,000 - ₹ 12,00,000 per yearPrimary Skills: Site Reliability Engineer (SRE) with Grafana, Datadog, Dynatrace, SplunkLocation : Hyderabad OnlyNotice Period: Immediate Joiners only
-
SRE Architect
2 days ago
Hyderabad, Telangana, India Zensar Technologies Full time ₹ 15,00,000 - ₹ 20,00,000 per yearJob Title: DevOps/SRE LeadLocation: Pune / HyderabadJob Type: FulltimeExperience: 15+ yearsJob Overview:We are seeking a highly experienced DevOps/SRE Lead with over 15 years of professional experience. The ideal candidate will possess a deep understanding of DevOps principles, extensive experience in Site Reliability Engineering (SRE), and a strong...
-
SRE Engineer Java + Devops
3 weeks ago
Hyderabad, Telangana, India S&P Global Market Intelligence Full timeJob DescriptionJob Description:- We are seeking a skilled and motivated Application Operations Engineer for an SRE role with Java, React JS and Spring boot skillset along with expertise in Data Bricks, particularly with Oracle integration, to join our dynamic SRE team. The ideal candidate should have 3 to 6 years of experience in supporting robust web...
-
Princpal Engineer, Sre
2 weeks ago
Hyderabad, Telangana, India Talent500 Full timeAbout Talent500 INC Talent500 helps companies hire build and manage global teams We are trusted by the worlds leading companies - from Fortune 500s to fast-growth startups - to help them build and run their high-impact remote teams Today Talent500 is the fastest growing remote team builder in the world Our suite of proprietary AI-enabled tools and...
-
SRE Lead Design
2 weeks ago
Hyderabad, Telangana, India PepsiCo Full time ₹ 8,00,000 - ₹ 24,00,000 per yearOverviewWe are looking for a self-driven, software engineering mindset SRE engineer toDrive new shift left activities critical to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design / Project roadmap that enablees resilient outcomesApply pre-emptive approach into production minimizing business impact, via...
-
SRE Lead Design
2 weeks ago
Hyderabad, Telangana, India Pepsico Full timeOverviewWe are looking for a self-driven, software engineering mindset SRE engineer to- Drive new shift left activities critical to apply Site Reliability Engineering (SRE) and quality assurance principles within the application design / Project roadmap that enablees resilient outcomes- Apply pre-emptive approach into production minimizing business impact,...
-
Lead DevOps
15 hours ago
Hyderabad, Telangana, India Epam Systems Full time ₹ 20,00,000 - ₹ 25,00,000 per yearProven experience (7+ years) as a DevOps Engineer / SRE, with at least 2+ years in a lead role.Strong expertise in on-premises infrastructure management and hybrid cloud environments.Hands-on experience with Ansible (automation/configuration management).Deep knowledge of Terraform for IaC and multi-cloud provisioning.Strong expertise in Kubernetes...