
Site Reliability Engineering Manager
4 days ago
About Business Unit: SaaSOps leads post-production support and the overall experience of Epsilon PeopleCloud products for our global clients. This function is responsible for product support, incident management, managed operations and the automation of processes. The team has successfully incubated and mainstreamed Site Reliability Engineering (SRE) as a practice, to ensure reliable product operations on a global scale. Plus, the team is actively leading the adoption of AI in operations (AIOps) and recently launched AI-driven self-service capabilities to enhance operational efficiency and improve client experiences. Will be a senior IC role responsible for driving strong operations engineering practices in SaaS product operations. Role will be working closely with engineering, delivery and operations team to ensure streamlined release and change management processes Role will be closely working with product operations team to deep dive and identify root cause of production issues and work with concerned teams to come up with a permanent fix to recurring issues Will contribute to evolution of AIOps strategy - identify use cases and come up with AI / Agentic autonomous solutions The candidate will be hands-on technology leader with a proven experience working as a SRE leader in a product set up. ~ The ideal candidate should have a strong full stack engineering background with Cloud & AI / Gen AI experience ~ Must have strong development skills - at least two of Python, Java, C#; strong DB skills (RDBMS, NoSql, Cloud DBs), Container / orchestration, Cloud Infrastructure ~ Super proficient in atleast one hyperscaler cloud (AWS, GCP, Azure) ~ Demonstrated real world experience in traditional ML & Gen AI use case deployments in production ~ Candidate should have had experience in working closely with Engineering & Operations team - must have a strong DevOps, Release management, change management experience ~ Experience in AIOps will be an added advantage. ~ Epsilon is a global data, technology and services company that powers the marketing and advertising ecosystem. For decades, we’ve provided marketers from the world’s leading brands the data, technology and services they need to engage consumers with 1 View, 1 Vision and 1 Voice. Epsilon’s comprehensive portfolio of capabilities across our suite of digital media, messaging and loyalty solutions bridge the divide between marketing and advertising technology. We process 400+ billion consumer actions every single day using advanced AI and hold many patents of proprietary technology, including real-time modeling languages and consumer privacy advancements. Epsilon is a global company with more than 9,000 employees around the world. We believe collaboration is the catalyst that unlocks our full potential. That’s why our work-world, aptly named ‘YOUniverse’ is passionate about crafting a nurturing environment that elevates your growth, wellbeing and work-life harmony. Epsilon is committed to promoting diversity, inclusion, and equal employment opportunities by using reasonable efforts to attract, recruit, engage and retain qualified individuals of all ethnicities and backgrounds, including, but not limited to, women, people of color, LGBTQ individuals, people with disabilities and any other underrepresented groups, traits or characteristics.
-
Site Reliability Engineering Manager
4 days ago
bangalore, India Tata Consultancy Services Full timeRole**: Manager, Site Reliability EngineeringRequired Technical Skill Set: Manager, Site Reliability EngineeringDesired Experience Range: 12 - 18 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual Interview Job Description:Describe what the person will do in the role - how he/she will impact...
-
Site Reliability Engineering Manager
19 hours ago
bangalore, India Tata Consultancy Services Full timeRole**: Manager, Site Reliability Engineering Required Technical Skill Set: Manager, Site Reliability Engineering Desired Experience Range: 12 - 18 yrs Notice Period: Immediate to 90Days only Location of Requirement: Bangalore We are currently planning to do a Virtual Interview Job Description: Describe what the person will do in the role - how he/she will...
-
Site Reliability Engineer
4 days ago
bangalore, India IntraEdge Full timeJob Title: Site Reliability Engineer (SRE) – Production SupportLocation: BengaluruJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong experience in production support, DevOps practices, and cloud infrastructure management. The ideal candidate will be responsible for maintaining the reliability, performance, and scalability...
-
Site Reliability Engineer
7 hours ago
bangalore, India JRD Systems Full timePosition: Site Reliability Engineer (SRE) Role Overview: We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Windows infrastructure to manage and optimize our cloud and on-premises environments. The ideal candidate will partner with development teams to improve service reliability, implement automation, and ensure...
-
Site reliability engineer
4 weeks ago
Bangalore, India Xebia Full timeWe are seeking an experienced AWS Dev Ops Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (Ia C), CI/CD, monitoring & observability frameworks, and incident...
-
Site reliability engineer
2 weeks ago
Bangalore, India BayOne Solutions Full timeRole: Site Reliability Engineer Location: Remote Duration: Full Time The CXE Site Reliability Engineering (SRE) team manages the CI/CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance. However, the team faces challenges in reliably managing AWS environments that support the CX Cloud, particularly in addressing...
-
Site Reliability Engineer
4 days ago
bangalore, India Endpoint Clinical Full timeAbout Us:Endpoint is an interactive response technology (IRT®) systems and solutions provider that supports the life sciences industry. Since 2009, we have been working with a single vision in mind, to help sponsors and pharmaceutical companies achieve clinical trial success. Our solutions, realized through the proprietary PULSE® platform, have proven to...
-
Site Reliability Engineer
4 days ago
bangalore, India HDFC Limited Full timeHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 Years Job PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance. Job Responsibilities: Help build a Site Reliability...
-
Site reliability engineer
4 weeks ago
Bangalore, India ViewSonic Full timeJob Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, Dev Ops Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, Cloud Watch, Lambda, and RDS. Interest and understanding of Platform...
-
Site reliability engineer
2 weeks ago
Bangalore, India ViewSonic Full timeJob Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, Dev Ops Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, Cloud Watch, Lambda, and RDS. Interest and understanding of Platform...