Senior Site Reliability Engineer I

3 weeks ago


Mumbai, India RELX India (Pvt) Ltd Risk div Company Full time

About the role

We are seeking a talented and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our Azure AI Services platform. You will work closely with cross-functional teams to design, implement, and maintain robust infrastructure and automation solutions

Responsibilities:

Design, deploy, and maintain a highly available and scalable data infrastructure on Azure open ai , databases and event driven services

Monitor and optimize the performance of AI workloads

Collaborate with cross-functional teams, including data engineers, data scientists, and developers, to provide technical guidance and support in implementing best practices.

Ensure data governance policies and practices are followed to maintain data integrity, security, and compliance.

Troubleshoot and resolve issues related to data infrastructure, working closely with operations and development teams.

Implement automation and monitoring tools to streamline operations and improve system reliability.

Plan and execute disaster recovery procedures and backup strategies for data platforms.

Stay up to date with industry trends and emerging technologies related to data management, analytics, and cloud computing.

Requirements:

Proven experience as an SRE or similar role, with a focus on data infrastructure and analytics.

Strong expertise in managing and optimizing Azure open ai or event driven applications in azure

In-depth knowledge of data governance principles, data security, and compliance requirements.

Experience with performance optimization techniques for large-scale data processing and analytics workloads.

Experience managing Azure cloud services, including compute, storage, networking, and security.

Familiarity with AI services, particularly OpenAI, for implementing machine learning and natural language processing solutions.

Proficiency in Terraform for infrastructure as code management and automation.

Any database knowledge is required, including SQL and NoSQL databases, for data storage and management.

Proficiency in scripting and automation using languages such as Python, PowerShell, or Bash.

Familiarity with cloud platforms, preferably Microsoft Azure, and related services (Azure Data Factory, Azure Data Lake Analytics, etc.).

Solid understanding of containerization technologies, such as Docker and Kubernetes.

Strong problem-solving skills and the ability to troubleshoot complex issues in a distributed data environment.

Excellent communication and collaboration skills to work effectively with cross-functional teams

Desirable Skills: Desirable – AWS / Azure /Kubernetes Certifications

Qualifications - 5+ years of Software Engineering experience

BS Engineering/Computer Science or equivalent experience required



  • Mumbai, Maharashtra, India Antal Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with our engineering teams to design, implement, and operate...


  • Mumbai, Maharashtra, India antal international network Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Antal International Network. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and efficiency of our software solutions.Key Responsibilities:Monitor production environment...


  • Mumbai, Maharashtra, India IDFC FIRST Bank Full time

    Job Title: Senior Site Reliability Engineering ManagerFunction/ Department: Information TechnologyJob Purpose:IDFC FIRST Bank is seeking a seasoned Site Reliability Engineering Manager to lead our efforts in ensuring seamless customer experiences. As a key member of our IT team, you will be responsible for defining SRE principles, SLIs, and SLAs, and...


  • Mumbai, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering ManagerAbout Fynd:Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in...


  • Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering Manager About Fynd: Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG . We are...


  • Mumbai, India Fynd Full time

    Site Reliability Engineering Manager About Fynd: Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG . We are...


  • Mumbai, India IDFC FIRST Bank Full time

    Role/ Job Title:  Senior Site Reliability Engineering Manager Function/ Department:  Information Technology Job Purpose: Site Reliability Engineering (SRE) department plays a pivotal role in providing seamless experience for our customers. With state-of-the-art technology and tools, we are transforming the overall application development and...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni, Fynd is headquartered in Mumbai and has 1000+ brands under management, more than 10k stores, and servicing...


  • Navi Mumbai, Maharashtra, India Cyber Sphere LLC Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team at Cyber Sphere LLC.Job Summary:The successful candidate will play a crucial role in ensuring the reliability, scalability, and performance of our Azure AI Services platform.Key Responsibilities:Design, deploy, and maintain a highly...


  • Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering ManagerAbout Fynd:Fyndis India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni:Farooq Adam, Harsh Shah, and Sreeraman MG . We are headquartered...


  • Mumbai, India CSC Full time

    Role: Site Reliability EngineerLocation: Mumbai/ BangaloreWorking Model: HybridShift: 12-9PM Intro:Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of business,...


  • mumbai, India CSC Full time

    Role: Site Reliability Engineer Location: Mumbai/ Bangalore Working Model: Hybrid Shift: 12-9PM Intro: Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in Mumbai and have 1000+ brands under...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is a leading omnichannel platform and tech company specializing in retail tech and innovative products in AI, ML, big data ops, gaming, crypto, image editing, and the learning space. Founded in 2012 by three IIT Bombay alumni, Fynd is headquartered in Mumbai and manages over 1000 brands, 10k stores, and 23k+ pin codes.Role OverviewAs a Site...


  • Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering ManagerAbout Fynd:Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in...


  • Mumbai, Maharashtra, India antal international network Full time

    Key Responsibilities:We are seeking a skilled Site Reliability Engineer to join our team at Antal International Network. The successful candidate will be responsible for ensuring the availability, scalability, and efficiency of our software solutions.Key Responsibilities:Run the production environment by monitoring availability and taking a holistic view of...


  • Mumbai, Maharashtra, India May I Help You Full time

    Job SummaryAt May I Help You, we are seeking a skilled Senior Analyzer Engineer to join our team. As a key member of our team, you will be responsible for analyzing, troubleshooting, and maintaining the performance of our analyzer systems.About the RoleAnalyze and troubleshoot analyzer systems to ensure optimal performance.Perform regular calibration and...


  • Mumbai, Maharashtra, India Antal Full time

    {"Job OverviewAs a Site Reliability Engineer at Antal, you will be responsible for ensuring the availability, scalability, and performance of our software systems.Key Responsibilities* Monitor and maintain production environment, identifying and resolving issues to ensure high uptime* Improve system reliability, quality, and time-to-market of software...


  • Mumbai, Maharashtra, India Session AI Full time

    Job Title: Site Reliability Engineer IIWe are seeking a highly skilled Site Reliability Engineer II to join our team at Session AI. As a key member of our Site Reliability Engineering Group, you will play a vital role in ensuring the seamless operation of our Cloud platform.Key Responsibilities:Design and implement solutions to enhance the availability,...