Site Reliability Engineer

2 months ago


Mumbai, India Jio Full time
Site Reliability Engineer (SRE) with Automation

Job OverviewAs a Site Reliability (SRE)/DevOps Automation Engineer, you will be responsible for the availability, automation, performance, efficiency, Scaling, monitoring and emergency response for any incidents/issues in Applications. You will use your deep understanding of platforms, architecture, people, systems, and processes to both establish and continuously improve SLIs and SLOs for uptime, performance, deployment, monitoring, and troubleshooting. You are interested in setting direction and leading the day to day processes that shape our vision for reliability

Responsibilities and Duties

Design and implement automation projects according to the requirements and responsible for end to end delivery up to production environment.

Willing to work hands-on coding to deliver given project.

Work collaboratively with OEM/vendor/partner for IT Infra Automation/Self-service tools deployment for capacity forecasting, predictability of failure, zero touch operation and auto healing.

Build standard documentation for automation.

Participate in RCA and understand the gap in monitoring automation for operations.

Maintain and support the Product and Data systems: proactively monitor events, investigate issues, analyze solutions, and drive problems through to resolution.

Experience with configuration management tools like Chef, Puppet, Salt or equivalent

Experience in Administration of AWS, Google or Azure Cloud

Define requirements and develop tools and reporting as needed by projects and operations.

Participate in 24x7 on-call rotation for after-hours emergencies

Use operational tools and monitoring platforms to gain in-depth knowledge, understanding, and ongoing monitoring of system availability, performance, and capacity.

Implement alerting strategy that makes alerts actionable and unique.

Provide follow-through to ensure issues are resolved to satisfaction

Drive continuous improvement and innovation within the team.

A sense of ownership, initiative and drive.

Qualifications

Bachelor's degree in Computer Science, or a related technical field involving software or systems engineering, or equivalent practical experience

5+ years hands on Experience with Linux/UNIX/Windows OS

Strong Shell/Python/PowerShell skills.

Experience in Infra Orchestration / Automation tools eg. Ansible, Terraform.

Good understanding of Git, DevOps methodology, CI/CD for Automation Projects.

Hands on experience on managing Web servers, Application servers, Databases (SQL/NoSQL)

Experience on Docker/Kubernetes

Knowledge of monitoring tools and strategy

Experience with incident management, running incident post-mortems

Solid understanding of automated deployment processes

Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.

Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.

Experience designing and developing software oriented towards systems or network automation.



  • Mumbai, Maharashtra, India Antal Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with our engineering teams to design, implement, and operate...


  • Mumbai, Maharashtra, India antal international network Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Antal International Network. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and efficiency of our software solutions.Key Responsibilities:Monitor production environment...


  • Mumbai, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering ManagerAbout Fynd:Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in...


  • Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering Manager About Fynd: Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG . We are...


  • Mumbai, India Fynd Full time

    Site Reliability Engineering Manager About Fynd: Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG . We are...


  • Navi Mumbai, Maharashtra, India Cyber Sphere LLC Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team at Cyber Sphere LLC.Job Summary:The successful candidate will play a crucial role in ensuring the reliability, scalability, and performance of our Azure AI Services platform.Key Responsibilities:Design, deploy, and maintain a highly...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni, Fynd is headquartered in Mumbai and has 1000+ brands under management, more than 10k stores, and servicing...


  • Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering ManagerAbout Fynd:Fyndis India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni:Farooq Adam, Harsh Shah, and Sreeraman MG . We are headquartered...


  • mumbai, India CSC Full time

    Role: Site Reliability Engineer Location: Mumbai/ Bangalore Working Model: Hybrid Shift: 12-9PM Intro: Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of...


  • Mumbai, India CSC Full time

    Role: Site Reliability EngineerLocation: Mumbai/ BangaloreWorking Model: HybridShift: 12-9PM Intro:Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of business,...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in Mumbai and have 1000+ brands under...


  • Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    About FyndFynd is a leading omnichannel platform and tech company specializing in retail tech and innovative products in AI, ML, big data ops, gaming, crypto, image editing, and the learning space. Founded in 2012 by three IIT Bombay alumni, Fynd is headquartered in Mumbai and manages over 1000 brands, 10k stores, and 23k+ pin codes.Role OverviewAs a Site...


  • Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full time

    Site Reliability Engineering ManagerAbout Fynd:Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in...


  • Mumbai, Maharashtra, India antal international network Full time

    Key Responsibilities:We are seeking a skilled Site Reliability Engineer to join our team at Antal International Network. The successful candidate will be responsible for ensuring the availability, scalability, and efficiency of our software solutions.Key Responsibilities:Run the production environment by monitoring availability and taking a holistic view of...


  • Mumbai, Maharashtra, India Antal Full time

    {"Job OverviewAs a Site Reliability Engineer at Antal, you will be responsible for ensuring the availability, scalability, and performance of our software systems.Key Responsibilities* Monitor and maintain production environment, identifying and resolving issues to ensure high uptime* Improve system reliability, quality, and time-to-market of software...


  • Mumbai, Maharashtra, India Session AI Full time

    Job Title: Site Reliability Engineer IIWe are seeking a highly skilled Site Reliability Engineer II to join our team at Session AI. As a key member of our Site Reliability Engineering Group, you will play a vital role in ensuring the seamless operation of our Cloud platform.Key Responsibilities:Design and implement solutions to enhance the availability,...


  • Mumbai, Maharashtra, India IDFC FIRST Bank Full time

    Job Title: Senior Site Reliability Engineering ManagerFunction/ Department: Information TechnologyJob Purpose:IDFC FIRST Bank is seeking a seasoned Site Reliability Engineering Manager to lead our efforts in ensuring seamless customer experiences. As a key member of our IT team, you will be responsible for defining SRE principles, SLIs, and SLAs, and...


  • Mumbai, Maharashtra, India M&G Full time

    About the RoleWe are seeking a highly skilled Cloud Site Reliability Engineer to join our team at M&G Global Services. As a Cloud Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain cloud-based systems and infrastructure to...


  • Mumbai, Maharashtra, India RELX India (Pvt) Ltd Risk div Company Full time

    About the RoleWe are seeking a seasoned Site Reliability Engineer with expertise in containerization and orchestration to join our team at RELX India (Pvt) Ltd Risk div Company. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining highly available and scalable container-based infrastructure using...