Site Reliability Engineering Manager

2 weeks ago


Delhi, India o9 Solutions, Inc. Full time
Be part of something revolutionaryAt o9 Solutions, our mission is clear: be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate global enterprises’ siloed planning capabilities, helping them capture millions and, in some cases, billions of dollars in value leakage. But our impact doesn’t stop there. Businesses that plan better and faster also reduce waste, which drives better outcomes for the planet, too.We're on the lookout for the brightest, most committed individuals to join us on our mission. Along the journey, we’ll provide you with a nurturing environment where you can be part of something truly extraordinary and make a real difference for companies and the planet .

About the role...As an SRE Manager, you will have the opportunity to work for an AI-based Unicorn which is recognized as one of the fastest-growing companies on the Inc. 5000 list.You will be responsible for leading a team of talented SRE professionals to maintain and execute organizational policies and procedures for change management, configuration management, release and deployment management, service monitoring, problem management, maintain and support the o9 Digital Brain Platform across all major cloud providers - AWS, GCP, Azure and Samsung Cloud utilizing state of the art CI/CD tools.This role will empower you to continuously challenge the status quo and implement the great ideas you may have to create value for o9 clients.

What you will do in this role:Deploy, maintain and support o9 digital Brain SaaS environments on all major cloudsManage SRE team, assigning and monitoring work tasks and ensuring quality and efficiency is maintainedHire and Grow SRE talentLead, plan, build, configure, test, and deploy software and systems to manage platform infrastructure and applicationsCollaborate with internal and external customers to manage o9 platform deployment, maintenance and support needsImprove reliability, quality, cost, and time-to-deploy, and time-to-upgradeMonitor, measure and optimize system performanceProvide on-call support on Rotation/shift basisAnalyze and approve code and configuration changesAbility and flexibility to work with teams globally, across the time zones

What you’ll have...Primary Skill:

Strong in operating system concepts, Linux and troubleshooting.Secondary Skill:

Automation and cloudEducation:

Bachelor’s degree in computer science, Software Engineering, Information Technology, Industrial Engineering, Engineering ManagementManagement Experience:

9+ years of experience building and leading high performing diverse teams either as SRE manager or DevOps Manager. Cloud (at least one) and Kubernetes administration certification

Experience:5+ years of experience in a

SRE role , deploying and maintaining applications, performance tuning, conducting application upgrades, patches, and supporting continuous integration and deployment tooling4+ years of experience deploying and maintaining applications in

any one of the clouds

(AWS, AZURE, GCP)Experience with

Dockers

or similar and experience with

Kubernetes

or similarExperience supporting

Hadoop

or any other

big data platform

Skills:Ability to debug issues and solve problemsGood hands on experience with

Jenkins, Ansible, Terraform, ArgoCDAdministration of

databases

(MS SQL, Mongo, SSIS)Working knowledge with

Linux

and

Windows

operating systemStrong decision-making, problem-solving skills, critical thinking, and testing skillsAbility to deliver independently and to translate requirements into technical solutions with minimal supervisionCharacteristics: Passion to learn and adapt to new technologyWe really value team spirit: Transparency and frequent communication is key. At o9, this is not limited by hierarchy, distance, or function

What we’ll do for you:Flat organization: With a very strong entrepreneurial culture (and no corporate politics).Great people and unlimited fun at work.Possibility to really make a difference in a scale-up environment.Support network: Work with a team you can learn from every day.Diversity: We pride ourselves on our international working environment.Work-Life Balance:

part of A team:

the process works...Respond with your interest to us.We’ll contact you either via video call or phone call - whatever you prefer, with the further schedule status.During the interview phase, you will meet with the technical panel for 60 minutes. We will contact you after the interview to let you know if we’d like to progress your application.There will be a round of Intro call, a technical discussion followed by a techno - Managerial and Managerial round.We will let you know if you’re the successful candidate.Good luck

More about us…With the latest increase in our valuation from $2.7B to $3.7B despite challenging global macroeconomic conditions, o9 Solutions is one of the fastest-growing technology companies in the world today. Our mission is to digitally transform planning and decision-making for the enterprise and the planet. Our culture is high-energy and drives us to aim 10x in everything we do.

Our platform, the o9 Digital Brain, is the premier AI-powered, cloud-native platform driving the digital transformations of major global enterprises including Google, Walmart, ABInBev, Starbucks and many others.

Our headquarters are located in Dallas, with offices in Amsterdam, Paris, London, Barcelona, Madrid, Sao Paolo, Bengaluru, Tokyo, Seoul, Milan, Stockholm, Sydney, Shanghai, Singapore and Munich.

o9 is an equal opportunity employer and seeks applicants of diverse backgrounds and hires without regard to race, colour, gender, religion, national origin, citizenship, age, sexual orientation or any other characteristic protected by law

  • Delhi, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Delhi, India Delphic (South Asia) Full time

    Job Title: Site Reliability Engineer (SRE)Location: RemoteJob Type: Full-timeExperience : 7 yearsIntroduction:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will...


  • Delhi, India Delphic Full time

    Job Title: Site Reliability Engineer (SRE)Location: RemoteJob Type: Full-timeExperience : 7 yearsIntroduction:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will...


  • Delhi, India Gateway Search Full time

    Hiring for a MNC client which provides software as a service products related to customer support, sales, and other customer communications. The company was founded in Denmark in 2007. It has over 100,000 customers and 5000+ global employees.Currently hiring for a new Product Development Center of Excellence in Pune. As an early hire, you will have a unique...


  • Delhi, India K&K social resources and development GmbH Full time

    K&K Social Resources & Development GmbH is an international recruiting agency that has been providing technical resources in the European region since 1993. This position is with one of our clients in India who is actively hiring candidates to expand their teams.Title: Site Reliability EngineerLocation: India - RemoteEmployment Type: PermanentNotice...


  • Delhi, India K&K Social Resources And Development GmbH Full time

    K&K Social Resources & Development Gmb H is an international recruiting agency that has been providing technical resources in the European region since 1993. This position is with one of our clients in India who is actively hiring candidates to expand their teams.Title: Site Reliability EngineerLocation: India - RemoteEmployment Type: PermanentNotice...


  • Delhi, India IDEMIA Full time

    We are hiring for Site Reliability Engineer role at Noida location.Responsibility:- Involved in deploy/manage/operate of medium to large scale production systems- Understanding of Linux as a runtime environment- Familiar to Cloud native concepts and virtualisation- Familiar to CI/CD concepts and tools like Jenkins, Gitlab etc- Previous experience of working...


  • Delhi, India IDEMIA Full time

    We are hiring forSite Reliability Engineerrole atNoidalocation.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with Docker,...


  • Delhi, India IDEMIA Full time

    We are hiring for Site Reliability Engineer role at Noida location.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with...


  • Delhi, India Hirextra -World's First Staffing Aggregator Full time

    Job Description :- Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services.- Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management.- Experience in Prometheus, Grafana, Splunk, CloudWatch).- Automate routine operational tasks and cloud...


  • Delhi, Delhi, India HyringNinja Full time

    **About the Company:**HyringNinja is a leading provider of innovative solutions, and we're looking for a skilled Site Reliability Engineer Leader to join our team.


  • Delhi, India Netcore Cloud Full time

    Job Title: Manager of SRE (Site Reliability Engineering) & Application SupportLocation: ThaneReports to: Sr VP Delivery headDepartment: Engineering ; Full-TimeAbout us:At Netcore, innovation isn’t just a buzzword—it's the core of everything we do. As the pioneering force behind the first and leading AI/ML-powered Customer Engagement and Experience...


  • Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability EngineerExperience Range: 8 – 12 YearsLocation: Pune & Chennai, Bangalore , DelhiMust-Have:Exceptional skills in...


  • Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability EngineerExperience Range: 8 – 12 YearsLocation: Pune & Chennai, Bangalore , DelhiMust-Have:Exceptional skills in...


  • Delhi, India Netcore Cloud Full time

    Job Title:Manager of SRE (Site Reliability Engineering) & Application SupportLocation: ThaneReports to: Sr VP Delivery headDepartment: Engineering;Full-TimeAbout us:At Netcore, innovation isn’t just a buzzword—it's the core of everything we do. As the pioneering force behind the first and leading AI/ML-powered Customer Engagement and Experience Platform...


  • Delhi, India Coforge Full time

    Job Title: Site Reliability EngineerSkills : SRE, CI/CD, AWS, Python, Terraform & KubernetesLocation: Hyderabad (Work from Office)Experience: 7-15 YearsNote: Immediate joiners are preferableJob Description:We at Coforge are hiring a Site Reliability Engineer with the following skillset:Design, implement, and manage scalable and secure cloud-based...


  • New Delhi, India AIVID.AI Full time

    Role Overview:We are seeking proactive and skilled Site Reliability Engineers (SREs) to manage clientdeployments, provide on-site support, and ensure the seamless functioning of our AI-basedcamera analytics systems. This hybrid role requires a mix of on-site visits and remote work.The selected candidates will operate from their respective regions—New...


  • Delhi, India SwiftWIN | A Concord Company Full time

    Job Title: Site Reliability Engineer (SRE) - Azure DevOpsJob Overview:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with strong experience in Azure DevOps to join our dynamic team. The SRE will be responsible for maintaining the reliability, availability, and performance of our production environments, with a specific focus on...


  • New Delhi, India AIVID.AI Full time

    Role Overview:We are seeking proactive and skilled Site Reliability Engineers (SREs) to manage clientdeployments, provide on-site support, and ensure the seamless functioning of our AI-basedcamera analytics systems. This hybrid role requires a mix of on-site visits and remote work.The selected candidates will operate from their respective regions—New...


  • Delhi, India Systal Technology Solutions Full time

    Site Reliability EngineerCompetitive Salary & BenefitsBangaloreSystal is a global managed network and security service and transformation specialist. We consult, deploy, and integrate multi-vendor technologies which help enterprise businesses maximize the security and value of their complex IT infrastructure. Across our 24/7 Network and Security Operations...