Current jobs related to Site Reliability Engineer - Mumbai - Antal
-
Site Reliability Engineer
4 weeks ago
Mumbai, Maharashtra, India Antal Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with our engineering teams to design, implement, and operate...
-
Site Reliability Engineer
4 weeks ago
Mumbai, Maharashtra, India antal international network Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Antal International Network. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and efficiency of our software solutions.Key Responsibilities:Monitor production environment...
-
Site Reliability Engineer
5 months ago
Mumbai, India dentsu Full timeThe purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...
-
Site Reliability Engineering Manager
3 weeks ago
mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full timeSite Reliability Engineering ManagerAbout Fynd:Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in...
-
Site Reliability Engineer
2 days ago
Mumbai, India CSC Full timeRole: Site Reliability EngineerLocation: Mumbai/ BangaloreWorking Model: HybridShift: 12-9PMIntro:Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of business,...
-
Site reliability engineering manager
3 weeks ago
Mumbai, India Fynd Full timeSite Reliability Engineering Manager About Fynd: Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG . We are...
-
Site Reliability Engineering Manager
3 weeks ago
Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full timeSite Reliability Engineering Manager About Fynd: Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG . We are...
-
Site Reliability Engineering Manager
4 weeks ago
Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full timeAbout FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni, Fynd is headquartered in Mumbai and has 1000+ brands under management, more than 10k stores, and servicing...
-
Site Reliability Engineer
4 weeks ago
Navi Mumbai, Maharashtra, India Cyber Sphere LLC Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled and experienced Site Reliability Engineer to join our team at Cyber Sphere LLC.Job Summary:The successful candidate will play a crucial role in ensuring the reliability, scalability, and performance of our Azure AI Services platform.Key Responsibilities:Design, deploy, and maintain a highly...
-
Site Reliability Engineering Manager
1 month ago
Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full timeSite Reliability Engineering ManagerAbout Fynd:Fyndis India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni:Farooq Adam, Harsh Shah, and Sreeraman MG . We are headquartered...
-
Site Reliability Engineer
7 days ago
mumbai, India CSC Full timeRole: Site Reliability Engineer Location: Mumbai/ Bangalore Working Model: Hybrid Shift: 12-9PM Intro: Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of...
-
Site Reliability Engineer
1 week ago
Mumbai, India CSC Full timeRole: Site Reliability EngineerLocation: Mumbai/ BangaloreWorking Model: HybridShift: 12-9PM Intro:Do you want to be noticed for your work? Make a difference every day? Be Impactful? Work with cutting edge technology? If so, you will fit in perfectly at CSC and especially within the Regulatory Technology Team. The world’s leading provider of business,...
-
Site Reliability Engineering Manager
3 weeks ago
Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full timeAbout FyndFynd is a leading omnichannel platform and tech company specializing in retail tech and innovative products in AI, ML, big data ops, gaming, crypto, image editing, and the learning space. Founded in 2012 by three IIT Bombay alumni, Fynd is headquartered in Mumbai and manages over 1000 brands, 10k stores, and 23k+ pin codes.Role OverviewAs a Site...
-
Site Reliability Engineering Manager
4 weeks ago
Mumbai, Maharashtra, India Fynd (Shopsense Retail Technologies Ltd.) Full timeAbout FyndFynd is India's largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in Mumbai and have 1000+ brands under...
-
Site Reliability Engineering Manager
1 month ago
Mumbai, India Fynd (Shopsense Retail Technologies Ltd.) Full timeSite Reliability Engineering ManagerAbout Fynd:Fynd is India’s largest omnichannel platform and multi-platform tech company with expertise in retail tech and products in AI, ML, big data ops, gaming + crypto, image editing, and the learning space. Founded in 2012 by 3 IIT Bombay alumni: Farooq Adam, Harsh Shah, and Sreeraman MG. We are headquartered in...
-
Site Reliability Engineer
2 weeks ago
Mumbai, Maharashtra, India antal international network Full timeKey Responsibilities:We are seeking a skilled Site Reliability Engineer to join our team at Antal International Network. The successful candidate will be responsible for ensuring the availability, scalability, and efficiency of our software solutions.Key Responsibilities:Run the production environment by monitoring availability and taking a holistic view of...
-
Site Reliability Engineering Expert
2 weeks ago
Mumbai, Maharashtra, India Antal Full time{"Job OverviewAs a Site Reliability Engineer at Antal, you will be responsible for ensuring the availability, scalability, and performance of our software systems.Key Responsibilities* Monitor and maintain production environment, identifying and resolving issues to ensure high uptime* Improve system reliability, quality, and time-to-market of software...
-
Site Reliability Engineer II
4 weeks ago
Mumbai, Maharashtra, India Session AI Full timeJob Title: Site Reliability Engineer IIWe are seeking a highly skilled Site Reliability Engineer II to join our team at Session AI. As a key member of our Site Reliability Engineering Group, you will play a vital role in ensuring the seamless operation of our Cloud platform.Key Responsibilities:Design and implement solutions to enhance the availability,...
-
Site Reliability Engineering Manager
2 weeks ago
Mumbai, Maharashtra, India IDFC FIRST Bank Full timeJob Title: Senior Site Reliability Engineering ManagerFunction/ Department: Information TechnologyJob Purpose:IDFC FIRST Bank is seeking a seasoned Site Reliability Engineering Manager to lead our efforts in ensuring seamless customer experiences. As a key member of our IT team, you will be responsible for defining SRE principles, SLIs, and SLAs, and...
-
Cloud Site Reliability Engineer
4 weeks ago
Mumbai, Maharashtra, India M&G Full timeAbout the RoleWe are seeking a highly skilled Cloud Site Reliability Engineer to join our team at M&G Global Services. As a Cloud Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain cloud-based systems and infrastructure to...
Site Reliability Engineer
3 months ago
Job Description :
A major player in the tech industry, which specializes in retail technology, AI, ML, and big data, is seeking new talent. Established by alumni from a top engineering institute, this organization manages a vast network of brands and stores. Headquartered in Mumbai, it is recognized for its innovation and expertise across multiple tech domains.
What will you do ?
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Improve reliability, quality, and time-to-market of our suite of software solutions
- Be the 1st person to report the incident.
- Debug production issues across services and levels of the stack.
- Envisioning the overall solution for defined functional and non-functional requirements, and being able to define technologies, patterns and frameworks to realise it.
- Building automated tools in Python / Java / GoLang / Ruby etc.
- Help Platform and Engineering teams gain visibility into our infrastructure.
- Lead design of software components and systems, to ensure availability, scalability, latency, and efficiency of our services.
- Participate actively in detecting, remediating and reporting on Production incidents, ensuring the SLAs are met and driving Problem Management for permanent remediation.
- Participate in on-call rotation to ensure coverage for planned/unplanned events.
- Perform other task like load-test & generating system health reports.
- Periodically check for all dashboards readiness.
- Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
- Working with your SRE and Engineering counterparts for driving Game days, training and other response readiness efforts.
- Participate in the 24x7 support coverage as needed Troubleshooting and problem-solving complex issues with thorough root cause analysis on customer and SRE production environments
- Collaborate with Service Engineering organizations to build and automate tooling, implement best practices to observe and manage the services in production and consistently achieve our market leading SLA.
- Improving the scalability and reliability of our systems in production.
- Evaluating, designing and implementing new system architectures.
Some specific Requirements :
- B.E./B.Tech. in Engineering, Computer Science, technical degree, or equivalent work experience
- At least 3 years of managing production infrastructure. Leading / managing a team is a huge plus.
- Experience with cloud platforms like - AWS, GCP.
- Experience developing and operating large scale distributed systems with Kubernetes, Docker and and Serverless (Lambdas)
- Experience in running real-time and low latency high available applications (Kafka, gRPC, RTP)
- Comfortable with Python, Go, or any relevant programming language.
- Experience with monitoring alerting using technologies like Newrelic / zybix /Prometheus / Garafana / cloudwatch / Kafka / PagerDuty etc.
- Experience with one or more orchestration, deployment tools, e.g. CloudFormation / Terraform / Ansible / Packer / Chef.
- Experience with configuration management systems such as Ansible / Chef / Puppet.
- Knowledge of load testing methodologies, tools like Gating, Apache Jmeter.
- Work your way around Unix shell.
- Experience running hybrid clouds and on-prem infrastructures on Red Hat Enterprise Linux / CentOS
- A focus on delivering high-quality code through strong testing practices.
What do we Growth :
Growth knows no bounds, as we foster an environment that encourages creativity, embraces challenges, and cultivates a culture of continuous expansion. We are looking at new product lines, international markets and brilliant people to grow even further. We teach, groom and nurture our people to become leaders. You get to grow with a company that is growing exponentially.
2. Flex University :
- We help you upskill by organising in-house courses on important subjects
- Learning Wallet: You can also do an external course to upskill and grow, we reimburse it for you.
3. Culture :
- Community and Team building activities
- Host weekly, quarterly and annual events/parties.
4. Wellness :
- Mediclaim policy for you + parents + spouse + kids
- Experienced therapist for better mental health, improve productivity & work-life balance
We work 5 days from the office and we make sure people have everything they need :
- Free meals
- Snacks, goodies & a lot of fun culture