Site Reliability Engineer

1 day ago

delhi, India noon Full time

Job Description- Site Reliability Engineer

About noon
noon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.

noon operates without boundaries; we are aggressively and voraciously ambitious. Starting in 2017 with noon.com, the region’s homegrown e-commerce platform and leading online shopping destination, noon is now a digital ecosystem of products and services - noon, noon Food, Noon in Minutes, NowNow, SIVVI, noon One, and noon Pay.

At noon we have the courage to pursue what seems impossible, we work hard to get things done, we go to great lengths to ensure that the experience of everyone from our customers to our sellers or noon Bandidos is stellar but above all, we are grateful for the opportunities we have. If you feel the above values resonate with you – you will enjoy this incredible journey with us

Job Description
As a Site Reliability Engineer (SRE) at noonpayments, you will play a crucial role in maintaining and enhancing the reliability, availability, and performance of our cloud-based infrastructure and services.

You will be responsible for automating deployments, optimizing systems, and ensuring seamless performance across our platforms. This position requires a strong foundation in cloud infrastructure management, particularly with Azure - AKS and GCP-GKE, alongside hands-on experience with Azure DevOps and monitoring tools like Datadog.

You will:
Cloud Infrastructure Management: Manage and optimize cloud environments across Azure and GCP, ensuring efficient resource utilization, high system availability, and scalability (AKS-GKE).
Infrastructure as Code: Utilize Terraform for infrastructure provisioning, ensuring consistent and scalable deployments, and managing infrastructure via Azure DevOps pipelines.
Configuration Management: Implement and manage system configurations using Ansible to ensure consistency and streamline updates across different environments.
Continuous Integration/Continuous Deployment (CI/CD): Develop, maintain, and optimize CI/CD pipelines within Azure DevOps to automate testing and deployment processes, reducing time from development to production.
Monitoring and Observability: Set up and maintain comprehensive monitoring and observability solutions using Datadog to track system health, performance, and proactively detect issues.
Container Orchestration: Deploy, manage, and optimize Kubernetes clusters to support scalable and resilient application deployments.
Incident Management: Participate in a 24/7 on-call or roster-based team to respond to incidents, conduct root cause analysis, and implement solutions to minimize downtime and ensure system reliability.
Performance Tuning: Continuously monitor system performance, identify bottlenecks, and implement optimizations to improve efficiency and response times.
Capacity Planning: Plan and manage system capacity to ensure resources meet current and future demands, enabling seamless service delivery.
Collaboration: Work closely with Network Operations Center (NOC) and DevOps teams to troubleshoot issues, optimize deployment processes, and drive continuous improvement .
Documentation: Create and maintain detailed documentation for system configurations, deployment processes, and incident reports.

Skill Requirements
Bachelor’s degree in computer science, Information Technology or any other related discipline or equivalent related experience.
Certifications in Cloud, ITIL, CKA are a plus.
6+ years of directly related or relevant experience, preferably in information security.
Extensive experience with cloud platforms such as Azure, GCP, and Huawei Cloud.
Proficiency with Terraform for infrastructure automation and Ansible for configuration management.
Hands-on experience with Kubernetes for container orchestration mainly AKS and GKE.
Expertise in monitoring and observability tools such as Datadog.
Familiarity with Azure VMSS, GCP MIG for virtual machine scaling and management.
Experience in a 24/7 on-call or roster-based team environment, focusing on system uptime and incident response.
Strong understanding of SRE processes and best practices for system reliability, availability, and performance.
Excellent problem-solving skills and the ability to handle complex technical issues under pressure.
Effective communication skills and a collaborative approach to working with diverse teams.
Experience with payment gateway projects or similar high-transaction systems is preferred.
Additional knowledge in advanced monitoring techniques, performance tuning, and capacity planning is a plus.

Who will excel?
We’re looking for candidates who thrive in a fast-paced, dynamic start-up environment. We’re searching for problem solvers, people who operate with a bias for action and have a deep understanding of the importance of resourcefulness over reliance.

Candor is our only default. Demanding unequivocal high standards should be non-negotiable because quality matters. We want people who are radically candid, cohorts who commit to settling for nothing but the best - in hiring, in accepting work from colleagues, and in your own work.

Ours is not an easy mission, but it is a meaningful one. Every hire must actively raise the bar of talent in the company to help us reach our vision.

Site Reliability Engineer

1 week ago

Delhi, India Integra Connect Full time

About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
Site Reliability Engineer

1 week ago

Delhi, India Integra Connect Full time

About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
Site Reliability Engineer

2 weeks ago

Delhi, India Persistent Systems Full time

About Position:We are looking for Site Reliability Engineers who are proficient with monitoring tools, preferably New Relic. The person should have experience with Terraform, Docker, Kubernetes, and any cloud. Python coding experience is very much preferred.Role: Site Reliability EngineerLocation: HyderabadExperience: 8+ Yrs.Job Type: Full Time...
Site Reliability Engineer

4 months ago

new delhi, India dentsu Full time

The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...
Site Reliability Engineer

7 days ago

new delhi, India Antal International Full time

Job Description Summary role description: Hiring for a Site Reliability Engineer for a fastest-growing energy technology company. Company description: Our client is one of the fastest-growing energy technology companies in India, founded by some of the leaders in this space. They lead technological innovation for the most effective energy...
Site Reliability Engineer

2 days ago

new delhi, India Antal International Full time

Job Description Summary role description: Hiring for a Site Reliability Engineer for a fastest-growing energy technology company. Company description: Our client is one of the fastest-growing energy technology companies in India, founded by some of the leaders in this space. They lead technological innovation for the most effective energy...
Site Reliability Engineer

2 weeks ago

delhi, India Insight Global Full time

Required Skills & ExperienceBachelor's degree in Computer Science, Engineering, or a related field.3+ years of experience in Systems Engineering or Site Reliability Engineering.Strong proficiency in GoLang programming.Experience with Red Hat OpenShift and container technologies (Docker, Kubernetes).Understanding of cloud platforms (AWS, Azure,...
Site Reliability Engineer

19 hours ago

delhi, India TrueBlue Inc. Full time

ROLE: The Site Reliability Engineer will participate in the monitoring and improvement of one of PeopleReady’s enterprise applications. She/he will use infrastructure monitoring tools to assess the overall system health providing metrics and visualization to report health to development/support teams. This role will also work closely with engineering...
Staff Site Reliability Engineer

24 hours ago

delhi, India Moveworks Full time

Who We Are Moveworks is the universal AI copilot for search and automation across all your business applications. We give employees one place to go to find information and get support while reducing costs for your business. The Moveworks Copilot is powered by an industry-leading Reasoning Engine that uses a combination of public and proprietary language...
Site Reliability Engineer

2 weeks ago

Delhi, India noon Full time

Job Description- Site Reliability EngineerAbout noonnoon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries; we are...
Site Reliability Engineer

1 day ago

delhi, India noon Full time

Job Description- Site Reliability EngineerAbout noonnoon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries; we are...
Site Reliability Engineer

1 week ago

Delhi, India noon Full time

Job Description- Site Reliability EngineerAbout noon noon.Com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries;we are...
Manager- Site Reliability Engineering

1 week ago

new delhi, India Mrsool Full time

Who Are We❓ Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...
Manager- Site Reliability Engineering

2 weeks ago

New Delhi, India Mrsool Full time

Who Are We❓Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...
Manager- Site Reliability Engineering

2 days ago

new delhi, India Mrsool Full time

Who Are We❓Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...
SDIII Engineer

2 weeks ago

New Delhi, India Mrsool Full time

Who Are We❓Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...
Senior Site Reliability Engineer

1 month ago

Delhi, India Vimeo Full time

We are looking for Self starter, motivated and extraordinary individuals with strong communication and interpersonal skills to join our Site Reliability Engineering team that supports the database infrastructure, as well as builds and runs a platform that delivers Vimeo product/ services to all of its customers around the world.What you’ll do:Gain a deep...
Senior Site Reliability Engineer

1 week ago

Delhi, India Vimeo Full time

We are looking for Self starter, motivated and extraordinary individuals with strong communication and interpersonal skills to join our Site Reliability Engineering team that supports the database infrastructure, as well as builds and runs a platform that delivers Vimeo product/ services to all of its customers around the world.What you’ll do:Gain a deep...
Senior Site Reliability Engineer

12 hours ago

delhi, India C&R Software Full time

Job Description Summary The Cloud Operations team is accountable for the operational excellence of the C&R cloud platform, which hosts several business-critical, client-facing applications. The objective of the SRE within Cloud Operations is to coordinate a timely and focused organisational-wide response to severe/high-impact technical incidents airing from...
Senior Engineer

1 month ago

Delhi, India C&R Software Full time

Job Description SummaryThe Cloud Operations team is accountable for the operational excellence of the C&R cloud platform, which hosts several business-critical, client-facing applications. The objective of the SRE within Cloud Operations is to coordinate a timely and focused organisational-wide response to severe/high-impact technical incidents airing from...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineer