Site Reliability Engineer

2 days ago


Delhi, India noon Full time
Job Description- Site Reliability EngineerAbout noon noon.Com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries;

we are aggressively andvoraciously ambitious. Starting in 2017 with noon.Com, the region’s homegrown e-commerce platform and leading online shopping destination, noon is now a digital ecosystem of products and services - noon, noon Food, Noon in Minutes, NowNow, SIVVI, noon One, and noon Pay.At noon we have the courage to pursue what seems impossible, we work hard to get things done, we go to great lengths to ensure that the experience of everyone from our customers to our sellers or noon Bandidos is stellar but above all, we are grateful for the opportunities we have. If you feel the above values resonate with you – you will enjoy this incredible journey with usJob Description As a Site Reliability Engineer (SRE) at noonpayments, you will play a crucial role in maintaining and enhancing the reliability, availability, and performance of our cloud-based infrastructure and services.You will be responsible for automating deployments, optimizing systems, and ensuring seamless performance across our platforms. This position requires a strong foundation in cloud infrastructure management, particularly with Azure - AKS and GCP-GKE, alongside hands-on experience with Azure DevOps and monitoring tools like Datadog.You will:Cloud Infrastructure Management: Manage and optimize cloud environments across Azure and GCP, ensuring efficient resource utilization, high system availability, and scalability (AKS-GKE).Infrastructure as Code: Utilize Terraform for infrastructure provisioning, ensuring consistent and scalable deployments, and managing infrastructure via Azure DevOps pipelines.Configuration Management: Implement and manage system configurations using Ansible to ensure consistency and streamline updates across different environments.Continuous Integration/Continuous Deployment (CI/CD): Develop, maintain, and optimize CI/CD pipelines within Azure DevOps to automate testing and deployment processes, reducing time from development to production.Monitoring and Observability: Set up and maintain comprehensive monitoring and observability solutions using Datadog to track system health, performance, and proactively detect issues.Container Orchestration: Deploy, manage, and optimize Kubernetes clusters to support scalable and resilient application deployments.Incident Management: Participate in a 24/7 on-call or roster-based team to respond to incidents, conduct root cause analysis, and implement solutions to minimize downtime and ensure system reliability.Performance Tuning: Continuously monitor system performance, identify bottlenecks, and implement optimizations to improve efficiency and response times.Capacity Planning: Plan and manage system capacity to ensure resources meet current and future demands, enabling seamless service delivery.Collaboration: Work closely with Network Operations Center (NOC) and DevOps teams to troubleshoot issues, optimize deployment processes, and drive continuous improvement.Documentation: Create and maintain detailed documentation for system configurations, deployment processes, and incident reports.Skill Requirements Bachelor’s degree in computer science, Information Technology or any other related discipline or equivalent related experience.Certifications in Cloud, ITIL, CKA are a plus.6+ years of directly related or relevant experience, preferably in information security.Extensive experience with cloud platforms such as Azure, GCP, and Huawei Cloud.Proficiency with Terraform for infrastructure automation and Ansible for configuration management.Hands-on experience with Kubernetes for container orchestration mainly AKS and GKE.Expertise in monitoring and observability tools such as Datadog.Familiarity with Azure VMSS, GCP MIG for virtual machine scaling and management.Experience in a 24/7 on-call or roster-based team environment, focusing on system uptime and incident response.Strong understanding of SRE processes and best practices for system reliability, availability, and performance.Excellent problem-solving skills and the ability to handle complex technical issues under pressure.Effective communication skills and a collaborative approach to working with diverse teams.Experience with payment gateway projects or similar high-transaction systems is preferred.Additional knowledge in advanced monitoring techniques, performance tuning, and capacity planning is a plus.Who will excel? We’re looking for candidates who thrive in a fast-paced, dynamic start-up environment. We’re searching for problem solvers, people who operate with a bias for action and have a deep understanding of the importance of resourcefulness over reliance.Candor is our only default. Demanding unequivocal high standards should be non-negotiable because quality matters. We want people who are radically candid, cohorts who commit to settling for nothing but the best - in hiring, in accepting work from colleagues, and in your own work.Ours is not an easy mission, but it is a meaningful one. Every hire must actively raise the bar of talent in the company to help us reach our vision.

  • Delhi, India TechBlocks Full time

    Seeking a skilled Senior Site Reliability Engineer with expertise in Google Cloud Platform (GCP) to join our dynamic team. As a Senior SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure and applications hosted on GCP.Responsibilities:Design, build, and maintain the core infrastructure used by all...


  • Delhi, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Delhi, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Delhi, India Persistent Systems Full time

    About Position:We are looking for Site Reliability Engineers who are proficient with monitoring tools, preferably New Relic. The person should have experience with Terraform, Docker, Kubernetes, and any cloud. Python coding experience is very much preferred.Role: Site Reliability EngineerLocation: HyderabadExperience: 8+ Yrs.Job Type: Full Time...


  • new delhi, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • new delhi, India Antal International Full time

    Job Description Summary role description: Hiring for a Site Reliability Engineer for a fastest-growing energy technology company. Company description: Our client is one of the fastest-growing energy technology companies in India, founded by some of the leaders in this space. They lead technological innovation for the most effective energy...


  • Delhi, India Sigmaways Inc Full time

    BackgroundAs a developer, you will work with a team of skilled Site Reliability Engineers and help them to improve the application reliability. You will play a critical role in working with the reliability of the massive scale application that processes billions of events every day. You will collaborate with multiple stakeholders and help the team write...


  • delhi, India Insight Global Full time

    Required Skills & ExperienceBachelor's degree in Computer Science, Engineering, or a related field.3+ years of experience in Systems Engineering or Site Reliability Engineering.Strong proficiency in GoLang programming.Experience with Red Hat OpenShift and container technologies (Docker, Kubernetes).Understanding of cloud platforms (AWS, Azure,...


  • Delhi, India noon Full time

    Job Description- Site Reliability EngineerAbout noonnoon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries; we are...


  • New Delhi, India Mrsool Full time

    Who Are We❓Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...


  • new delhi, India Mrsool Full time

    Who Are We❓ Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...

  • SDIII Engineer

    5 days ago


    New Delhi, India Mrsool Full time

    Who Are We❓Welcome to the world of Mrsool! ✨ Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery...


  • Delhi, India Vimeo Full time

    We are looking for Self starter, motivated and extraordinary individuals with strong communication and interpersonal skills to join our Site Reliability Engineering team that supports the database infrastructure, as well as builds and runs a platform that delivers Vimeo product/ services to all of its customers around the world.What you’ll do:Gain a deep...


  • Delhi, India Vimeo Full time

    We are looking for Self starter, motivated and extraordinary individuals with strong communication and interpersonal skills to join our Site Reliability Engineering team that supports the database infrastructure, as well as builds and runs a platform that delivers Vimeo product/ services to all of its customers around the world.What you’ll do:Gain a deep...

  • Senior Engineer

    4 weeks ago


    Delhi, India C&R Software Full time

    Job Description SummaryThe Cloud Operations team is accountable for the operational excellence of the C&R cloud platform, which hosts several business-critical, client-facing applications. The objective of the SRE within Cloud Operations is to coordinate a timely and focused organisational-wide response to severe/high-impact technical incidents airing from...


  • Delhi, India Airtel International LLP-Airtel Africa Full time

    Job title: Site Reliability Engineer - Airtel MoneyWork Location: GurgaonDivision/Department: EngineeringWhy Airtel Africa?At Airtel, we don’t just make things – we make things possible. Airtel Africa is on a mission to change the world by connecting people with ideas. We are building next generation systems to improve the quality of life for millions of...


  • Delhi, India McCain Foods Full time

    JOB RESPONSIBILITIES:Work with stakeholders such as product owners and Engineering to define service level objectives (SLOs) for system operations.Track performance against SLOs in partnership with monitoring teams or other stakeholders, and ensure systems continue to meet SLOs over time.Create dashboards and reports to communicate key metrics.Create...


  • Delhi, India Alp Consulting Ltd. Full time

    Experienced L3 SRE engineer based on business-critical SaaS applicationCapacity to L3 across the full stack including infra backend and front-end, before escalation to engineering business unitCapacity to automate SRE tools to provide proactive L3 support, close to our tech monitoring strategyCapacity to work under business pressure for business-critical...

  • Site Engineer

    1 week ago


    Delhi, India Bare Wall Studio Full time

    Company DescriptionBare Wall Studio is a dynamic multi-disciplinary design studio based in Bengaluru, India. Our team of passionate architects and designers is committed to delivering innovative and sustainable design solutions. We believe in leveraging technology to create impactful and lasting designs for the future.Role DescriptionThis is a full-time...


  • Delhi, India Castlight Health Full time

    Job Description:Experience Level: 5 - 7 yearsResponsibilities:● Create reusable solutions using terraform plans, chef recipes and cookbooks, DSL for provisioning formaintaining and decommissioning the infrastructure● Provide day-to-day support of multiple environments such as: production, staging, and development● Provide 24x7 support for platform...