Cloud Operations L2 Support Engineer

3 weeks ago


New Delhi, India Rakuten Symphony Full time

Job Summary :We are seeking a highly skilled and experienced Cloud Engineer with a strong Site Reliability Engineering (SRE) mindset to join our team. This role will be critical in ensuring the availability, reliability, and performance of our platform services and applications, particularly those supporting our Radio Access Network (RAN) and Core Network functions deployed on cloud infrastructure. The ideal candidate will possess deep expertise in Kubernetes, cloud operations, and a passion for optimizing complex distributed systems. You will be instrumental in running our production environment, responding to critical incidents, and driving continuous improvement in system reliability and efficiency across both RAN and Core cloud deployments.Key Responsibilities:- Platform Reliability & Availability (SRE Focus): - Run the production environment by proactively monitoring availability and taking a holistic view of system health for our cloud-based RAN and Core Network platforms. - Improve the reliability and quality of the system through automation, process refinement, and best practices for both RAN and Core cloud components. - Measure and optimize system performance to ensure efficient resource utilization and optimal user experience for network services. - Ensure services are available, the underlying infrastructure is properly functioning and monitor critical applications and related services to guarantee system availability for RAN and Core functions. - Cloud Operations & Kubernetes Management: - Design, deploy, and manage Kubernetes clusters and related cloud infrastructure for both RAN and Core Network application deployments. - Implement and maintain containerization strategies and orchestration best practices for telecom workloads. - Manage and troubleshoot Robin storage solutions within the Kubernetes environment, supporting the unique storage needs of RAN and Core applications. - Implement and manage CI/CD pipelines for cloud-native RAN and Core applications. - Responsible for cloud resource provisioning, scaling, and cost optimization for all deployed network functions. - Incident & Problem Management: - Collaborate for high-priority incident tickets (e.g., MIC Reported Incident, Serious/Medium/Small Network Incidents, RIUD Faults), ensuring rapid system recovery for both RAN and Core impacted services. - Be on standby to interface with developers when issues arise and get escalated, providing immediate technical insights and support for cloud-native network functions. - Lead Problem Management efforts, including Root Cause Analysis (RCA), for complex incidents affecting RAN and Core cloud deployments. - Identify bugs and work with development teams to prioritize and implement fixes for cloud-native network elements. - Monitoring & Alerting: - Implement and maintain robust monitoring, logging, and alerting solutions for cloud infrastructure and applications supporting RAN and Core services. - Define and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for critical RAN and Core services running in the cloud. - Automation & Tooling: - Develop and implement automation scripts and tools to streamline operational tasks, deployments, and incident response for cloud-native RAN and Core components. - Evaluate and integrate new tools and technologies to enhance operational efficiency. - Collaboration & Knowledge Sharing: - Support for Governance Reports, providing technical data and insights on cloud platform performance for RAN and Core. - Handle customer queries with technical expertise and provide timely resolutions related to cloud-deployed network services. - Provide training and mentorship to junior team members on cloud technologies and SRE practices, specifically in the context of telecom networks. - Work closely with development, network, and security teams to ensure seamless service delivery across the entire network architecture.Technical Requirements (Most Visible):- Deep expertise in Kubernetes: - Cluster deployment, management, and troubleshooting for high-performance telecom workloads. - Container orchestration, Pod lifecycle, Deployments, Services, Ingress. - Helm charts, Kustomize. - Advanced networking within Kubernetes (CNI, CoreDNS, service mesh concepts). - Security best practices in Kubernetes, especially for critical network functions. - Proficiency in Cloud Platforms: Experience with at least one major cloud provider (e.g., AWS, Azure, GCP) with focus on enterprise-grade infrastructure. - Containerization Technologies: Docker, container. - Robin Storage: Hands-on experience with Robin.io or similar distributed persistent storage solutions for Kubernetes, particularly for stateful RAN and Core applications. - Infrastructure as Code (IaC): Terraform, Ansible, or similar tools for automating cloud and Kubernetes deployments. - Scripting & Automation: Strong proficiency in Python, Go, Bash, or similar for developing automation and tooling. - Monitoring & Logging Tools: Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, or similar, with experience in large-scale data ingestion and analysis. - CI/CD Tools: Jenkins, GitLab CI/CD, Argo CD, or similar, for continuous deployment of network functions. - Operating Systems: Linux (e.g., CentOS, Ubuntu, RHEL) expert-level knowledge. - Networking Fundamentals: Deep understanding of TCP/IP, DNS, Load Balancing, Firewalls, VPNs, and advanced network concepts relevant to telecom (e.g., SRv6, Segment Routing, GTP-U/C). - Telecommunications Network Knowledge: - Strong understanding of Radio Access Network (RAN) architecture, components, and interfaces (e.g., O-RAN, vRAN concepts). - Strong understanding of Core Network (EPC/5GC) architecture, functions (e.g., AMF, SMF, UPF, MME, SGW, PGW), and protocols. - Familiarity with network function virtualization (NFV) and software-defined networking (SDN) principles.Qualifications:- Education: Bachelor’s degree in computer science, Engineering, or a related field. - Experience: Minimum of 5-6 years of experience in a Cloud Engineering, DevOps, or SRE role, with a significant focus on Kubernetes and cloud operations, ideally within a telecommunications or high-availability environment. - Problem-Solving: Exceptional analytical and problem-solving skills, with a methodical approach to debugging complex distributed systems. - Communication: Excellent verbal and written communication skills, capable of effectively collaborating with technical and non-technical stakeholders. - Proactive Mindset: Ability to anticipate issues, identify risks, and propose preventative solutions. - Incident Response: Proven experience in responding to and resolving critical production incidents in a fast-paced environment. - Continuous Improvement: A strong desire to learn, adapt, and drive continuous improvement in processes and systems.



  • New Delhi, India Ingenico Full time

    Position Title:Technical Support Engineer L2 Cloud Solution Location:Noida – Hybrid Reports to:Service Delivery Manager / L2 ManagerAbout the Role: As a Technical Support Engineer L2 Cloud Solution at Ingenico, you will provide cloud platform and production support for Ingenico devices and platforms. You will manage incidents and technical issues for...

  • L2 Support Engineer

    2 weeks ago


    New Delhi, India TECEZE Full time

    Job Title: L2 Support EngineerLocation: PunePosition Type: Full TimeExperience: 3-5 YearsJob Overview:We are seeking an experienced L2 Support Engineer to join our team. The candidate will provide advanced technical support, manage IT infrastructure, and ensure smooth daily operations across networks, servers, and end-user systems.Key Responsibilities:-...

  • L2 Support Engineer

    2 weeks ago


    New Delhi, India Insight Global Full time

    Required Skills & Experience- 3-7+ years of experience providing L2 Incident Support- Experience supporting apps with the following tech stack: Spring boot, Microservices, Java- Experience supporting SQL databases- Experience in L2 IT support, including hardware/software troubleshooting, check logs, system monitoring, patch deployment, etc- Experience with...


  • New Delhi, India TRDFIN Support Services Pvt Ltd Full time

    Job Role: L2 - Desktop Support EngineerExperience : 1 yearsBudget : 17k to 20k as a take homeLocation: Chennai / GurugramFlexible to work in rotational shift with 5 days workingRoles and Responsibilities:-Hands on experience in Active Directory- user id (Addition / Deletion / Disable)-OS installation and in Bit locker encryption Assembling-Hardware...

  • L2 Engineer

    2 days ago


    New Delhi, India SourceFuse Full time

    SourceFuse Technologies hiring L2 Engineer - OSS Support with 5+ years of experience.Overview: We are seeking a highly motivated and experienced Open-Source Software (OSS) Support Engineer with a strong background in the Telecom domain to join our growing team. In this role, you will be responsible for providing technical support and guidance to our users...


  • New Delhi, India TECEZE Full time

    Job Title: Network Engineer – Cisco ACI (L2 Operations)Company: TECEZEExperience: 5–8 YearsLocation: MumbaiAvailability: Immediate Joiners PreferredOverviewWe are seeking a skilled Cisco ACI Network Operations Engineer with hands-on experience in L2 Data Center Networking to support a mission-critical, low-latency trading environment adhering to National...


  • New Delhi, India Sharp Brains Full time

    Position: Network Engineer - L2 Support Job Type: Full - Time (On-Site) Contract Duration: 01 - Year Compensation: ₹ 60,000 to ₹ 70,000 INR Monthly Location: Navi Mumbai, Maharashtra, IndiaPosition Overview: We are looking for aNetwork Engineer (Level 2)must proficient inEnglishto provide end-user IT support for our client in Navi Mumbai. The candidate...


  • New Delhi, India Signzy Full time

    Job Title: Product Support Engineer( L2 )About SignzySignzy is an AI-powered RPA platform designed for financial services. Our platform can automate even the most complex workflows and decision-making processes into real-time APIs. Powered by Nebula, our no-code AI model builder, and a Fintech API Marketplace with over 200+ APIs, Signzy serves 90+ financial...


  • New Delhi, India Signzy Full time

    Job Title: Product Support Engineer( L2 )About SignzySignzy is an AI-powered RPA platform designed for financial services. Our platform canautomate even the most complex workflows and decision-making processes into real-time APIs.Powered by Nebula, our no-code AI model builder, and a Fintech API Marketplace with over200+ APIs, Signzy serves 90+ financial...

  • L2 Support Engineer

    3 weeks ago


    Delhi, India Insight Global Full time

    Required Skills & Experience- 3-7+ years of experience providing L2 Incident Support- Experience supporting apps with the following tech stack: Spring boot, Microservices, Java- Experience supporting SQL databases- Experience in L2 IT support, including hardware/software troubleshooting, check logs, system monitoring, patch deployment, etc- Experience with...