
Highly Skilled Cloud Operations Engineer Wanted
1 week ago
We are seeking an experienced cloud operations engineer with expertise in Site Reliability Engineering (SRE) to ensure the availability, reliability, and performance of our platform services and applications.
This role will be critical in ensuring the high availability and reliability of our cloud-based RAN and Core Network platforms. The successful candidate will have a strong understanding of Kubernetes, container orchestration, and automation best practices.
Key Responsibilities:
- Platform Reliability & Availability:
- Run the production environment by proactively monitoring availability and taking a holistic view of system health for our cloud-based RAN and Core Network platforms.
- Improve the reliability and quality of the system through automation, process refinement, and best practices for both RAN and Core cloud components.
- Measure and optimize system performance to ensure efficient resource utilization and optimal user experience for network services.
- Ensure services are available, the underlying infrastructure is properly functioning and monitor critical applications and related services to guarantee system availability for RAN and Core functions.
- Cloud Operations & Kubernetes Management:
- Design, deploy, and manage Kubernetes clusters and related cloud infrastructure for both RAN and Core Network application deployments.
- Implement and maintain containerization strategies and orchestration best practices for telecom workloads.
- Manage and troubleshoot Robin storage solutions within the Kubernetes environment, supporting the unique storage needs of RAN and Core applications.
- Implement and manage CI/CD pipelines for cloud-native RAN and Core applications.
- Responsible for cloud resource provisioning, scaling, and cost optimization for all deployed network functions.
- Incident & Problem Management:
- Collaborate on high-priority incident tickets, ensuring rapid system recovery for both RAN and Core impacted services.
- Be on standby to interface with developers when issues arise and provide immediate technical insights and support for cloud-native network functions.
- Lead Problem Management efforts, including Root Cause Analysis (RCA), for complex incidents affecting RAN and Core cloud deployments.
- Identify bugs and work with development teams to prioritize and implement fixes for cloud-native network elements.
- Monitoring & Alerting:
- Implement and maintain robust monitoring, logging, and alerting solutions for cloud infrastructure and applications supporting RAN and Core services.
- Define and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for critical RAN and Core services running in the cloud.
- Automation & Tooling:
- Develop and implement automation scripts and tools to streamline operational tasks, deployments, and incident response for cloud-native RAN and Core components.
- Evaluate and integrate new tools and technologies to enhance operational efficiency.
- Collaboration & Knowledge Sharing:
- Support governance reports, providing technical data and insights on cloud platform performance for RAN and Core.
- Handle customer queries with technical expertise and provide timely resolutions related to cloud-deployed network services.
- Provide training and mentorship to junior team members on cloud technologies and SRE practices, specifically in the context of telecom networks.
- Work closely with development, network, and security teams to ensure seamless service delivery across the entire network architecture.
Technical Requirements:
- Deep expertise in Kubernetes:
- Cluster deployment, management, and troubleshooting for high-performance telecom workloads.
- Container orchestration, Pod lifecycle, Deployments, Services, Ingress.
- Helm charts, Kustomize.
- Advanced networking within Kubernetes (CNI, CoreDNS, service mesh concepts).
- Security best practices in Kubernetes, especially for critical network functions.
- Proficiency in Cloud Platforms: Experience with at least one major cloud provider (e.g., AWS, Azure, GCP) with focus on enterprise-grade infrastructure.
- Containerization Technologies: Docker, container.
- Robin Storage: Hands-on experience with Robin.io or similar distributed persistent storage solutions for Kubernetes, particularly for stateful RAN and Core applications.
- Infrastructure as Code (IaC): Terraform, Ansible, or similar tools for automating cloud and Kubernetes deployments.
- Scripting & Automation: Strong proficiency in Python, Go, Bash, or similar for developing automation and tooling.
- Monitoring & Logging Tools: Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, or similar, with experience in large-scale data ingestion and analysis.
- CI/CD Tools: Jenkins, GitLab CI/CD, Argo CD, or similar, for continuous deployment of network functions.
- Operating Systems: Linux (e.g., CentOS, Ubuntu, RHEL) expert-level knowledge.
- Networking Fundamentals: Deep understanding of TCP/IP, DNS, Load Balancing, Firewalls, VPNs, and advanced network concepts relevant to telecom (e.g., SRv6, Segment Routing, GTP-U/C).
- Telecommunications Network Knowledge:
- Strong understanding of Radio Access Network (RAN) architecture, components, and interfaces (e.g., O-RAN, vRAN concepts).
- Strong understanding of Core Network (EPC/5GC) architecture, functions (e.g., AMF, SMF, UPF, MME, SGW, PGW), and protocols.
- Familiarity with network function virtualization (NFV) and software-defined networking (SDN) principles.
Qualifications:
- Education: Bachelor's degree in computer science, Engineering, or a related field.
- Experience: Minimum of 5-6 years of experience in a Cloud Engineering, DevOps, or SRE role, with a significant focus on Kubernetes and cloud operations, ideally within a telecommunications or high-availability environment.
- Problem-Solving: Exceptional analytical and problem-solving skills, with a methodical approach to debugging complex distributed systems.
- Communication: Excellent verbal and written communication skills, capable of effectively collaborating with technical and non-technical stakeholders.
- Proactive Mindset: Ability to anticipate issues, identify risks, and propose preventative solutions.
- Incident Response: Proven experience in responding to and resolving critical production incidents in a fast-paced environment.
- Continuous Improvement: A strong desire to learn, adapt, and drive continuous improvement in processes and systems.
-
Highly Skilled Cloud Operations Specialist
1 week ago
Mumbai, Maharashtra, India beBeeSiteReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Senior Cloud Engineer PositionWe are seeking a skilled and experienced Senior Cloud Engineer to join our team. In this role, you will be responsible for ensuring the smooth operation of our cloud-based infrastructure, identifying potential issues before they occur, and implementing solutions to prevent downtime.Key Responsibilities:Maintain and enhance...
-
Highly Skilled Cloud Native Engineer
4 days ago
Mumbai, Maharashtra, India beBeeReliability Full time ₹ 15,00,000 - ₹ 18,00,000We are seeking a highly skilled Site Reliability Engineer to join our team. This role requires 6+ years of experience in designing and implementing scalable, resilient cloud-native infrastructure across multiple platforms.The successful candidate will own the SRE function, including availability, latency, performance monitoring, emergency response, and...
-
Mumbai, Maharashtra, India beBeeSkills Full time ₹ 9,00,000 - ₹ 12,00,000Job Title Network Operations Engineer Position About the Role We are seeking a skilled Network Operations Engineer to join our Infrastructure Engineering team. As a key member, you will play a vital role in supporting critical services across network infrastructure, cloud platforms, and operational tasks. Key Responsibilities: Incident Management: Provide...
-
Mumbai, Maharashtra, India beBeeCloud Full time US$ 1,00,000 - US$ 1,50,000Senior Cloud Software DeveloperWe are seeking a highly skilled and motivated Senior Cloud Software Developer to join our team. As a senior member of our team, you will play a key role in designing, developing, and deploying cloud-based software solutions.The ideal candidate will have a strong background in software development, with a focus on cloud...
-
Highly Skilled Cloud Computing Professional
1 week ago
Mumbai, Maharashtra, India beBeeCloud Full time ₹ 15,000 - ₹ 28,00,000Cloud Network DevOps EngineerWe are seeking a highly skilled Cloud Network DevOps Engineer to join our team. As a Cloud Network DevOps Engineer, you will be responsible for implementing and maintaining the Azure network infrastructure using Infrastructure as Code (IaC), with CI/CD pipelines using GitHub Actions Workflow.In this role, you will develop and...
-
Mumbai, Maharashtra, India beBeeSoftwareEngineer Full time US$ 1,04,000 - US$ 1,30,878Job OpportunityCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, powers complex distributed software systems across the world.The company is a pioneer of global collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet...
-
Highly Skilled Process Engineer Wanted
5 days ago
Mumbai, Maharashtra, India beBeeProcess Full time ₹ 1,20,00,000 - ₹ 1,50,00,000Process Engineer OpportunityWe are seeking a highly skilled Process Engineer to join our organization.The ideal candidate will have experience in developing complex dynamic simulation process models and smart applications using Simulation and Optimization tools.Key Responsibilities:Translate project requirements into specifications and deliverables.Produce...
-
Highly Skilled Cloud Architect
2 weeks ago
Mumbai, Maharashtra, India beBeeCloudInfrastructure Full time ₹ 15,00,000 - ₹ 25,00,000We are seeking a skilled and experienced Cloud Infrastructure ArchitectJob Description:Design, implement, and maintain scalable infrastructure for delivering web, mobile, and big data applications on cloud platforms.Create highly available and fault-tolerant systems using cloud-native services.Scale and optimize various SQL and NoSQL databases, web servers,...
-
Mumbai, Maharashtra, India beBeeEngineering Full time ₹ 20,00,000 - ₹ 25,00,000Infrastructure Operations SpecialistWe are seeking an experienced expert to join our organization.Job Summary:The ideal candidate will have a strong background in systems engineering, with expertise in designing and implementing scalable and reliable infrastructure. They will be responsible for ensuring smooth operations, identifying potential issues before...
-
Highly Skilled Deployment Specialist Wanted
2 weeks ago
Mumbai, Maharashtra, India beBeeDeployment Full time ₹ 15,00,000 - ₹ 20,00,000Deployment Engineer RoleWe are seeking a highly skilled professional to fill this key position.Key ResponsibilitiesDesign and implement technical solutions using latest technologies and tools.Monitor availability and take a holistic view of application and system health.Implement automated software/tools to manage platform infrastructure and applications...