
Expert in Large Scale Cluster Management
2 weeks ago
We are seeking an experienced Kubernetes Expert who will be responsible for designing, implementing, and managing large-scale Kubernetes clusters with a strong focus on performance, security, and reliability.
Key Responsibilities :
- Design, deploy, and manage highly available Kubernetes clusters across multi-cloud and on-prem environments.
- Implement security best practices, role-based access control (RBAC), and compliance policies.
- Ensure smooth scaling, monitoring, and troubleshooting of clusters to meet enterprise-grade requirements.
- Integrate GPU support within Kubernetes clusters to optimize performance for AI/ML workloads.
- Collaborate with data science and engineering teams to ensure seamless execution of GPU-intensive applications.
- Develop and implement metering and monitoring solutions to track cloud resource consumption.
- Optimize resource allocation and provide insights for cost optimization and efficiency.
- Provide expertise on integrating Kubernetes with OpenStack environments.
- Manage and optimize hybrid cloud deployments leveraging both Kubernetes and OpenStack.
- Work closely with DevOps, Cloud, and Infrastructure teams to implement best practices.
- Prepare detailed documentation, runbooks, and guidelines for cluster operations.
Required Expertise & Skills :
- Proven experience in designing, deploying, and managing Kubernetes clusters at scale.
- Hands-on experience in enabling GPU support in Kubernetes for AI/ML workloads.
- Strong knowledge of containerization technologies (Docker, CRI-O, containerd, etc.).
- Experience with monitoring and metering solutions (Prometheus, Grafana, custom tooling, etc.) for cloud resource utilization.
- Understanding of networking concepts within Kubernetes (CNI plugins, ingress, service mesh, etc.).
- Good knowledge of OpenStack services and experience with Kubernetes-OpenStack integration (preferred).
- Strong problem-solving, debugging, and performance-tuning skills.
- Familiarity with CI/CD pipelines and automation tools (Helm, Ansible, Terraform, ArgoCD, etc.).
-
Optimizing Large-Scale HPC Platforms
2 weeks ago
Chennai, Tamil Nadu, India beBeeHighperformance Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Senior Linux Engineer - Quantitative ResearchWe are seeking an experienced Linux engineer to lead the design, engineering, and automation of large-scale High-Performance Computing (HPC) platforms for quantitative finance and advanced research.Develop and manage scalable HPC environments with a focus on performance optimization.Research and experiment with...
-
Senior Kafka Cluster Architect
2 weeks ago
Chennai, Tamil Nadu, India beBeeKafka Full time ₹ 1,50,00,000 - ₹ 2,00,00,000We are seeking a seasoned Kafka Administrator to join our team. The ideal candidate will have extensive experience in designing, implementing, and managing large-scale Kafka clusters.Key ResponsibilitiesDesign, deploy, and manage high-performance Kafka clusters on-prem and cloud-based environments.Perform regular upgrades, patches, and disaster recovery...
-
Expert Large Scale Data Processing Professional
2 weeks ago
Chennai, Tamil Nadu, India beBeeSoftwareEngineer Full time US$ 1,50,000 - US$ 2,00,000Job Title: Big Data EngineerThe role of a Software Engineer (Data) is crucial in the development of large-scale data processing systems for LLM research. We are seeking an experienced professional with deep expertise in designing, building, and operating scalable data infrastructure to support distributed computing and data orchestration.Key...
-
Cluster Head
3 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay Revisit Process NetAmbit is looking for a dynamic and experienced professional to lead our GPay Revisit Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+ Key Responsibilities Lead and manage a team of 150+ associates in the GPay Revisit Process. Drive operational...
-
Cluster Head
3 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay Revisit ProcessNetAmbit is looking for a dynamic and experienced professional to lead our GPay Revisit Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+Key Responsibilities- Lead and manage a team of 150+ associates in the GPay Revisit Process.- Drive operational...
-
Cluster Head
2 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay ProcessNetAmbit is looking for a dynamic and experienced professional to lead our GPay Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+Key Responsibilities- Lead and manage a team of 150+ associates in the GPay Process.- Drive operational excellence, ensure process...
-
Cluster Head
2 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay Revisit ProcessNetAmbit is looking for a dynamic and experienced professional to lead our GPay Revisit Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+Key ResponsibilitiesLead and manage a team of 150+ associates in the GPay Revisit Process.Drive operational excellence,...
-
Cluster Head
3 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay Revisit Process NetAmbit is looking for a dynamic and experienced professional to lead our GPay Revisit Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+ Key Responsibilities Lead and manage a team of 150+ associates in the GPay Revisit Process. Drive operational...
-
Cluster Head
2 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay Revisit ProcessNetAmbit is looking for a dynamic and experienced professional to lead our GPay Revisit Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+Key Responsibilities- Lead and manage a team of 150+ associates in the GPay Revisit Process.- Drive operational...
-
Cluster Head
2 weeks ago
Chennai, Tamil Nadu, India NetAmbit Full timeWe're Hiring: Cluster Head – GPay Process NetAmbit is looking for a dynamic and experienced professional to lead our GPay Process Operations. Role: Cluster Head Location: Bangalore, Chennai, Hyd, Kanchipuram Team Size: 150+ Key Responsibilities Lead and manage a team of 150+ associates in the GPay Process. Drive operational excellence, ensure process...