Sr Systems Engineer Linux – AI Infrastructure
1 week ago
Position: Senior Linux Administrator – AI/ML InfrastructureLocation: Remote Experience: 5+ Years Type: Full-timeRole OverviewWe are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises Linux servers optimized for AI/ML workloads.The ideal candidate will have deep expertise in Linux system administration, Kubernetes cluster management, and a strong understanding of data center infrastructure components including servers, networking, storage, and virtualization technologies.This role requires hands-on experience in automating infrastructure, optimizing performance, and ensuring reliability for high-performance computing (HPC) and AI/ML pipelines.Key ResponsibilitiesDeploy, configure, and manage on-premises Linux servers supporting AI/ML workloads.Set up, manage, and troubleshoot Kubernetes clusters for containerized workloads.Optimize system and network performance for compute-intensive applications.Automate provisioning and configuration using Ansible, Terraform, and scripting (Bash/Python).Administer and monitor data center components such as servers, storage arrays, switches, and power systems.Ensure system security, patch management, and compliance across environments.Collaborate with DevOps, Data Science, and AI engineering teams to enable seamless integration with ML pipelines.Plan and implement scalability strategies, maintaining uptime and redundancy.Maintain comprehensive documentation of configurations, policies, and network diagrams.Required Skills & Qualifications7+ years of experience in Linux system administration (RHEL, Ubuntu, CentOS).Proven hands-on experience with Kubernetes cluster management (setup, scaling, troubleshooting).CKA (Certified Kubernetes Administrator) certification is mandatory.Strong knowledge of data center components – servers, racks, networking switches, storage systems, and virtualization layers.Experience with Ansible, Terraform, CI/CD pipelines, and infrastructure automation.Proficiency in scripting languages (Bash, Python).Understanding of performance tuning, system optimization, and fault diagnosis.Excellent problem-solving, communication, and collaboration skills.Preferred / Good to HaveExposure to NVIDIA GPU management, CUDA environments, and AI/ML compute nodes.Familiarity with HPC environments and distributed computing frameworks.Experience managing monitoring systems (Prometheus, Grafana) and backup solutions.Knowledge of DevOps practices, containerization, and hybrid cloud environments.
-
Senior Linux System Administrator
4 weeks ago
Delhi, India GoodSpace AI Full timeJob Title: Linux Infrastructure Engineer – HPC & CloudLocation: Kurla, MumbaiType: Onsite, 5 days a weekPosition Overview:We are seeking a skilled HPC Linux System Administrator to manage and optimize our high-performance computing infrastructure. In this role, you’ll be responsible for deploying, configuring, and maintaining scalable Linux-based...
-
Linux System Engineer
6 days ago
New Delhi, India PlusWealth Capital Management LLP Full timeJob Description for Devops /Linux EngineerAbout Us PlusWealth Capital Management LLP is a proprietary high-frequency trading firm, active in multiple markets including equities, options, and futures. We thrive on building cutting edge, data-driven, and tech-based trading algorithms. As a dynamic, machine-learning oriented trading platform, we embody the...
-
Linux System Engineer
5 days ago
New Delhi, India PlusWealth Capital Management LLP Full timeJob Description for Devops /Linux EngineerAbout Us PlusWealth Capital Management LLP is a proprietary high-frequency trading firm, active in multiple markets including equities, options, and futures. We thrive on building cutting edge, data-driven, and tech-based trading algorithms. As a dynamic, machine-learning oriented trading platform, we embody the...
-
New Delhi, India Trident Consulting Full timeTrident Consulting is seeking an”Systems Engineer(AI, GPU and Cloud Infrastructure)"for one of our clients in"Remote".A global leader in business and technology services.Role:Systems Engineer(AI, GPU and Cloud Infrastructure) Location:India(Remote) Type:Contract to hire/FulltimeRequired Skills: Minimum 4 years of experience in systems engineering,...
-
New Delhi, India Trident Consulting Full timeTrident Consulting is seeking an”Systems Engineer(AI, GPU and Cloud Infrastructure)"for one of our clients in"Remote".A global leader in business and technology services.Role:Systems Engineer(AI, GPU and Cloud Infrastructure) Location:India(Remote) Type:Contract to hire/FulltimeRequired Skills: Minimum 4 years of experience in systems engineering,...
-
System Engineer L2 Linux Kubernetes
4 weeks ago
New Delhi, India SpeedMart Full timeCompany Profile Our client is a global IT services company that helps businesses with digital transformation with offices in India and the United States. It helps businesses with digital transformation, provide IT collaborations and uses technology, innovation, and enterprise to have a positive impact on the world of business. With expertise is in the fields...
-
Linux System Administrator
4 weeks ago
New Delhi, India Paroscale Technologies Pvt Ltd Full timeJob Description – System Administrator (Linux, Parallel Filesystems)Company: Paroscale Technologies Pvt LtdPosition Type: Full-timeAbout the RoleWe are looking for highly skilled System Administrator Engineers with deep expertise in Linux systems and hands-on experience deploying, configuring, and troubleshooting Parallel File Systems (PFS) such as Lustre...
-
AI System Test Engineer
4 weeks ago
New Delhi, India MeshDefend Full timeMeshDefend is an early-stage VC funded startup that is building AI-Native Agentic Operating System forEnterprise Data Infrastructure. We are creating AI Agents to make Data Infrastructure Intelligent,Autonomous and Secure. Our founding team is composed of seasoned technologists and inventors, eachwith a track record of building global product portfolios...
-
Senior AI Infrastructure
2 weeks ago
new delhi, India DeepSource Technologies Full timeRole OverviewWe are seeking a highly skilled Senior AI Infrastructure & Platform Engineer to join our client's team in Riyadh. In this role, you'll be responsible for building, managing, and optimizing scalable AI infrastructure and compute environments that support high-performance workloads, including GPU-accelerated AI/ML pipelines, cluster scheduling,...
-
Sr. AI Engineer
4 weeks ago
New Delhi, India AI Planet Full time|• Location: Hyderabad, India (Hybrid)• Experience: 4+ Years (Mandatory)• Employment Type: Full-Time• Department: AI Engineering / Product InnovationAt AI Planet, we’re building the future of human-like AI systems, from intelligent agents to cutting-edge generative tools that redefine how people create, collaborate, and innovate. Our team blends...