Sr Systems Engineer Linux – AI Infrastructure

4 weeks ago


Guntur, India DC Tech Consulting Full time

Position: Senior Linux Administrator – AI/ML InfrastructureLocation: RemoteExperience: 5+ YearsType: Full-timeRole OverviewWe are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises Linux servers optimized for AI/ML workloads.The ideal candidate will have deep expertise in Linux system administration, Kubernetes cluster management, and a strong understanding of data center infrastructure components including servers, networking, storage, and virtualization technologies.This role requires hands-on experience in automating infrastructure, optimizing performance, and ensuring reliability for high-performance computing (HPC) and AI/ML pipelines.Key ResponsibilitiesDeploy, configure, and manage on-premises Linux servers supporting AI/ML workloads.Set up, manage, and troubleshoot Kubernetes clusters for containerized workloads.Optimize system and network performance for compute-intensive applications.Automate provisioning and configuration using Ansible, Terraform, and scripting (Bash/Python).Administer and monitor data center components such as servers, storage arrays, switches, and power systems.Ensure system security, patch management, and compliance across environments.Collaborate with DevOps, Data Science, and AI engineering teams to enable seamless integration with ML pipelines.Plan and implement scalability strategies, maintaining uptime and redundancy.Maintain comprehensive documentation of configurations, policies, and network diagrams.Required Skills & Qualifications7+ years of experience in Linux system administration (RHEL, Ubuntu, CentOS).Proven hands-on experience with Kubernetes cluster management (setup, scaling, troubleshooting).CKA (Certified Kubernetes Administrator) certification is mandatory.Strong knowledge of data center components – servers, racks, networking switches, storage systems, and virtualization layers.Experience with Ansible, Terraform, CI/CD pipelines, and infrastructure automation.Proficiency in scripting languages (Bash, Python).Understanding of performance tuning, system optimization, and fault diagnosis.Excellent problem-solving, communication, and collaboration skills.Preferred / Good to HaveExposure to NVIDIA GPU management, CUDA environments, and AI/ML compute nodes.Familiarity with HPC environments and distributed computing frameworks.Experience managing monitoring systems (Prometheus, Grafana) and backup solutions.Knowledge of DevOps practices, containerization, and hybrid cloud environments.



  • guntur, India beBeeinfrastructure Full time

    Infrastructure Engineer PositionWe are seeking a highly skilled Senior Linux Administrator to fill the role of an Infrastructure Engineer, focusing on the implementation and management of on-premises Linux servers optimized for AI/ML workloads.Main Responsibilities:Server Management: Deploy, configure, and manage on-premises Linux servers to support AI/ML...


  • guntur, India beBeeAI Full time

    Career OpportunityJob Description:Antriksh Cloud is a pioneer in sustainable AI infrastructure, focusing on eco-efficient GPU data centers powered by hydroelectric energy. We provide scalable and energy-efficient solutions tailored to the diverse needs of our clients. With a focus on environmental responsibility, Antriksh Cloud utilizes state-of-the-art...


  • guntur, India beBeeArtificialIntelligence Full time

    Job Title: AI System ArchitectLocation: Flexible/Hybrid/RemoteExperience: 6+ YearsEmployment Type: Full-timeAbout the RoleWe are seeking an experienced AI System Architect to lead the development and scaling of cutting-edge AI solutions. You will play a key role in designing architecture, building scalable pipelines, and enabling production-grade AI systems...


  • guntur, India beBeeAI Full time

    Job Title: AI Infrastructure SpecialistAbout the Role:This is a full-time remote opportunity for an AI Platform Engineer at a leading sustainable AI infrastructure provider. As an AI Platform Engineer, you will be responsible for designing, implementing, and managing scalable AI infrastructure solutions.Key Responsibilities:Developing and maintaining...


  • guntur, India beBeeTechOps Full time

    AI Tech Operations SpecialistWe are seeking a highly skilled AI Tech Operations Specialist to join our team. The successful candidate will be responsible for ensuring the reliability, scalability, and efficiency of AI systems in production.Key Responsibilities:Manage deployments, monitor performance, troubleshoot issues, and implement best practices for Tech...


  • guntur, India beBeeEngineer Full time

    Job Title:AI Systems Optimization EngineerAs an AI Systems Optimization Engineer, you will be responsible for optimizing our AI systems and building the next generation of intelligence capabilities.Key ResponsibilitiesAI Inference Pipeline Optimization:Design and optimize AI inference pipelines to deliver maximum quality at minimal cost.Natural Language...


  • guntur, India beBeecloud Full time

    Cloud Infrastructure Specialist OpportunityWe are seeking a highly skilled Cloud Infrastructure Specialist to join our team. The ideal candidate will have a strong background in cloud infrastructure, including OpenStack, Linux systems, and Kubernetes.The role involves providing technical support for cloud environments, managing and maintaining Linux-based...

  • Ai Systems Developer

    4 weeks ago


    Guntur, India Whatjobs IN C2 Full time

    Company Description BraindAI is an AI Consultancy transforming how enterprises leverage artificial intelligence. We bridge the gap between strategy and execution by delivering sophisticated automation solutions for companies and high-growth organisations across Ireland and the UK. Our team builds enterprise-grade AI systems that integrate seamlessly with...


  • guntur, India beBeeCloudInfrastructure Full time

    Job Title: Cloud Infrastructure SpecialistJob Description:MGT-Commerce GmbH specializes in delivering high-performance managed cloud hosting solutions powered by Amazon Web Services (AWS). Founded in 2010 and located in Berlin, the company serves a global client base with its expert Single Server and Multi Server Hosting services, tailored to individual...


  • Guntur, India Stoopa AI Full time

    Company Description Stoopa.AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring our first dedicated SRE/DevOps Engineer to build, optimize, and own our reliability engineering function from the ground up. This is a...