DC Tech Consulting | Senior Systems Engineer

2 weeks ago


india DC Tech Consulting Full time

Job Profile: Senior Systems Engineer - Kubernetes & Linux Platform

Summary:

An experienced Systems Engineer with over 10 years of specialized expertise in Linux platforms, Kubernetes cluster management, and advanced troubleshooting. Skilled in Kubernetes Day 2 operations, Linux networking, Linux storage, and Nvidia GPU configurations within Kubernetes environments. Proficient in supporting AI/ML workflows, optimizing infrastructure for performance and reliability, and collaborating with data science teams to facilitate AI/ML tools on Linux and Kubernetes infrastructure.

Key Responsibilities:


Kubernetes Cluster Management:

• Design, deploy, and manage Kubernetes clusters, ensuring optimal performance, scalability, and security.

• Oversee Kubernetes Day2 operations, including lifecycle management, patching, upgrading, and continuous monitoring.

• Develop and implement best practices for Kubernetes cluster security, multi-tenancy, and resource optimization.

Kubernetes Troubleshooting & Maintenance:

• ProactivelytroubleshootandresolvecomplexissuesrelatedtoKubernetesclusters, containers, and networking.

• Perform in-depth diagnostics of cluster performance, node scaling, and API responsiveness.

• Support teams in resolving persistent Kubernetes issues and providing documentation for recurring issues.

Linux Systems Administration:

• Manage Linux-based servers and configurations, ensuring systemst ability and security across all environments.

• Oversee network configuration, DNS management, and performancetuningforLinux systems.

• CoordinatewithstorageteamstomanageLinux-basedstorage,optimizefilesystems,and ensure high availability.

Nvidia GPU Configurations & AI/ML Support:

• Configure and manage Nvidia GPUs in Kubernetes clusters to support GPU-intensive workloads.

• Collaborate with datascience teams tosupport AI/ML workflows,i ncluding model training, inference, and scaling on Kubernetes.

• Understandand troubleshoot AI/ML tools and platforms deployedon Kubernetes, optimizing resources for high-performance computing.

Cross-Functional Collaboration:

• Partner with DevOps, DataEngineering, and DataScience teams to ensure infrastructure supports AI/ML requirements.

• Document and provide knowledge sharing on best practices, configurations, and troubleshooting procedures.

• Participate in on-callrotation to provide high-priority support for production incidents.

Required Skills:

Technical Expertise:

• 10+years of experience withLinux systems administration and troubleshooting.

• Strong knowledgeof Kubernetescluster management, including Day2 operations and troubleshooting.

• Deep understanding of Linux networking, storage, and security practices.

• Hands-on experience with Nvidia GPU configurationin Kubernetes environments.

• Familiarity with AI/ML workflows, tools, and best practices in a Linux/Kubernetes environment.

Problem-Solving Abilities:

  • Demonstrated experience with advanced troubleshooting in both Kubernetes andLinux.
  • Ability to analyze and resolve complex technical issues efficiently.
  • Strong focus on performance tuning and optimization.

Preferred Qualifications:

  • Certifications in Kubernetes (CKA/CKAD/CKS) or Linux (RHCE).
  • Familiarity with monitoring tools like Prometheus, Grafana, or ELK stack.
  • Experience with AI/ML frameworks such as TensorFlow, PyTorch, or NVIDIA Triton Inference Server.
  • Soft Skills:
  • Strong analytical and communication skills.
  • Ability to work independently and within a collaborative team environment.
  • Adaptable to changing priorities and a fast-paced work environment.
  • This role offers the opportunity to work on cutting-edge technology infrastructure, playing a critical part in enabling and supporting the organization’s AI and ML goals. 




  • India DC Tech Consulting Full time

    Job Profile: Senior Systems Engineer - Kubernetes & Linux Platform Summary: An experienced Systems Engineer with over 10 years of specialized expertise in Linux platforms, Kubernetes cluster management, and advanced troubleshooting. Skilled in Kubernetes Day 2 operations, Linux networking, Linux storage, and Nvidia GPU configurations within Kubernetes...


  • india Smart Tech LLC Full time

    Company Description: Smart Tech LLC specializes in developing AI-driven solutions to address the world's most challenging data and AI problems. We are on a mission to collaborate with the brightest minds to create real-world solutions that enhance the lives of millions and contribute to a sustainable world. Role Description: This is a remote contract role...


  • india Smart Tech LLC Full time

    Company Description:Smart Tech LLC specializes in developing AI-driven solutions to address the world's most challenging data and AI problems. We are on a mission to collaborate with the brightest minds to create real-world solutions that enhance the lives of millions and contribute to a sustainable world.Role Description:This is a remote contract role for a...


  • india Smart Tech LLC Full time

    Company Description: Smart Tech LLC specializes in developing AI-driven solutions to address the world's most challenging data and AI problems. We are on a mission to collaborate with the brightest minds to create real-world solutions that enhance the lives of millions and contribute to a sustainable world. Role Description: This is a remote contract role...


  • india Senior Data Integration Engineer Full time

    Must Have Skills/Skill Requirement:Design and architect integration solutions to connect various enterprise applications, systems, and databases.Develop and implement integration workflows, APIs, and data pipelines to enable smooth communication and data exchange between different applications.Utilize Azure Integration Services such as Azure Logic Apps,...


  • india Senior Data Integration Engineer Full time

    Must Have Skills/Skill Requirement: Design and architect integration solutions to connect various enterprise applications, systems, and databases. Develop and implement integration workflows, APIs, and data pipelines to enable smooth communication and data exchange between different applications. Utilize Azure Integration Services such as Azure Logic Apps,...


  • India Senior Data Integration Engineer Full time

    Must Have Skills/Skill Requirement: Design and architect integration solutions to connect various enterprise applications, systems, and databases. Develop and implement integration workflows, APIs, and data pipelines to enable smooth communication and data exchange between different applications. Utilize Azure Integration Services such as Azure Logic...


  • india Tech Consulting Full time

    Technical Interview Coach Are you an interview specialist? Do you know what it takes to coach adults on effectively conveying their experience and skill sets in the best way possible to win an interview? Do you have proven, documented, expert experience and methods in this area? Are you driven and excited by coaching, mentoring, and directing adults in...


  • india Tech Mahindra Full time

    Responsibilities: • Provide technical customer service support within the IFS landscape • Perform troubleshooting to determine and resolve the root cause of most hardware and software issues • Monitor and test network performance and provide reports and statistics on results • Spanning Tier 1, 2 and 3 support responsibilities including engaging...

  • Data and AI Engineer

    2 weeks ago


    India Smart Tech LLC Full time

    Company Description:Smart Tech LLC specializes in developing AI-driven solutions to address the world's most challenging data and AI problems. We are on a mission to collaborate with the brightest minds to create real-world solutions that enhance the lives of millions and contribute to a sustainable world.Role Description:This is a remote contract role for a...

  • Data and AI Engineer

    2 weeks ago


    India Smart Tech LLC Full time

    Company Description: Smart Tech LLC specializes in developing AI-driven solutions to address the world's most challenging data and AI problems. We are on a mission to collaborate with the brightest minds to create real-world solutions that enhance the lives of millions and contribute to a sustainable world. Role Description: This is a remote...

  • Data and AI Engineer

    2 weeks ago


    India Smart Tech LLC Full time

    Company Description: Smart Tech LLC specializes in developing AI-driven solutions to address the world's most challenging data and AI problems. We are on a mission to collaborate with the brightest minds to create real-world solutions that enhance the lives of millions and contribute to a sustainable world. Role Description: This is a remote...


  • india Prudence Tech Solutions Full time

    Job description Company Description Prudence Tech Solutions is a leading IT Consulting and Services company, headquartered in Columbus, Ohio, serving clients across the United States since 2010. We specialize in delivering innovative IT solutions designed to drive digital transformation, enhance operational efficiency, and support business growth. Our...


  • india HCLTech Full time

    Urgent Opening for Cloud Senior Site Reliability Engineer role for Pan India location with HCL TechInterested candidates kindly share your updated resume to sagardo@hcltech.com with the subject line "Cloud Senior Site Reliability Engineer Role_ your name & preferred location"Job Description: Ability to learn SRE practices across Red Hat Open Shift, Google...

  • Firmware Engineer

    1 month ago


    india Drones Tech Lab Full time

    Job Title – Firmware Engineer (UAV Systems)Company – Drones Tech LabTMExperience – 3+ yearsIndustry – Aerospace & Defence (UAV)Location – Kolkata or RemoteCompany DescriptionDrones Tech LabTM is a pioneer in drone manufacturing, drone pilot training, drone forensics and executes drone-as-a-service projects such as mapping, surveillance,...


  • india ManpowerGroup India Full time

    Position: Instrumentation and Control System EngineerJob Description: Assisting the Lead Engineer in the creation of all Instrument Deliverables based on the assigned Projects.Qualification: B.E / B.Tech. in Instrumentation Engineering or related field.Experience: 5 to 8 years in Industry experience (Power plants/ Fertilizer/Petrochemical/oil and gas...


  • india ManpowerGroup India Full time

    Position: Instrumentation and Control System Engineer Job Description: Assisting the Lead Engineer in the creation of all Instrument Deliverables based on the assigned Projects. Qualification: B.E / B.Tech. in Instrumentation Engineering or related field. Experience: 5 to 8 years in Industry experience (Power plants/ Fertilizer/Petrochemical/oil and gas...


  • India ManpowerGroup India Full time

    Position: Instrumentation and Control System Engineer Job Description: Assisting the Lead Engineer in the creation of all Instrument Deliverables based on the assigned Projects. Qualification: B.E / B.Tech. in Instrumentation Engineering or related field. Experience: 5 to 8 years in Industry experience (Power plants/ Fertilizer/Petrochemical/oil and...


  • india Celito Tech, Inc. Full time

    Job DescriptionJob Title: Sr. Infrastructure and Cybersecurity EngineerReports To: Director, Infrastructure and CybersecurityEmployment Type: Full-timeTimings: Upto 2.30 AM ISTTHE CELITO TEAMThe Celito Team architects the buildout of simplified, integrated, and compliant technology stacks. With both consulting and products, our expertise can help our...


  • india Celito Tech, Inc. Full time

    Job Description Job Title: Sr. Infrastructure and Cybersecurity Engineer Reports To: Director, Infrastructure and Cybersecurity Employment Type: Full-time Timings: Upto 2.30 AM IST THE CELITO TEAM The Celito Team architects the buildout of simplified, integrated, and compliant technology stacks. With both consulting and products, our expertise can help our...