HPC Software Systems Engineer

3 weeks ago


Chennai, India KLA Full time

KLA Overview:


KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays.


The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and in 2019 we invested 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us. You will be part of digital and mainstream technology staff as well as Digital Transformation Services.


You will play a vital role in crafting delivering solutions to make our employees and customers more agile and productive. Emphasizing teamwork and workflow improvements. This technical position provides an outstanding opportunity for those who are motivated to face challenges both in technical leadership and also in project management fronts. The position will work with multi-functional teams and across multiple IT technologies to define the product requirements, plan resources, and drive the development process from design to final field deployment. In addition, this position is also responsible for working with business teams to solidify the roadmap of the projects as we continue to support the business.


Key Responsibilities:


Architect and Design High-Performance Compute Clusters:

  • Collaborate with cross-functional teams to design, implement, and support HPC clusters.
  • Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects.


Project Specifications and Timelines:

  • Understand project specifications and performance requirements at both subsystem and system levels.
  • Drive adherence to project timelines, ensuring program milestones are achieved on schedule.


Efficiency and Optimization:

  • Dive deep into system performance to identify bottlenecks and inefficiencies.
  • Optimize job scheduling, infrastructure and parallel execution.
  • Evaluate detailed timing of cluster operations and data transfer.


Linux OS Configuration:

  • Leverage your strong Linux skills to configure appropriate operating systems for the HPC environment.


Skills and Abilities:

  • We’re looking for inquisitive problem-solvers who thrive on challenges.
  • Excellent written and verbal communication skills are essential.
  • Grow to lead a team of software engineers
  • Team-oriented and highly motivated.
  • Effective organization and time management.
  • Adaptability to change in a rapidly growing environment.


Required Qualifications:


  • Strong Object Oriented programming skills in Java and/or C++
  • Proficiency in parallel programming and distributed computing.
  • In-depth knowledge of Linux systems and internals.
  • Ability to find and address bottlenecks in data movement, code execution, and job scheduling.
  • Knowledge of libraries such as SIMD, AVX, IPP, MKL, openCV, openMP, OpenCL, MPI, and CUDA.
  • Familiarity with performance profilers (Intel vTune, Nvidia Nsight compute, AMD uProf, perf).
  • Strong understanding of HPC hardware (servers, GPUs, networking, storage, BIOS, BMC).
  • Proficiency in Shell and Python scripting for test environments.


Preferred Qualifications:


  • Experience with Kubernetes, Prometheus, and Grafana.
  • BS or MS degree with 3 to 5 years of validated experience.
  • Background in Computer Engineering or Electrical Engineering.


Education & Experience:


Doctorate (Academic) Degree and related work experience of 3 years

OR

Master's Level Degree and related work experience of 6 years

OR

Bachelor's Level Degree and related work experience of 8 years


Equal Employment Opportunity:


We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees.


KLA is proud to be an Equal Opportunity Employer. We do not discriminate on the basis of race, religion, color, national origin, sex, gender identity, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other status protected by applicable law. We will ensure that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment.


  • HPC system engineer

    1 month ago


    Chennai, India KLA Full time

    Key Responsibilities:Architect and Design High-Performance Compute Clusters:• Collaborate with cross-functional teams to design, implement, and support HPC clusters.• Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects.Project Specifications and Timelines:•...

  • HPC system engineer

    1 month ago


    Chennai, India KLA Full time

    Key Responsibilities:Architect and Design High-Performance Compute Clusters:• Collaborate with cross-functional teams to design, implement, and support HPC clusters.• Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects.Project Specifications and Timelines:•...

  • HPC system engineer

    1 month ago


    Chennai, India KLA Full time

    Key Responsibilities: Architect and Design High-Performance Compute Clusters: • Collaborate with cross-functional teams to design, implement, and support HPC clusters. • Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects. Project Specifications and Timelines: •...

  • HPC system engineer

    1 month ago


    Chennai, India KLA Full time

    Key Responsibilities:Architect and Design High-Performance Compute Clusters:• Collaborate with cross-functional teams to design, implement, and support HPC clusters.• Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects.Project Specifications and Timelines:•...


  • Chennai, India KLA Full time

    KLA Overview:KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents...


  • Chennai, India KLA Full time

    KLA Overview: KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA...


  • Chennai, India KLA Full time

    KLA Overview: KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents...


  • Chennai, India KLA Full time

    KLA Overview: KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents...


  • chennai, India KLA Full time

    KLA Overview: KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents...


  • Chennai, India Cephas Consultancy Services Private Limited Full time

    Job title HPC Systems R&DEngineerOur AI Advanced Computing Labs islooking for an extraordinary HPC System R&D Engineer tojoin its team to develop systemlevel HPC technologies that wouldform the foundation of nextgeneration clusters used in our toolsthat leverage AI to push the boundaries of process control forconductor manufacturing. The technologies would...


  • Chennai, India 3110 K-T India Full time

    Description Architect and Design High-Performance Compute Clusters : Collaborate with cross-functional teams to design, implement, and support HPC clusters. Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects. Project Specifications and Timelines : Understand...

  • HPC Admin

    2 months ago


    Chennai, India ScaleneWorks Full time

    Infrastructural Skills : Experience 5 to 7 years Mandatory: - Excellent (L2+) LINUX Skills for Red Hat and SLES (Ubuntu – optional) - Excellent Hardware knowledge - HP range of serves X86 architecture, C7000 Enclosures, Gen 9/10 Blades, SL server architecture, Moonshot etc. Tooling : Scheduling: Altair PBS Pro and Data Synapse - Required Essential –(Any...

  • HPC Systems Engineer

    1 month ago


    Chennai, India PeopleGene Full time

    Looking for an extraordinary HPC System R&D Engineer to develop system-level HPC technologies that leverage AI to push the boundaries of process control for conductor manufacturing.Responsibilities:Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale.Develop...

  • HPC Systems Engineer

    3 weeks ago


    Chennai, India PeopleGene Full time

    Looking for an extraordinary HPC System R&D Engineer to develop system-level HPC technologies that leverage AI to push the boundaries of process control for conductor manufacturing. Responsibilities: Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale. ...

  • HPC Systems Engineer

    1 month ago


    Chennai, India PeopleGene Full time

    Looking for an extraordinary HPC System R&D Engineer to develop system-level HPC technologies that leverage AI to push the boundaries of process control for conductor manufacturing. Responsibilities:Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale.Develop...

  • HPC Systems Engineer

    1 month ago


    Chennai, India PeopleGene Full time

    Looking for an extraordinary HPC System R&D Engineer to develop system-level HPC technologies that leverage AI to push the boundaries of process control for conductor manufacturing. Responsibilities: Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale. Develop...

  • HPC Systems Engineer

    1 month ago


    Chennai, India PeopleGene Full time

    Looking for an extraordinary HPC System R&D Engineer to develop system-level HPC technologies that leverage AI to push the boundaries of process control for conductor manufacturing. Responsibilities:Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale.Develop...


  • Chennai, India KLA Full time

    Job DescriptionArchitect and Design High-Performance Compute Clusters:Collaborate with cross-functional teams to design, implement, and support HPC clusters.Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects.Project Specifications and Timelines:Understand project...


  • Chennai, India KLA Full time

    Job Description Architect and Design High-Performance Compute Clusters : Collaborate with cross-functional teams to design, implement, and support HPC clusters. Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects. Project Specifications and Timelines : Understand project...


  • chennai, India KLA Full time

    Job DescriptionArchitect and Design High-Performance Compute Clusters:Collaborate with cross-functional teams to design, implement, and support HPC clusters.Optimize compute resources for maximum efficiency, considering CPU/GPU architecture, storage scalability, and high-bandwidth interconnects.Project Specifications and Timelines:Understand project...