Senior Infrastructure Engineer for Large Scale GPU Clusters

3 weeks ago


Bengaluru, Karnataka, India NVIDIA Full time

NVIDIA Job Overview:

We are a pioneering company in the field of visual computing, and we are looking for a highly skilled Senior Infrastructure Engineer to join our team. As a key member of our Farm GPU SRE group, you will be responsible for designing, implementing, and supporting large scale infrastructure with high uptime.

Job Responsibilities:

  • Design and implement scalable infrastructure solutions for our GPU clusters
  • Develop software platforms and frameworks to improve cluster efficiency and user experience
  • Maintain and optimize existing infrastructure to ensure high availability and performance
  • Engage in incident response and postmortem activities to identify areas for improvement
  • Collaborate with cross-functional teams to design and deploy new services

Requirements:

  • Bachelor's degree in Computer Science or related technical field
  • 3+ years of industry experience in Linux system administration, HPC cluster scheduling, and IT automation tools
  • Strong understanding of open-source technologies and distributed systems
  • Excellent problem-solving skills, communication skills, and sense of ownership

Preferred Qualifications:

  • Experience with Bright Cluster Manager (BCM)
  • Understanding of InfiniBand or Ethernet concepts
  • Experience with high-speed storage solutions like Lustre and GPFS

About NVIDIA:

We are a leader in the visual computing market, and we are committed to innovation and excellence. Our engineers work on cutting-edge projects that have a significant impact on the industry. We offer competitive compensation, excellent benefits, and opportunities for professional growth and development.

Estimated Salary:$140,000 - $180,000 per year, depending on location and experience.



  • Bengaluru, Karnataka, India Oracle Full time

    We are seeking an experienced Senior Cluster Networking Engineer to join our team at Oracle OCI.About the RoleThis position involves provisioning, securing, scaling, and operating the network stack required to run distributed AI workloads across a cluster spanning thousands of GPUs.Our customers expect auto-remediation of incidents, touchless upgrades across...


  • Bengaluru, Karnataka, India LinkedIn Full time

    About the RoleAre you a seasoned software engineer with a passion for building scalable and efficient systems? Do you have expertise in designing and developing large-scale search infrastructure? We're seeking a skilled DevOps Engineer to join our team at LinkedIn, where you'll play a key role in shaping the future of search technology.Salary Range: $180,000...


  • Bengaluru, Karnataka, India Databricks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Software Engineer for our Databricks Engineering team. As a key member of our engineering organization, you will work with teams that develop Databricks products and features for thousands of enterprises worldwide.Key ResponsibilitiesAs an executive engineering individual contributor, you will have...


  • Bengaluru, Karnataka, India Synopsys Inc Full time

    At Synopsys Inc, we are seeking a highly skilled GPU Staff/Senior Staff Engineer to join our team.About the RoleThis is a challenging opportunity for an expert in GPU-accelerated algorithms to optimize and implement performance improvements for OPC software in the EDA industry. The ideal candidate will work closely with cross-functional teams to ensure...


  • Bengaluru, Karnataka, India The Nielsen Company Full time

    As a Cloud Engineer at The Nielsen Company, you will play a crucial role in maintaining the infrastructure that supports our television audience measurement services. With nearly 14,000 associates globally, we are a leading player in the media industry.About the RoleWe are looking for an experienced DevOps engineer to join our team of cloud experts. As a...


  • Bengaluru, Karnataka, India LinkedIn Full time

    About UsAt LinkedIn, we are passionate about empowering professionals to achieve their goals. Our vision is to create economic opportunity for every member of the global workforce.Job SummaryWe are seeking a highly experienced Software Architect to lead the design and development of our large-scale infrastructure systems. As a key member of our software...


  • Bengaluru, Karnataka, India TEKsystems Full time

    TEKsystems is a leading technology services company that helps organizations solve complex challenges with unparalleled expertise, innovative approaches, and unparalleled scale. As a Cloud Engineer for Large Scale Banking Infrastructure, you will play a crucial role in designing, implementing, and maintaining highly available, secure, and scalable...


  • Bengaluru, Karnataka, India Oracle Full time

    Oracle's Data Services Team is seeking a motivated Cloud Infrastructure Engineer to join our fast-paced, rapidly evolving Cloud engineering team in Kiev. This individual will be a member of the SRE services focused on Cloud Services, building deployments, operations, mitigating security vulnerabilities, and automations.About the RoleThis position will be...


  • Bengaluru, Karnataka, India LinkedIn Full time

    Role OverviewWe are seeking a skilled Senior Software Engineer to join our team in Bangalore, India. The successful candidate will have expertise in designing and developing large-scale distributed systems, with a strong focus on observability.Job DescriptionThis is a unique opportunity to work with our MELT instrumentation team, responsible for creating the...


  • Bengaluru, Karnataka, India LinkedIn Full time

    About the RoleWe're seeking a seasoned software engineer to join our world-class team in Bangalore, India. As a key member of our infrastructure team, you will design and build next-generation platforms for LinkedIn's core applications.Key ResponsibilitiesDesign, develop, and operate large-scale data infrastructures that power all of LinkedIn's core...


  • Bengaluru, Karnataka, India LinkedIn Full time

    We are seeking an experienced Infrastructure Software Engineer to join our team in Bangalore, India. In this role, you will be responsible for designing, implementing, and optimizing large-scale distributed systems with a focus on security and compliance.As a member of our world-class software engineering team, you will work closely with the open-source...


  • Bengaluru, Karnataka, India LinkedIn Full time

    At LinkedIn, we're committed to creating economic opportunity for every member of the global workforce. Our mission is to help members achieve their career goals and build skills that matter most in a rapidly changing world.About the RoleThis role will be based in Bangalore, India, where you'll have the opportunity to work with a talented team of software...


  • Bengaluru, Karnataka, India Synopsys Inc Full time

    Synopsys Inc is seeking a seasoned High-Performance Infrastructure Solutions Engineer to join its global software infrastructure team.About the RoleThis senior-level position focuses on designing and optimizing high-performance infrastructure solutions for large-scale geometric data analysis in OPC and Proteus Mask Synthesis tools.The ideal candidate will...


  • Bengaluru, Karnataka, India Synopsys Inc Full time

    About the RoleWe are seeking a highly skilled Senior Infrastructure Solutions Engineer to join our global software infrastructure team at Synopsys Inc. This is a unique opportunity to design and optimize cutting-edge infrastructure solutions for large-scale geometric data analysis in OPC and Proteus Mask Synthesis tools.Key ResponsibilitiesDesign, implement,...


  • Bengaluru, Karnataka, India 6thStreet Full time

    About Us6thStreet.com is a leading online fashion platform in the UAE, KSA, and Kuwait. We offer a wide range of international brands and provide excellent customer service.Job TitleDevOps EngineerAbout the RoleWe are seeking an experienced DevOps Engineer to join our team. The successful candidate will be responsible for ensuring the smooth operation of our...


  • Bengaluru, Karnataka, India LinkedIn Full time

    As a Senior Staff Software Engineer, you will have the opportunity to work on building the next-generation infrastructure and platforms for LinkedIn, including an application and service delivery platform, massively scalable data storage and replication systems, cutting-edge search platform, best-in-class AI platform, experimentation platform, privacy and...


  • Bengaluru, Karnataka, India Sureminds Full time

    About SuremindsSureminds is a forward-thinking organization seeking an experienced HPC VDI Engineer to join our team. As a highly skilled professional, you will be responsible for managing and maintaining our complex infrastructure.Salary: $120,000 - $180,000 per annumJob DescriptionAs an HPC VDI Engineer at Sureminds, you will be tasked with installing,...


  • Bengaluru, Karnataka, India Synopsys Inc Full time

    Exciting opportunity to contribute to the cutting-edge of semiconductor development at Synopsys Inc. We are seeking a Senior Software Development Engineer to design and develop software for large-scale geometric data analysis and high-performance computing.About UsSynopsys Inc is a leading provider of electronic design automation (EDA) software and services....


  • Bengaluru, Karnataka, India Neetable Full time

    About Neetable: Web and Mobile App Development FirmWe are a leading provider of enterprise technology solutions, serving Fortune 500 companies, SMBs, and Startups worldwide. Our team is dedicated to delivering impactful digital engineering solutions that set us apart from the competition.Job Title: Senior WordPress DeveloperWe seek an experienced Senior...


  • Bengaluru, Karnataka, India LinkedIn Full time

    Leading the Future of Next-Generation InfrastructureWe are seeking a highly skilled software engineer to join our world-class infrastructure team at LinkedIn. This role will involve building the next-generation infrastructure and platforms for LinkedIn, including application and service delivery platform, data storage and replication systems, search...