System Development Engineer, AGI Infrastructure
3 days ago
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive engineers to play a pivotal role in the development/maintenance of industry-leading multi-modal and multi-lingual large language models (LLM). AGI team's mission is to leverage our hyper-scalable, general-purpose large model training and inference systems to develop and deploy cutting-edge sensory AI foundational models that revolutionize machine perception, interpretation and interaction, with humans and with the physical world.
We believe in "Work Hard. Have Fun. Make History" value by having a strong focus on sharing learning experiences from the front line with the development teams. So, the options for people in the team are vast. If you like mastering a domain and going deep, we need you. If you can juggle three tasks and coordinate with multiple people in the heat of an incident, we need you. If you love the benefits of process and methodical improvement, you will love it here. If you want to keep your head down, headphones on, and bash out code to support the team, we have a spot for you too.
You will be required to deeply understand technology landscapes, and evaluate the use of new technologies. You will be influential within your team and work with peers and senior leaders to define and revise the standards for operational excellence across systems. You will consistently tackle abstract issues that span multiple functional areas and drive your team to push for improvements that can scale across other teams, services, and platforms.
Key job responsibilities
Provide support for cluster and node management, ensuring smooth operation of GenAI infrastructure.
Continuously improve and automate our cluster/capacity/maintenance upgrades.
Troubleshoot and research root causes throughly and fix defects.
Develop automation tools for improving operational excellence.
Candidates should be well-versed in core AWS services, including EC2 , Lambda , EKS etc.
Experienced in setting up and managing CI/CD pipelines using tools such as AWS CodePipeline, GitHub Actions, or similar platforms.
Familiarity with Infrastructure as Code (IaC) tools like AWS CloudFormation, Terraform, or the AWS CDK is a valuable asset. Furthermore, understanding of networking concepts like VPC, subnets, and security groups, Load Balancers and Route 53, is desirable.
Should have hands-on experience in Kubernetes.
About the team
Join our AGI team and work at the forefront of AI. Collaborate with top minds pushing boundaries in deep learning, reinforcement learning, and more. Gain valuable experience and accelerate your career growth. This is a unique opportunity to create history and shape the future of artificial intelligence.
- 1+ years of systems development experience
- Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust
- Experience with Linux/Unix
- Experience with CI/CD pipelines build processes
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
-
Chennai, Tamil Nadu, India Amazon Full time ₹ 5,00,000 - ₹ 25,00,000 per yearDescriptionThe Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive engineers to play a pivotal role in the development/maintenance of industry-leading multi-modal and multi-lingual large language models (LLM). AGI team's mission is to leverage our hyper-scalable, general-purpose large model training and inference...
-
IT Infrastructure System Engineer
1 week ago
Chennai, Tamil Nadu, India DNV careers Full time ₹ 9,00,000 - ₹ 12,00,000 per yearYou will join a dynamic team dedicated to ensuring the reliable and secure operation of our cutting-edge GPM products and maintaining the robustness and scalability of the IT infrastructure, ultimately enhancing efficiency. We work together to design and manage a scalable and dynamic infrastructure that seamlessly integrates both on-site and cloud solutions,...
-
IT Infrastructure System Engineer
3 days ago
Chennai, Tamil Nadu, India DNV Full time ₹ 12,00,000 - ₹ 25,00,000 per yearYou will join a dynamic team dedicated to ensuring the reliable and secure operation of our cutting-edge GPM products and maintaining the robustness and scalability of the IT infrastructure, ultimately enhancing efficiency. We work together to design and manage a scalable and dynamic infrastructure that seamlessly integrates both on-site and cloud solutions,...
-
IT Infrastructure System Engineer
13 hours ago
Chennai, Tamil Nadu, India DNV Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionYou will join a dynamic team dedicated to ensuring the reliable and secure operation of our cutting-edge GPM products and maintaining the robustness and scalability of the IT infrastructure, ultimately enhancing efficiency. We work together to design and manage a scalable and dynamic infrastructure that seamlessly integrates both on-site and cloud...
-
Infrastructure Engineer
2 days ago
Chennai, Tamil Nadu, India Rotork Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob DescriptionWe are seeking a highly skilled Infrastructure Engineer (On-Prem) to join our team in Chennai, India. In this role, you will be responsible for designing, implementing, and maintaining our on-premises infrastructure to ensure optimal performance, reliability, and security of our IT systems.Design and implement on-premises infrastructure...
-
Infrastructure Engineer
16 hours ago
Chennai, Tamil Nadu, India Rotork Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob Description We are seeking a highly skilled Infrastructure Engineer (On-Prem) to join our team in Chennai, India. In this role, you will be responsible for designing, implementing, and maintaining our on-premises infrastructure to ensure optimal performance, reliability, and security of our IT systems.Design and implement on-premises infrastructure...
-
Infrastructure Architect
1 week ago
Chennai, Tamil Nadu, India AdeptView Full time ₹ 15,00,000 - ₹ 25,00,000 per yearPosition :Infrastructure Architect AI Systems (NVIDIA HGX & Dell PowerEdge)Experience :5+ YearsLocation :IndiaJob SummaryWe are seeking a seasoned Infrastructure Architect AI Systems with over 5 years of experience in designing and managing high-performance computing environments. The ideal candidate will have extensive hands-on expertise with NVIDIA HGX...
-
Infrastructure Engineer, Database
1 week ago
Chennai, Tamil Nadu, India NatWest Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJoin us as an Infrastructure EngineerYou'll engineer infrastructure technology for public and private cloud environments, complying with security, resilience, sustainability, and operational requirements with observability and guardrails built inYou'll also use automation to provide testing and a route to live for the product, working with customers to help...
-
Associate Systems Engineer
16 hours ago
Chennai, Tamil Nadu, India SignaTech Full time ₹ 2,00,000 - ₹ 6,00,000 per yearJob Title: Associate Systems EngineerLocation: Navalur, Chennai.Company DescriptionWe are looking for an enthusiastic and detail-oriented Associate Systems Engineer to join our dynamic IT infrastructure team. This role will be instrumental in maintaining and enhancing our server, workstation, and cloud environments while ensuring reliability, security, and...
-
Network Infrastructure Engineer
6 days ago
Chennai, Tamil Nadu, India NatWest Group Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMeraki Network Infrastructure Engineer Join us as a Network Infrastructure EngineerYou'll collaborate in building the best possible enterprise network solutions and engineer infrastructure technology to comply with security, resilience, sustainability, and operational requirements with observability and guardrails built in Using automation, you'll...