SME AI Cloud

4 weeks ago


Mumbai, Maharashtra, India Yotta Data Services Private Limited Full time
Job Scope:

This is a full-time on-site role as an AI Cloud Engineering SME, as a member of the Yotta IT Engineering team located in Panvel, you will be responsible for evaluating, developing, GPU Supercomputing clusters and networking infrastructure based on NVIDIA reference architecture, infrastructure, platforms, and applications. Design, Evaluate, Enhance, Automate, related to systems on AI platform you will also be responsible for providing technical leadership to the team and working collaboratively with cross-functional teams to identify business opportunities and provide innovative services in the market, to keep the first mover advantage.

Job Responsibilities:

- Provides technical expertise in IT for AI Cloud, (NVIDIA, HPC) systems (Servers, Applications, Cloud, Tools, Automation) development of program requirements relating to information technology, application analysis, software development, and systems integration.
- Designs, plans, implements, handover service or platforms.

Collaborate with NVIDIA Solution Architect & Engineering teams on large-scale GPU-as-a-service projects, both on-premises and in cloud deployments.

- Provides specifications and detailed schematics for system architecture.
- Provides specific detailed information for hardware and software selection, implementation techniques and tools for the most efficient solution to meet business needs, including present and future capacity requirements.
- Desings and implements and optimize software stacks including MaaS (metal-as-a-service), Job Scheduler (SLURM/PBS), Cloud Orchestration (Kubernetes), and Network Management (NetQ for Ethernet fabric and UFM for InfiniBand).
- Well versed with NVIDIA platform both hardware and software layer. Including following
- H100, L40, A100, LMPerf, Pytorch, tensorflow, nemo, LLM models
- UFM, BCM, Kubernetes, NVCF, SLURM
- Conducts testing of system design.
- Evaluates and reports on new technologies to enhance capabilities of the existing systems.
- Ensures the quality of the work produced maintains the highest standards.
- Communicated with cross functional teams to better functioning.
- Participates in the development of the technical solution through the mandated processes.
- Participate in program readiness and design reviews for marquee customers
- Attend technical and programmatic meetings with customers, system users, development team members and represent the organization in matters pertaining to the project.

Participates in the Engineering Review Board for technical review of Engineering Change Requests

Must-Have Skill:

- Hands-on experience with NVIDIA GPU & Networking Technologies, particularly NVIDIA Data Centre GPUs (A100/H100/L40) and NVIDIA Networking Technologies (InfiniBand, Ethernet).
- Proficiency in provisioning and managing software stacks like MaaS, Job Scheduler (SLURM/PBS), Cloud Orchestration (Kubernetes), and Network Management (NetQ for Ethernet fabric and UFM for InfiniBand).
- Prior experience collaborating with NVIDIA Solution Architect & Engineering teams on large-scale GPU-as-a-service projects.
- Familiarity with benchmarking applications from widely-used platforms and frameworks, including MLPerf, PyTorch, TensorFlow, NeMo, Megatron-LM, TensorRT-LLM, Triton Inference Server, and vLLM.
- Experience in performance engineering, including debugging, profiling, benchmarking, and tuning various GPU applications on large-scale supercomputing clusters.

Good to Have Skill:

- Knowledge of other HPC technologies and architectures beyond NVIDIA, broadening expertise in the field.
- Experience with other cloud platforms and orchestration tools, expanding versatility in deployment environments.
- Strong problem-solving and troubleshooting abilities, enabling quick resolution of complex technical issues.
- Excellent communication and collaboration skills to work effectively within cross-functional teams and with external partners.

Behavioral Attributes:

- Strong problem-solving skills with a proactive and solution-oriented approach.
- Excellent communication and collaboration skills for effective customer support.
- Adaptability to handle a dynamic and fast-paced cloud administration environment.
- Commitment to security best practices and continuous improvement.

Qualification and Experience:

- Bachelor's or master's degree in computer science, Engineering, or equivalent industry experience.
- 10 to 15+ years of relevant experience in HPC engineering roles, with a focus on NVIDIA GPU and Networking Technologies.
- Demonstrated success in deploying and managing large-scale GPU Supercomputing clusters, preferably in collaboration with NVIDIA teams.
- Proven track record of performance engineering activities and optimizing GPU applications for high-performance computing workloads.
  • SME AI Cloud

    4 weeks ago


    Mumbai, Maharashtra, India Yotta Data Services Private Limited Full time

    Job Scope This is a full-time on-site role as an AI Cloud Engineering SME, responsible for evaluating, developing GPU Supercomputing clusters and networking infrastructure based on NVIDIA reference architecture, infrastructure, platforms, and applications. Design, Evaluate, Enhance, Automate systems on AI platform, provide technical leadership to the team,...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About Our MissionFair Cloud AI is dedicated to empowering businesses to optimize their cloud infrastructure costs through innovative AI technologies. Our mission is to deliver comprehensive solutions that cater to the unique needs of our clients. As a Sales Engineer, you will play a crucial role in bridging the gap between our innovative solutions and our...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About Us:Fair Cloud AI is an innovative startup revolutionizing how businesses optimize cloud infrastructure costs through cutting-edge AI technologies. We deliver comprehensive solutions, including custom applications, backend infrastructure, and full-scale technology stacks. Our mission is to empower businesses to scale efficiently with tailored, AI-driven...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    Our TeamWe are a passionate team of innovators who are dedicated to delivering exceptional results for our clients. Our team comprises experienced professionals with a strong background in cloud computing, AI, and software development. As a Sales Engineer, you will join a dynamic team that is committed to empowering businesses to optimize their cloud...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About Us: FairCloud AI is an innovative startup revolutionizing how businesses optimize cloud infrastructure costs through cutting-edge AI technologies. We deliver comprehensive solutions, including custom applications, backend infrastructure, and full-scale technology stacks. Our mission is to empower businesses to scale efficiently with tailored, AI-driven...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About Us: FairCloud AI is an innovative startup revolutionizing how businesses optimize cloud infrastructure costs through cutting-edge AI technologies. We deliver comprehensive solutions, including custom applications, backend infrastructure, and full-scale technology stacks. Our mission is to empower businesses to scale efficiently with tailored, AI-driven...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    Opportunity to GrowAs a Sales Engineer at Fair Cloud AI, you will have the opportunity to grow your technical and sales expertise while contributing to a fast-scaling startup. Our dynamic work environment is ideal for individuals who are passionate about delivering exceptional results and empowering businesses to optimize their cloud infrastructure costs...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About Us Fair Cloud AI is a pioneering startup that leverages cutting-edge AI technologies to revolutionize cloud infrastructure cost optimization. Our comprehensive solutions include custom applications, backend infrastructure, and full-scale technology stacks. We empower businesses to scale efficiently with tailored, AI-driven solutions backed by robust...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About Us: FairCloud AI is a pioneering startup revolutionizing the way businesses optimize cloud infrastructure costs with cutting-edge AI technologies. We deliver comprehensive solutions, including custom applications, backend infrastructure, and full-scale technology stacks, empowering businesses to scale efficiently with tailored, AI-driven solutions.Job...


  • Mumbai, Maharashtra, India FairCloud AI Full time

    About FairCloud AI: We're a dynamic startup pushing the boundaries of AI and cloud technology, collaborating with a passionate team to deliver impactful client solutions.Job Description: The Sales Engineer will work closely with sales reps, engineers, and clients to translate complex technical capabilities into compelling, client-focused solutions that drive...


  • Mumbai, Maharashtra, India MavenMagnet AI Full time

    MavenMagnet AI is a pioneering force in AI-based data analytics, empowering clients with transformative insights and unparalleled time and cost efficiency. Our cutting-edge approach leverages a vast array of digitally-sourced data points to deliver qualitative insights on a quantitative scale.We pride ourselves on utilizing a rich combination of AI and...

  • Lead Cloud Engineer

    3 weeks ago


    Mumbai, Maharashtra, India Netcore Cloud Full time

    About us:At Netcore, innovation isn't just a buzzword—it's the core of everything we do. As the pioneering force behind the first and leading AI/ML-powered Customer Engagement and Experience Platform (CEE), we're dedicated to revolutionizing how B2C brands interact with their customers. Our state-of-the-art SaaS products are designed to foster personalized...

  • Lead Cloud Engineer

    3 weeks ago


    Mumbai, Maharashtra, India Netcore Cloud Full time

    About us:At Netcore, innovation isn't just a buzzword—it's the core of everything we do. As the pioneering force behind the first and leading AI/ML-powered Customer Engagement and Experience Platform (CEE), we're dedicated to revolutionizing how B2C brands interact with their customers. Our state-of-the-art SaaS products are designed to foster personalized...


  • Mumbai, Maharashtra, India Jio Full time

    We are seeking an experienced and knowledgeable Veritas NetBackup Subject Matter Expert Lead SME to join our dynamic backup team As a Veritas NetBackup Lead SME you will play a crucial role in designing implementing and maintaining our backup and recovery infrastructure Your expertise will be instrumental in ensuring the security efficiency and...


  • Mumbai, Maharashtra, India CitiusTech Full time

    **Career Growth Opportunities:**CitiusTech offers comprehensive benefits to ensure you have a long and rewarding career with us. Our EVP, Be You Be Awesome, reflects our continuing efforts to create CitiusTech as a great workplace where our employees can thrive, personally and professionally.Job Requirements:Design and implement scalable, secure, and...


  • Mumbai, Maharashtra, India Orange Business Full time

    Job Title: SME- Cloud Network Security (Azure)Job Location: Mumbai/ Bangalore/Chennai/ Hyderabad/GurgaonAbout the role-We're searching for a talented and passionateSubject Matter Experthaving expertise in data center & cloud network and security technologies to join ourService Delivery , which is responsible for remotely managing, securing and supporting...


  • Mumbai, Maharashtra, India Es Magico AI Studio Full time

    About Es Magico AI Studio: We collaborate with early-stage startups to bring innovative products to life through product design and technology services. Our focus is on AI-driven solutions, B2B SaaS, and scalable technology products prioritizing user-friendly experiences, agile methodologies, and clean efficient designs.Job Overview: We are seeking a highly...

  • Netcore Cloud

    4 weeks ago


    Mumbai, Maharashtra, India NETCORE CLOUD PRIVATE LIMITED Full time

    Job Description :Netcore Cloud is a leading AI/ML-powered customer engagement and experience platform that helps B2C brands drive conversions, revenue, and retention. We are looking for a Backend Developer (Python/Go) who thrives in building high-performance, scalable systems and is eager to work on complex challenges in cloud-based environments.Key...

  • Cloud Architect

    43 minutes ago


    Mumbai, Maharashtra, India Es Magico AI Studio Full time

    We are a venture studio focused on developing innovative AI products through collaboration with early-stage startups. Our focus is on Generative AI-driven products, B2B SaaS, and scalable technology.About the Role:As a Backend Engineer (SDE 3), you will play a critical role in architecting, designing, and implementing cutting-edge solutions. You'll work...


  • Mumbai, Maharashtra, India NewtonAI Full time

    About NewtonAIWe are a cutting-edge tech company based in Mumbai, aiming to revolutionize the AI landscape with our innovative solutions. Our team of experts is passionate about building high-performance applications that make a real-world impact.**Job Description**We are seeking an exceptional FastAPI Backend Developer to join our team and contribute to the...