Cloud Machine Learning LLM Serving Staff engineer
2 weeks ago
Company:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Software EngineeringGeneral Summary:Job Overview:The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with experience in machine learning framework development. Job activities span the whole product life cycle from early design to commercial deployment. The environment is fast-paced and requires cross-functional interaction daily so good communication, planning and execution skills are a must.Key ResponsibilitiesAnalyze software requirements, determine the feasibility of design within the given constraints, consult with architecture and HW engineers, and implement software solutions best suited for Qualcomm's SOCs.Analyze and identify system level issues, interface with the software development, integration, and test teamsLead high performing teams towards system design and deliverables.Proven track record of leading teams in Machine learning software engineering.Strong foundation of Mathematical modeling of problems and linear algebra, coupled with state of the art algorithms in ML/AI space.Improve and optimize key Deep Learning models on Qualcomm AI 100.Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories.Collaborate and interact with internal teams to analyze and optimize training and inference for deep learning.Build software tools and ecosystem around AI SW Stack.Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo to build abstraction layers for inference accelerator.Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems.Optimize the entire deep learning pipeline including graph compiler integration.Apply knowledge of software engineering best practices.Desirable Skills and AptitudesDeep Learning experience or knowledge – LLMs, Natural Language Processing, Vision, Audio, Recommendation systems.Knowledge of the structure and function of different components of Pytorch, TensorFlow software stacks.Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design.Ability to work independently, define requirements and scope, and lead your own development effort.Well versed with open-source development practices.Strong developer with a research mindset – strives to innovate.Avid problem solver – should be able to find solutions to key engineering and domain problems.Knowledge of tiling and scheduling a Machine learning operator is a plus.Experience in using C++ 14 (advanced features)Experience of profiling software and optimization techniquesHands on experience writing SIMD and/or multi-threaded high-performance code is a plus.Experience of ML compiler, Auto-code generation (using MLIR) is a plus.Experiences to run workloads on large scale heterogeneous clusters is a plus.Hands-on experience with CUDA, CUDNN is a plus.Qualifications:Bachelor's / Masters/ PHD degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.8+ years Software Engineering or related work experience.8+ years' experience with Programming Language such as C++, Python.Minimum Qualifications:• Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience. ORMaster's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience. ORPhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.• 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc Applicants: Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability- or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.If you would like more information about this role, please contact Qualcomm Careers.
-
Bengaluru, Karnataka, India Qualcomm Full time ₹ 12,00,000 - ₹ 24,00,000 per yearCompany:Qualcomm India Private LimitedJob Area:Engineering Group, Engineering Group > Software EngineeringGeneral Summary:The Qualcomm Cloud Computing team is developing hardware and software for Machine Learning solutions spanning the data center, edge, infrastructure, automotive market. We are seeking ambitious, bright, and innovative engineers with...
-
Staff Machine Learning Engineer
4 days ago
Bengaluru, Karnataka, India Zscaler Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout ZscalerServing thousands of enterprise customers around the world including 45% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...
-
Staff Machine Learning Engineer
2 weeks ago
Bengaluru, Karnataka, India Automation Anywhere Full time ₹ 8,00,000 - ₹ 24,00,000 per yearAbout UsAutomation Anywhere is the leader in Agentic Process Automation (APA), transforming how work gets done with AI-powered automation. Its APA system, built on the industry's first Process Reasoning Engine (PRE) and specialized AI agents, combines process discovery, RPA, end-to-end orchestration, document processing, and analytics—all delivered with...
-
Bengaluru, Karnataka, India Integers Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionRole : Machine Learning & Generative AI EngineerJob Location : Bangalore (Hybrid 2-3 days from office)Experience : 3-7 yearsAbout The RoleWe are seeking a highly skilled Machine Learning & Generative AI Engineer to design, build, and deploy advanced ML and GenAI solutions.This role provides the opportunity to work on cutting-edge AI technologies...
-
Staff Machine Learning Engineer
2 weeks ago
Bengaluru, Karnataka, India Zscaler Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout ZscalerServing thousands of enterprise customers around the world including 45% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...
-
Machine Learning Engineer
4 days ago
Bengaluru, Karnataka, India HYrEzy Tech Solutions Full time ₹ 15,00,000 - ₹ 25,00,000 per yearLocation: Onsite - BengaluruCompany: :Y Combinator backed Insurtech Startup transforming the underwriting landscape with Generative AI. Our SaaS solutions help US-based insurance companies make smarter, faster decisions by optimizing underwriting processes, reducing risk, and improving premiums. We're looking for a Machine Learning Engineer - 1 to help us...
-
Senior Machine Learning Engineer
4 days ago
Bengaluru, Karnataka, India Trellix Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole Overview:We are seeking a highly skilled and experienced Senior Machine Learning Engineer to join our innovative Data Science and Engineering team. Reporting to the Data Science Director, you will play a critical role in building and scaling machine learning systems that power our cybersecurity products. You will work closely with data scientists and...
-
Machine Learning Engineer-2
7 days ago
Bengaluru, Karnataka, India YO HR Consultancy Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPosition OverviewAs anMLE-2, you will be responsible for designing, implementing, and optimizing cutting-edge AI and machine learning solutions that deliver measurable business impact. You will lead theend-to-end ML lifecycle— from model development to deployment — while collaborating closely with cross-functional teams to enhance AI capabilities and...
-
Machine Learning Lead
5 days ago
Bengaluru, Karnataka, India Shashwath Solution Full time ₹ 1,50,00,000 - ₹ 2,50,00,000 per yearWe are seeking an experienced AI Lead with a minimum of 10 years of industry experience in the field of Artificial Intelligence. The ideal candidate will have a proven track record of successfully leading and delivering multiple projects in Machine Learning, Deep Learning, and Natural Language Processing (NLP). Additionally, they should possess sound...
-
Machine Learning Engineer
1 week ago
Bengaluru, Karnataka, India Capslock Marketplaces 🚀 Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAbout the RoleAs a Machine Learning Engineer, you will be at the forefront of our AI initiatives, responsible for designing, developing, and deploying scalable AI/ML models for production. You will work on end-to-end machine learning pipelines, including fine-tuning LLMs and Indic TTS models, to build contextual memory and a great user experience for...