
Principal Systems Performance Engineer
5 days ago
Our vision is to transform how the world uses information to enrich life for all Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence inspiring the world to learn communicate and advance faster than ever Principal Senior Systems Performance Engineer Micron Data Center and Client Workload Engineering in Hyderabad India is seeking a senior principal engineer to join our dynamic team The successful candidate will primarily contribute to the ML development ML DevOps HBM program in the data center by analyzing how AI ML workloads perform on the latest MU-HBM Micron main memory expansion memory and near memory HBM LP solutions conduct competitive analysis showcase the benefits that workloads see with MU-HBM s capacity bandwidth thermals contribute to marketing collateral and extract AI ML workload traces to help optimize future HBM designs Job Responsibilities The Job Responsibilities include but are not limited to the following Design implement and maintain scalable reliable ML infrastructure and pipelines Collaborate with data scientists and ML engineers to deploy machine learning models into production environments Automate and optimize ML workflows including data preprocessing model training evaluation and deployment Monitor and manage the performance reliability and scalability of ML systems Troubleshoot and resolve issues related to ML infrastructure and deployments Implement and manage distributed training and inference solutions to enhance model performance and scalability Utilize DeepSpeed TensorRT vLLM for optimizing and accelerating AI inference and training processes Understand key care abouts when it comes to ML models such as transformer architectures precision quantization distillation attention span KV cache MoE etc Build workload memory access traces from AI models Study system balance ratios for DRAM to HBM in terms of capacity and bandwidth to understand and model TCO Study data movement between CPU GPU and the associated memory subsystems DDR HBM in heterogeneous system architectures via connectivity such as PCIe NVLINK Infinity Fabric to understand the bottlenecks in data movement for different workloads Develop an automated testing framework through scripting Customer engagements and conference presentations to showcase findings and develop whitepapers Requirements Strong programming skills in Python and familiarity with ML frameworks such as TensorFlow PyTorch or scikit-learn Experience in data preparation cleaning splitting and transforming data for training validation and testing Proficiency in model training and development creating and training machine learning models Expertise in model evaluation testing models to assess their performance Skills in model deployment launching server live inference batched inference Experience with AI inference and distributed training techniques Strong foundation in GPU and CPU processor architecture Familiarity with and knowledge of server system memory DRAM Strong experience with benchmarking and performance analysis Strong software development skills using leading scripting programming languages and technologies Python CUDA C C Familiarity with PCIe and NVLINK connectivity Preferred Qualifications Experience in quickly building AI workflows building pipelines and model workflows to design deploy and manage consistent model delivery Ability to easily deploy models anywhere using managed endpoints to deploy models and workflows across accessible CPU and GPU machines Understanding of MLOps the overarching concept covering the core tools processes and best practices for end-to-end machine learning system development and operations in production Knowledge of GenAIOps extending MLOps to develop and operationalize generative AI solutions including the management of and interaction with a foundation model Familiarity with LLMOps focused specifically on developing and productionizing LLM-based solutions Experience with RAGOps focusing on the delivery and operation of RAGs considered the ultimate reference architecture for generative AI and LLMs Data management collect ingest store process and label data for training and evaluation Configure role-based access control dataset search browsing and exploration data provenance tracking data logging dataset versioning metadata indexing data quality validation dataset cards and dashboards for data visualization Workflow and pipeline management work with cloud resources or a local workstation connect data preparation model training model evaluation model optimization and model deployment steps into an end-to-end automated and scalable workflow combining data and compute Model management train evaluate and optimize models for production store and version models along with their model cards in a centralized model registry assess model risks and ensure compliance with standards Experiment management and observability track and compare different machine learning model experiments including changes in training data models and hyperparameters Automatically search the space of possible model architectures and hyperparameters for a given model architecture analyze model performance during inference monitor model inputs and outputs for concept drift Synthetic data management extend data management with a new native generative AI capability Generate synthetic training data through domain randomization to increase transfer learning capabilities Declaratively define and generate edge cases to evaluate validate and certify model accuracy and robustness Embedding management represent data samples of any modality as dense multi-dimensional embedding vectors generate store and version embeddings in a vector database Visualize embeddings for improvised exploration Find relevant contextual information through vector similarity search for RAGs Education Bachelor s or higher with 12 years of experience in Computer Science or related field About Micron Technology Inc We are an industry leader in innovative memory and storage solutions transforming how the world uses information to enrich life for all With a relentless focus on our customers technology leadership and manufacturing and operational excellence Micron delivers a rich portfolio of high-performance DRAM NAND and NOR memory and storage products through our Micron and Crucial brands Every day the innovations that our people create fuel the data economy enabling advances in artificial intelligence and 5G applications that unleash opportunities - from the data center to the intelligent edge and across the client and mobile user experience To learn more please visit micron com careers All qualified applicants will receive consideration for employment without regard to race color religion sex sexual orientation gender identity national origin veteran or disability status To request assistance with the application process and or for reasonable accommodations please contact Micron Prohibits the use of child labor and complies with all applicable laws rules regulations and other international and industry labor standards Micron does not charge candidates any recruitment fees or unlawfully collect any other payment from candidates as consideration for their employment with Micron AI alert Candidates are encouraged to use AI tools to enhance their resume and or application materials However all information provided must be accurate and reflect the candidate s true skills and experiences Misuse of AI to fabricate or misrepresent qualifications will result in immediate disqualification Fraud alert Micron advises job seekers to be cautious of unsolicited job offers and to verify the authenticity of any communication claiming to be from Micron by checking the official Micron careers website in the About Micron Technology Inc
-
Principal Engineer
1 week ago
Hyderabad, Telangana, India Centroid Systems, Inc. Full time US$ 90,000 - US$ 1,20,000 per yearPrincipal Engineer (Full Stack Developer) - US shift – Full timeAbout the RoleWe are seeking a Principal Managed Services Engineer to join our Managed Services Team and take ownership of supporting diverse client environments. This role focuses on maintaining, triaging, and improving a variety of customer systems — from custom-built applications to...
-
Principal Software Engineer
2 days ago
Hyderabad, Telangana, India Tanisha Systems Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Description: Senior/Principal Software Engineer (C++ - ITSO) About the Role We are looking for a skilled Embedded Software Engineer with strong expertise in Embedded C/C++ and experience in POS, payment solutions, or ITSO-linked systems. The role involves designing, developing, testing, and optimizing high-performance embedded software while working...
-
Principal Engineer
5 days ago
Hyderabad, Telangana, India Mulya Technologies Full timePrincipal Engineer – Analog Design We are a global company that makes industry-leading memory interface chips and Silicon IP to advance data center connectivity and solve the bottleneck between memory and processing We are a premier chip and silicon IP provider, is seeking to hire an exceptional Principal Engineer – Analog Design to join our memory...
-
Principal Systems Engineer
4 weeks ago
Hyderabad, Telangana, India Medtronic Full timeAt Medtronic you can begin a life-long career of exploration and innovation while helping champion healthcare access and equity for all You ll lead with purpose breaking down barriers to innovation in a more connected compassionate world A Day in the Life Principal Systems Engineer is responsible for both leading and contributing to investigations and...
-
Principal Operation Project Engineer
7 days ago
Hyderabad, Telangana, India Cubic Transportation Systems Full timeHiring Principal Operations Project EngineerExperience: 15+ YearsLocation: HyderabadNotice: Immediate to 30 DaysKey Skills:- Production support expertise- Root cause analysis- System stability & performance- Technical leadership- Operational excellence- Change and project management- Tools & methodologies- Communication skillsKey Responsibilities:Production...
-
Principal Systems Engineer
2 weeks ago
Hyderabad, Telangana, India Medtronic Full time US$ 1,50,000 - US$ 2,00,000 per yearAt Medtronic you can begin a life-long career of exploration and innovation, while helping champion healthcare access and equity for all. You'll lead with purpose, breaking down barriers to innovation in a more connected, compassionate world.A Day in the LifePrincipal Systems Engineer is responsible for both leading and contributing to investigations and...
-
Principal Operation Project Engineer
2 days ago
Hyderabad, Telangana, India Cubic Transportation Systems Full time ₹ 15,00,000 - ₹ 20,00,000 per yearHiring Principal Operations Project EngineerExperience: 15+ YearsLocation: HyderabadNotice: Immediate to 30 DaysKey Skills:Production support expertiseRoot cause analysisSystem stability & performanceTechnical leadershipOperational excellenceChange and project managementTools & methodologiesCommunication skillsKey Responsibilities:Production Support & Issue...
-
Principal Operation Project Engineer
5 days ago
Hyderabad, Telangana, India Cubic Transportation Systems Full timeHiring Principal Operations Project Engineer Experience: 15+ Years Location: Hyderabad Notice: Immediate to 30 Days Key Skills: Production support expertise Root cause analysis System stability & performance Technical leadership Operational excellence Change and project management Tools & methodologies Communication skills Key...
-
Principal Systems Engineer
3 weeks ago
Hyderabad, Telangana, India Medtronic Full timeAt Medtronic you can begin a life-long career of exploration and innovation while helping champion healthcare access and equity for all Youll lead with purpose breaking down barriers to innovation in a more connected compassionate world A Day in the LifeThe Firmware Engineer will be a member of the Engineering R D team working on the development and...
-
Principal Engineer
1 day ago
Hyderabad, Telangana, India Cloud4C Services Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob Title: Principal Engineer - StorageExperience: 7 -10 YearsLocation: Hyderabad( Work From Office)Job Description:We are looking for a skilled NetApp Storage Engineer with a strong understanding of storage systems and Data ONTAP administration in SAN & NAS environments. The ideal candidate will excel in managing storage infrastructure, troubleshooting, and...