Data Scientist

3 weeks ago


Vellore, India FPX AI Full time
Role Overview

FPX is building an AI infrastructure marketplace that enables developers to access and deploy compute efficiently. As a PyTorch + CUDA Engineer focused on benchmarking and performance, you will be responsible for designing, running, and interpreting benchmarks across model, framework, and hardware stacks. Your role is critical in validating performance claims, detecting regressions, and guiding optimizations to ensure FPX remains at the cutting edge of compute efficiency.

You will collaborate closely with ML systems, compiler, hardware, and platform teams. You should have strong experience in PyTorch internals, GPU programming, and profiling tools.


Key Responsibilities
  • Define, build, and maintain a benchmark suite covering representative deep learning workloads (training, inference, mixed) across modalities (vision, NLP, recommendation, etc.).
  • Automate running of benchmarks across multiple hardware configurations (NVIDIA GPUs, possibly AMD, and future accelerators).
  • Use profiling, tracing, and performance tools (e.g. Nsight Systems, Nsight Compute, PyTorch Profiler, CUPTI, NVTX) to identify bottlenecks across layers (operator, kernel, memory, data movement).
  • Write and maintain scripts / harnesses that manage benchmark orchestration, result collection, and analysis (latency, throughput, memory usage, utilization metrics).
  • Detect and triage performance regressions (e.g. nightly, CI-integrated benchmarks).
  • Partner with compiler / runtime / kernel teams to propose optimizations, micro-bench kernel patches, fusion, operator-level improvements, or configuration tuning.
  • Validate performance improvements across scale (multi-GPU, distributed) and in production-like settings.
  • Publish benchmark results, document methodology, and communicate trade-offs to stakeholders (engineering, product, customers).
  • Occasionally assist in custom kernel development when needed (e.g. fused kernels, optimized CUDA code) or integrating specialized libraries (Triton, CUTLASS, cuBLAS, cuDNN).
  • Stay up-to-date on new features in PyTorch (e.g. torch.compile, CUDA Graphs, new backends) and evaluate their impact.


Required Qualifications
  • BS / MS / PhD in Computer Science, Electrical Engineering, or equivalent experience.
  • Solid experience (3+ years) in GPU programming: CUDA, kernel development, memory management, concurrency.
  • Deep familiarity with PyTorch internals (operators, autograd, dispatcher, JIT/inductor pipeline or equivalent).
  • Experience with profiling and analysis of GPU workloads (Nsight, CUPTI, NVTX, PyTorch Profiler).
  • Strong Python and C++ skills.
  • Ability to analyze low-level performance (latency, throughput, memory, occupancy) and correlate to high-level model behavior.
  • Experience writing benchmark harnesses, automation, and result pipelines.
  • Excellent communication skills — able to present performance trade-offs and complex analysis to technical and non-technical audiences.


Preferred / Nice-to-Have
  • Experience with distributed training/inference (DDP, FSDP, model parallelism).
  • Experience with PyTorch’s newer compilation pathways (e.g. torch.compile, Inductor, Dynamo).
  • Knowledge of CUDA Graphs, kernel fusion, memory optimizations, tensor core usage.
  • Experience with other ML frameworks and baselining comparisons (TensorFlow, JAX, ONNX).
  • Published benchmarks, open-source contributions, or performance tools development.
  • Prior experience in systems, compilers, or GPU runtime development.
  • Familiarity with scaling benchmarks, cluster deployments, and heterogeneous hardware.


Compensation
  • Competitive salary + equity + benefits.
  • Potential for bonuses tied to performance improvements and critical benchmark delivery.

  • Data Scientist

    2 weeks ago


    Vellore, India FactEntry Data Solutions Pvt Ltd Full time

    **Python NLP Engineer - Machine Learning** **Skills**: - Python developer - 3+yrs - Elastic Search - 1+yrs - Database - SQL, MongoDB 3+yrs - AWS or cloud / web hosting - 1+yrs **Desired Qualification**: - Proficient knowledge of programming in Python - Ability to wrangle both Unstructured and Structured data. - Work on research problems in information...

  • Data Scientist

    2 days ago


    Vellore, India STRYDO TECHNOLOGIES PVT.LTD Full time

    We are looking for an expert in machine learning to help us extract value from our data. You will lead all the processes from data collection, cleaning, and pre-processing, to training models and deploying them to production. Schedule: - Day shift

  • Vp - data science

    3 days ago


    Vellore, India Capri Global Capital Ltd. Full time

    The VP - Data Science will oversee the development and implementation of data-driven solutions across the organization. The role involves leading a team of data scientists, collaborating with cross-functional teams, and delivering actionable insights to support business decisions. The ideal candidate will have a deep understanding of machine learning,...


  • Vellore, India Ampstek Full time

    Title: Senior Generative AI Engineer (Databricks Data Lake)Location: Remote (India)Full Time Job Summary:About the RoleWe are seeking an experienced Senior Generative AI Engineer with a strong background in Databricks and data lake architectures. This individual will be responsible for designing, developing, and deploying cutting-edge Generative AI (GenAI)...


  • Vellore, India Chaitanya HR Consultancy Full time

    ***: - Gathering and analyzing infection data to make evidence-based decisions - Educating medical and public health professionals on infection prevention protocols to facilitate emergency preparedness - Isolating and treating infected individuals to contain the spread of infectious diseases - Assisting with the development of action plans in case of a...


  • Vellore, India Undocked Full time

    About Us At Undocked, we help companies excel in e-commerce by delivering bespoke optimizations and cutting-edge analytics. Our experiences in retail and supply chain product strategy, technology, and operations have helped organizations succeed in their e-commerce and digital transformation journeys. We are looking for an Azure AI Foundry Developer to join...

  • Software Engineer

    2 weeks ago


    Vellore, India TPI Global Solutions Full time

    Job Title: Engineer - Software Lvl 3 - MLOpsLocation: RemoteJob Type: ContractContract Duration: 6 MonthsShift: Night (8:30 PM to 5:30 AM)Duties:Build tools for automation around ML workflows orchestration and model deployments using CI/CD workflowsAutomate infrastructure deployments and tool configurations using Terraform or similar IaC toolsWrite clean,...


  • Vellore, India SPRINTPARK Full time

    Company Description:SprintPark is a comprehensive IT consulting and solutions provider located in Hyderabad. We focus on delivering tailored, efficient, and innovative solutions to meet the evolving needs of businesses. Our services include IT consulting, staffing, project management, and software solutions aimed at driving business success and fostering...