Senior Data Engineer

2 days ago


India MyRemoteTeam Inc Full time

About Us MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent. We empower businesses by providing world-class software engineers, operations support, and infrastructure to help them grow faster and better. Position: Senior Data Engineer (Python Coder) Location: India ( Remote ) Work Commitment: 40 Hrs / Week (full-time) Contract Duration: 3 - 6 Months Client: Wipro ( Google ) BGV: YES Role: Senior Data Engineer (Python Coder) Exp: Min. 8 Years Role Summary We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) development. As a senior Individual Contributor (IC) , you will be the team's expert on data ingestion, processing, and quality for all AI training. Your primary mission is to build scalable, automated systems that transform massive, raw datasets into pristine, model-ready formats. While your focus will be on data engineering, your expertise will be valued in collaborating on model training runs and experiments. You're the perfect fit if you are a Python expert who thrives on solving large-scale data challenges and enjoys working at the intersection of data engineering and machine learning. Key Responsibilities Architect & Build: Design, develop, and own robust, scalable, and automated ETL/ELT pipelines in Python for ingesting and processing terabyte-scale text datasets. Data Quality: Implement rigorous data cleaning, deduplication, filtering, and normalization strategies. Define and enforce data quality standards to ensure the highest integrity for model training. Data Transformation : Efficiently structure and format diverse datasets (JSON, Parquet, etc.) for consumption by LLM training frameworks. Collaboration: Work closely with our team of AI researchers and ML engineers to understand data requirements, define metrics, and support the model training lifecycle. Optimization: Continuously optimize data processing workflows for speed, cost, and reliability. ML Support (Secondary): Occasionally assist in launching, monitoring, and debugging data-related issues during model training runs. Required Qualifications 8+ years of professional experience in data engineering, data processing, or backend software engineering. Expert-level proficiency in Python and its data ecosystem (e.g., Pandas, NumPy, Dask, Polars). Proven experience building and maintaining large-scale data pipelines. Deep understanding of data structures, data modeling, and software engineering best practices (Git, CI/CD, testing). Experience handling and parsing diverse data formats (JSON, CSV, XML, Parquet) at scale. Excellent problem-solving skills and a meticulous attention to detail. Strong communication and collaboration skills, with experience working in a team environment. Preferred Qualifications (Nice-to-Haves) Hands-on experience with the data preprocessing pipeline for an LLM (e.g., LLaMA, BERT, GPT-family). Strong experience with big data frameworks like Apache Spark or Ray. Experience with Hugging Face libraries (Transformers, Datasets, Tokenizers). Familiarity with ML frameworks like PyTorch or TensorFlow. Proficiency with cloud platforms (AWS, GCP, Azure) and their data/storage services.



  • Hyderabad, Telangana, , India The Modern Data Company Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Role Overview:The Senior Data Platform Engineer is an experienced professional who designs and optimizes complex data processing solutions to address sophisticated business needs. In this role, you will tackle large -scale batch and streaming data projects, ensuring that data pipelines and platforms are scalable, high -performing, and secure. A Senior Data...


  • Pune, India NTT Data Full time

    Job Description NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Data Engineer Senior Consultant to join our team in Pune, Mahrshtra (IN-MH), India (IN). Location: | Experience: 5+...


  • India Crest Data Full time

    Crest Data is the global leading provider of Data Analytics, Security, DevOps, Cloud Solutions, Software integrations, Analytics, and security-based technological services. With a clientele that includes several Fortune 500 corporations and some of the innovative Silicon Valley Startups. We are looking for an experienced and enthusiastic Senior DevOps...

  • Data Engineer

    2 weeks ago


    India Crayon Data Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Job DescriptionData Engineer – ChennaiBuild the foundation of data intelligence — design, develop, and optimize data pipelines that power AI-driven insights.Location: Chennai, IndiaExperience: 2–3 years in data engineering or data-driven product developmentRole OverviewAs a Data Engineer, you will be responsible for building and maintaining scalable...

  • Senior Data Engineer

    3 weeks ago


    India AIQU Full time

    We are hiring for Senior Data Engineer to join one of our major clients based out of KSA.Job Details:Role: Senior Data EngineerWork Location: RemoteEmployment Type: Contract – 12 months & extendableRole Summary:The Senior Data Engineer plays a lead role in designing, building, optimizing, and governing data pipelines and architectures that power analytics,...

  • Data Engineer

    15 minutes ago


    India NTT DATA Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Req ID: 343254NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Data Engineer to join our team in Remote, Karnātaka (IN-KA), India (IN). "Key Responsibilities: Design and implement...


  • India RapidBrains Full time

    Job Title: Senior Data Engineer Experience: 6+ Years Employment Type: Contract Location: Remote Overview We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines. The ideal candidate will have strong experience with Azure Data Factory (ADF), Azure Synapse, PySpark, and...


  • India RapidBrains Full time

    Job Title: Senior Data Engineer Experience: 6+ Years Employment Type: Contract Location: Remote Overview We are looking for a Senior Data Engineer with deep expertise in Azure Data Engineering to design, build, and optimize large-scale data pipelines. The ideal candidate will have strong experience with Azure Data Factory (ADF), Azure Synapse, PySpark, and...


  • India Hunarstreet Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Senior Data Engineer Location: Remote We are hiring a Senior Data Engineer for one of our leading software development clients. The Senior Data Engineer will play a key role in building and extending data pipeline architecture, optimizing data flow, and supporting cross-functional teams. The ideal candidate has extensive experience in handling big data and...


  • Bengaluru, India NTT Data Full time

    Job Description NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Senior DevOps Engineer to join our team in Bangalore, Karntaka (IN-KA), India (IN). DevOps Engineer Senior Development...