Python Expert Large Scale Data Engineer

2 weeks ago


alappuzha, India beBeeDataEngineering Full time

Senior Data Engineering RoleWe are looking for a seasoned Senior Data Engineer to lead our data pipelines that power large language model development.The ideal candidate will be the team's expert on data ingestion, processing, and quality for all AI training. Their primary mission is to build scalable, automated systems that transform massive, raw datasets into pristine, model-ready formats.Key Responsibilities:Data Ingestion and Processing: Design, develop, and own robust, scalable, and automated ETL/ELT pipelines in Python for ingesting and processing terabyte-scale text datasets.Data Quality: Implement rigorous data cleaning, deduplication, filtering, and normalization strategies. Define and enforce data quality standards to ensure the highest integrity for model training.Data Transformation: Efficiently structure and format diverse datasets (JSON, Parquet, etc.) for consumption by LLM training frameworks.Collaboration: Work closely with our AI researchers and ML engineers to understand data requirements, define metrics, and support the model training lifecycle.Optimization: Continuously optimize data processing workflows for speed, cost, and reliability.Requirements:8+ years of professional experience in data engineering, data processing, or backend software engineering.Expert-level proficiency in Python and its data ecosystem (e.g., Pandas, NumPy, Dask, Polars).Proven experience building and maintaining large-scale data pipelines.Deep understanding of data structures, data modeling, and software engineering best practices (Git, CI/CD, testing).Experience handling and parsing diverse data formats (JSON, CSV, XML, Parquet) at scale.Excellent problem-solving skills and a meticulous attention to detail.Strong communication and collaboration skills, with experience working in a team environment.Benefits:Opportunity to lead cutting-edge AI and ML projects.Collaborative and innovative team culture.Competitive compensation with continuous learning opportunities.Talented individuals who thrive on solving large-scale data challenges and enjoy working at the intersection of data engineering and machine learning are encouraged to apply. A degree in Computer Science or related field is preferred but not required.



  • alappuzha, India beBeeDataScience Full time

    Job DescriptionWe are seeking an expert Python developer to join our team as a Data Science Engineer. In this role, you will be responsible for designing and building high-performance Python systems used in large-scale AI data pipelines.The ideal candidate will have a strong background in Python development, with expertise in libraries such as Pandas, NumPy,...

  • Data Engineer

    2 weeks ago


    Alappuzha, India Aceolution Full time

    Job Title: Data Engineer – Python Expert(Freelance Role) Location: Remote / Hybrid Employment Type: Contract/ Freelance Role Summary We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) development. As a senior Individual Contributor (IC), you will be the team's expert...


  • alappuzha, India beBeeData Full time

    Senior Data Engineer PositionWe are seeking an experienced Senior Data Engineer to join our team. This role will involve designing and building scalable data pipelines and cloud-based data solutions.Key Responsibilities:Design, build, and optimize large-scale data pipelines and ETL workflows.Develop efficient and reusable Python scripts for data...


  • alappuzha, India beBeeDataEngineering Full time

    Job Title: Data Engineering ExpertJob DescriptionAs a data engineering expert, you will be responsible for designing and implementing efficient data pipelines to extract insights from large datasets.Data extraction and transformation using SQL and ETL tools like SAP BODS.IDQ (Information Data Quality): Profiling, cleansing and validating data using business...


  • alappuzha, India beBeeData Full time

    Job Title: Senior Data ArchitectWe are seeking an experienced Data Architecture Lead with 10+ years of experience in data engineering, data warehousing, and large-scale data platform design.The ideal candidate must have strong hands-on expertise in Snowflake, data architecture, ETL processes, and large data migration solutioning.The role involves leading...

  • Cloud Data Engineer

    1 week ago


    alappuzha, India beBeeData Full time

    Top data engineering position available for a skilled professional seeking to leverage expertise in cloud computing, Python and SQL.A minimum of 3 years experience in designing and implementing large-scale data systems using storage, virtual machines, serverless technologies and parallel processing.Proficiency in Python and supporting libraries including...


  • alappuzha, India beBeeDevelopment Full time

    Job TitleLarge Language Model (LLM) Training Expert for Coding TasksThis role involves creating high-quality, contextually relevant code in PHP and curating datasets for training large language models. The expert will collaborate with other domain specialists to ensure accurate annotations and high-performance models suited for the coding...

  • Data engineer

    4 weeks ago


    Alappuzha, India Idyllic Services Full time

    Job Title: GCP Data Engineer Experience: 5+ Years Location: Pune Employment Type: Full-time Job Summary: We are looking for an enthusiastic GCP Data Engineer to join our dynamic and growing project team. In this role, you will have the opportunity to develop your skills while contributing to large-scale data projects on Google Cloud Platform. You will work...

  • Data Science Expert

    2 weeks ago


    alappuzha, India beBeeResearch Full time

    Scientific Researcher OpportunityJoin our team of innovators and experts in the life sciences domain as a researcher. You will be working on developing automated workflows, using Python for data cleaning, wrangling, and analysis.Key responsibilities include working with large datasets, contributing to AI/ML-driven solutions, and applying logical thinking to...

  • Data Strategist

    1 week ago


    alappuzha, India beBeeCloud Full time

    Job Title: Cloud Solutions ArchitectUSEReady is a leading provider of data and analytics solutions. We partner with top cloud ecosystem leaders like Snowflake, Tableau, and Amazon Web Services.We are seeking an experienced Principal Snowflake Architect to act as a Trusted Advisor and Global Evangelist for our data practice. This role requires a rare blend of...