Senior Big Data Specialist

2 weeks ago


tumakuru, India beBeeDataEngineer Full time

Key Data Engineer RoleWe are seeking an experienced Senior Data Engineer to design and implement large-scale data pipelines that power our language model development.The ideal candidate will have expertise in data ingestion, processing, and quality for all AI training, building scalable systems that transform massive datasets into pristine formats.Key Responsibilities:Design, develop, and own robust ETL/ELT pipelines in Python for ingesting and processing terabyte-scale text datasets.Implement rigorous data cleaning, deduplication, filtering, and normalization strategies to ensure high-quality model training.Efficiently structure and format diverse datasets for consumption by LLM training frameworks.Collaborate with researchers and ML engineers to understand data requirements and support model training lifecycle.Optimize data processing workflows for speed, cost, and reliability.Occasionally assist in launching, monitoring, and debugging data-related issues during model training runs.Required Qualifications:8+ years of professional experience in data engineering, data processing, or backend software engineering.Expert-level proficiency in Python and its data ecosystem.Proven experience building and maintaining large-scale data pipelines.Deep understanding of data structures, data modeling, and software engineering best practices.Experience handling and parsing diverse data formats at scale.Excellent problem-solving skills and attention to detail.Strong communication and collaboration skills.



  • tumakuru, India beBeeDataMigration Full time

    Big Data Migration SpecialistOur organization is looking to hire a skilled professional to spearhead the migration initiative from Teradata to Cerebro (BigQuery).Migrate reporting data sources from Teradata to BigQuery.Update and validate existing Power BI reports to align with new BigQuery data models.Design and develop new tables, views, and queries in...

  • Big Data Specialist

    3 days ago


    tumakuru, India TRDFIN Support Services Pvt Ltd Full time

    Role OverviewResponsible for designing distributed data ecosystems capable of processing huge data volumes. The role demands expertise in big data clusters, large scale computations, and high-performance job tuning.Key ResponsibilitiesBuild Hadoop/Spark-based data platforms and clusters.Configure and administer distributed file systems (HDFS).Build...


  • Tumakuru, India AMISEQ Full time

    SRE & Devops (Big Data) Bengaluru, Exp- 3-6 YearsRequired Skills: ● Demonstrated ability in designing, building, refactoring and releasing software written in Java, Python. ● Strong experience with Big Data technologies including Hadoop, Spark, Flink, Hive, HDFS. ● Ability to handle data integration, bug fixing, performance optimization, and feature...

  • Information Architect

    2 weeks ago


    tumakuru, India beBeeDataExpert Full time

    Job OpportunityAs a seasoned data professional, you will excel in designing and implementing robust serverless architectures that leverage the power of big data and real-time processing.


  • tumakuru, India beBeeData Full time

    As a seasoned data professional, we are seeking an experienced Senior Big Data Developer to join our team.This role will be responsible for analyzing, organizing and processing raw data using big data technologies such as Spark.The ideal candidate will have expertise in performing data validation, cleaning and transformation using big data technologies...


  • tumakuru, India beBeeData Full time

    Job OpportunityAs a key member of our team, you will engage in ongoing support activities and project efforts as needed.Triage identified issues across account source platforms, integrations, and customer data hubs. Analyze and triage API messages as part of ongoing platform support.Perform CDH initial data loads and data cleanup using Postman, Python, SQL,...


  • tumakuru, India beBeeData Full time

    Career OpportunitiesJob DescriptionWe are seeking a Senior Data Integration Specialist to join our team. The ideal candidate will have experience in designing and developing ETL processes using Oracle Data Integrator to facilitate seamless data integration between various sources and targets.Key Responsibilities:Design and develop ETL processes using Oracle...


  • tumakuru, India beBeeDataEngineer Full time

    The role of a Cloud Data Engineer involves the design, implementation, and maintenance of large-scale data pipelines using Google Cloud Platform (GCP) services.This encompasses setting up and configuring cloud storage classes, managing data flows with Dataflow, querying big datasets with BigQuery, and leveraging PySpark/Python for scalable data...


  • tumakuru, India beBeeData Full time

    Senior Data Architecture SpecialistA key position in our organization is focused on designing and implementing scalable data infrastructure, utilizing cloud-based solutions such as Databricks. This role requires expertise in transforming large datasets using Azure Databricks & Delta lake technologies.We are seeking a highly skilled professional to design and...


  • tumakuru, India beBeeData Full time

    Senior Data Architect - HR Data Conversion Role: Senior Data Architect - HR Data ConversionJob Type: Full-timeJob Location: RemoteExperience: 5–10 yearsJob Description: Key Responsibilities:Design, develop, and maintain large-scale ETL pipelines using Python, Airflow, and data warehousing solutions.Collaborate with cross-functional teams to integrate...