Master Data Pipeline Architect

3 days ago


kanpur, India beBeeData Full time

Job Title: Data Engineer – Python ExpertLocation: Remote / HybridEmployment Type: Contract/ FreelanceRole Summary We are seeking a seasoned senior data engineer to architect, build, and own scalable automated systems that transform raw datasets into model-ready formats. The primary mission is to design, develop, and own robust ETL/ELT pipelines in Python for ingesting and processing terabyte-scale text datasets.Key Responsibilities Architect & Build: Design, develop, and maintain large-scale data pipelines in Python for ingestion and processing of massive datasets. Data Quality: Implement data cleaning, deduplication, filtering, and normalization strategies and define data quality standards. Data Transformation: Efficiently structure and format diverse datasets for consumption by LLM training frameworks. Collaboration: Work closely with AI researchers and ML engineers to understand data requirements, define metrics, and support the model training lifecycle. Optimization: Continuously optimize data processing workflows for speed, cost, and reliability.Required Skills and Qualifications 8+ years of professional experience in data engineering, data processing, or backend software engineering. Proven expertise in Python and its ecosystem (e.g., Pandas, NumPy, Dask, Polars). Experience handling and parsing diverse data formats at scale. Excellent problem-solving skills, attention to detail, and strong communication skills.Benefits Opportunity to lead cutting-edge AI projects. Collaborative team environment.Competitive compensation package with continuous learning opportunities.


  • Data Architect

    2 weeks ago


    Kanpur, India DataAlchemy.AI Full time

    Data ArchitectRole OverviewWe seek a skilled Data Architect to lead the design, structuring, and management of enterprise data solutions powering our AI/ML platforms. You’ll collaborate closely with engineers to ensure robust, scalable, and high-quality data pipelines for model development and analytics.Key ResponsibilitiesDesign and oversee data models,...


  • kanpur, India beBeeData Full time

    Data Architect SpecialistJob Overview:We are seeking an experienced data architect to develop and implement robust, scalable, high-performance data architectures that support our enterprise data initiatives.Design end-to-end data solutions using modern cloud data platforms.Lead the planning, execution, and delivery of multiple data projects, ensuring...


  • kanpur, India beBeeData Full time

    Cloud Data ArchitectA challenging role awaits an experienced Cloud Data Architect to design and implement scalable cloud-native data systems on AWS.Data Engineering Expertise: Implement robust data pipelines using Apache Iceberg, AWS Glue, Redshift, and Atlan for enhanced governance and scalability.Legacy System Migration: Migrate legacy tools like Siebel,...


  • Kanpur, India beBeeData Full time

    About the Role:We are seeking a highly skilled Data Engineer to join our team and contribute to building and optimizing data pipelines, designing data architectures, and working on cloud-native data solutions.This is a collaborative environment where you can thrive on solving complex problems and love optimizing systems for performance.Key responsibilities...


  • kanpur, India beBeeDataEngineer Full time

    Role OverviewAs a seasoned data architect, you will play a pivotal role in designing and implementing robust data pipelines across diverse Oracle ecosystems.Design and develop scalable, high-performance ETL/ELT data pipelines using Oracle Data Integrator (ODI) for seamless data processing.Develop efficient data ingestion workflows through Python and shell...


  • kanpur, India beBeeData Full time

    Enterprise Data Architect Lead:We are seeking a visionary Senior Data Architect to spearhead enterprise data architecture and drive business transformation through strategic technology initiatives.About the Role:Define and maintain a comprehensive enterprise data architecture strategy, roadmap, and standards that align with our business objectives.Lead...


  • kanpur, India beBeeDataEngineer Full time

    Job Title Cloud Data Engineer About the Role As a Cloud Data Engineer, you will be responsible for designing, implementing and maintaining robust data pipelines and building scalable data lakes. Key Responsibilities: Design, develop and maintain scalable ETL pipelines using cloud-native tools (AWS DMS, AWS Glue, Kafka, Azure Data Factory, GCP Dataflow,...


  • kanpur, India beBeeDataarchitect Full time

    Lead Enterprise Data ArchitectWe're seeking a visionary data architect to spearhead the design and delivery of large-scale enterprise data solutions.Create end-to-end data architectures and roadmaps for complex data platforms.Drive the implementation of robust ETL/ELT pipelines and orchestration frameworks.The ideal candidate will have 12–15 years of...


  • kanpur, India beBeeDataEngineer Full time

    Unlocking Data Insights with Scalable PipelinesWe are seeking a skilled professional to develop and maintain large-scale data infrastructure that drives business growth.This role requires a deep understanding of data modeling, ETL frameworks, and cloud-based data platforms.Key Responsibilities: Data EngineeringDesign and implementation of scalable data...


  • kanpur, India beBeeBackend Full time

    AI Architect and Data EngineerAbout the RoleWe are seeking a highly skilled AI architect and data engineer to join our team. As a Founding Backend Engineer, you will be responsible for designing and implementing the architecture of our AI system.Your primary focus will be on building the context layer, which serves as the sensory system that determines how...