Senior Engineer, Data Engineering

5 days ago


India R Systems Full time
Job Title: Data Engineer Contract Period: 12 Months
Offshore candidates accepted (Singapore Based Company)
Experience
Minimum 4+ years as a Data Engineer or similar role.(Please don't apply if less than 4 years exp in Data Engineer)
Proven experience in Python, Spark, and PySpark (non-negotiable).(Hands-on in building ETL pipelines, real-time streaming, and data transformations .
Worked with data warehouses, cloud platforms (AWS/Azure/GCP) , and databases .

✅ Spark Core API : RDDs, transformations/actions, DAG execution.
Spark SQL : DataFrames, schema optimization, UDFs.
Data Handling : S3, HDFS, JDBC, Parquet, Avro, ORC.
Performance Tuning : Cloud Deployment : Databricks, AWS EMR, Azure HDInsight, GCP Dataproc.
Bachelor’s/Master’s in Computer Science, Computer Engineering, or equivalent.
We are seeking an experienced Data Engineer to join our team and support data-driven initiatives. This role involves building scalable pipelines, working with streaming data, and collaborating with data scientists and business stakeholders to deliver high-quality solutions.

Design, build, and optimize data pipelines and ETL workflows .
Manage and process large datasets using Spark, PySpark, and SQL .
Collaborate with data scientists and product teams to integrate AI/ML models into production.
Ensure data quality, scalability, and performance in all pipelines.
Deploy and manage Spark workloads on cloud platforms (AWS, Azure, GCP, Databricks) .
Automate testing and deployment of Spark jobs via CI/CD pipelines.

Bachelor’s/Master’s degree in Computer Science, Computer Engineering, or related field.
Minimum 4 years of professional experience as a Data Engineer.
Strong expertise in Python, Spark, PySpark .
Hands-on experience with Spark SQL, DataFrames, UDFs, DAG execution .
Knowledge of data ingestion tools (Kafka, Flume, Kinesis) and data formats (Parquet, Avro, ORC).
Familiarity with performance tuning in Spark (partitioning, caching, broadcast joins).
Experience deploying on Databricks, AWS EMR, Azure HDInsight, or GCP Dataproc .
Exposure to testing and CI/CD for data pipelines (pytest, Jenkins, GitHub Actions).

  • India Beige Bananas Full time

    Company Description Beige Bananas is a rapidly growing pure play AI consulting firm that focuses on building hyper custom AI products for Fortune 500 Retail, CPG, and Media companies. The company embraces an outcome-driven mindset to help clients accelerate value realization from their analytics investments. Role Description This is a full-time remote...


  • India Beige Bananas Full time

    Company DescriptionBeige Bananas is a rapidly growing pure play AI consulting firm that focuses on building hyper custom AI products for Fortune 500 Retail, CPG, and Media companies. The company embraces an outcome-driven mindset to help clients accelerate value realization from their analytics investments.Role DescriptionThis is a full-time remote role for...


  • Pune, India NTT Data Full time

    Job Description NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Senior GenAI Engineers to join our team in Pune, Mahrshtra (IN-MH), India (IN). Location: | Experience: 8+...


  • Mumbai, India NTT Data Full time

    Job Description NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Senior GenAI Engineers to join our team in Mumbai, Mahrshtra (IN-MH), India (IN). Location: | Experience: 8+...


  • Bengaluru, India NTT DATA, Inc. Full time

    Job Description Make an impact with NTT DATA Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion its a place where you can grow, belong and thrive. Your day at NTT...

  • Lead Data Engineer

    4 days ago


    India Eucloid Data Solutions Full time

    Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications. The ideal candidate will support development of data infrastructure on Databricks for our clients by participating in activities which may include starting from up- stream and down-stream technology selection to designing and building of...

  • Lead Data Engineer

    4 days ago


    India Eucloid Data Solutions Full time

    Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications. The ideal candidate will support development of data infrastructure on Databricks for our clients by participating in activities which may include starting from up- stream and down-stream technology selection to designing and building of...

  • Lead Data Engineer

    3 days ago


    India Eucloid Data Solutions Full time

    Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications. The ideal candidate will support development of data infrastructure on Databricks for our clients by participating in activities which may include starting from up- stream and down-stream technology selection to designing and building of...

  • Lead Data Engineer

    3 days ago


    India Eucloid Data Solutions Full time

    Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications. The ideal candidate will support development of data infrastructure on Databricks for our clients by participating in activities which may include starting from up- stream and down-stream technology selection to designing and building of...


  • India Vista Applied Solutions Group Inc Full time

    Job Summary: Our client is seeking a Senior Data Engineer to lead the modernization of our enterprise data platform using Microsoft Fabric, with a focus on building an AI-ready foundation. This role is key to replacing legacy ERP systems, retiring SSIS, and implementing scalable, code-driven data pipelines to support advanced analytics and future AI...