Data Engineer

2 days ago


Jamnagar, Gujarat, India Kalyani Technologies Full time

Overview:

We are looking for a highly skilled Python Data Engineer to join our team in an on-premise data engineering environment. The ideal candidate will have experience in ETL tools, data processing technologies, data orchestration, and relational databases. Additionally, you should be proficient in Python scripting for data engineering tasks and have experience working with Spark, PySpark, and other relevant data technologies. While cloud tools are a good-to-have, this position primarily focuses on on-premise data infrastructure.

This is an excellent opportunity to work on exciting projects that require developing scalable data pipelines, real-time data streaming, and optimizing data processing tasks using Python.

Key Responsibilities:

  • ETL Development & Optimization: Design, develop, and optimize ETL pipelines using open-source or cloud ETL tools (e.g., Apache Nifi, Talend, Pentaho, Airflow, AWS Glue).
  • Python Scripting for Data Engineering: Write Python scripts to automate data extraction, transformation, and loading (ETL) processes. Ensure that the code is optimized for performance and scalability.
  • Big Data Processing: Work with Apache Spark and PySpark to process large datasets in a distributed computing environment. Optimize Spark jobs for performance and resource efficiency.
  • Job Orchestration: Use Apache Airflow or other orchestration tools to schedule, monitor, and automate data pipeline workflows.
  • Data Streaming: Design and implement real-time data streaming solutions using technologies like Apache Kafka or AWS Kinesis for high-throughput, low-latency data processing.
  • File Formats & Table Formats: Work with open-source table formats like Apache Parquet, Apache Avro, or Delta Lake, and other structured/unstructured data formats for efficient data storage and access.
  • Database Management: Work with relational databases (e.g., PostgreSQL, MySQL, SQL Server) for data storage, management, and optimization. Understand database concepts such as normalization, indexing, and query optimization.
  • SQL Expertise: Write and optimize complex SQL queries for data extraction, transformations, and aggregation across large datasets. Ensure queries are efficient and scalable.
  • BI & Data Warehouse Knowledge: Exposure to BI tools and data warehousing concepts is a plus, ensuring the data is structured in a way that supports analytics and reporting.

Required Skills & Experience:

  • ETL Tools: Experience working with open-source ETL tools such as Apache Nifi, Talend, or Pentaho. Cloud-based tools like AWS Glue or Azure Data Factory are good to have.
  • Python Scripting: Proficiency in Python for automating data processing tasks, writing data pipelines, and working with libraries such as Pandas, Dask, PySpark, etc.
  • Big Data Technologies: Experience with Apache Spark and PySpark for distributed data processing, along with optimization techniques.
  • Data Orchestration: Experience using Apache Airflow or similar tools for scheduling and automating data pipelines.
  • Data Streaming: Experience with Apache Kafka or AWS Kinesis for building and managing real-time data pipelines.
  • Open-Source File Formats: Knowledge of Apache Parquet, Apache Avro, Delta Lake, or similar open-source table formats for efficient data storage and retrieval.
  • Relational Databases: Strong experience with at least one relational database (e.g., PostgreSQL, MySQL, SQL Server) and a solid understanding of database concepts like indexing, normalization, and query optimization.
  • SQL Expertise: Strong skills in writing and optimizing complex SQL queries for data extraction, transformations, and aggregation.

Nice to Have:

  • BI/Analytics Tools: Familiarity with BI tools like Power BI, Tableau, Looker, or similar reporting and data visualization platforms.
  • Data Warehousing: Knowledge of data warehousing principles, schema design (e.g., star/snowflake), and optimization techniques for large datasets.
  • Cloud Technologies: Experience with cloud data platforms like Databricks, Snowflake, or Azure Synapse is beneficial, though the role is focused on on-prem environments.
  • Containerization: Familiarity with containerization tools like Docker or Kubernetes for deploying data engineering workloads.

Educational Qualifications:

  • Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or a related field (or equivalent work experience).

Additional Qualities:

  • Excellent problem-solving and troubleshooting skills.
  • Ability to work both independently and in a collaborative environment.
  • Strong communication skills, both written and verbal.
  • Detail-oriented with a focus on data quality and performance optimization.
  • Proactive attitude and the ability to take ownership of projects.

  • Data Engineer

    4 days ago


    Jamnagar, Gujarat, India HCLTech Full time

    Role: Data EngineerTotal years of experience: 9+ YearsLocation: PAN India (Any HCL office)Relevant experience for engagement: 5 YearsNote: - We are conducting a virtual discussion on this coming Saturday, 13th of Sep 25 from 9:30 am to 6:30 pm, if interested please feel free to share your profilesJob Description:Relevant Skills:Maintain architecture...


  • Jamnagar, Gujarat, India beBeeDataEngineer Full time ₹ 40,00,000 - ₹ 50,00,000

    Job SummaryWe are seeking a seasoned data engineering leader to drive the development of scalable, high-performance data platforms and advanced analytics capabilities across the enterprise.You will be responsible for building a robust data practice, establishing technical standards, and empowering cross-functional teams to deliver production-grade data...

  • Cloud Data Engineer

    2 days ago


    Jamnagar, Gujarat, India beBeeDataEngineering Full time ₹ 15,00,000 - ₹ 30,00,000

    Data Engineering Expert - Cloud BasedWe are seeking a highly skilled Data Engineer to join our team.Design and implement scalable data pipelines using cloud services (e.g., Cloud Dataflow, Cloud Composer/Airflow, Pub/Sub).Develop and maintain ETL/ELT processes for ingesting, transforming, and loading data into Snowflake.Collaborate with cross-functional...


  • Jamnagar, Gujarat, India beBeeEngineering Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    We are seeking a talented Data Engineer to join our dynamic team. As a key member of our data infrastructure, you will play a critical role in designing and implementing scalable data solutions to support our business initiatives.Job Responsibilities:Design and develop efficient data pipelines and ETL processes to ingest, transform, and store data from...


  • Jamnagar, Gujarat, India beBeeDataEngineering Full time ₹ 15,00,000 - ₹ 20,10,000

    As a Senior Data Engineer, you will lead the development of scalable data processing systems. Your expertise in cloud-based big data platforms and proficiency in programming languages like Python, Scala, and Spark will enable you to design and implement robust data pipelines.Key Responsibilities:Collaborate with cross-functional teams to integrate data...


  • Jamnagar, Gujarat, India beBeeDataEngineer Full time ₹ 15,00,000 - ₹ 20,00,000

    Job DescriptionWe are seeking a skilled Senior Data Engineer Lead to design, develop and optimize ETL/ELT data pipelines using Apache Spark (PySpark or Scala), AWS Glue, and Azure Data Factory.Responsibilities include working with structured and unstructured data to build scalable ingestion and transformation workflows across cloud platforms. You will...

  • Data Engineering Lead

    5 hours ago


    Jamnagar, Gujarat, India ZettaMine Labs Pvt. Ltd. Full time

    Job Description – Data Engineering Lead (AI for Data Quality & Analytics)Exp Range :7+years Location : Bengaluru/HyderabadRole OverviewWe are seeking an experienced Data Engineering Lead with a strong foundation in data pipelines, data validation, and AI-enhanced quality frameworks. This role is responsible for building and maintaining high-quality data...

  • Data Engineer

    1 week ago


    Jamnagar, Gujarat, India Enterprise Minds, Inc Full time

    Position Name- Data EngineerNo of Positions-2Exp-4-8 YearsMode-HybridLocation-PuneNP- 10-15 DaysYour PositionAs a data engineer, you will be responsible for delivering data intelligence solutions to our customers all around the globe, based on an innovative product, which provides insights into the performance of their material handling systems. You will be...


  • Jamnagar, Gujarat, India beBeeExpertise Full time ₹ 1,80,00,000 - ₹ 2,28,00,000

    Job DescriptionWe are seeking an experienced Senior Data Engineer to join our team. As a key member of our data engineering group, you will be responsible for designing, developing, and deploying large-scale data processing systems.Required Skills and QualificationsProven experience leading data engineering teams, including distributed teams across multiple...


  • Jamnagar, Gujarat, India beBeeDataEngineer Full time ₹ 23,04,000 - ₹ 2,59,20,000

    Job DescriptionAs a data engineer, you will play a pivotal role in unlocking the full potential of our organization's data assets.Our ideal candidate will possess a unique blend of technical expertise and business acumen, enabling them to design, develop, and maintain scalable data pipelines that drive business growth and inform strategic decision-making.Key...