Data Engineer

21 hours ago


Pune Maharashtra India, Maharashtra Kalyani Technologies Full time

Overview:


We are looking for a highly skilled Python Data Engineer to join our team in an on-premise data engineering environment. The ideal candidate will have experience in ETL tools, data processing technologies, data orchestration, and relational databases. Additionally, you should be proficient in Python scripting for data engineering tasks and have experience working with Spark, PySpark, and other relevant data technologies. While cloud tools are a good-to-have, this position primarily focuses on on-premise data infrastructure.

This is an excellent opportunity to work on exciting projects that require developing scalable data pipelines, real-time data streaming, and optimizing data processing tasks using Python.


Key Responsibilities:

  • ETL Development & Optimization: Design, develop, and optimize ETL pipelines using open-source or cloud ETL tools (e.g., Apache Nifi, Talend, Pentaho, Airflow, AWS Glue).
  • Python Scripting for Data Engineering: Write Python scripts to automate data extraction, transformation, and loading (ETL) processes. Ensure that the code is optimized for performance and scalability.
  • Big Data Processing: Work with Apache Spark and PySpark to process large datasets in a distributed computing environment. Optimize Spark jobs for performance and resource efficiency.
  • Job Orchestration: Use Apache Airflow or other orchestration tools to schedule, monitor, and automate data pipeline workflows.
  • Data Streaming: Design and implement real-time data streaming solutions using technologies like Apache Kafka or AWS Kinesis for high-throughput, low-latency data processing.
  • File Formats & Table Formats: Work with open-source table formats like Apache Parquet, Apache Avro, or Delta Lake, and other structured/unstructured data formats for efficient data storage and access.
  • Database Management: Work with relational databases (e.g., PostgreSQL, MySQL, SQL Server) for data storage, management, and optimization. Understand database concepts such as normalization, indexing, and query optimization.
  • SQL Expertise: Write and optimize complex SQL queries for data extraction, transformations, and aggregation across large datasets. Ensure queries are efficient and scalable.
  • BI & Data Warehouse Knowledge: Exposure to BI tools and data warehousing concepts is a plus, ensuring the data is structured in a way that supports analytics and reporting.


Required Skills & Experience:

  • ETL Tools: Experience working with open-source ETL tools such as Apache Nifi, Talend, or Pentaho. Cloud-based tools like AWS Glue or Azure Data Factory are good to have.
  • Python Scripting: Proficiency in Python for automating data processing tasks, writing data pipelines, and working with libraries such as Pandas, Dask, PySpark, etc.
  • Big Data Technologies: Experience with Apache Spark and PySpark for distributed data processing, along with optimization techniques.
  • Data Orchestration: Experience using Apache Airflow or similar tools for scheduling and automating data pipelines.
  • Data Streaming: Experience with Apache Kafka or AWS Kinesis for building and managing real-time data pipelines.
  • Open-Source File Formats: Knowledge of Apache Parquet, Apache Avro, Delta Lake, or similar open-source table formats for efficient data storage and retrieval.
  • Relational Databases: Strong experience with at least one relational database (e.g., PostgreSQL, MySQL, SQL Server) and a solid understanding of database concepts like indexing, normalization, and query optimization.
  • SQL Expertise: Strong skills in writing and optimizing complex SQL queries for data extraction, transformations, and aggregation.


Nice to Have:

  • BI/Analytics Tools: Familiarity with BI tools like Power BI, Tableau, Looker, or similar reporting and data visualization platforms.
  • Data Warehousing: Knowledge of data warehousing principles, schema design (e.g., star/snowflake), and optimization techniques for large datasets.
  • Cloud Technologies: Experience with cloud data platforms like Databricks, Snowflake, or Azure Synapse is beneficial, though the role is focused on on-prem environments.
  • Containerization: Familiarity with containerization tools like Docker or Kubernetes for deploying data engineering workloads.


Educational Qualifications:

  • Bachelor’s or Master’s degree in Computer Science, Engineering, Information Systems, or a related field (or equivalent work experience).


Additional Qualities:

  • Excellent problem-solving and troubleshooting skills.
  • Ability to work both independently and in a collaborative environment.
  • Strong communication skills, both written and verbal.
  • Detail-oriented with a focus on data quality and performance optimization.
  • Proactive attitude and the ability to take ownership of projects.


  • Data Engineer

    4 days ago


    Pune, Maharashtra, India Data Axle Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    About Data Axle:Data Axle Inc. has been an industry leader in data, marketing solutions, sales and research for 50 years in the US. Data Axle has set a strategic global centre of excellence in Pune. This centre delivers mission critical data services to its global customers powered by its proprietary cloud-based technology platform and leveraging proprietary...

  • Data Engineer

    1 week ago


    Pune, Maharashtra, India Jash Data Sciences Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Do you love solving real-world data problems with the latest and best techniques? And having fun while solving them in a team Then come join our high-energy team of passionate data people. Jash Data Sciences is the right place for you.We are a cutting-edge Data Sciences and Data Engineering startup based in Pune, India.We believe in continuous learning and...


  • Pune, Maharashtra, India Mars Data Insights Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Title: Data Operations Engineer & RUN SupportSkills:Data Operations Engineering, data manipulation, Python, Talend, GCP, Bigquery, DataIku, ITSM/ticketing tools, Helix, Jira, task management, data pipelines,  RUN Service, data infrastructure, data quality Job Location:Pune Job Type:Fulltime/Hybrid Work Experience:5+ years We are seeking a highly...

  • Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra Zensar Technologies Full time

    Opportunity for Data EngineerLocation-Bangalore, Hyderabad, PuneImmediate Joiner to 15 daysKey Responsibilities Design and develop a data lakehouse solution using Apache Iceberg and Apache SparkEnable high-performance Treasury analytics, integrating financial datasets and reporting enginesWork with AWS services to create secure and scalable infrastructureUse...

  • Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra Response Informatics Full time

    Experience: 4 to 7 yrs in total/ almost all on skillsBudget: 18 LPALocation: Pune/ Noida/ GurgaonMode of work: WFONotice Period: immediate Joiner or within 15 Days Requirements: Expert Data Engineer - Hands-on experience with PySpark/Spark-SQL - Hands-on with Spark, SQL optimization - Deep understanding of DWH, Data modelling - Decent understanding of any...

  • Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra Bits In Glass Full time

    Title: Senior Data Engineer / ETL EngineerLocation: Pune / Hyderabad / MohaliJob Summary:As a Senior Data Engineer / ETL Engineer, you will be instrumental in designing, developing, and optimizing data processing systems that support our organization's data initiatives. Your expertise in Bigdata and Google Cloud Platform (GCP) will be essential in building...

  • Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra NuVision Auto Glass Full time

    NuVision Auto Glass is a leading auto glass service provider in the USA, serving customers across Arizona, Florida, South Carolina, and Colorado. Known for delivering reliable mobile windshield replacement and expert auto glass services, ensuring convenience and safety at every step.With seamless insurance claims, easy financing options for cash payments,...

  • Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra ITC Infotech Full time

    4+ years of experience in data engineering or cloud data developmentMaintenance and support of Data Pipeline.Data load monitoring Data Validation and quality checksIdentify and optimize ingestion pipelines in consultation with CustomerData: L2/L3 SupportInvestigate Glue job failures and restartFix minor transformation logic or input data issuesResolve...

  • Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra Luxoft Full time

    Job Description Project Description:You'll be working in the GM Business Analytics team located in Pune. The successful candidate will be a member of the global Distribution team, which has team members in London and Pune.We work as part of a global team providing analytical solutions for IB distribution/sales people. Solutions deployed should be extensible...

  • Senior Data Engineer

    21 hours ago


    Pune, Maharashtra, India, Maharashtra Live Connections Full time

    Minimum 5 - 8Years Skill SetDatabricks Engineer, Azure Databricks, SQL, PysparkSkill to EvaluateDatabricks Engineer, Azure Databricks, SQLExperience5 to 8 YearsLocationPune, Maharashtra, IndiaJob Description Key Responsibilities: - Design, build, and maintain scalable data pipelines and ETL processes. - Collaborate with data scientists and analysts to...