PySpark Hive Data Engineer

18 hours ago


Pune, Maharashtra, India Citi Full time ₹ 12,00,000 - ₹ 36,00,000 per year

The RoleThe Data Engineer is accountable for developing high quality data products to support the Bank's regulatory requirements and data driven decision making. A Data Engineer will serve as an example to other team members, work closely with customers, and remove or escalate roadblocks. By applying their knowledge of data architecture standards, data warehousing, data structures, and business intelligence they will contribute to business outcomes on an agile team.ResponsibilitiesDeveloping and supporting scalable, extensible, and highly available data solutionsDeliver on critical business priorities while ensuring alignment with the wider architectural visionIdentify and help address potential risks in the data supply chainFollow and contribute to technical standardsDesign and develop analytical data modelsRequired Qualifications & Work ExperienceFirst Class Degree in Engineering/Technology (4-year graduate course)8 to 12 years' experience implementing data-intensive solutions using agile methodologies, should be hands on on PySpark, Hive, HDFS, HadoopShould have strong understanding of AWS Glue Serverless Data Integration, Terraform, deploying Apache Spark on AWS, using Elastic Kubernetes Service (EKS), use of deployment tools LightSpeed TetkonExperience of relational databases and using SQL for data querying, transformation and manipulationExperience of modelling data for analytical consumersAbility to automate and streamline the build, test and deployment of data pipelinesExperience in cloud native technologies and patternsA passion for learning new technologies, and a desire for personal growth, through self-study, formal classes, or on-the-job trainingExcellent communication and problem-solving skillsAn inclination to mentor; an ability to lead and deliver medium sized components independentlyTechnical Skills (Must Have)ETL: Hands on experience of building data pipelines. Proficiency in two or more data integration platforms such as Ab Initio, Apache Spark, Talend and InformaticaBig Data: Experience of 'big data' platforms such as Hadoop, Hive or Snowflake for data storage and processingData Warehousing & Database Management: Expertise around Data Warehousing concepts, Relational (Oracle, MSSQL, MySQL) and NoSQL (MongoDB, DynamoDB) database designData Modeling & Design: Good exposure to data modeling techniques; design, optimization and maintenance of data models and data structuresLanguages: Proficient in one or more programming languages commonly used in data engineering such as Python, Java or ScalaDevOps: Exposure to concepts and enablers - CI/CD platforms, version control, automated quality control managementData Governance: A strong grasp of principles and practice including data quality, security, privacy and complianceTechnical Skills (Valuable)Ab Initio: Experience developing Co>Op graphs; ability to tune for performance. Demonstrable knowledge across full suite of Ab Initio toolsets e.g., GDE, Express>IT, Data Profiler and Conduct>IT, Control>Center, Continuous>FlowsCloud: Good exposure to public cloud data platforms such as S3, Snowflake, Redshift, Databricks, BigQuery, etc. Demonstratable understanding of underlying architectures and trade-offsData Quality & Controls: Exposure to data validation, cleansing, enrichment and data controlsContainerization: Fair understanding of containerization platforms like Docker, KubernetesFile Formats: Exposure in working on Event/File/Table Formats such as Avro, Parquet, Protobuf, Iceberg, DeltaOthers: Experience of using a Job scheduler e.g., Autosys. Exposure to Business Intelligence tools e.g., Tableau, Power BICertification on any one or more of the above topics would be an advantage.-Job Family Group:Technology-Job Family:Digital Software Engineering-Time Type:Full time-Most Relevant SkillsPlease see the requirements listed above.-Other Relevant SkillsFor complementary skills, please see above and/or contact the recruiter.-Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi. View Citi's EEO Policy Statement and the Know Your Rights poster.


  • Data Engineer

    2 weeks ago


    Pune, Maharashtra, India, Maharashtra Tata Consultancy Services Full time

    Job Title :- Data Engineer - PysparkExperience: 5 to 8 YearsLocation: Pune/HyderabadJob DescriptionRequired Skills:5+ years of experience in Big data and pysparkMust-HaveGood work experience on Big Data Platforms like Hadoop, Spark, Scala, Hive, Impala, SQLGood-to-HaveGood Spark, Pyspark,Big Data experienceSpark UI/Optimization/debugging techniquesGood...

  • Data Engineer

    7 days ago


    Pune, Maharashtra, India Tata Consultancy Services Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Job Title :- Data Engineer - PysparkExperience: 5 to 8 YearsLocation: Pune/HyderabadJob DescriptionRequired Skills:5+ years of experience in Big data and pysparkMust-HaveGood work experience on Big Data Platforms like Hadoop, Spark, Scala, Hive, Impala, SQLGood-to-HaveGood Spark, Pyspark,Big Data experienceSpark UI/Optimization/debugging techniquesGood...

  • Pyspark Developer

    3 days ago


    Pune, Maharashtra, India Tech Mahindra Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Pyspark DeveloperRequirements:Mandatory: Primary skill: Pyspark, Data Engineering, Azure Data BricksGood Experience of Hadoop, Hive, and Cloudera/ Azure/GCP 3+ years of experience in the design and implementation of Big Data systems using PySpark, database migration, transformation and integration solutions for any Data warehousing project.Must have...


  • Pune, Maharashtra, India Bitwise Solutions Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Role: Jr. Pyspark Data EngineerExperience: 2 to 4 yearsNotice Period: ImmediateWork Mode: HybridPosition OverviewWe are looking for a skilled PySpark Data Engineer with 2 to 4 years of experience. The ideal candidate should have strong expertise in building and optimizing data pipelines using PySpark and should have experience working on cloud platforms like...

  • Data Engineer

    2 weeks ago


    Pune, Maharashtra, India Binary Star SearchX Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Job Description : - Minimum 5 years of experience in data analytics field - Experience with Azure/AWS Databricks - Experience in building and optimizing data pipelines, architectures and data sets - Excellent experience in Scala or Python, PySpark and SQL - Ability to troubleshoot and optimize complex queries on the Spark platform -...


  • Pune, Maharashtra, India Lumiq Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Data Engineer - ClouderaLocation: Pune & MumbaiMode: 5 Days WFOWho we areWe are a leading Data and Analytics company helping enterprises across industries solve their complex data challenges. Our expertise lies in building next-gen data platforms and solutions that enable organizations to unlock the full potential of their data.We partner with enterprises...

  • Data Engineer

    3 days ago


    Pune, Maharashtra, India Syansoft Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Expertise in Hadoop ecosystem, PySpark, Python,SQL,develop,maintain scalable data pipelines ETL workflows.Ensuring data quality,optimizing performance,collaborating stakeholders,managing HDFS/Hive/HBase,supporting data-driven cloud (GCP preferred).


  • Pune, Maharashtra, India Futurz Staffing Solutions Full time ₹ 12,00,000 - ₹ 18,00,000 per year

    Note- Associate "Already applied in NCS, will not be eligible"Role & responsibilitiesEssential for this role:Education and Qualifications:• Bachelors degree in IT, Computer Science, Software Engineering, Business Analytics or equivalent.• Minimum seven plus years of experience in data analytics field• Experience with Azure/AWS Databricks• Experience...

  • Big Data Engineer_C

    3 days ago


    Pune, Maharashtra, India Kaizen Sra Technologies Full time ₹ 8,00,000 - ₹ 25,00,000 per year

    Hi All,Skill: Bigdata EngineerExp: 6-9 YearsLocation: Pune, ChennaiF2F Interview on 19th Jul 2025. Who are interested please send me your updated resume.Mandatory Skills: PySpark, spark, python , GCP, SCALA, SQL, Hadoop, Hive, AWS, GCPKey Responsibilities:Design, develop, and maintain scalable data pipelines and ETL workflows using PySpark, Hadoop, and...


  • Pune, Maharashtra, India Citi Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    At Citi we're not just building technology, we're building the future of banking. Encompassing a broad range of specialties, roles, and cultures, our teams are creating innovations used across the globe. Citi is constantly growing and progressing through our technology, with laser focused on evolving the ways of doing things. As one of the world's most...