Pyspark Engineer

7 days ago


Gurgaon, Haryana, India Citi Full time ₹ 8,00,000 - ₹ 16,00,000 per year

Responsibilities:

  • Engineering Degree with 1-2 years of experience in BigData systems, Hive, Hadoop, Spark (Python/ scala) and cloud based data management technologies
  • Hands-on experience in Unix Scripting, Python and Scala programing along with strong experience in SQL.
  • Comfortable working with completed unstructured, undocumented code and turning it around into best in class code redesigning costly compute and data processes and aligning to best development standards
  • Experienced in working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
  • Well versed with necessary data preprocessing and application engineering skills
  • At least 3 years of experience designing software systems with intense computational needs across real time and batch process .
  • Experience and understanding of Supervised, unsupervised machine learning techniques
  • Exposure to data ingestion, ETL tools such as Talend, modeling tools, Performance Management tooling such as Pepper data, Cloudera stack will be a plus
  • Knowledge of data management, data governance, data security and regulatory practices
  • Ability to identify, clearly articulate and solve complex business problems and present them to the management in a structured and simpler form
  • Should have experience of working in onsite, offsite delivery model
  • Experience working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
  • Experience in Credit Cards and Retail Banking
  • Should have excellent communication and inter-personal skills
  • Strong process/project management skills
  • Multiple stake holder management
  • Control orientated and Risk awareness

Qualifications:

  • Fast Learner with a desire to excel and attitude to partner and solve problems in complex environments placing business objectives at center or all activity.
  • Experience in Performance Tuning, Code Re-engineering is preferred.
  • Experience in broad IT architecture and design preferred across data and channels
  • Experience in query tuning, automation technologies (Autosys, Jenkins, Service Now) preferred
  • Exposure to container technology, Machine learning will be a plus

Education:

  • Bachelors/University degree or equivalent experience

This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.

Job Family Group:

Decision Management

Job Family:

Data/Information Management

Time Type:

Full time

Most Relevant Skills

Please see the requirements listed above.

Other Relevant Skills

Python (Programming Language), Spark SQL.

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi's EEO Policy Statement and the Know Your Rights poster.



  • Gurgaon, Haryana, India RBS Full time ₹ 15,00,000 - ₹ 30,00,000 per year

    Join us as a Software Engineer, PySparkThis is an opportunity for a driven Software Engineer to take on an exciting new career challengeDay-to-day, you'll be engineering and maintaining innovative, customer centric, high performance, secure and robust solutionsIt's a chance to hone your existing technical skills and advance your career while building a wide...


  • Gurgaon, Haryana, India RBS Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Join us as a Software Engineer, PySparkThis is an opportunity for a technically minded individual to join us as a Software EngineerYou'll be designing, producing, testing and implementing working software, working across the lifecycle of the systemHone your existing software engineering skills and advance your career in this critical roleWe're offering this...

  • Pyspark Engineer

    5 days ago


    Gurgaon, Haryana, India Citi Full time

    Engineering Degree with 1-2 years of experience in BigData systems, Hive, Hadoop, Spark (Python/ scala) and cloud based data management technologies Hands-on experience in Unix Scripting, Python and Scala programing along with strong experience in SQL. Comfortable working with completed unstructured, undocumented code and turning it around into best in class...


  • Gurgaon, Haryana, India Impetus Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Job Title:Lead Data Engineer – GCP (BigQuery • Composer • Python • PySpark)Location:GurgaonExperience:8+ years (data engineering / analytics engineering), with previous lead responsibilitiesAbout the Role:You will lead the design, build and operation of large-scale data platforms on the Google Cloud Platform. You will manage a team of data engineers,...


  • Gurgaon, Haryana, India EXL Talent Acquisition Team Full time ₹ 1,50,00,000 - ₹ 3,00,00,000 per year

    Data Engineer Consultant – Job DescriptionJob Summary:Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data assets and data related products by liaising with multiple stakeholders..Qualifications and Skills:Strong knowledge on Python and Pyspark  Expectation is to have ability to write Pyspark scripts for developing...

  • AWS Data Engineer

    7 days ago


    Gurgaon, Haryana, India Imbibe Consultancy Services Pvt Ltd Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Experience:5 to 8 yearsLocation:Bengaluru, Gurgaon, PuneJob code:101356Posted on:Oct 27, 2025About UsAceNet Consulting is a fast-growing global business and technology consulting firm specializing in business strategy, digital transformation, technology consulting, product development, start-up advisory and fund-raising services to our global clients across...


  • Gurgaon, Haryana, India EXL Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    DescriptionData Engineer Consultant – Job DescriptionJob Summary:Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data assets and data related products by liaising with multiple stakeholders..Qualifications and Skills:Strong knowledge on Python and Pyspark  Expectation is to have ability to write Pyspark scripts for...


  • Gurgaon, Haryana, India Recruin Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Only Immediate JoineesResponsibilities:Skills – AWS, DE, AirflowKey ResponsibilitiesDesign, develop, and maintain data pipelines using AWS services such as Glue, Lambda, Step Functions, EMR, Kinesis, and S3.Build and optimize data warehouses and data lakes on AWS (Redshift, Lake Formation).Develop ETL/ELT jobs using PySpark, Spark, Python, and AWS native...

  • Azure Data Engineer

    2 weeks ago


    Gurgaon, Haryana, India Infogain Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    ROLES & RESPONSIBILITIESKey ResponsibilitiesAnalyze existing Hadoop, Pig, and Spark scripts from Dataproc and refactor them into Databricks-native PySpark.Implement data ingestion and transformation pipelines using Delta Lake best practices.Apply conversion rules and templates for automated code migration and testing.Conduct data validation between legacy...

  • Pyspark Developer

    5 days ago


    Gurgaon, Haryana, India Citi Full time

    Gather operational data from various cross functional stakeholders to examine past business performance. Interpret business requirements to engineer efficient code solutions, with a strong focus on modularity, scalability, and long-term maintainability Will be involved in exploratory data analysis, confirmatory data analysis and/or qualitative analysis....