Pyspark Engineer
7 days ago
Responsibilities:
- Engineering Degree with 1-2 years of experience in BigData systems, Hive, Hadoop, Spark (Python/ scala) and cloud based data management technologies
- Hands-on experience in Unix Scripting, Python and Scala programing along with strong experience in SQL.
- Comfortable working with completed unstructured, undocumented code and turning it around into best in class code redesigning costly compute and data processes and aligning to best development standards
- Experienced in working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
- Well versed with necessary data preprocessing and application engineering skills
- At least 3 years of experience designing software systems with intense computational needs across real time and batch process .
- Experience and understanding of Supervised, unsupervised machine learning techniques
- Exposure to data ingestion, ETL tools such as Talend, modeling tools, Performance Management tooling such as Pepper data, Cloudera stack will be a plus
- Knowledge of data management, data governance, data security and regulatory practices
- Ability to identify, clearly articulate and solve complex business problems and present them to the management in a structured and simpler form
- Should have experience of working in onsite, offsite delivery model
- Experience working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
- Experience in Credit Cards and Retail Banking
- Should have excellent communication and inter-personal skills
- Strong process/project management skills
- Multiple stake holder management
- Control orientated and Risk awareness
Qualifications:
- Fast Learner with a desire to excel and attitude to partner and solve problems in complex environments placing business objectives at center or all activity.
- Experience in Performance Tuning, Code Re-engineering is preferred.
- Experience in broad IT architecture and design preferred across data and channels
- Experience in query tuning, automation technologies (Autosys, Jenkins, Service Now) preferred
- Exposure to container technology, Machine learning will be a plus
Education:
- Bachelors/University degree or equivalent experience
This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.
Job Family Group:
Decision Management
Job Family:
Data/Information Management
Time Type:
Full time
Most Relevant Skills
Please see the requirements listed above.
Other Relevant Skills
Python (Programming Language), Spark SQL.
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi's EEO Policy Statement and the Know Your Rights poster.
-
Software Engineer, PySpark
7 days ago
Gurgaon, Haryana, India RBS Full time ₹ 15,00,000 - ₹ 30,00,000 per yearJoin us as a Software Engineer, PySparkThis is an opportunity for a driven Software Engineer to take on an exciting new career challengeDay-to-day, you'll be engineering and maintaining innovative, customer centric, high performance, secure and robust solutionsIt's a chance to hone your existing technical skills and advance your career while building a wide...
-
Software Engineer, PySpark, VP
2 weeks ago
Gurgaon, Haryana, India RBS Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJoin us as a Software Engineer, PySparkThis is an opportunity for a technically minded individual to join us as a Software EngineerYou'll be designing, producing, testing and implementing working software, working across the lifecycle of the systemHone your existing software engineering skills and advance your career in this critical roleWe're offering this...
-
Pyspark Engineer
5 days ago
Gurgaon, Haryana, India Citi Full timeEngineering Degree with 1-2 years of experience in BigData systems, Hive, Hadoop, Spark (Python/ scala) and cloud based data management technologies Hands-on experience in Unix Scripting, Python and Scala programing along with strong experience in SQL. Comfortable working with completed unstructured, undocumented code and turning it around into best in class...
-
Lead GCP Data Engineer
7 days ago
Gurgaon, Haryana, India Impetus Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJob Title:Lead Data Engineer – GCP (BigQuery • Composer • Python • PySpark)Location:GurgaonExperience:8+ years (data engineering / analytics engineering), with previous lead responsibilitiesAbout the Role:You will lead the design, build and operation of large-scale data platforms on the Google Cloud Platform. You will manage a team of data engineers,...
-
Gurgaon, Haryana, India EXL Talent Acquisition Team Full time ₹ 1,50,00,000 - ₹ 3,00,00,000 per yearData Engineer Consultant – Job DescriptionJob Summary:Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data assets and data related products by liaising with multiple stakeholders..Qualifications and Skills:Strong knowledge on Python and Pyspark Expectation is to have ability to write Pyspark scripts for developing...
-
AWS Data Engineer
7 days ago
Gurgaon, Haryana, India Imbibe Consultancy Services Pvt Ltd Full time ₹ 6,00,000 - ₹ 18,00,000 per yearExperience:5 to 8 yearsLocation:Bengaluru, Gurgaon, PuneJob code:101356Posted on:Oct 27, 2025About UsAceNet Consulting is a fast-growing global business and technology consulting firm specializing in business strategy, digital transformation, technology consulting, product development, start-up advisory and fund-raising services to our global clients across...
-
Manager-Data Engineering-Cloud Data Engineering
2 weeks ago
Gurgaon, Haryana, India EXL Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionData Engineer Consultant – Job DescriptionJob Summary:Data Engineer (DE) Consultant is responsible for designing, developing, and maintaining data assets and data related products by liaising with multiple stakeholders..Qualifications and Skills:Strong knowledge on Python and Pyspark Expectation is to have ability to write Pyspark scripts for...
-
Data Engineer – AWS
7 days ago
Gurgaon, Haryana, India Recruin Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOnly Immediate JoineesResponsibilities:Skills – AWS, DE, AirflowKey ResponsibilitiesDesign, develop, and maintain data pipelines using AWS services such as Glue, Lambda, Step Functions, EMR, Kinesis, and S3.Build and optimize data warehouses and data lakes on AWS (Redshift, Lake Formation).Develop ETL/ELT jobs using PySpark, Spark, Python, and AWS native...
-
Azure Data Engineer
2 weeks ago
Gurgaon, Haryana, India Infogain Full time ₹ 8,00,000 - ₹ 12,00,000 per yearROLES & RESPONSIBILITIESKey ResponsibilitiesAnalyze existing Hadoop, Pig, and Spark scripts from Dataproc and refactor them into Databricks-native PySpark.Implement data ingestion and transformation pipelines using Delta Lake best practices.Apply conversion rules and templates for automated code migration and testing.Conduct data validation between legacy...
-
Pyspark Developer
5 days ago
Gurgaon, Haryana, India Citi Full timeGather operational data from various cross functional stakeholders to examine past business performance. Interpret business requirements to engineer efficient code solutions, with a strong focus on modularity, scalability, and long-term maintainability Will be involved in exploratory data analysis, confirmatory data analysis and/or qualitative analysis....