Spark Developer

4 weeks ago


Chennai, India Citi Full time

ETL Developer will be responsible for designing, implementing, and optimizing distributed data processing jobs to handle large-scale data in Hadoop Distributed File System(HDFS) using Apache Spark and Python. This role required deep understanding of data engineering principles, proficiency in Python and hands-on experience with Spark and Hadoop ecosystems. Developer will collaborate with data engineers, analysts, and business stakeholders to process, transform and drive insights and data driven decisions.

Responsibilities:

  • Data Processing and Transformation:

Design and Implement of Spark applications to process and transform large datasets in HDFS.

Develop ETL Pipelines in Spark using Python for data Ingestion, cleaning, aggregation, and transformations.

Performance Optimization:

Optimize Spark jobs for efficiency, reducing run time and resource usage.

Finetune memory management, caching, and partitioning strategies for Optimal performance

Data Engineering with Hadoop and Spark:

Load data from different sources into HDFS, ensuring data accuracy and integrity.

Integrate Spark Applications with Hadoop frameworks like Hive, Sqoop etc.

Testing and debugging:

Troubleshoot and debug Spark Job failures, monitor job logs, and Spark UI to Identify Issues.

Qualifications:

  • 2-5 years of relevant experience
  • Experience in programming/debugging used in business applications
  • Working knowledge of industry practice and standards
  • Comprehensive knowledge of specific business area for application development
  • Working knowledge of program languages
  • Consistently demonstrates clear and concise written and verbal communication
  • Expertise in handling complex large-scale Warehouse environments
  • Hands-on experience writing complex SQL queries, exporting and importing large amounts of data using utilities

Education:

  • Bachelor's degree in a quantitative field (such as Engineering, Computer Science) or equivalent experience

This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.

-

Job Family Group:

Technology

-

Job Family:

Applications Development

-

Time Type:

Full time

-

Most Relevant Skills

Please see the requirements listed above.

-

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

-

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi .

View Citi's EEO Policy Statement and the Know Your Rights poster.


  • Spark Developer

    4 days ago


    Chennai, Tamil Nadu, India Citi Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    ETL Developer will be responsible for designing, implementing, and optimizing distributed data processing jobs to handle large-scale data in Hadoop Distributed File System(HDFS) using Apache Spark and Python. This role required deep understanding of data engineering principles, proficiency in Python and hands-on experience with Spark and Hadoop ecosystems....

  • Scala Spark Developer

    4 weeks ago


    Chennai, India Path Mentors Staffing Solution Full time

    Scala/Spark Developer: 5+ yrs experience in designing scalable data pipelines with Scala, Spark, Hadoop, Kafka. Skilled in data formats(Avro,Parquet),HDFS, SQL, NoSQL. Proficient in troubleshooting, optimization, delivering reliable bigdata solutions Required Candidate profile Scala/Spark Developer with 5+ years' experience in building scalable data...


  • Chennai, India Rarr Technologies Full time

    Well versed with: Very good proficiency in Python and Spark programming. Pandas: Experience with data manipulation and analysis using Pandas. Implementation experience of Spark Core, Spark SQL and Spark Streaming Working with Spark in combination Hadoop Ecosystem Design and implementation of low-latency, high-availability, and performance applications....

  • Spark Scala Developer

    2 weeks ago


    Chennai, India Infotel UK Full time

    Position: Spark Scala Developer Company: Infotel India At Infotel UK, a premier technology consulting firm, we are seeking a skilled Spark Scala Developer to join our dynamic team. In this role, you will leverage your expertise in Spark and Scala to design and implement advanced data processing solutions that drive insights and provide value to our...

  • Spark Scala Developer

    2 weeks ago


    Chennai, India Infotel UK Full time

    Position: Spark Scala Developer Company: Infotel India At Infotel UK, a premier technology consulting firm, we are seeking a skilled Spark Scala Developer to join our dynamic team. In this role, you will leverage your expertise in Spark and Scala to design and implement advanced data processing solutions that drive insights and provide value to our...

  • Spark Scala Developer

    3 weeks ago


    Chennai, India Tata Consultancy Services Full time

    Dear Candidates,Greetings from TCS!!!!TCS is looking for Spark Scala DeveloperExperience: 6-8 yearsLocation: ChennaiMUST HAVE SKILLS:Data Engineer Design and develop scalable and efficient solutions using Spark, Scala, Python, Airflow.Good code and configuration spark knowledge (at least 1 or 2 profiles should be very good at spark configuration and cluster...


  • Chennai, India Tata Consultancy Services Full time

    Dear Candidates,Greetings from TCS!!!!TCS is looking for Spark Scala DeveloperExperience: 6-8 yearsLocation: ChennaiMUST HAVE SKILLS:Data Engineer Design and develop scalable and efficient solutions using Spark, Scala, Python, Airflow.Good code and configuration spark knowledge (at least 1 or 2 profiles should be very good at spark configuration and cluster...

  • Spark Scala Developer

    2 weeks ago


    Chennai, India Tata Consultancy Services Full time

    Dear Candidates, Greetings from TCS!!!! TCS is looking for Spark Scala Developer Experience: 6-8 years Location: Chennai MUST HAVE SKILLS: - Data Engineer Design and develop scalable and efficient solutions using Spark, Scala, Python, Airflow. - Good code and configuration spark knowledge (at least 1 or 2 profiles should be very good at spark configuration...


  • Chennai, India Tata Consultancy Services Full time

    Dear Candidates, Greetings from TCS!!!! TCS is looking for Spark Scala Developer Experience: 6-8 years Location: Chennai MUST HAVE SKILLS: Data Engineer Design and develop scalable and efficient solutions using Spark, Scala, Python, Airflow. Good code and configuration spark knowledge (at least 1 or 2 profiles should be very good at spark configuration and...

  • Spark Scala Developer

    2 weeks ago


    Chennai, India Tata Consultancy Services Full time

    Dear Candidates,Greetings from TCS!!!!TCS is looking for Spark Scala DeveloperExperience: 6-8 yearsLocation: ChennaiMUST HAVE SKILLS:- Data Engineer Design and develop scalable and efficient solutions using Spark, Scala, Python, Airflow.- Good code and configuration spark knowledge (at least 1 or 2 profiles should be very good at spark configuration and...