Pyspark data engineer

11 hours ago


Delhi, India ITI Data Full time
Location : India
Type : Full-time
Experience : 10 – 13 years
Functions : Consulting, Finance, Information Technology, Big Data Engineering
Industries : Capital Markets, Investment Banking, Alternative Investments, Financial Services, Management Consulting, Information Technology and Services, Healthcare
Job Description
We are looking for a Py Spark solutions developer and data engineer that is able to design and build solutions for one of our Fortune 500 Client programs, which aims towards building a data standardized and curation needs on Hadoop cluster. This is high visibility, fast-paced key initiative will integrate data across internal and external sources, provide analytical insights, and integrate with the customer’s critical systems.
Key Responsibilities
Ability to design, build and unit test applications on Spark framework on Python .
Build Py Spark based applications for both batch and streaming requirements, which will require in-depth knowledge on majority of Hadoop and No SQL databases as well.
Develop and execute data pipeline testing processes and validate business rules and policies
Optimize performance of the built Spark applications in Hadoop using configurations around Spark Context, Spark-SQL, Data Frame, and Pair RDD's .
Experience in skilled in Hadoop, Kafka, Scala, Spark.
Optimize performance for data access requirements by choosing the appropriate native Hadoop file formats ( Avro, Parquet, ORC etc ) and compression codec respectively.
Ability to design & build real-time applications using Apache Kafka & Spark Streaming Build integrated solutions leveraging Unix shell scripting, RDBMS, Hive, HDFS File System, HDFS File Types, HDFS compression codec. Build data tokenization libraries and integrate with Hive & Spark for column-level obfuscation Experience in processing large amounts of structured and unstructured data, including integrating data from multiple sources. Create and maintain integration and regression testing framework on Jenkins integrated with Bit Bucket and/or GIT repositories
Participate in the agile development process, and document and communicate issues and bugs relative to data standards in scrum meetings
Work collaboratively with onsite and offshore team. Develop & review technical documentation for artifacts delivered. Ability to solve complex data-driven scenarios and triage towards defects and production issues Ability to learn-unlearn-relearn concepts with an open and analytical mindset Participate in code release and production deployment
  • PySpark Data Engineer

    15 hours ago


    Delhi, India ITI Data Full time

    Location : IndiaType : Full-timeExperience : 10 – 13 yearsFunctions : Consulting, Finance, Information Technology, Big Data EngineeringIndustries : Capital Markets, Investment Banking, Alternative Investments, Financial Services, Management Consulting, Information Technology and Services, HealthcareJob DescriptionWe are looking for a PySpark...

  • PySpark Data Engineer

    17 hours ago


    Delhi, India ITI Data Full time

    Location:IndiaType: Full-timeExperience: 10 – 13 yearsFunctions: Consulting, Finance, Information Technology, Big Data EngineeringIndustries: Capital Markets, Investment Banking, Alternative Investments, Financial Services, Management Consulting, Information Technology and Services, HealthcareJob DescriptionWe are looking for a PySpark solutions developer...


  • Delhi, India ITI Data Full time

    Location:IndiaType: Full-timeExperience: 10 – 13 yearsFunctions: Consulting, Finance, Information Technology, Big Data EngineeringIndustries: Capital Markets, Investment Banking, Alternative Investments, Financial Services, Management Consulting, Information Technology and Services, HealthcareJob DescriptionWe are looking for a PySpark solutions developer...


  • Delhi, Delhi, India Tata Consultancy Services Full time

    About Tata Consultancy ServicesTata Consultancy Services (TCS) is a global leader in IT services, consulting, and business solutions. With over 50 years of experience, TCS has been at the forefront of driving innovation and growth through technology.Job Title: Data Engineer for Azure Synapse and Pyspark DevelopmentThis role offers an exciting opportunity to...

  • Data Engineer

    4 days ago


    Delhi, India Tata Consultancy Services Full time

    Job DescriptionName of the position: Data engineerLocation : Bangalore, Hyderabad,Mumbai, Pune ,Chennai,NCRSkill Requirements & Experience- Overall 8+ years of work experience in Data Warehouse(DWH) Development- Experience in Azure, Python , SQL , Pyspark- Hands on exp in Azure data factory, data bricks- Exposure and experience to Cosmos- Understanding of...

  • AWS Data Engineer

    4 weeks ago


    delhi, India ITI Data Full time

    Job Description We are looking for an AWS Data with primary skills on PySpark development who will be able to design and build solutions for one of our Fortune 500 Client programs, which aims towards building an Enterprise Data Lake on AWS Cloud platform, build Data pipelines by developing several AWS Data Integration, Engineering & Analytics resources....

  • AWS Data Engineer

    1 month ago


    delhi, India ITI Data Full time

    Job DescriptionWe are looking for an AWS Data with primary skills on PySpark development who will be able to design and build solutions for one of our Fortune 500 Client programs, which aims towards building an Enterprise Data Lake on AWS Cloud platform, build Data pipelines by developing several AWS Data Integration, Engineering & Analytics resources. There...

  • AWS Data Engineer

    4 months ago


    Delhi, India ITI Data Full time

    Job DescriptionWe are looking for an AWS Data with primary skills on PySpark development who will be able to design and build solutions for one of our Fortune 500 Client programs, which aims towards building an Enterprise Data Lake on AWS Cloud platform, build Data pipelines by developing several AWS Data Integration, Engineering & Analytics resources. There...


  • delhi, India Data Warehouse Engineer Full time

    Experience : 2- 5 YearsPrimary Skills : Strong SQL knowledge, Data warehouse concepts and ETL hands on experience, Azure services like Data Factory, Azure Data Lake, and Good analysis skills.Good to have skills: Power BI, Databricks, Python.


  • Delhi, India Data Warehouse Engineer Full time

    Experience : 2- 5 YearsPrimary Skills : Strong SQL knowledge, Data warehouse concepts and ETL hands on experience, Azure services like Data Factory, Azure Data Lake, and Good analysis skills.Good to have skills: Power BI, Databricks, Python.


  • Delhi, India KC Executive Search Full time

    Technical Stack: Python, Spark, AWS, PySparkResponsibilitiesDevelop and enhance data-processing, orchestration, monitoring, and more by leveraging popular open-source software, AWS, and GitLab automation.Collaborate with product and technology teams to design and validate the capabilities of the data platform Identify, design, and implement process...


  • Delhi, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people – we...


  • Delhi, India Tata Consultancy Services Full time

    Dear Candidate,Greetings from TCS Human Resources Team!!Job Role: Pyspark Hadoop DeveloperJob Locations: BengaluruExperience Range: 5 to 10 years5+yrs of hands on experience in Hadoop Scala, Pyspark, Hive, Spark , Hive/Impala/SQL with Hadoop eco systemGood Data background. Hadoop is the primary skill and PySpark is secondary experience.Experience in design...


  • Delhi, India KC Executive Search Full time

    Technical Stack: Python, Spark, AWS, PySparkResponsibilitiesDevelop and enhance data-processing, orchestration, monitoring, and more by leveraging popular open-source software, AWS, and GitLab automation.Collaborate with product and technology teams to design and validate the capabilities of the data platform Identify, design, and implement process...


  • Delhi, India Tata Consultancy Services Full time

    Dear Candidate,Greetings from TCS Human Resources Team!!Job Role: Pyspark Hadoop DeveloperJob Locations: BengaluruExperience Range: 5 to 10 years- 5+yrs of hands on experience in Hadoop Scala, Pyspark, Hive, Spark , Hive/Impala/SQL with Hadoop eco system- Good Data background. Hadoop is the primary skill and PySpark is secondary experience.- Experience in...

  • Python +pyspark

    6 months ago


    Delhi, India Nityo Infotech Full time

    2 hours ago **Job Code**: JD-19623 **JOB DESCRIPTION**: Mandatory Skills - Python, Pyspark, DataBricks, SQL Primary Skills: - Hands-on Python, PySpark, Databricks, SQL, AWS (S3, Lambda, EC2, RDS), and CI-CD tools. Job description: - 6 or more years of experience developing, testing, and implementing major Information Technology programs or projects that...

  • PySpark Developer

    3 days ago


    Delhi, India Tata Consultancy Services Full time

    Role : PySpark Developer Technical Skill Set : PySpark, Python, HDFS, Hadoop, SQLLocation : Mumbai, Pune ,Chennai, Banglore , NCR, HyderabadMust-Have :Sound programming knowledge on PySpark & SQL in terms of processing large amount of semi structured & unstructured dataAbility to design data pipelines in end to end mannerKnowledge on Avro, Parquet...

  • PySpark Developer

    2 weeks ago


    Delhi, India Tata Consultancy Services Full time

    Role - PySpark DeveloperExperience - 5 TO 10 YRSLocation - Bangalore / HyderabadMust Have - PysparkDesired Competencies (Technical/Behavioral Competency)Must-Have · Hands-on experience in Pyspark including Dataframe core functions sparkSQL and SparkStreaming. · Minimum 3 years of experience HIVE, Hadoop, Kafka, YARN, HBase & MongoDB · Hands on...

  • PySpark Developer

    2 weeks ago


    Delhi, India Tata Consultancy Services Full time

    Role - PySpark DeveloperExperience- 5 TO 10 YRSLocation- Bangalore / HyderabadMust Have- PysparkDesired Competencies (Technical/Behavioral Competency)Must-Have · Hands-on experience in Pyspark including Dataframe core functions sparkSQL and SparkStreaming. · Minimum 3 years of experience HIVE, Hadoop, Kafka, YARN, HBase & MongoDB · Hands on experience in...

  • Pyspark Developer

    2 weeks ago


    Delhi, India Tata Consultancy Services Full time

    Greetings from Tcs !!!We are conducting interviews for Pyspark Developer .Role: Pyspark DeveloperExperience: 6-10 YearsLocation: BangaloreResponsibility of / Expectations from the Role- Minimum 5 years of PySpark Development experience, especially in Spark SQL and Complex Transformations- Minimum 3 years of Python development experience- Minimum 2 years of...