Data Engineer with Bigdata, Spark, Hive and Airflow | Basic knowledge working with Kubernetes IRC273751

2 weeks ago


Chennai, Tamil Nadu, India GlobalLogic Full time ₹ 40,00,000 - ₹ 1,20,00,000 per year

Description

  • Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
  • Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
  • Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
  • Good experience in any one programming language -Scala/Python , Python preferred.
  • Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
  • Experience in using Kafka or any other message brokers
  • Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
  • Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
  • Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
  • Should have experience with any one No SQL databases like Amazon S3 etc
  • Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
  • Work expereince on any one cloud AWS or GCP or Azure

Requirements

  • Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
  • Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
  • Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
  • Good experience in any one programming language -Scala/Python , Python preferred.
  • Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
  • Experience in using Kafka or any other message brokers
  • Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
  • Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
  • Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
  • Should have experience with any one No SQL databases like Amazon S3 etc
  • Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
  • Work expereince on any one cloud AWS or GCP or Azure

Good to have skills:

  • Experience in AWS cloud services like EMR, S3, Redshift, EKS/ECS etc
  • Experience in GCP cloud services like Dataproc, Google storage etc
  • Experience in working with huge Big data clusters with millions of records
  • Experience in working with ELK stack, specially Elasticsearch
  • Experience in Hadoop MapReduce, Apache Flink, Kubernetes etc

Job responsibilities

  • Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
  • Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
  • Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
  • Good experience in any one programming language -Scala/Python , Python preferred.
  • Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
  • Experience in using Kafka or any other message brokers
  • Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
  • Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
  • Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
  • Should have experience with any one No SQL databases like Amazon S3 etc
  • Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
  • Work expereince on any one cloud AWS or GCP or Azure

Good to have skills:

  • Experience in AWS cloud services like EMR, S3, Redshift, EKS/ECS etc
  • Experience in GCP cloud services like Dataproc, Google storage etc
  • Experience in working with huge Big data clusters with millions of records
  • Experience in working with ELK stack, specially Elasticsearch
  • Experience in Hadoop MapReduce, Apache Flink, Kubernetes etc

What we offer

Culture of caring.
At GlobalLogic, we prioritize a culture of caring. Across every region and department, at every level, we consistently put people first. From day one, you'll experience an inclusive culture of acceptance and belonging, where you'll have the chance to build meaningful connections with collaborative teammates, supportive managers, and compassionate leaders.

Learning and development.
We are committed to your continuous learning and development. You'll learn and grow daily in an environment with many opportunities to try new things, sharpen your skills, and advance your career at GlobalLogic. With our Career Navigator tool as just one example, GlobalLogic offers a rich array of programs, training curricula, and hands-on opportunities to grow personally and professionally.

Interesting & meaningful work.
GlobalLogic is known for engineering impact for and with clients around the world. As part of our team, you'll have the chance to work on projects that matter. Each is a unique opportunity to engage your curiosity and creative problem-solving skills as you help clients reimagine what's possible and bring new solutions to market. In the process, you'll have the privilege of working on some of the most cutting-edge and impactful solutions shaping the world today.

Balance and flexibility.
We believe in the importance of balance and flexibility. With many functional career areas, roles, and work arrangements, you can explore ways of achieving the perfect balance between your work and life. Your life extends beyond the office, and we always do our best to help you integrate and balance the best of work and life, having fun along the way

High-trust organization.
We are a high-trust organization where integrity is key. By joining GlobalLogic, you're placing your trust in a safe, reliable, and ethical global company. Integrity and trust are a cornerstone of our value proposition to our employees and clients. You will find truthfulness, candor, and integrity in everything we do.

About GlobalLogic
GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world's largest and most forward-thinking companies. Since 2000, we've been at the forefront of the digital revolution – helping create some of the most innovative and widely used digital products and experiences. Today we continue to collaborate with clients in transforming businesses and redefining industries through intelligent products, platforms, and services.


  • Data Engineer

    2 weeks ago


    Chennai, Tamil Nadu, India iAgami Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    We're Hiring: Data Engineer – Platform & Distributed SystemsAre you a builder at heart who loves working at the intersection of data engineering, cloud infrastructure, and ML platforms?We're looking for a Data Engineer to join our team and help scale our JupyterLab-based Notebook platform, enabling data scientists and engineers across cloud and on-premise...


  • Chennai, Tamil Nadu, India Prodapt Full time

    OverviewJoin the Prodapt team in supporting a unified, scalable, and secure Jupyter-based environment for data science and machine learning. You will help build, maintain, and optimize the platform that empowers analysts, engineers, and scientists to explore data, develop models, and collaborate at scale.ResponsibilitiesOverall experience of 5 years with...


  • Chennai, Tamil Nadu, India Prodapt Full time

    OverviewJoin the Prodapt team in building a unified, cloud-native environment for scalable machine learning training and experimentation. You will help design, develop, and optimize robust workflows that empower data scientists and engineers to efficiently explore, train, and validate ML models at scale.ResponsibilitiesOverall experience of 10+ years with...


  • Chennai, Tamil Nadu, India NielsenIQ Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Company DescriptionCompany DescriptionNIQ is the world's leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most...


  • Chennai, Tamil Nadu, India NielsenIQ Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Company Description Company DescriptionNIQ is the world's leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most...

  • GCP data engineer

    3 days ago


    Chennai, Tamil Nadu, India Prodapt Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    OverviewDesign and implement complex ETL/ELT pipelines using PySpark and Airflow for large-scale data processing on GCP.Lead data migration initiatives, including automating the movement of Teradata tables to BigQuery, ensuring data accuracy and consistency.Develop robust frameworks to streamline batch and streaming data ingestion workflows, leveraging...


  • Chennai, Tamil Nadu, India Tata Consultancy Services (TCS) Full time ₹ 4,00,000 - ₹ 6,00,000 per year

    Need to work as a developer in Cloudera Hadoop.Work on Hadoop, Python, PySpark, Hive SQL's, Bigdata Eco System Tools.Experience in working with teams in a complex organization involving multiple reporting lines.The candidate should have strong functional and technical knowledge to deliver what is required and he/she should be well acquainted with Banking...

  • Senior Data Engineer

    2 weeks ago


    Chennai, Tamil Nadu, India iAgami Full time

    We're Hiring – Senior Data Engineer (GCP) Are you passionate about building data platforms that process billions of transactions and drive real-time decisions? Join our growing team atiAgamiTechnologies, where innovation meets data engineering excellence. Location: Chennai / Trichy (Remote) Role: Senior Data Engineer Experience: 5+ years Type: Full-time...

  • Bigdata Pyspark

    5 days ago


    Chennai, Tamil Nadu, India Virtusa Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    7 to 11 years of Java based application development Hands on Java 8 2 to 4 years of Apache Spark based application development.2 to 4 years as Hadoop developerExperience in building batch and Streaming applications. Experience with Apache Spark batch frameworkKnowledge of Apache Spark structured streaming is a nice to haveKnowledge of advanced Spark usage...

  • sa

    1 week ago


    Chennai, Tamil Nadu, India LatentView Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    About Company :LatentView Analytics is a leading global analytics and decision sciences provider, delivering solutions that help companies drive digital transformation and use data to gain a competitive advantage. With analytics solutions that provide 360-degree view of the digital consumer, fuel machine learning capabilities and support artificial...