Data Engineer with Bigdata, Spark, Hive and Airflow | Basic knowledge working with Kubernetes IRC273751
2 weeks ago
Description
- Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
- Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
- Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
- Good experience in any one programming language -Scala/Python , Python preferred.
- Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
- Experience in using Kafka or any other message brokers
- Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
- Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
- Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
- Should have experience with any one No SQL databases like Amazon S3 etc
- Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
- Work expereince on any one cloud AWS or GCP or Azure
Requirements
- Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
- Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
- Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
- Good experience in any one programming language -Scala/Python , Python preferred.
- Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
- Experience in using Kafka or any other message brokers
- Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
- Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
- Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
- Should have experience with any one No SQL databases like Amazon S3 etc
- Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
- Work expereince on any one cloud AWS or GCP or Azure
Good to have skills:
- Experience in AWS cloud services like EMR, S3, Redshift, EKS/ECS etc
- Experience in GCP cloud services like Dataproc, Google storage etc
- Experience in working with huge Big data clusters with millions of records
- Experience in working with ELK stack, specially Elasticsearch
- Experience in Hadoop MapReduce, Apache Flink, Kubernetes etc
Job responsibilities
- Data engineer with 4 to 6 years of hands on experience working on Big Data Platforms
- Experience building and optimizing Big data data pipelines and data sets ranging from Data ingestion to Processing to Data Visualization.
- Good Experience in writing and optimizing Spark Jobs, Spark SQL etc. Should have worked on both batch and steaming data processing
- Good experience in any one programming language -Scala/Python , Python preferred.
- Experience in writing and optimizing complex Hive and SQL queries to process huge data. good with UDFs, tables, joins,Views etc
- Experience in using Kafka or any other message brokers
- Configuring, monitoring and scheduling of jobs using Oozie and/or Airflow
- Processing streaming data directly from Kafka using Spark jobs, expereince in Spark- streaming is must
- Should be able to handling different file formats (ORC, AVRO and Parquet) and unstructured data
- Should have experience with any one No SQL databases like Amazon S3 etc
- Should have worked on any of the Data warehouse tools like AWS Redshift or Snowflake or BigQuery etc
- Work expereince on any one cloud AWS or GCP or Azure
Good to have skills:
- Experience in AWS cloud services like EMR, S3, Redshift, EKS/ECS etc
- Experience in GCP cloud services like Dataproc, Google storage etc
- Experience in working with huge Big data clusters with millions of records
- Experience in working with ELK stack, specially Elasticsearch
- Experience in Hadoop MapReduce, Apache Flink, Kubernetes etc
What we offer
Culture of caring.
At GlobalLogic, we prioritize a culture of caring. Across every region and department, at every level, we consistently put people first. From day one, you'll experience an inclusive culture of acceptance and belonging, where you'll have the chance to build meaningful connections with collaborative teammates, supportive managers, and compassionate leaders.
Learning and development.
We are committed to your continuous learning and development. You'll learn and grow daily in an environment with many opportunities to try new things, sharpen your skills, and advance your career at GlobalLogic. With our Career Navigator tool as just one example, GlobalLogic offers a rich array of programs, training curricula, and hands-on opportunities to grow personally and professionally.
Interesting & meaningful work.
GlobalLogic is known for engineering impact for and with clients around the world. As part of our team, you'll have the chance to work on projects that matter. Each is a unique opportunity to engage your curiosity and creative problem-solving skills as you help clients reimagine what's possible and bring new solutions to market. In the process, you'll have the privilege of working on some of the most cutting-edge and impactful solutions shaping the world today.
Balance and flexibility.
We believe in the importance of balance and flexibility. With many functional career areas, roles, and work arrangements, you can explore ways of achieving the perfect balance between your work and life. Your life extends beyond the office, and we always do our best to help you integrate and balance the best of work and life, having fun along the way
High-trust organization.
We are a high-trust organization where integrity is key. By joining GlobalLogic, you're placing your trust in a safe, reliable, and ethical global company. Integrity and trust are a cornerstone of our value proposition to our employees and clients. You will find truthfulness, candor, and integrity in everything we do.
About GlobalLogic
GlobalLogic, a Hitachi Group Company, is a trusted digital engineering partner to the world's largest and most forward-thinking companies. Since 2000, we've been at the forefront of the digital revolution – helping create some of the most innovative and widely used digital products and experiences. Today we continue to collaborate with clients in transforming businesses and redefining industries through intelligent products, platforms, and services.
-
Data Engineer
2 weeks ago
Chennai, Tamil Nadu, India iAgami Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWe're Hiring: Data Engineer – Platform & Distributed SystemsAre you a builder at heart who loves working at the intersection of data engineering, cloud infrastructure, and ML platforms?We're looking for a Data Engineer to join our team and help scale our JupyterLab-based Notebook platform, enabling data scientists and engineers across cloud and on-premise...
-
Senior Software Engineer
2 weeks ago
Chennai, Tamil Nadu, India Prodapt Full timeOverviewJoin the Prodapt team in supporting a unified, scalable, and secure Jupyter-based environment for data science and machine learning. You will help build, maintain, and optimize the platform that empowers analysts, engineers, and scientists to explore data, develop models, and collaborate at scale.ResponsibilitiesOverall experience of 5 years with...
-
Senior Software Engineer
2 weeks ago
Chennai, Tamil Nadu, India Prodapt Full timeOverviewJoin the Prodapt team in building a unified, cloud-native environment for scalable machine learning training and experimentation. You will help design, develop, and optimize robust workflows that empower data scientists and engineers to efficiently explore, train, and validate ML models at scale.ResponsibilitiesOverall experience of 10+ years with...
-
Senior Data Engineer
3 days ago
Chennai, Tamil Nadu, India NielsenIQ Full time ₹ 10,00,000 - ₹ 25,00,000 per yearCompany DescriptionCompany DescriptionNIQ is the world's leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most...
-
Senior Data Engineer
4 days ago
Chennai, Tamil Nadu, India NielsenIQ Full time ₹ 10,00,000 - ₹ 25,00,000 per yearCompany Description Company DescriptionNIQ is the world's leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior and revealing new pathways to growth. In 2023, NIQ combined with GfK, bringing together the two industry leaders with unparalleled global reach. With a holistic retail read and the most...
-
GCP data engineer
3 days ago
Chennai, Tamil Nadu, India Prodapt Full time ₹ 6,00,000 - ₹ 18,00,000 per yearOverviewDesign and implement complex ETL/ELT pipelines using PySpark and Airflow for large-scale data processing on GCP.Lead data migration initiatives, including automating the movement of Teradata tables to BigQuery, ensuring data accuracy and consistency.Develop robust frameworks to streamline batch and streaming data ingestion workflows, leveraging...
-
Big Data Hadoop Developer
1 week ago
Chennai, Tamil Nadu, India Tata Consultancy Services (TCS) Full time ₹ 4,00,000 - ₹ 6,00,000 per yearNeed to work as a developer in Cloudera Hadoop.Work on Hadoop, Python, PySpark, Hive SQL's, Bigdata Eco System Tools.Experience in working with teams in a complex organization involving multiple reporting lines.The candidate should have strong functional and technical knowledge to deliver what is required and he/she should be well acquainted with Banking...
-
Senior Data Engineer
2 weeks ago
Chennai, Tamil Nadu, India iAgami Full timeWe're Hiring – Senior Data Engineer (GCP) Are you passionate about building data platforms that process billions of transactions and drive real-time decisions? Join our growing team atiAgamiTechnologies, where innovation meets data engineering excellence. Location: Chennai / Trichy (Remote) Role: Senior Data Engineer Experience: 5+ years Type: Full-time...
-
Bigdata Pyspark
5 days ago
Chennai, Tamil Nadu, India Virtusa Full time ₹ 6,00,000 - ₹ 18,00,000 per year7 to 11 years of Java based application development Hands on Java 8 2 to 4 years of Apache Spark based application development.2 to 4 years as Hadoop developerExperience in building batch and Streaming applications. Experience with Apache Spark batch frameworkKnowledge of Apache Spark structured streaming is a nice to haveKnowledge of advanced Spark usage...
-
sa
1 week ago
Chennai, Tamil Nadu, India LatentView Full time ₹ 5,00,000 - ₹ 15,00,000 per yearAbout Company :LatentView Analytics is a leading global analytics and decision sciences provider, delivering solutions that help companies drive digital transformation and use data to gain a competitive advantage. With analytics solutions that provide 360-degree view of the digital consumer, fuel machine learning capabilities and support artificial...