
Data Engineering Lead
4 days ago
Immediate Joiners are preferrable Position Overview:
We seek a highly skilled and experienced Data Engineering Lead to join our team. This role demands deep technical expertise in Apache Spark, Hive, Trino (formerly Presto), Python, AWS Glue, and the broader AWS ecosystem. The ideal candidate will possess strong hands-on skills and the ability to design and implement scalable data solutions, optimise performance, and lead a high-performing team to deliver data-driven insights.
Key Responsibilities:
Technical Leadership
Lead and mentor a team of data engineers, fostering best practices in coding, design, and delivery.
Drive the adoption of modern data engineering frameworks, tools, and methodologies to ensure high-quality and scalable solutions.
Translate complex business requirements into effective data pipelines, architectures, and workflows.
Data Pipeline Development
Architect, develop, and optimize scalable ETL/ELT pipelines using Apache Spark, Hive, AWS Glue, and Trino.
Handle complex data workflows across structured and unstructured data sources, ensuring performance and cost-efficiency.
Develop real-time and batch processing systems to support business intelligence, analytics, and machine learning applications.
Cloud & Infrastructure Management
Build and maintain cloud-based data solutions using AWS services like S3, Athena, Redshift, EMR, DynamoDB, and Lambda.
Design and implement federated query capabilities using Trino for diverse data sources.
Manage Hive Metastore for schema and metadata management in data lakes.
Performance Optimization
Optimize Apache Spark jobs and Hive queries for performance, ensuring efficient resource utilization and minimal latency.
Implement caching and indexing strategies to accelerate query performance in Trino.
Continuously monitor and improve system performance through diagnostics and tuning.
Collaboration & Stakeholder Engagement
Work closely with data scientists, analysts, and business teams to understand requirements and deliver actionable insights.
Ensure that data infrastructure aligns with organizational goals and compliance standards.
Data Governance & Quality
Establish and enforce data quality standards, governance practices, and monitoring processes.
Ensure data security, privacy, and compliance with regulatory frameworks.
Innovation & Continuous Learning
Stay ahead of industry trends, emerging technologies, and best practices in data engineering.
Proactively identify and implement improvements in data architecture and processes.
Qualifications:
Required Technical Expertise
Advanced proficiency with Apache Spark (core, SQL, streaming) for large-scale data processing.
Strong expertise in Hive for querying and managing structured data in data lakes.
In-depth knowledge of Trino (Presto) for federated querying and high-performance SQL execution.
Solid programming skills in Python with frameworks like PySpark and Pandas.
Hands-on experience with AWS Glue, including Glue ETL jobs, Glue Data Catalog, and Glue Crawlers.
Deep understanding of data formats such as Parquet, ORC, Avro, and their use cases.
Cloud Proficiency
Expertise in AWS services, including S3, Redshift, Athena, EMR, DynamoDB, and IAM.
Experience designing scalable and cost-efficient cloud-based data solutions.
Performance Tuning
Strong ability to optimize Apache Spark jobs, Hive queries, and Trino workloads for distributed environments.
Experience with advanced techniques like partitioning, bucketing, and query plan optimization.
Leadership & Collaboration
Proven experience leading and mentoring data engineering teams.
Strong communication skills, with the ability to interact with technical and non-technical stakeholders effectively.
Education & Experience
Bachelors or Masters degree in Computer Science, Data Engineering, or a related field.
8+ years of experience in data engineering with a minimum of 2 years in a leadership role.
Qualifications:
8+ years of experience in building data pipelines from scratch in large data volume environments
AWS certifications, such as AWS Certified Data Analytics or AWS Certified Solutions Architect.
Experience with Kafka or Kinesis for real-time data streaming would be a plus.
Familiarity with containerization tools like Docker and orchestration platforms like Kubernetes.
Knowledge of CI/CD pipelines and DevOps practices for data engineering.
Prior experience with data lake architectures and integrating ML workflows.
-
Data Engineer
6 days ago
Bengaluru, Karnataka, India NTT DATA Full time ₹ 15,00,000 - ₹ 25,00,000 per yearDesign and implement tailored data solutions to meet customer needs and use cases, spanning from streaming to data lakes, analytics, and beyond within a dynamically evolving technical stack. Provide thought leadership by recommending the most appropriate technologies and solutions for a given use case, covering the entire spectrum from the application layer...
-
Lead Data Scientist
4 weeks ago
Bengaluru, Karnataka, India NTT DATA Full timeNTT DATA strives to hire exceptional innovative and passionate individuals who want to grow with us If you want to be part of an inclusive adaptable and forward-thinking organization apply now We are currently seeking a Lead Data Scientist - Computer Vision Generative AI to join our team in Bangalore Karn taka IN-KA India IN We are seeking a...
-
Lead Data Scientist
1 week ago
Bengaluru, Karnataka, India NTT DATA Full time ₹ 12,00,000 - ₹ 36,00,000 per yearNTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Lead Data Scientist - Computer Vision & Generative AI to join our team in Bangalore, Karnātaka (IN-KA), India (IN). We are seeking...
-
Data Engineer
4 days ago
Bengaluru, Karnataka, India Capgemini Engineering Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAt Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world's most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and...
-
Lead Data Scientist
5 days ago
Bengaluru, Karnataka, India NTT DATA, Inc. Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are currently seeking a Lead Data Scientist - Computer Vision & Generative AI to join our team in Bangalore, Karntaka (IN-KA), India (IN).We are seeking a highly skilled and experienced Lead Data Scientist with 7+ years of expertise in Machine Learning, Deep Learning (especially Computer Vision), and Generative AI. The ideal candidate will lead the design...
-
Lead Data Scientist
2 weeks ago
Bengaluru, Karnataka, India Enable Data Incorporated Full time ₹ 20,00,000 - ₹ 25,00,000 per yearEnable Data Incorporated is looking for a talented and experienced Lead Data Scientist to join our innovative team. In this role, you will be responsible for driving data science initiatives, developing predictive models, and providing insights that guide key business decisions. The ideal candidate has a strong background in statistical analysis, machine...
-
Digital Engineering Lead Engineer
4 weeks ago
Bengaluru, Karnataka, India NTT DATA Full timeReq ID 299336NTT DATA strives to hire exceptional innovative and passionate individuals who want to grow with us If you want to be part of an inclusive adaptable and forward-thinking organization apply now We are currently seeking a Digital Engineering Lead Engineer to join our team in Bangalore Karn xc4 x81taka IN-KA India IN How We Will Help...
-
Data Engineer
5 days ago
Bengaluru, Karnataka, India NTT DATA, Inc. Full time ₹ 15,00,000 - ₹ 25,00,000 per yearKey Responsibilities:Design and implement tailored data solutions to meet customer needs and use cases, spanning from streaming to data lakes, analytics, and beyond within a dynamically evolving technical stack.Provide thought leadership by recommending the most appropriate technologies and solutions for a given use case, covering the entire spectrum from...
-
Data Engineer
6 days ago
Bengaluru, Karnataka, India NTT DATA, Inc. Full time ₹ 1,04,000 - ₹ 1,30,878 per yearReq ID:321800We are currently seeking a Data Engineer (Talend &Pyspark) to join our team in Bangalore, Karntaka (IN-KA), India (IN)."Job Duties: Key Responsibilities: Design and implement tailored data solutions to meet customer needs and use cases, spanning from streaming to data lakes, analytics, and beyond within a dynamically evolving technical...
-
Data Engineer
1 week ago
Bengaluru, Karnataka, India NTT DATA Full time ₹ 15,00,000 - ₹ 25,00,000 per yearMigrate ETL workflows from SAP BODS to AWS Glue/dbt/Talend. Develop and maintain scalable ETL pipelines in AWS. Write PySpark scripts for large-scale data processing. Optimize SQL queries and transformations for AWS PostgreSQL. Work with Cloud Engineers to ensure smooth deployment and performance tuning. Integrate data pipelines with existing Unix systems....