Data Engineer
3 days ago
What we do: GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, food and health sectors. Its vision is to inspire people to win in ways that make the world better. Today, GMG’s investments span four key verticals: GMG Sports, GMG Food, GMG Health, and GMG Consumer Goods. Under the ownership and management of the Baker family, it has become a leading global company, affiliated with the world’s most successful and respected brands in the well-being sector. Working across the Middle East, North Africa, and Asia, GMG has introduced more than 120 brands into its markets. What will you do: We are seeking a highly skilled Data Engineer specializing in AWS and Databricks. The ideal candidate will design, build, and maintain scalable data pipelines, ensuring efficient data ingestion, processing, and integration from multiple sources. This role requires expertise in AWS services, PySpark, SQL, and Databricks, along with strong optimization, security, and cost management skills. Roles and Responsibilities: Data Engineering & Pipeline Development • Develop and manage ETL pipelines for structured, semi-structured, and unstructured data using AWS Glue, PySpark, and SQL. • Handle real-time event stream data ingestion and processing from multiple source systems. • Ensure efficient data integration into Databricks for advanced processing and analytics. Cloud & Infrastructure Management • Build and optimize backend systems leveraging AWS services (Glue, Athena, Lambda, SNS, S3). • Implement, configure, and manage Databricks environments, including clusters, notebooks, and libraries for performance optimization. • Ensure optimal resource utilization for AWS and Databricks clusters to improve efficiency and reduce costs. • Integrate Databricks with various cloud services while following governance and security best practices. Testing & CI/CD Best Practices • Write unit test cases and integration tests to ensure data pipeline reliability. • Establish best practices for Databricks CI/CD and implement automation for deployment. Optimization & Security • Apply performance tuning techniques to optimize queries, storage, and processing times. • Ensure compliance with security, governance, and industry best practices across AWS and Databricks environments. • Monitor system performance and proactively address issues to maintain high availability and reliability. Functional/Technical Competencies: Knowledge of Glue, PySpark, SQL, Athena, Lambda, SNS, S3 Knowledge of Databricks : Cluster setup, Notebooks, Libraries, CI/CD, Optimization Data Processing: Event stream ingestion and batch processing Testing: Writing unit test cases and integration tests Security & Governance: AWS/Databricks governance standards and best practices Performance Optimization: Query tuning, cluster performance improvements, cost reduction Strong problem-solving and analytical skills Ability to work in a fast-paced, cloud-based data environment Excellent collaboration and communication skills Strong attention to detail and commitment to best practices Educational Qualification: Bachelor's in computer science or computer engineering Certification in Data Engineering and Analytics Experience: Minimum 6 years' experience in Data engineering (Core development/design), in which 3+ years in AWS with command on (AWS glue, pyspark, SQL, Athena, lambda, SNS, S3) and 1+ year in Databricks.
-
Senior Data Engineer
17 hours ago
haryana, India Eucloid Data Solutions Full timeAbout EucloidAt Eucloid, innovation meets impact. As a leader in AI and Data Science, we create solutions that redefine industries—from Hi-tech and D2C to Healthcare and SaaS. With partnerships with giants like Databricks, Google Cloud, and Adobe, we’re pushing boundaries and building next-gen technology.Join our talented team of engineers, scientists,...
-
Senior Data Engineer
3 days ago
haryana, India Pacific Data Integrators Full timeRole: Senior Data EngineerLocation: RemoteJob Type: Full-timeShift time: Open to work in EST shift (5PM to 2AM IST) Key ResponsibilitiesLead the design, development, and implementation of complex data integration solutions using Informatica Intelligent Data Management Cloud (IDMC).Develop, document, unit test, and maintain high-quality ETL applications that...
-
Principal Data Consultant
2 weeks ago
haryana, India Eucloid Data Solutions Full timeJob description:The candidate will advise clients on multiple business problems and help them achieve desirable business outcomes through projects. The candidate is expected to be a highly motivated individual with an ability to provide strategic & operational leadership for a high-performing, diverse team of Data Analysts, Data Scientists & Data...
-
Senior Data Engineer
2 weeks ago
Gurugram, Haryana, India, IN Pacific Data Integrators Full timeRole: Senior Data EngineerLocation: RemoteJob Type: Full-timeShift time: Open to work in EST shift (5PM to 2AM IST) Key ResponsibilitiesLead the design, development, and implementation of complex data integration solutions using Informatica Intelligent Data Management Cloud (IDMC).Develop, document, unit test, and maintain high-quality ETL applications that...
-
Data Engineer
2 weeks ago
haryana, India CodeVyasa Full timeWe are looking for a skilled Data Engineer l Gurgaon ll 3+ yrs of experience to join our engineering team.About UsCodeVyasa is a mid-sized product engineering company that works with top-tier product and solutions organizations such as McKinsey, Walmart, RazorPay, Swiggy, and others. We are a team of 550+ engineers, driving innovation across Product & Data...
-
Data Engineer
2 weeks ago
Haryana, India Movate Full timePosition: Permanent Role: Data Engineer Experience: 5 - 6 Years Work Location: Gurugram Shift Timing – 3PM – 12AM (Work from office) Notice Period: Immediate to 15 days Must-Have Skills - Strong proficiency in SQL and Python for data transformation and automation. - Hands-on experience building and supporting ETL/data pipelines in AWS environments. -...
-
Data Engineer
2 weeks ago
haryana, India Movate Full timePosition: Permanent Role: Data Engineer Experience: 5 - 6 Years Work Location: Gurugram Shift Timing – 3PM – 12AM (Work from office) Notice Period: Immediate to 15 days Must-Have Skills • Strong proficiency in SQL and Python for data transformation and automation. • Hands-on experience building and supporting ETL/data pipelines in AWS environments....
-
Data Engineer
2 weeks ago
Haryana, India GMG Full timeWhat we do: GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, food and health sectors. Its vision is to inspire people to win in ways that make the world better. Today, GMG’s investments span four key verticals: GMG Sports, GMG Food, GMG Health, and GMG...
-
Data Engineer
1 day ago
haryana, India IGT Solutions Full timeJob Title: Senior Data Engineer Experience: 6 to 9 Years Location: Remote Employment Type: Full-time Primary Skills: Azure Databricks- Unity catalog in Databrick, workflow, job scheduling, Alert, Pyspark, SQL, ETL. Job Summary: We are seeking a highly skilled Senior Data Engineer with hands-on experience in Databricks, PySpark, ETL development, and SQL . The...
-
Data Engineer
3 days ago
haryana, India GMG Full timeWhat we do:GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, food and health sectors. Its vision is to inspire people to win in ways that make the world better. Today, GMG’s investments span four key verticals: GMG Sports, GMG Food, GMG Health, and GMG...