Manager - Data Lake and Data Architecture - AWS, Azure, Good to have GCP

4 weeks ago


Gurugram Gurugram India Sirius AI Full time

Job Description Key Responsibilities: - Data Architecture & Management: Design and implement scalable, cloud-agnostic Data Lake, Data LakeHouse, Data Mesh and Data Fabric architectures to efficiently store, process, and manage structured and unstructured data from various sources. - Data Pipeline Development: Design, develop, and maintain robust data pipelines to ingest, process, and transform data from multiple sources into usable formats for analytics and reporting using services like AWS Glue, Azure Data Factory, GCP Dataflow, Apache Spark, or Apache Airflow. - Data Integration and ETL: Develop and optimize Extract, Transform, Load (ETL) and ELT processes to integrate disparate data sources into the data lake, ensuring high data quality, consistency, and reliability across multiple cloud platforms. - Cloud-Agnostic Data Engineering: Develop data solutions that are cloud-agnostic, leveraging open-source technologies like Apache Spark, Delta Lake, Presto, and Kubernetes, ensuring compatibility across AWS, Azure, and GCP. - Big Data Processing & Analytics: Utilize big data technologies such as Apache Spark, Hive, and Presto for distributed computing, enabling large-scale data transformations and analytics. - Data Governance and Security: Implement robust data governance policies, security frameworks, and compliance controls, including role-based access control (RBAC), encryption, and monitoring to meet industry standards (GDPR, HIPAA, PCI-DSS). - DevOps Integration for Data Platforms: Leverage cloud-agnostic DevOps tools and practices for source control, build automation, release management, and Infrastructure as Code (IaC) to streamline the development, deployment, and management of data lake and data architecture solutions across multiple cloud providers. Solutions should support CI/CD pipelines, automated testing, and scalable data workflows. - Continuous Integration and Deployment (CI/CD): Establish automated CI/CD pipelines to streamline deployment, testing, and monitoring of data infrastructure and workflows. - Performance Optimization: Optimize data workflows and query performance using indexing, caching, and partitioning strategies to improve efficiency and cost-effectiveness. - Monitoring and Troubleshooting: Implement observability solutions using tools like Prometheus, Grafana, or cloud-native monitoring services to proactively detect and resolve data pipeline issues. - Collaboration and Documentation: Work with cross-functional teams, including data scientists, analysts, and business stakeholders, to design and implement scalable data solutions. Maintain comprehensive documentation of data architectures, processes, and best practices. Job Qualifications: - Bachelor's degree in Computer Science, Engineering, or a related field. - 6+ years of experience as a Data Engineer, specializing in cloud-agnostic data solutions and data lake architectures. - Strong expertise in cloud data platforms such as AWS, Azure, and Google Cloud, with hands-on experience in services like AWS S3, Azure Data Lake, Google Cloud Storage, and related data processing tools. - Proficiency in big data technologies such as Apache Spark, Hadoop, Kafka, Delta Lake, or Presto. - Experience with SQL and NoSQL databases, including PostgreSQL, MySQL, and DynamoDB. - Expertise in containerization and orchestration platforms such as Docker and Kubernetes. - Experience implementing DevOps and CI/CD practices using Terraform, CloudFormation, or other Infrastructure as Code (IaC) tools. - Knowledge of data visualization tools such as Power BI, Tableau, or Looker for presenting insights and reports. - Strong problem-solving and troubleshooting skills with a proactive approach to identifying and resolving issues. - Experience leading teams of 5+ cloud engineers. - Preferred certifications in AWS, Azure, or Google Cloud data engineering.


  • AWS Data Lake Admin

    1 week ago


    India Tata Consultancy Services Full time

    We await your innovation at TCS: Hiring | AWS Data Lake Admin| Greetings from TCS!! We are Hiring for AWS Data Lake Admin Required Experience: 7-12 years Work location: Bangalore, Hyderabad Job Description: We are seeking a highly skilled AWS Data Lake Administrator to manage and optimize our data lake architecture on AWS. The ideal candidate will have...


  • Gurugram, Gurugram, India Response Informatics Full time

    Job Description Databricks Data Engineer NP: Immediate joiner 30 days joiner Exp-6-9 yrs Location - Gurgaon Mandate- Spark, sql, pyspark, azure (all skills) Key Responsibilities: Design, build, and maintain scalable data pipelines using Databricks / Apache Spark. Develop and optimize data lake/warehouse architectures in Delta Lake. Implement ETL/ELT...


  • Gurugram, India EXL Full time

    Role Summary : We are looking for a skilled Data Engineer with solid experience in the Azure data ecosystem and hands-on expertise in Azure Databricks. The ideal candidate will have worked on designing and developing scalable, cloud-native data pipelines and solutions using Azure data services. You will contribute to building and optimizing data lakehouse...

  • Data Engineer

    6 days ago


    Gurugram, Gurugram, India GMG Full time

    Job Description What we do: GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, everyday goods, health and beauty, properties and logistics sectors. Under the ownership and management of the Baker family for over 45 years, GMG is a valued partner of choice for...

  • Azure Data Engineer

    13 hours ago


    Gurugram, India Epergne Solutions Full time

    Job Title : Azure Data Engineer Location : Remote Experience : 5 +Years Notice Period : 15 Days / Immediate Only Primary Responsibilities: Design, develop, and maintain data storage solutions using Azure SQL Database , Azure Data Lake , and Azure Blob Storage . Build and manage data pipelines for ingestion, processing, and transformation within Azure. Create...

  • Staff Data Engineer

    13 hours ago


    Gurugram, India NPS Prism Full time

    Job Description: Staff Data Engineer – NPS Prism  Location: India (Remote/Hybrid)  Experience: 6–8 Years  Employment Type: Full-time  Role Overview  We are seeking an experienced Staff Data Engineer to join the NPS Prism engineering team — a Bain platform that provides advanced analytics, benchmarking, and insights into customer experience metrics...

  • Data Engineer

    4 weeks ago


    Gurugram, India SkillKart Full time

    Description :Role Overview :We are seeking a skilled Data Engineer with expertise in Microsoft Azure to join our Data & Analytics team. In this role, you will be responsible for designing, building, and optimizing scalable data pipelines and architectures using Azure data services. You will collaborate with cross-functional teams including data scientists,...

  • Data Engineer

    1 week ago


    India Yoda Tech Full time

    Job Description We are seeking a skilled and motivated Data Engineer to join our team and help build scalable, secure, and efficient data pipelines and platforms. The ideal candidate will have 2 to 4 years of hands-on experience with modern data engineering tools and cloud platforms, particularly in Azure and AWS ecosystems. Key Responsibilities Design,...

  • Data Engineer

    4 weeks ago


    Gurugram, India Ixceed Solutions Full time

    Job Description : We are seeking a skilled Data Engineer with strong expertise in Azure data services to design and implement effective data solutions, support data quality, and contribute to cloud migration strategies. This role is key in building scalable data pipelines, maintaining data integrity, and collaborating on data architecture best practices...

  • Data Engineer

    2 weeks ago


    Gurugram, India One Degree North HR Services Full time

    Job Description : Function : Data Science and Analysis ? Data Analysis / Business Intelligence, Software Engineering ? Big Data / DWH / ETLDatabricks Data AnalysisETL+5 more Data Warehousing Spark SQLAzure Data Factory Azure SynapseRequirements : - 5 years of experience in data engineering, with at least 3+ years in Azure Databricks.- Strong proficiency in...