Manager - Data Lake and Data Architecture - Primary AWS, Secondary Azure, Good to have GCP

7 days ago


Gurgaon, Haryana, India Sirius AI Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Key Responsibilities:

  • Data Architecture & Management:
    Design and implement scalable, cloud-agnostic Data Lake, Data LakeHouse, Data Mesh and Data Fabric architectures to efficiently store, process, and manage structured and unstructured data from various sources.
  • Data Pipeline Development
    : Design, develop, and maintain robust data pipelines to ingest, process, and transform data from multiple sources into usable formats for analytics and reporting using services like AWS Glue, Azure Data Factory, GCP Dataflow, Apache Spark, or Apache Airflow.
  • Data Integration and ETL:
    Develop and optimize Extract, Transform, Load (ETL) and ELT processes to integrate disparate data sources into the data lake, ensuring high data quality, consistency, and reliability across multiple cloud platforms.
  • Cloud-Agnostic Data Engineering:
    Develop data solutions that are cloud-agnostic, leveraging open-source technologies like Apache Spark, Delta Lake, Presto, and Kubernetes, ensuring compatibility across AWS, Azure, and GCP.
  • Big Data Processing & Analytics:
    Utilize big data technologies such as Apache Spark, Hive, and Presto for distributed computing, enabling large-scale data transformations and analytics.
  • Data Governance and Security
    : Implement robust data governance policies, security frameworks, and compliance controls, including role-based access control (RBAC), encryption, and monitoring to meet industry standards (GDPR, HIPAA, PCI-DSS).
  • DevOps Integration for Data Platforms:
    Leverage cloud-agnostic DevOps tools and practices for source control, build automation, release management, and Infrastructure as Code (IaC) to streamline the development, deployment, and management of data lake and data architecture solutions across multiple cloud providers. Solutions should support CI/CD pipelines, automated testing, and scalable data workflows.
  • Continuous Integration and Deployment (CI/CD
    ): Establish automated CI/CD pipelines to streamline deployment, testing, and monitoring of data infrastructure and workflows.
  • Performance Optimization
    : Optimize data workflows and query performance using indexing, caching, and partitioning strategies to improve efficiency and cost-effectiveness.
  • Monitoring and Troubleshooting
    : Implement observability solutions using tools like Prometheus, Grafana, or cloud-native monitoring services to proactively detect and resolve data pipeline issues.
  • Collaboration and Documentation:
    Work with cross-functional teams, including data scientists, analysts, and business stakeholders, to design and implement scalable data solutions. Maintain comprehensive documentation of data architectures, processes, and best practices.

Job Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 6+ years of experience as a Data Engineer, specializing in cloud-agnostic data solutions and data lake architectures.
  • Strong expertise in cloud data platforms such as AWS, Azure, and Google Cloud, with hands-on experience in services like AWS S3, Azure Data Lake, Google Cloud Storage, and related data processing tools.
  • Proficiency in big data technologies such as Apache Spark, Hadoop, Kafka, Delta Lake, or Presto.
  • Experience with SQL and NoSQL databases, including PostgreSQL, MySQL, and DynamoDB.
  • Expertise in containerization and orchestration platforms such as Docker and Kubernetes.
  • Experience implementing DevOps and CI/CD practices using Terraform, CloudFormation, or other Infrastructure as Code (IaC) tools.
  • Knowledge of data visualization tools such as Power BI, Tableau, or Looker for presenting insights and reports.
  • Strong problem-solving and troubleshooting skills with a proactive approach to identifying and resolving issues.
  • Experience leading teams of 5+ cloud engineers.
  • Preferred certifications in AWS, Azure, or Google Cloud data engineering.

  • Data Engineer

    2 days ago


    Gurgaon, Haryana, India Sirius AI Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Data Engineer Job DescriptionAbout Sirius AISirius AI is a US headquartered AI Consulting services and products company with operations in India. Sirius AI focuses on Financial Services enterprises and solutions / services delivered across multiple geographies. We are an innovation-driven AI and data driven consulting services firm with a high focus on...

  • Data Engineer

    1 week ago


    Gurgaon, Haryana, India CoPoint Data Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Senior Data Engineer Location: India (Gurugram)About CoPointAIAI isn't coming — it's here. And we help enterprises make it real. At CoPointAI, we work inside the enterprise — not just around it — to turn AI potential into practical wins. From hands-on C-suite workshops (our AI Foundations series) to AI-native MVPs built in weeks, our team partners...

  • Staff Data Engineer

    2 weeks ago


    Gurgaon, Haryana, India Bain Full time ₹ 15,00,000 - ₹ 30,00,000 per year

    Key ResponsibilitiesData Architecture & Engineering LeadershipDesign and own scalable data architectures for ingestion, transformation, and analytics on Databricks.Build robust ETL/ELT pipelines using PySpark, SQL, and Databricks Workflows.Lead performance tuning, partitioning, and data optimization across large distributed systems.Mentor junior data...

  • Azure Data Engineer

    2 days ago


    Gurgaon, Haryana, India Infogain Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    ROLES & RESPONSIBILITIESKey ResponsibilitiesAnalyze existing Hadoop, Pig, and Spark scripts from Dataproc and refactor them into Databricks-native PySpark.Implement data ingestion and transformation pipelines using Delta Lake best practices.Apply conversion rules and templates for automated code migration and testing.Conduct data validation between legacy...

  • Azure Data Engineer

    1 week ago


    Gurgaon, Haryana, India Fiftyfive technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    About the Role:We are looking for an experienced Senior Data Engineer with strong hands-on expertise in Azure Data Services, particularly Azure Databricks. The ideal candidate will be responsible for building robust, scalable, and high-performance data solutions to support business analytics and insights. Databricks certification is highly preferred.Key...


  • Gurgaon, Haryana, India Xander Talent Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Experience: 5-8 years Location: Bangalore, Chennai, Delhi, Pune, Kolkata Primary Roles and Responsibilities: ● Developing Modern Data Warehouse solutions using Databricks and AWS/ Azure Stack ● Ability to provide solutions that are forward-thinking in data engineering and analytics space ● Collaborate with DW/BI leads to understand new ETL pipeline...


  • Gurgaon, Haryana, India Srijan Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Senior/Lead Data Engineer - DatabricksWe are seeking a highly skilled and experienced Data Engineering Lead with a strong background in the retail domain and exceptional programming abilities. As a Lead, you will play a pivotal role in implementing, and optimizing data architecture to support our retail business operations and analytics initiatives. Your...

  • Data Engineer

    2 weeks ago


    Gurgaon, Haryana, India GMG Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    What we do:GMG is a global well-being company retailing, distributing and manufacturing a portfolio of leading international and home-grown brands across sport, everyday goods, health and beauty, properties and logistics sectors. Under the ownership and management of the Baker family for over 45 years, GMG is a valued partner of choice for the world's most...


  • Gurgaon, Haryana, India Tata Consultancy Servicess Full time ₹ 1,00,00,000 - ₹ 3,00,00,000 per year

    Hello,Good dayJD Orientation: -Mandatory Skill Set:Extensive expertise in designing and implementing data load processes using Azure Data Factory , Azure Databricks , Delta Lake, Azure Delta Lake Storage and Python / PySpark .Proficient with Databricks & Python .Senior developers with Full Database / Datawarehouse / DataMart development...


  • Gurgaon, Haryana, India SRIJAN TECHNOLOGIES Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Job Description See all the jobs at Srijan Technologies PVT LTD here:Senior/Lead Data Engineer - Databricks Location: GurgaonTeam: Technology TeamJob Type: Full-time | Partially remoteApply by: No close date We are seeking a highly skilled and experienced Data Engineering Lead with a strong background in the retail domain and exceptional programming...