Manager - Data Lake and Data Architecture - AWS, Azure, Good to have GCP

1 day ago


Gurugram Gurugram India Sirius AI Full time

Job Description Key Responsibilities: - Data Architecture & Management: Design and implement scalable, cloud-agnostic Data Lake, Data LakeHouse, Data Mesh and Data Fabric architectures to efficiently store, process, and manage structured and unstructured data from various sources. - Data Pipeline Development: Design, develop, and maintain robust data pipelines to ingest, process, and transform data from multiple sources into usable formats for analytics and reporting using services like AWS Glue, Azure Data Factory, GCP Dataflow, Apache Spark, or Apache Airflow. - Data Integration and ETL: Develop and optimize Extract, Transform, Load (ETL) and ELT processes to integrate disparate data sources into the data lake, ensuring high data quality, consistency, and reliability across multiple cloud platforms. - Cloud-Agnostic Data Engineering: Develop data solutions that are cloud-agnostic, leveraging open-source technologies like Apache Spark, Delta Lake, Presto, and Kubernetes, ensuring compatibility across AWS, Azure, and GCP. - Big Data Processing & Analytics: Utilize big data technologies such as Apache Spark, Hive, and Presto for distributed computing, enabling large-scale data transformations and analytics. - Data Governance and Security: Implement robust data governance policies, security frameworks, and compliance controls, including role-based access control (RBAC), encryption, and monitoring to meet industry standards (GDPR, HIPAA, PCI-DSS). - DevOps Integration for Data Platforms: Leverage cloud-agnostic DevOps tools and practices for source control, build automation, release management, and Infrastructure as Code (IaC) to streamline the development, deployment, and management of data lake and data architecture solutions across multiple cloud providers. Solutions should support CI/CD pipelines, automated testing, and scalable data workflows. - Continuous Integration and Deployment (CI/CD): Establish automated CI/CD pipelines to streamline deployment, testing, and monitoring of data infrastructure and workflows. - Performance Optimization: Optimize data workflows and query performance using indexing, caching, and partitioning strategies to improve efficiency and cost-effectiveness. - Monitoring and Troubleshooting: Implement observability solutions using tools like Prometheus, Grafana, or cloud-native monitoring services to proactively detect and resolve data pipeline issues. - Collaboration and Documentation: Work with cross-functional teams, including data scientists, analysts, and business stakeholders, to design and implement scalable data solutions. Maintain comprehensive documentation of data architectures, processes, and best practices. Job Qualifications: - Bachelor's degree in Computer Science, Engineering, or a related field. - 6+ years of experience as a Data Engineer, specializing in cloud-agnostic data solutions and data lake architectures. - Strong expertise in cloud data platforms such as AWS, Azure, and Google Cloud, with hands-on experience in services like AWS S3, Azure Data Lake, Google Cloud Storage, and related data processing tools. - Proficiency in big data technologies such as Apache Spark, Hadoop, Kafka, Delta Lake, or Presto. - Experience with SQL and NoSQL databases, including PostgreSQL, MySQL, and DynamoDB. - Expertise in containerization and orchestration platforms such as Docker and Kubernetes. - Experience implementing DevOps and CI/CD practices using Terraform, CloudFormation, or other Infrastructure as Code (IaC) tools. - Knowledge of data visualization tools such as Power BI, Tableau, or Looker for presenting insights and reports. - Strong problem-solving and troubleshooting skills with a proactive approach to identifying and resolving issues. - Experience leading teams of 5+ cloud engineers. - Preferred certifications in AWS, Azure, or Google Cloud data engineering.



  • India Sirius AI Full time

    Key Responsibilities: Data Architecture & Management: Design and implement scalable, cloud-agnostic Data Lake, Data LakeHouse, Data Mesh and Data Fabric architectures to efficiently store, process, and manage structured and unstructured data from various sources. Data Pipeline Development : Design, develop, and maintain robust data pipelines to...

  • Azure Data Engineer

    3 weeks ago


    Hyderabad, India Fragma Data Systems Full time

    Job Description Must-Have Skills - Good experience in Pyspark - Including Dataframe core functions and Spark SQL - Good experience in SQL DBs - Be able to write queries including fair complexity. - Should have excellent experience in Big Data programming for data transformation and aggregations - Good at ELT architecture. Business rules processing and data...


  • Bengaluru, Karnataka, India, Karnataka Tata Consultancy Services Full time

    AWS Data Lake Administrator LOCATION – BLR / HYD Job Description: We are seeking a highly skilled AWS Data Lake Administrator to manage and optimize our data lake architecture on AWS. The ideal candidate will have strong experience with AWS data services such as S3, Lake Formation, Glue Catalog, Redshift Spectrum, and extensive SQL knowledge. The role...

  • Azure Data Engineer

    2 weeks ago


    Pune, India Fragma Data Systems Full time

    Job Description Technology Skills - Building and operationalizing large scale enterprise data solutions and applications using one or more of AZURE data and analytics services in combination with custom solutions - Azure Synapse/Azure SQL DWH, Azure Data Lake, Azure Blob Storage, Spark, HDInsights, Databricks, CosmosDB, EventHub/IOTHub. - Experience in...


  • Bengaluru, Karnataka, India, Karnataka Tata Consultancy Services Full time

    TCS Hiring for AWS Data Lake AdministratorRole - AWS Data Lake AdministratorExperience - 10 to 14 YearsLocation - Bengaluru, HyderabadRoles & ResponsibilitiesKey Responsibilities:Administer and manage the AWS Data Lake infrastructure, ensuring high availability, security, and performance.Configure and manage AWS S3, Lake Formation, and Glue Data Catalog to...

  • Azure Data Engineer

    7 days ago


    Gurugram, India Worksconsultancy Full time

    We are looking for a Senior Data Engineer with strong expertise in SQL, Python, Azure Synapse, Azure Data Factory, Snowflake, and Databricks. The ideal candidate should have a solid understanding of SQL (DDL, DML, query optimization) and ETL pipelines while demonstrating a learning mindset to adapt to evolving technologies.Key Responsibilities :- Collaborate...

  • Data MDM Architect

    2 weeks ago


    Hyderabad, Telangana, India, Telangana Tata Consultancy Services Full time

    Greetings from TCS!!!Job Title: Azure/AWS Data MDM ArchitectLocation - PAN IndiaYears of Experience - 10-16 yearsRole Overview We are seeking an accomplished Data MDM Architect with deep expertise in designing and implementing cloud-based Master Data Management (MDM) solutions across Azure and AWS ecosystems (experience with GCP is a plus). The ideal...

  • Data engineer

    2 weeks ago


    India Data-Hat AI Full time

    Department: Data Engineering & AI Solutions  Reports To: Lead Data Solutions Architect  Travel: International travel required (up to 30–40%)   Position Summary:   We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructure that underpins mission-critical AI systems. With 12+...

  • Data Management

    1 week ago


    India Zensar Full time

    Bachelor s or master s degree in computer science Information Systems or related field 12 years of experience in data architecture and or data management Expertise in data governance metadata management and data quality frameworks Expertise in data governance privacy and data lifecycle management policies Solid understanding of conceptual logical and...

  • Data Engineer

    3 days ago


    Gurugram, India SkillKart Full time

    Description :Role Overview :We are seeking a skilled Data Engineer with expertise in Microsoft Azure to join our Data & Analytics team. In this role, you will be responsible for designing, building, and optimizing scalable data pipelines and architectures using Azure data services. You will collaborate with cross-functional teams including data scientists,...