
Manager - Data Lake and Data Architecture - AWS, Azure, Good to have GCP
1 day ago
Job Description Key Responsibilities: - Data Architecture & Management: Design and implement scalable, cloud-agnostic Data Lake, Data LakeHouse, Data Mesh and Data Fabric architectures to efficiently store, process, and manage structured and unstructured data from various sources. - Data Pipeline Development: Design, develop, and maintain robust data pipelines to ingest, process, and transform data from multiple sources into usable formats for analytics and reporting using services like AWS Glue, Azure Data Factory, GCP Dataflow, Apache Spark, or Apache Airflow. - Data Integration and ETL: Develop and optimize Extract, Transform, Load (ETL) and ELT processes to integrate disparate data sources into the data lake, ensuring high data quality, consistency, and reliability across multiple cloud platforms. - Cloud-Agnostic Data Engineering: Develop data solutions that are cloud-agnostic, leveraging open-source technologies like Apache Spark, Delta Lake, Presto, and Kubernetes, ensuring compatibility across AWS, Azure, and GCP. - Big Data Processing & Analytics: Utilize big data technologies such as Apache Spark, Hive, and Presto for distributed computing, enabling large-scale data transformations and analytics. - Data Governance and Security: Implement robust data governance policies, security frameworks, and compliance controls, including role-based access control (RBAC), encryption, and monitoring to meet industry standards (GDPR, HIPAA, PCI-DSS). - DevOps Integration for Data Platforms: Leverage cloud-agnostic DevOps tools and practices for source control, build automation, release management, and Infrastructure as Code (IaC) to streamline the development, deployment, and management of data lake and data architecture solutions across multiple cloud providers. Solutions should support CI/CD pipelines, automated testing, and scalable data workflows. - Continuous Integration and Deployment (CI/CD): Establish automated CI/CD pipelines to streamline deployment, testing, and monitoring of data infrastructure and workflows. - Performance Optimization: Optimize data workflows and query performance using indexing, caching, and partitioning strategies to improve efficiency and cost-effectiveness. - Monitoring and Troubleshooting: Implement observability solutions using tools like Prometheus, Grafana, or cloud-native monitoring services to proactively detect and resolve data pipeline issues. - Collaboration and Documentation: Work with cross-functional teams, including data scientists, analysts, and business stakeholders, to design and implement scalable data solutions. Maintain comprehensive documentation of data architectures, processes, and best practices. Job Qualifications: - Bachelor's degree in Computer Science, Engineering, or a related field. - 6+ years of experience as a Data Engineer, specializing in cloud-agnostic data solutions and data lake architectures. - Strong expertise in cloud data platforms such as AWS, Azure, and Google Cloud, with hands-on experience in services like AWS S3, Azure Data Lake, Google Cloud Storage, and related data processing tools. - Proficiency in big data technologies such as Apache Spark, Hadoop, Kafka, Delta Lake, or Presto. - Experience with SQL and NoSQL databases, including PostgreSQL, MySQL, and DynamoDB. - Expertise in containerization and orchestration platforms such as Docker and Kubernetes. - Experience implementing DevOps and CI/CD practices using Terraform, CloudFormation, or other Infrastructure as Code (IaC) tools. - Knowledge of data visualization tools such as Power BI, Tableau, or Looker for presenting insights and reports. - Strong problem-solving and troubleshooting skills with a proactive approach to identifying and resolving issues. - Experience leading teams of 5+ cloud engineers. - Preferred certifications in AWS, Azure, or Google Cloud data engineering.
-
India Sirius AI Full timeKey Responsibilities: Data Architecture & Management: Design and implement scalable, cloud-agnostic Data Lake, Data LakeHouse, Data Mesh and Data Fabric architectures to efficiently store, process, and manage structured and unstructured data from various sources. Data Pipeline Development : Design, develop, and maintain robust data pipelines to...
-
Azure Data Engineer
3 weeks ago
Hyderabad, India Fragma Data Systems Full timeJob Description Must-Have Skills - Good experience in Pyspark - Including Dataframe core functions and Spark SQL - Good experience in SQL DBs - Be able to write queries including fair complexity. - Should have excellent experience in Big Data programming for data transformation and aggregations - Good at ELT architecture. Business rules processing and data...
-
AWS Data Lake Administrator
7 days ago
Bengaluru, Karnataka, India, Karnataka Tata Consultancy Services Full timeAWS Data Lake Administrator LOCATION – BLR / HYD Job Description: We are seeking a highly skilled AWS Data Lake Administrator to manage and optimize our data lake architecture on AWS. The ideal candidate will have strong experience with AWS data services such as S3, Lake Formation, Glue Catalog, Redshift Spectrum, and extensive SQL knowledge. The role...
-
Azure Data Engineer
2 weeks ago
Pune, India Fragma Data Systems Full timeJob Description Technology Skills - Building and operationalizing large scale enterprise data solutions and applications using one or more of AZURE data and analytics services in combination with custom solutions - Azure Synapse/Azure SQL DWH, Azure Data Lake, Azure Blob Storage, Spark, HDInsights, Databricks, CosmosDB, EventHub/IOTHub. - Experience in...
-
AWS Data Lake Administrator
5 days ago
Bengaluru, Karnataka, India, Karnataka Tata Consultancy Services Full timeTCS Hiring for AWS Data Lake AdministratorRole - AWS Data Lake AdministratorExperience - 10 to 14 YearsLocation - Bengaluru, HyderabadRoles & ResponsibilitiesKey Responsibilities:Administer and manage the AWS Data Lake infrastructure, ensuring high availability, security, and performance.Configure and manage AWS S3, Lake Formation, and Glue Data Catalog to...
-
Azure Data Engineer
7 days ago
Gurugram, India Worksconsultancy Full timeWe are looking for a Senior Data Engineer with strong expertise in SQL, Python, Azure Synapse, Azure Data Factory, Snowflake, and Databricks. The ideal candidate should have a solid understanding of SQL (DDL, DML, query optimization) and ETL pipelines while demonstrating a learning mindset to adapt to evolving technologies.Key Responsibilities :- Collaborate...
-
Data MDM Architect
2 weeks ago
Hyderabad, Telangana, India, Telangana Tata Consultancy Services Full timeGreetings from TCS!!!Job Title: Azure/AWS Data MDM ArchitectLocation - PAN IndiaYears of Experience - 10-16 yearsRole Overview We are seeking an accomplished Data MDM Architect with deep expertise in designing and implementing cloud-based Master Data Management (MDM) solutions across Azure and AWS ecosystems (experience with GCP is a plus). The ideal...
-
Data engineer
2 weeks ago
India Data-Hat AI Full timeDepartment: Data Engineering & AI Solutions Reports To: Lead Data Solutions Architect Travel: International travel required (up to 30–40%) Position Summary: We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructure that underpins mission-critical AI systems. With 12+...
-
Data Management
1 week ago
India Zensar Full timeBachelor s or master s degree in computer science Information Systems or related field 12 years of experience in data architecture and or data management Expertise in data governance metadata management and data quality frameworks Expertise in data governance privacy and data lifecycle management policies Solid understanding of conceptual logical and...
-
Data Engineer
3 days ago
Gurugram, India SkillKart Full timeDescription :Role Overview :We are seeking a skilled Data Engineer with expertise in Microsoft Azure to join our Data & Analytics team. In this role, you will be responsible for designing, building, and optimizing scalable data pipelines and architectures using Azure data services. You will collaborate with cross-functional teams including data scientists,...