Senior Lead Data Engineer

1 week ago


Bengaluru, Karnataka, India GSK Full time ₹ 12,00,000 - ₹ 36,00,000 per year
Nazwa biura: Bengaluru Luxor North Tower
Posted Date: Nov 3 2025

Role Overview:

The Senior Lead Data Engineer will operate within a matrixed product team design, holding responsibility for the technical solution design, implementation, and ongoing enhancement of products and systems developed by Medical Digital and Tech. In alignment with Agile Ways of Working, DevSecOps principles, and requirements for Compliance and Digital Certainty, this role will collaborate closely with a Product Manager, adhering to agile and DevOps methodologies.

The Senior Lead Data Engineer will serve as a "T-Shaped" engineer, demonstrating both deep expertise and broad proficiency across essential engineering competencies, such as Software Development, Automated Testing, DevOps, CI/CD, Data Science/Analytics, and Lifecycle Management.

We are seeking an exceptionally skilled and strategic Senior lead Data Engineer with DevOps skills to lead the design, development, and optimization of our data and medical products and systems. The ideal candidate will possess significant expertise in Azure Databricks, modern Lakehouse architectures, infrastructure management, data pipeline automation, and advanced security practices, including the application of Generative AI to create advanced data and analytics solutions.

The role requires effective collaboration with experts from other tech teams and subject matter domains, as well as core engineering knowledge and experience with industry technologies, practices, and frameworks such as REACT, Azure Cloud Ops, AI/ML, CI/CD, DevOps, Automated Testing, and API Architectures.

Key Responsibilities
  • Azure Data Architecture: Define, architect, and implement scalable, secure, and cost-effective data solutions on Azure, utilizing Azure Data Lake Storage (ADLS) Gen2, Azure Data Factory (ADF), and Azure Synapse.

  • Databricks Lakehouse Implementation: Architect and optimize the Databricks Lakehouse platform, leveraging Delta Lake for transactional support and implementing robust data ingestion and transformation architectures.

  • GenAI Data Strategy: Lead data engineering initiatives for Generative AI projects, including the design and construction of data pipelines for Retrieval-Augmented Generation (RAG), feature engineering for large language model (LLM) fine-tuning, and managing vector databases and embedding workflows in both Databricks and Azure.

  • Advanced Data Processing: Develop, manage, and optimize large-scale batch and streaming data pipelines using Databricks notebooks with PySpark and SQL. Implement Databricks Workflows for job orchestration, ensuring robust monitoring, error handling, and alerting.

  • Data Governance and Security: Champion data governance best practices using Databricks Unity Catalog to manage permissions, enforce data quality, track lineage, and ensure compliance with security and privacy standards for all data assets.

  • Collaboration and Mentorship: Work closely with AI/ML engineers, data scientists, and business teams to understand data requirements for models and translate these into technical solutions. Provide technical leadership, mentorship, and guidance to the data engineering team.

  • Azure Cloud Architecture: Oversee the design, provisioning, and management of Azure cloud resources, including Azure Active Directory (AAD), networking, and security protocols. Manage Azure Databricks workspaces and clusters, monitor performance, troubleshoot issues, and optimize resource utilization. Utilize advanced Azure services such as Azure Functions, Logic Apps, and Synapse Analytics to construct robust, serverless solutions.

  • Databricks Pipeline Automation: Implement and manage end-to-end CI/CD pipelines for data and analytics projects on Azure Databricks using Azure DevOps and Databricks Asset Bundles (DABs) with Git integration. Automate the deployment of Databricks notebooks, libraries, and jobs across multiple environments (development, staging, production), and define/manage Databricks jobs using CI/CD practices to ensure version control and reliable, repeatable executions.

  • Infrastructure as Code (IaC) and Automation: Develop, implement, and maintain Infrastructure as Code for the entire cloud stack using advanced Azure Resource Manager (ARM) templates. Create complex automation scripts and playbooks with Python to automate infrastructure tasks and streamline workflows.

  • DevSecOps and Governance: Lead the integration of security best practices throughout the CI/CD pipeline and Azure environment. Establish and enforce governance policies for Databricks and Azure, manage access controls, compliance, and data privacy, and implement observability solutions for monitoring, logging, and alerting on Azure and Databricks using tools such as Azure Monitor, Log Analytics, and Grafana.

  • Collaboration and Problem-Solving: Serve as a technical liaison between data engineering, data science, and security teams to align best practices for data processing and MLOps. Provide expert-level troubleshooting and root cause analysis for performance and availability issues.

  • Cloud Infrastructure Management: Manage, optimize, and secure cloud environments on major platforms like Azure, with a focus on scalability and cost efficiency.

  • Process Improvement: Continuously evaluate and optimize existing processes to enhance the speed, quality, and reliability of software delivery.

Required Skills & Qualifications:Technical Skills
  • BE/ B Tech graduate with Over 6 to 8years of progressive experience in data engineering, with significant expertise in building solutions on Azure using Databricks.

  • Azure Ecosystem: Expert-level knowledge of Azure Data Platform components, including ADLS Gen2, Azure Data Factory, Azure Synapse Analytics, and Azure Key Vault.

  • Databricks Mastery: Demonstrated expertise with Databricks, including Delta Lake, Unity Catalog, Databricks SQL, MLflow, and advanced Spark optimization techniques such as Photon Engine and Adaptive Query Execution (AQE).

  • GenAI Integration: Hands-on experience creating Generative AI-driven data solutions, such as Retrieval-Augmented Generation (RAG) pipelines, fine-tuning LLMs, and implementing vector search in production environments.

  • Programming Expertise: Mastery of Python (including PySpark and Pandas) and SQL.

  • Data Warehousing and Modeling: Strong understanding of dimensional modeling, data warehousing concepts, and implementing the Medallion architecture within a Lakehouse framework.

  • CI/CD Tools: In-depth, hands-on experience with CI/CD platforms such as GitLab CI and GitHub Actions, Infrastructure-as-Code (Terraform), and containerization (Docker, Kubernetes) for data and ML workloads.

  • Containerization: Mastery of container technologies like Docker and orchestration platforms like Kubernetes.

  • Monitoring and Observability: Expertise with observability tools such as Grafana.

  • Version Control: Strong proficiency with Git, including advanced workflow management.

  • Operating Systems: Deep knowledge of Linux/Unix administration.

  • GenAI Model Deployment: Lead the deployment of large language models (LLMs) and Generative AI applications on Azure, addressing challenges related to latency, cost, and security.

  • RAG System Implementation: Architect and implement Retrieval-Augmented Generation (RAG) systems on Azure, integrating vector databases (like Azure AI Search) and managing the associated data and infrastructure.

  • AI-Powered Automation: Utilize Generative AI tools to automate code generation, improve testing procedures, and develop intelligent automation for operational tasks.

Preferred Qualifications
  • Databricks certifications such as Databricks Certified Data Engineer Professional or Generative AI Engineer. Experience with Generative AI-related technologies and frameworks like Azure AI Search and Lang Chain.

Inclusion at GSK:

As an employer committed to Inclusion, we encourage you to reach out if you need any adjustments during the recruitment process.

Please contact our Recruitment Team at IN.recruitment- to discuss your needs.

Dlaczego GSK?

Łączymy naukę, technologię i umiejętności, aby razem pokonywać choroby.

GSK to globalna firma biofarmaceutyczna, której celem jest łączenie nauki, technologii i talentów, aby wspólnie wyprzedzać choroby. Jako odnosząca sukcesy, rozwijająca się firma, w której ludzie mogą realizować swój potencjał, dążymy do pozytywnego wpływu na zdrowie 2,5 miliarda ludzi do końca dekady.

Priorytetem są dla nas innowacyjne rozwiązania w obszarze szczepionek i leków specjalistycznych, które maksymalizują rosnące możliwości w zakresie zapobiegania chorobom i ich leczenia.

Skupiamy się na czterech obszarach terapeutycznych: układzie oddechowym, immunologii; onkologii; HIV; oraz chorobach zakaźnych – aby wpływać na zdrowie na dużą skalę.

Ludzie i pacjenci na całym świecie polegają na lekach i szczepionkach, które produkujemy, dlatego zobowiązujemy się do tworzenia środowiska, w którym nasi pracownicy mogą się rozwijać i koncentrować na tym, co najważniejsze. Nasza kultura bycia ambitnym dla pacjentów, odpowiedzialnym za wpływ i postępowania właściwie jest fundamentem, na którym wspólnie dostarczamy rezultaty dla pacjentów, akcjonariuszy i naszych pracowników.

Inkluzywność w GSK:

Jako pracodawca zaangażowany w kwestie inkluzywności, zachęcamy do kontaktu, jeśli potrzebujesz jakichkolwiek zmian w trakcie procesu rekrutacji.

Skontaktuj się z naszym zespołem ds. rekrutacji pod adresem IN.recruitment-, aby omówić swoje potrzeby.

Ważna informacja dla firm/agencji zatrudnienia

GSK nie przyjmuje poleceń od firm/agencji rekrutacyjnych lub pośrednictwa pracy w odniesieniu do wakatów zamieszczonych na tej stronie. Wszystkie firmy/agencje są zobowiązane do skontaktowania się z Działem Zakupów Usług Komercyjnych/Działem HR GSK w celu uzyskania uprzedniej pisemnej zgody przed skierowaniem jakichkolwiek kandydatów do GSK. Uzyskanie uprzedniej pisemnej zgody jest warunkiem wstępnym każdej umowy (ustnej lub pisemnej) między firmą/agencją a GSK. W przypadku braku takiego pisemnego upoważnienia wszelkie działania podejmowane przez firmę/agencję będą uznawane za wykonane bez zgody lub umowy kontraktowej GSK. GSK nie ponosi zatem odpowiedzialności za żadne opłaty wynikające z takich działań lub opłaty wynikające z jakichkolwiek poleceń firm/agencji w odniesieniu do wakatów zamieszczonych na tej stronie.

Dotarła do nas informacja, że nazwy GlaxoSmithKline lub GSK lub spółek naszej grupy są wykorzystywane w związku z fałszywymi ogłoszeniami o pracę lub za pośrednictwem niezamawianych wiadomości e-mail, w których kandydaci są proszeni o dokonanie pewnych płatności za możliwości rekrutacji i rozmowy kwalifikacyjne. Należy pamiętać, że takie reklamy i wiadomości e-mail nie są w żaden sposób powiązane z grupą GlaxoSmithKline (lub GSK).

GlaxoSmithKline (lub GSK) nie pobiera żadnych opłat za proces rekrutacji. Prosimy nie dokonywać płatności na rzecz żadnych osób/podmiotów w związku z rekrutacją w żadnej spółce grupy GlaxoSmithKline (lub GSK) w żadnej lokalizacji na świecie. Nawet jeśli twierdzą, że pieniądze podlegają zwrotowi.

Jeśli natkniesz się na niechciane wiadomości e-mail z adresów, które nie kończą się na lub na ogłoszenia o pracę, w których napisano, że należy kontaktować się z adresem e-mail, który nie kończy się na "", powinieneś je zignorować i poinformować nas, wysyłając wiadomość e-mail na adres , abyśmy mogli potwierdzić, czy oferta pracy jest prawdziwa.



  • Bengaluru, Karnataka, India Astar Data Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Kindly find the Job Description Below.Job Title:Technical Lead- Data EngineerLocation: BangaloreYears of Experience: 8+ years of experienceSigmoidworks with a variety of clients from start-ups to fortune 500 companies. We are looking for a detailed oriented self-starter to assist our engineering and analytics teams in various roles as a Software...


  • Bengaluru, Karnataka, India NTT DATA Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Framework Design & Architecture Architect a metadata-driven, Python/Spark-based framework for automated data validation across high-volume production datasets. Define DQ rule templates for completeness, integrity, conformity, accuracy, and timeliness. Establish data quality thresholds, escalation protocols, and exception workflows. Automation & Integration...


  • Bengaluru, Karnataka, India NTT DATA Full time ₹ 1,20,000 - ₹ 3,00,000 per year

    Req ID: 341131NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Data Engineer Senior Consultant to join our team in Bangalore, Karnātaka (IN-KA), India (IN). Location: | Experience:...


  • Bengaluru, Karnataka, India NTT DATA, Inc. Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the RoleAs a Senior Data Engineer cum Data Scientist, you will be a key contributor to designing, developing, and operating advanced data architectures, pipelines, and analytics solutions within the Palantir ecosystemYou will leverage your deep expertise in Palantir Foundry and Gotham platforms, combined with strong AI capabilities, to solve complex...


  • Bengaluru, Karnataka, India NTT DATA Global Delivery Services Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Data Senior Engineer Req ID: 337668 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Data Senior Engineer to join our team in Bangalore, Karnātaka (IN-KA), India (IN). Job...


  • Bengaluru, Karnataka, India IC Data Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Senior Data Scientist (Remote Sensing)About the jobSagri Inc. is a pioneering social impact startup founded in 2018 with the vision of "realizing the coexistence of humanity and the earth." We utilize satellite data and AI technology to address global food crises and climate change challenges. Our core products - Actaba, Detaba, and Sagri - offer innovative...

  • Data Engineer

    2 weeks ago


    Bengaluru, Karnataka, India NTT DATA Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Design and implement tailored data solutions to meet customer needs and use cases, spanning from streaming to data lakes, analytics, and beyond within a dynamically evolving technical stack. Provide thought leadership by recommending the most appropriate technologies and solutions for a given use case, covering the entire spectrum from the application layer...


  • Bengaluru, Karnataka, India GSK Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role Overview:The Senior Lead Data Engineer will operate within a matrixed product team design, holding responsibility for the technical solution design, implementation, and ongoing enhancement of products and systems developed by Medical Digital and Tech. In alignment with Agile Ways of Working, DevSecOps principles, and requirements for Compliance and...

  • Senior Data Scientist

    2 weeks ago


    Bengaluru, Karnataka, India Enable Data Incorporated Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Enable Data Incorporated is looking for a talented and experienced Senior Data Scientist to join our innovative team. In this role, you will be responsible for driving data science initiatives, developing predictive models, and providing insights that guide key business decisions. The ideal candidate has a strong background in statistical analysis, machine...

  • Senior Data Scientist

    5 hours ago


    Bengaluru, Karnataka, India Anicca Data Science Solutions Full time ₹ 1,20,000 - ₹ 3,60,000 per year

    Job Description – Senior Data ScientistLocation:Bengaluru (Full-time, Onsite)About Anicca DataAnicca Data is a leading team of Data Science and Technology experts specializing in:Personalization, Recommendation Engines, and Product Search RankingLoyalty Programs and Customer AnalyticsSupply Chain Optimization (Demand Forecasting, Inventory, Replenishment &...