Data Engineer – Cloud Integration

2 weeks ago


Davangere, Karnataka, India Boston Insights Full time

Company Overview

Boston Insights is an innovative startup creating competitive advantage for pharmaceutical companies by unlocking their clinical supply chain data and enabling end-to-end visibility. We augment risk resiliency and agility to ensure uninterrupted supply of investigational drugs to patients on-time. Our mission is to transform how pharmaceutical companies manage their clinical supply chains through cutting-edge data solutions.

Position Overview

We are seeking a Data Engineer with 5+ years of experience, deep expertise in the Microsoft Azure technology stack, and proven ability in integrating data from external data lakes, AWS data warehouses, and enterprise supply chain solutions like SAP. The ideal candidate will also have strong experience in data governance and building data automation tools using Python and related languages.

Key Responsibilities

Data Integration & Pipeline Development

  • Design, build, and maintain scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Azure Data Lake
  • Lead the integration strategy for ingesting and harmonizing data from external sources, including AWS-based data lakes/warehouses (such as S3, Redshift) and SAP systems.
  • Automate ETL processes for data extraction, transformation, and loading across hybrid and multi-cloud environments.
  • Build and maintain real-time and batch data integration workflows between Azure, AWS, and on-premises sources.

Data Architecture & Infrastructure

  • Design and implement data lake and data warehouse solutions on Azure platform
  • Establish data governance frameworks and ensure data quality across all pipelines
  • Implement security best practices for handling sensitive pharmaceutical data
  • Create and maintain data documentation and lineage tracking

Data Governance & Quality

  • Define and enforce data governance frameworks: data cataloging, lineage, quality, privacy, and compliance
  • Implement robust data validation, cleansing, and monitoring systems to ensure accuracy and reliability
  • Support security standards through effective data management practices.

Automation & Tooling

  • Develop data automation tools and reusable components using Python, PySpark (and other relevant frameworks/languages)
  • Enable end-to-end process automation for data ingestion, processing, and reporting.
  • Implement CI/CD processes for data solutions, including testing, monitoring, and alerting.

Analytics & Reporting Support

  • Collaborate with data scientists and analysts to support advanced analytics
  • Build data models that enable risk assessment and supply chain optimization
  • Develop APIs and data services to support front-end applications
  • Create monitoring and alerting systems for data pipeline health

Collaboration & Support

  • Partner with supply chain, analytics, and business stakeholders to understand business requirements and translate them into scalable technical solutions.
  • Collaborate with SAP functional and technical teams to optimize data extraction and synchronization.

Required Technical Qualifications

  • 5+ years of professional data engineering experience.
  • Data Integration: Proven track record integrating data from AWS services (S3, Redshift, Glue, etc.) into Azure or other cloud environments
  • Azure Data Services: Expert-level knowledge of Azure Data Factory, Azure Synapse Analytics, Data Bricks, Azure Data Lake Storage, and Azure SQL Database, Apache Spark
  • Database Technologies: Strong knowledge of both relational (SQL Server, PostgreSQL) and NoSQL (Cosmos DB) databases
  • Programming Languages: Proficiency in Python, SQL, PySpark and PowerShell for data automation and wrangling
  • Version Control: Proficiency with Git and Azure DevOps
  • Hands-on experience with SAP data models and integrating SAP data with Azure data lake

Preferred Technical Skills

  • Experience with API-based data integration for cloud and enterprise applications.
  • Experience with Infrastructure as Code (ARM templates, Terraform)
  • Familiarity with data quality tools, metadata management, and automated data lineage tracking.
  • Knowledge of containerization (Docker, Kubernetes) for data automation workflows.
  • Knowledge of machine learning pipelines and MLOps practices
  • Experience with data visualization tool, Power BI

Professional Skills

  • Strong problem-solving and analytical thinking abilities
  • Excellent communication skills with ability to explain technical concepts to non-technical stakeholders
  • Experience with Agile development methodologies
  • Attention to detail and commitment to data quality

The opportunity we offer

  • Competitive salary commensurate with experience.
  • Professional development training and certifications.

Work Environment

  • Remote work arrangement with flexible hours
  • State-of-the-art technology and tools
  • Collaborative, innovation-driven culture
  • Access to cutting-edge pharmaceutical industry data and challenges
  • Opportunity to shape an innovative pharma analytics platform.

Application Instructions

Please submit your resume and a cover letter highlighting:

  • Experience in Azure, AWS data integration, and SAP data extraction.
  • Examples of data automation tools or frameworks you have developed.

Join us to unlock new possibilities in pharmaceutical supply chain data through advanced engineering and multi-cloud innovation

Boston Insights is transforming pharmaceutical supply chains through innovative data solutions. Join us in ensuring that life-saving investigational drugs reach patients on-time, every time.



  • Davangere, Karnataka, India beBeeDataEngineer Full time US$ 1,20,000 - US$ 1,70,000

    As a seasoned data engineer, you will play a pivotal role in designing, building, and maintaining scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Azure Data Lake.Our team is seeking an expert-level professional with 5+ years of experience in data integration, particularly integrating data from AWS services into Azure or other...


  • Davangere, Karnataka, India beBeeData Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Cloud Data Engineering ExpertWe are seeking a highly skilled Cloud Data Engineering Expert to join our team. As a key member of our data engineering team, you will be responsible for designing, developing, and maintaining scalable data pipelines and ETL processes using GCP services such as BigQuery, Cloud Data Fusion, Dataflow, Pub/Sub, Cloud Storage,...


  • Davangere, Karnataka, India beBeeCloud Full time ₹ 90,00,000 - ₹ 1,50,00,000

    Senior Cloud Data Engineer">OverviewAt our company, we are seeking a seasoned Senior Cloud Data Engineer to join our data engineering team. As a pivotal member of our team, you will play a crucial role in designing, implementing, and optimizing data pipelines, ensuring seamless integration with cloud platforms.Key ResponsibilitiesDesign, develop, and...


  • Davangere, Karnataka, India beBeeDataEngineer Full time ₹ 15,00,000 - ₹ 25,00,000

    Job OpportunityWe are seeking a skilled Data Engineer to join our team and help us drive data-driven decision making.Key Responsibilities:Design and implement scalable data pipelines using cloud-based technologies.Develop and maintain Extract, Transform, Load (ETL) processes to ensure seamless data transfer between systems.Collaborate with cross-functional...


  • Davangere, Karnataka, India beBeeData Full time ₹ 12,00,000 - ₹ 15,00,000

    Job Title: Cloud Data ArchitectWe are seeking a seasoned Cloud Data Architect to lead the design and implementation of large-scale data infrastructures on cloud platforms.About the Role:The ideal candidate will have extensive experience with cloud-based data engineering, including development, deployment, and management of scalable data pipelines using...

  • Cloud Data Modeler

    4 days ago


    Davangere, Karnataka, India beBeeDataModeler Full time ₹ 1,50,00,000 - ₹ 2,01,00,000

    Cloud Data Modeler Role OverviewOur organization is seeking an experienced Cloud Data Modeler to develop and optimize scalable data pipelines using cloud-native technologies.Design, develop, and implement robust data warehousing solutions utilizing Snowflake and other cloud-based tools.Collaborate with business stakeholders and consultants to understand data...


  • Davangere, Karnataka, India beBeeCloudEngineer Full time ₹ 1,80,00,000 - ₹ 2,10,00,000

    Senior Cloud Data Engineer PositionWe are seeking a highly skilled professional to lead complex data migration projects, with extensive experience in modern data engineering tools and platforms within the Azure ecosystem.The ideal candidate will have a strong foundation in data integration, transformation, and migration, along with a passion for working on...


  • Davangere, Karnataka, India beBeeOdi Full time ₹ 18,00,000 - ₹ 25,00,000

    Expert ODI DeveloperWe are seeking an experienced software developer with in-depth knowledge of Oracle Data Integrator (ODI) to design and implement data integration solutions. Key Responsibilities:Develop efficient ELT processes using ODI 11g/12c.Optimize Knowledge Module performance.Work on end-to-end ODI projects from requirement gathering to...


  • Davangere, Karnataka, India beBeeDataEngineer Full time ₹ 12,00,000 - ₹ 17,00,000

    Cloud Data Engineer Job DescriptionA Data Architect for Cloud Data Pipelines is needed to design, build, and maintain robust data pipelines for processing channel activity data.This role involves working on AWS cloud infrastructure and orchestration tools like Airflow for high availability, scalability, and performance.Main Responsibilities:Designing,...


  • Davangere, Karnataka, India beBeeInformatician Full time ₹ 80,00,000 - ₹ 1,50,00,000

    Informatica Developer Job DescriptionAs a highly skilled Informatica Developer, you will be responsible for designing and developing data integration solutions using Informatica PowerCenter, IDMC (IICS), Oracle Fusion Cloud data sources, and ETL frameworks.The ideal candidate will have expertise in SQL, PL/SQL, and relational databases (Oracle, SQL Server,...