Data Engineer – Cloud Integration

4 weeks ago


Agra, Uttar Pradesh, India Boston Insights Full time

Company Overview

Boston Insights is an innovative startup creating competitive advantage for pharmaceutical companies by unlocking their clinical supply chain data and enabling end-to-end visibility. We augment risk resiliency and agility to ensure uninterrupted supply of investigational drugs to patients on-time. Our mission is to transform how pharmaceutical companies manage their clinical supply chains through cutting-edge data solutions.

Position Overview

We are seeking a Data Engineer with 5+ years of experience, deep expertise in the Microsoft Azure technology stack, and proven ability in integrating data from external data lakes, AWS data warehouses, and enterprise supply chain solutions like SAP. The ideal candidate will also have strong experience in data governance and building data automation tools using Python and related languages.

Key Responsibilities

Data Integration & Pipeline Development

  • Design, build, and maintain scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Azure Data Lake
  • Lead the integration strategy for ingesting and harmonizing data from external sources, including AWS-based data lakes/warehouses (such as S3, Redshift) and SAP systems.
  • Automate ETL processes for data extraction, transformation, and loading across hybrid and multi-cloud environments.
  • Build and maintain real-time and batch data integration workflows between Azure, AWS, and on-premises sources.

Data Architecture & Infrastructure

  • Design and implement data lake and data warehouse solutions on Azure platform
  • Establish data governance frameworks and ensure data quality across all pipelines
  • Implement security best practices for handling sensitive pharmaceutical data
  • Create and maintain data documentation and lineage tracking

Data Governance & Quality

  • Define and enforce data governance frameworks: data cataloging, lineage, quality, privacy, and compliance
  • Implement robust data validation, cleansing, and monitoring systems to ensure accuracy and reliability
  • Support security standards through effective data management practices.

Automation & Tooling

  • Develop data automation tools and reusable components using Python, PySpark (and other relevant frameworks/languages)
  • Enable end-to-end process automation for data ingestion, processing, and reporting.
  • Implement CI/CD processes for data solutions, including testing, monitoring, and alerting.

Analytics & Reporting Support

  • Collaborate with data scientists and analysts to support advanced analytics
  • Build data models that enable risk assessment and supply chain optimization
  • Develop APIs and data services to support front-end applications
  • Create monitoring and alerting systems for data pipeline health

Collaboration & Support

  • Partner with supply chain, analytics, and business stakeholders to understand business requirements and translate them into scalable technical solutions.
  • Collaborate with SAP functional and technical teams to optimize data extraction and synchronization.

Required Technical Qualifications

  • 5+ years of professional data engineering experience.
  • Data Integration: Proven track record integrating data from AWS services (S3, Redshift, Glue, etc.) into Azure or other cloud environments
  • Azure Data Services: Expert-level knowledge of Azure Data Factory, Azure Synapse Analytics, Data Bricks, Azure Data Lake Storage, and Azure SQL Database, Apache Spark
  • Database Technologies: Strong knowledge of both relational (SQL Server, PostgreSQL) and NoSQL (Cosmos DB) databases
  • Programming Languages: Proficiency in Python, SQL, PySpark and PowerShell for data automation and wrangling
  • Version Control: Proficiency with Git and Azure DevOps
  • Hands-on experience with SAP data models and integrating SAP data with Azure data lake

Preferred Technical Skills

  • Experience with API-based data integration for cloud and enterprise applications.
  • Experience with Infrastructure as Code (ARM templates, Terraform)
  • Familiarity with data quality tools, metadata management, and automated data lineage tracking.
  • Knowledge of containerization (Docker, Kubernetes) for data automation workflows.
  • Knowledge of machine learning pipelines and MLOps practices
  • Experience with data visualization tool, Power BI

Professional Skills

  • Strong problem-solving and analytical thinking abilities
  • Excellent communication skills with ability to explain technical concepts to non-technical stakeholders
  • Experience with Agile development methodologies
  • Attention to detail and commitment to data quality

The opportunity we offer

  • Competitive salary commensurate with experience.
  • Professional development training and certifications.

Work Environment

  • Remote work arrangement with flexible hours
  • State-of-the-art technology and tools
  • Collaborative, innovation-driven culture
  • Access to cutting-edge pharmaceutical industry data and challenges
  • Opportunity to shape an innovative pharma analytics platform.

Application Instructions

Please submit your resume and a cover letter highlighting:

  • Experience in Azure, AWS data integration, and SAP data extraction.
  • Examples of data automation tools or frameworks you have developed.

Join us to unlock new possibilities in pharmaceutical supply chain data through advanced engineering and multi-cloud innovation

Boston Insights is transforming pharmaceutical supply chains through innovative data solutions. Join us in ensuring that life-saving investigational drugs reach patients on-time, every time.


  • Cloud Data Engineer

    2 weeks ago


    Agra, Uttar Pradesh, India Lemongrass Full time

    About LemongrassLemongrass is a software-enabled services provider, synonymous with SAP on Cloud, focused on delivering superior, highly automated Managed Services to Enterprise customers. Our customers span multiple verticals and geographies across the Americas, EMEA and APAC. We partner with AWS, SAP, Microsoft and other global technology leaders.We are...


  • Agra, Uttar Pradesh, India beBeeIntegration Full time ₹ 80,00,000 - ₹ 1,50,00,000

    Oracle Integration Cloud Expertise SoughtWe are seeking a skilled Oracle Developer to join our team in the field of cloud integration.Responsibilities:Built complex projects using Platform as a Service, Database Cloud Services/ Autonomous Transaction Processing databases, and Business Intelligence Cloud Services extracts.Developed PL/SQL skills to drive...


  • Agra, Uttar Pradesh, India beBeeCloudDataEngineer Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Senior Informatica Engineer Job DescriptionWe are seeking a highly experienced and skilled Informatica expert to design, develop and maintain scalable cloud-based ETL pipelines using Azure services. The ideal candidate will have strong hands-on experience in cloud-based analytics and expert-level knowledge of Azure Data Factory, Azure Synapse Analytics,...


  • Agra, Uttar Pradesh, India beBeeDataEngineer Full time ₹ 18,00,000 - ₹ 25,00,000

    Unlocking Business Value with Data Engineering ExpertiseAs a seasoned data engineer, you will lead the end-to-end migration of data warehousing and ETL processes from legacy systems to cloud-based platforms. You will design, develop, and implement highly efficient, scalable, and reliable ETL/ELT pipelines within the new environment.Key...


  • Agra, Uttar Pradesh, India beBeeData Full time US$ 1,00,000 - US$ 1,50,000

    Cloud Data EngineerAbout our company and role:We are seeking a Cloud Data Engineer to join our team, responsible for designing, developing, and maintaining scalable ETL pipelines using cloud-native tools.The ideal candidate will have experience in architecting and implementing data lakes and data warehouses on cloud platforms, as well as developing and...


  • Agra, Uttar Pradesh, India beBeeData Full time ₹ 1,80,00,000 - ₹ 2,20,00,000

    Job Title: Cloud Data EngineerWe are seeking a highly skilled Cloud Data Engineer to join our team. The ideal candidate will have strong software engineering skills, with a solid foundation in programming principles and experience with cloud-based technologies.Key Skills:A deep understanding of cloud computing platforms, including AWS.Hands-on experience...


  • Agra, Uttar Pradesh, India beBeeDataMigration Full time ₹ 90,00,000 - ₹ 1,20,00,000

    Job SummaryWe are seeking a seasoned and results-driven professional to spearhead the development and deployment of cloud-based data solutions.Key Responsibilities:Oversee the technical direction of projects, mentor team members, and deliver scalable and efficient systems using cloud and big data technologies.Architect and implement big data processing...

  • Cloud Data Architect

    2 weeks ago


    Agra, Uttar Pradesh, India beBeeDataArchitect Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Title: Cloud Data ArchitectSummary:As a seasoned Cloud Data Architect, you will design and implement robust data pipelines using cutting-edge cloud technologies.Key Responsibilities:Develop scalable data architecture solutions for large-scale data processing systems.Collaborate with cross-functional teams to integrate data into various platforms and...


  • Agra, Uttar Pradesh, India beBeeData Full time ₹ 15,00,000 - ₹ 25,00,000

    Salesforce Data Cloud ExpertSeeking a seasoned Salesforce Data Cloud professional with strong experience in data modeling and integration.Key Responsibilities:Design, configure, and implement Salesforce Data Cloud solutions including data streams, data model objects, identity resolution, calculated insights, and integrations.Define and optimize data models,...


  • Agra, Uttar Pradesh, India beBeeDataEngineer Full time ₹ 12,00,000 - ₹ 18,00,000

    Job Overview:We're looking for a skilled and motivated Data Engineer to develop robust data pipelines.Key Responsibilities:Design, build, and maintain scalable data pipelines using cloud services.Implement data ingestion, transformation, and storage solutions.Ensure data quality, integrity, and security across all systems.Monitor and troubleshoot data...