
Data Engineer – Cloud Integration
2 weeks ago
Company Overview
Boston Insights is an innovative startup creating competitive advantage for pharmaceutical companies by unlocking their clinical supply chain data and enabling end-to-end visibility. We augment risk resiliency and agility to ensure uninterrupted supply of investigational drugs to patients on-time. Our mission is to transform how pharmaceutical companies manage their clinical supply chains through cutting-edge data solutions.
Position Overview
We are seeking a Data Engineer with 5+ years of experience, deep expertise in the Microsoft Azure technology stack, and proven ability in integrating data from external data lakes, AWS data warehouses, and enterprise supply chain solutions like SAP. The ideal candidate will also have strong experience in data governance and building data automation tools using Python and related languages.
Key Responsibilities
Data Integration & Pipeline Development
- Design, build, and maintain scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Azure Data Lake
- Lead the integration strategy for ingesting and harmonizing data from external sources, including AWS-based data lakes/warehouses (such as S3, Redshift) and SAP systems.
- Automate ETL processes for data extraction, transformation, and loading across hybrid and multi-cloud environments.
- Build and maintain real-time and batch data integration workflows between Azure, AWS, and on-premises sources.
Data Architecture & Infrastructure
- Design and implement data lake and data warehouse solutions on Azure platform
- Establish data governance frameworks and ensure data quality across all pipelines
- Implement security best practices for handling sensitive pharmaceutical data
- Create and maintain data documentation and lineage tracking
Data Governance & Quality
- Define and enforce data governance frameworks: data cataloging, lineage, quality, privacy, and compliance
- Implement robust data validation, cleansing, and monitoring systems to ensure accuracy and reliability
- Support security standards through effective data management practices.
Automation & Tooling
- Develop data automation tools and reusable components using Python, PySpark (and other relevant frameworks/languages)
- Enable end-to-end process automation for data ingestion, processing, and reporting.
- Implement CI/CD processes for data solutions, including testing, monitoring, and alerting.
Analytics & Reporting Support
- Collaborate with data scientists and analysts to support advanced analytics
- Build data models that enable risk assessment and supply chain optimization
- Develop APIs and data services to support front-end applications
- Create monitoring and alerting systems for data pipeline health
Collaboration & Support
- Partner with supply chain, analytics, and business stakeholders to understand business requirements and translate them into scalable technical solutions.
- Collaborate with SAP functional and technical teams to optimize data extraction and synchronization.
Required Technical Qualifications
- 5+ years of professional data engineering experience.
- Data Integration: Proven track record integrating data from AWS services (S3, Redshift, Glue, etc.) into Azure or other cloud environments
- Azure Data Services: Expert-level knowledge of Azure Data Factory, Azure Synapse Analytics, Data Bricks, Azure Data Lake Storage, and Azure SQL Database, Apache Spark
- Database Technologies: Strong knowledge of both relational (SQL Server, PostgreSQL) and NoSQL (Cosmos DB) databases
- Programming Languages: Proficiency in Python, SQL, PySpark and PowerShell for data automation and wrangling
- Version Control: Proficiency with Git and Azure DevOps
- Hands-on experience with SAP data models and integrating SAP data with Azure data lake
Preferred Technical Skills
- Experience with API-based data integration for cloud and enterprise applications.
- Experience with Infrastructure as Code (ARM templates, Terraform)
- Familiarity with data quality tools, metadata management, and automated data lineage tracking.
- Knowledge of containerization (Docker, Kubernetes) for data automation workflows.
- Knowledge of machine learning pipelines and MLOps practices
- Experience with data visualization tool, Power BI
Professional Skills
- Strong problem-solving and analytical thinking abilities
- Excellent communication skills with ability to explain technical concepts to non-technical stakeholders
- Experience with Agile development methodologies
- Attention to detail and commitment to data quality
The opportunity we offer
- Competitive salary commensurate with experience.
- Professional development training and certifications.
Work Environment
- Remote work arrangement with flexible hours
- State-of-the-art technology and tools
- Collaborative, innovation-driven culture
- Access to cutting-edge pharmaceutical industry data and challenges
- Opportunity to shape an innovative pharma analytics platform.
Application Instructions
Please submit your resume and a cover letter highlighting:
- Experience in Azure, AWS data integration, and SAP data extraction.
- Examples of data automation tools or frameworks you have developed.
Join us to unlock new possibilities in pharmaceutical supply chain data through advanced engineering and multi-cloud innovation
Boston Insights is transforming pharmaceutical supply chains through innovative data solutions. Join us in ensuring that life-saving investigational drugs reach patients on-time, every time.
-
Cloud Data Integration Specialist
1 week ago
Davangere, Karnataka, India beBeeDataEngineer Full time US$ 1,20,000 - US$ 1,70,000As a seasoned data engineer, you will play a pivotal role in designing, building, and maintaining scalable data pipelines using Azure Data Factory, Azure Synapse Analytics, and Azure Data Lake.Our team is seeking an expert-level professional with 5+ years of experience in data integration, particularly integrating data from AWS services into Azure or other...
-
Cloud Data Engineer Specialist
2 days ago
Davangere, Karnataka, India beBeeData Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Cloud Data Engineering ExpertWe are seeking a highly skilled Cloud Data Engineering Expert to join our team. As a key member of our data engineering team, you will be responsible for designing, developing, and maintaining scalable data pipelines and ETL processes using GCP services such as BigQuery, Cloud Data Fusion, Dataflow, Pub/Sub, Cloud Storage,...
-
Cloud Data Engineering Expert
6 days ago
Davangere, Karnataka, India beBeeCloud Full time ₹ 90,00,000 - ₹ 1,50,00,000Senior Cloud Data Engineer">OverviewAt our company, we are seeking a seasoned Senior Cloud Data Engineer to join our data engineering team. As a pivotal member of our team, you will play a crucial role in designing, implementing, and optimizing data pipelines, ensuring seamless integration with cloud platforms.Key ResponsibilitiesDesign, develop, and...
-
GCP Cloud Data Engineering Specialist
5 days ago
Davangere, Karnataka, India beBeeDataEngineer Full time ₹ 15,00,000 - ₹ 25,00,000Job OpportunityWe are seeking a skilled Data Engineer to join our team and help us drive data-driven decision making.Key Responsibilities:Design and implement scalable data pipelines using cloud-based technologies.Develop and maintain Extract, Transform, Load (ETL) processes to ensure seamless data transfer between systems.Collaborate with cross-functional...
-
Cloud Data Architect
1 week ago
Davangere, Karnataka, India beBeeData Full time ₹ 12,00,000 - ₹ 15,00,000Job Title: Cloud Data ArchitectWe are seeking a seasoned Cloud Data Architect to lead the design and implementation of large-scale data infrastructures on cloud platforms.About the Role:The ideal candidate will have extensive experience with cloud-based data engineering, including development, deployment, and management of scalable data pipelines using...
-
Cloud Data Modeler
4 days ago
Davangere, Karnataka, India beBeeDataModeler Full time ₹ 1,50,00,000 - ₹ 2,01,00,000Cloud Data Modeler Role OverviewOur organization is seeking an experienced Cloud Data Modeler to develop and optimize scalable data pipelines using cloud-native technologies.Design, develop, and implement robust data warehousing solutions utilizing Snowflake and other cloud-based tools.Collaborate with business stakeholders and consultants to understand data...
-
Azure Cloud Data Engineering Role
2 days ago
Davangere, Karnataka, India beBeeCloudEngineer Full time ₹ 1,80,00,000 - ₹ 2,10,00,000Senior Cloud Data Engineer PositionWe are seeking a highly skilled professional to lead complex data migration projects, with extensive experience in modern data engineering tools and platforms within the Azure ecosystem.The ideal candidate will have a strong foundation in data integration, transformation, and migration, along with a passion for working on...
-
Data Integration Specialist
6 days ago
Davangere, Karnataka, India beBeeOdi Full time ₹ 18,00,000 - ₹ 25,00,000Expert ODI DeveloperWe are seeking an experienced software developer with in-depth knowledge of Oracle Data Integrator (ODI) to design and implement data integration solutions. Key Responsibilities:Develop efficient ELT processes using ODI 11g/12c.Optimize Knowledge Module performance.Work on end-to-end ODI projects from requirement gathering to...
-
Data Architect for Cloud Data Pipelines
2 days ago
Davangere, Karnataka, India beBeeDataEngineer Full time ₹ 12,00,000 - ₹ 17,00,000Cloud Data Engineer Job DescriptionA Data Architect for Cloud Data Pipelines is needed to design, build, and maintain robust data pipelines for processing channel activity data.This role involves working on AWS cloud infrastructure and orchestration tools like Airflow for high availability, scalability, and performance.Main Responsibilities:Designing,...
-
Data Integration Specialist
1 week ago
Davangere, Karnataka, India beBeeInformatician Full time ₹ 80,00,000 - ₹ 1,50,00,000Informatica Developer Job DescriptionAs a highly skilled Informatica Developer, you will be responsible for designing and developing data integration solutions using Informatica PowerCenter, IDMC (IICS), Oracle Fusion Cloud data sources, and ETL frameworks.The ideal candidate will have expertise in SQL, PL/SQL, and relational databases (Oracle, SQL Server,...