Sr. Data Engineer

3 days ago


Anand, India Exigo Tech Full time

Exigo Tech is a Sydney-based Technology Solutions Provider that is focused on providing solutions on three major verticals; Infrastructure, Cloud, and Application to businesses across Australia. We help companies reach operational efficiencies by empowering them with technology solutions that drive their business processes. Exigo is looking for Full-time Sr. Data Engineer We are ISO 27001:2022 certified organization Visit our website: for more details…. LinkedIn: Click Here to know more : LIFE AT EXIGO TECH Roles and Responsibilities Install, configure, and manage Apache Spark (open-source) clusters on Ubuntu, including Spark master/worker nodes and Spark environment files. Configure and manage Spark UI and Spark History Server for monitoring jobs, analyzing DAGs, stages, tasks, and troubleshooting performance. Develop, optimize, and deploy PySpark ETL/ELT pipelines using DataFrame API, UDFs, window functions, caching, partitioning, and broadcasting. Deploy PySpark jobs using spark-submit in client/cluster mode with proper logging and error handling. Install, configure, and manage Apache Airflow including UI, scheduler, webserver, connections, and variables. Create, schedule, and monitor Airflow DAGs for PySpark jobs using SparkSubmitOperator, BashOperator, or PythonOperator. Configure and manage cron jobs for scheduling data processing tasks where needed. Install, configure, and optimize Trino (PrestoSQL) coordinator and worker nodes; configure catalogs suchas S3, MySQL, or PostgreSQL. Maintain Linux/Ubuntu servers including services, logs, environment variables, memory usage, and port conflict resolution. Design and implement scalable data architectures using Azure Data Services including ADF, Synapse, ADLS, Azure SQL, and Databricks. Develop, manage, and automate ETL/ELT pipelines using Azure Data Factory (Pipelines, Mapping Dataflows, Dataflows). Monitor, troubleshoot, and optimize data pipelines across Spark, Airflow, Trino, and Azure platforms. Work with structured, semi-structured, and unstructured data across multiple data sources and formats. Implement data analytics, transformation, backup, and recovery solutions. Perform data migration, upgrade, and modernization using Azure and database tools. Implement CI/CD pipelines for data solutions using Azure DevOps and Git. Ensure data quality, governance, lineage, metadata management, and security compliance across cloud and big data environments. Design and optimize data models using star and snowflake schemas; build data warehouses, Delta Lake, and Lakehouse systems. Develop and rebuild reports/dashboards using Power BI, Tableau, or similar tools. Collaborate with internal teams, clients, and business users to gather requirements and deliver high-quality data solutions. Provide documentation, runbooks, and operational guidance. Technical Skills: Apache Spark (Open Source) & PySpark - Must Apache Spark installation & cluster configuration (Ubuntu/Linux) Spark master/worker setup (standalone & cluster mode) Spark UI & History Server configuration and debugging PySpark development (ETL pipelines, UDFs, window functions, DataFrame API) Performance tuning (partitioning, caching, shuffles) spark-submit deployment with monitoring and logging 2. Apache Airflow & Job Orchestration - Must Airflow installation & configuration (UI, scheduler, webserver) Creating and scheduling DAGs (SparkSubmitOperator, BashOperator, PythonOperator) Retry logic, triggers, alerting, and log management Cron job scheduling & process automation 3. Trino (PrestoSQL) - Must Trino coordinator & worker node setup Catalog configuration (S3, RDBMS sources) Distributed SQL troubleshooting & performance optimization 4. Azure Data Services (nice to have) Azure Data Factory Azure Synapse Analytics Azure SQL / Cosmos DB Azure Data Lake Storage (Gen2) Azure Databricks (Delta, Notebooks, Jobs) Azure Event Hubs / Stream Analytics 5. Microsoft Fabric ( nice to have) Lakehouse Warehouse Dataflows Notebooks Pipelines 6. Programming & Querying Python PySpark SQL Scala 7. Data Modeling & Warehousing Star schema modeling Snowflake schema modeling Fact/dimension modeling Data warehouse & Lakehouse design Delta Lake / Lakehouse architectures 8. DevOps & CI/CD Git / GitHub / Azure Repos Azure DevOps pipelines (CI/CD) Automated deployment for Spark, Airflow, ADF, Databricks, Fabric 9. BI Tools (Nice to have) Power BI Tableau Report building, datasets, DAX 10. Linux/Ubuntu Server Knowledge Shell scripting Service management Logs & environment variables Soft Skills: Excellent problem solving and communication skills Able to work well in a team setting Excellent organizational and time management skills Taking end-to-end ownership Production support & timely delivery Self-driven, flexible and innovative Microsoft Certified: Azure Data Engineer Associate (DP-203 / DP -300) Knowledge of DevOps and CI/CD pipelines in Azure Education: BSc/BA in Computer Science, Engineering or a related field Work Location: Vadodara, Gujarat, India


  • Sr. Platform Engineer

    2 weeks ago


    Anand, India CME Group Full time

    Join our Technology (DevOps) team as a Sr. Platform Engineer. In this critical role, you'll leverage your expertise in CI/CD, container orchestration (Kubernetes), and infrastructure-as-code to engineer the next generation of scalable, secure, and resilient platforms that power global markets. What You’ll Get - A supportive environment fostering career...

  • Data Engineer

    3 weeks ago


    Anand, India Aceolution Full time

    Job Title: Data Engineer – Python Expert(Freelance Role) Location: Remote / Hybrid Employment Type: Contract/ Freelance Role Summary We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) development. As a senior Individual Contributor (IC), you will be the team's expert...

  • Data Engineer

    3 days ago


    Anand, India MyRemoteTeam Inc Full time

    About Us MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent. We empower businesses by providing world-class software engineers, operations support, and infrastructure to help them grow faster and better. Position: Snowflake & Databricks Data Engineer / Data Science Engineer Experience: 9+ Years...


  • anand, India beBeeDataEngineering Full time

    Data Engineering OpportunityWe are seeking an experienced professional to lead our global engineering team in a data-driven project initiative. The ideal candidate will possess expertise in data engineering and have a strong background in managing large-scale data systems.Key responsibilities include designing, implementing, and maintaining data pipelines...

  • Cloud Data Engineer

    5 days ago


    anand, India beBeeDataEngineer Full time

    Cloud Data Engineer PositionWe are seeking a skilled Cloud Data Engineer to join our team. As a key member of the organization, you will play a critical role in designing and implementing data processing systems that drive business growth.Job Description:The ideal candidate will have experience with cloud-based data engineering platforms and technologies,...

  • Data Engineer – CDP

    2 weeks ago


    Anand, India Integers.Ai Full time

    Job Description: Data Engineer – CDPRole: Data Engineer – CDPJob Location: RemoteExperience: 4-8 yearsAbout the RoleWe are seeking an experienced Data Engineer with strong CDP expertise to join our team. The ideal candidate will have hands-on experience working with Customer Data Platforms—specifically Real-Time CDP and Salesforce CDP—along with a...

  • AWS Data Engineer

    3 weeks ago


    Anand, India Vista Applied Solutions Group Inc Full time

    Job Summary for AWS Data Engineer:We are seeking an experienced AWS Data Engineer with strong skills in Python and SQL to design, develop, and maintain cloud-based data pipelines and analytics solutions.Job Qualification and Responsibilities for AWS Data Engineer:3–8 years of Data Engineering experience.Strong Python programming.Advanced SQL (performance...

  • AWS Data Engineer

    3 weeks ago


    Anand, India Vista Applied Solutions Group Inc Full time

    Job Summary for AWS Data Engineer:We are seeking an experienced AWS Data Engineer with strong skills in Python and SQL to design, develop, and maintain cloud-based data pipelines and analytics solutions.Job Qualification and Responsibilities for AWS Data Engineer:3–8 years of Data Engineering experience.Strong Python programming.Advanced SQL (performance...


  • anand, India beBeeDataEngineering Full time

    Data Engineering LeadJoin our team as a data engineering lead and drive the design, build, and scale of robust data solutions on Microsoft Azure.Owning modern data pipelines and models that power analytics and reporting across the business requires hands-on expertise with SQL databases, Azure Data Lake Storage, Azure Data Factory (ADF), and Power...

  • Cloud Data Engineer

    3 days ago


    anand, India beBeeDataEngineer Full time

    Job DescriptionWe are seeking a skilled data engineer to drive the development and implementation of cloud-based data solutions using Google Cloud Platform (GCP). The ideal candidate will have hands-on experience in GCP, especially with BigQuery and Airflow, as well as strong SQL and Python skills.This role involves technical requirements gathering and...