AWS Data Engineer

1 week ago


gurugram, India Infogain Full time

AWS Data Engineer (Senior) with skills AWS - EKS, AWS - CloudFormation, AWS-Apps, AWS-Infra, AWS DBA, Python, Apache Hive, SQL for location Gurugram, India

Posted on: June 15, Share on Linkedin Share on Twitter Share on Facebook

ROLES & RESPONSIBILITIES

We are seeking a highly skilled and motivated Data Engineer to join our dynamic team. The ideal candidate will have extensive experience in ETL, Data Modelling, and Data Architecture. Proficiency in ETL optimization, designing, coding, and tuning big data processes using Scala is essential, along with hands-on experience in stream data processing using Spark, Kafka, and Spark Structured Streaming.

Additionally, the candidate should have extensive experience in building data platforms using a variety of technologies, including Scala, SQL/PLSQL, PostgreSQL, SQL Server, Teradata, Spark, Spark Structured Streaming, Kafka, Parquet/ORC, Data Modelling (Relational Dimensional E-R Modelling), ETL, RDS (PostgreSQL, MySQL), Splunk, DataDog, Airflow, Git, CI/CD Jenkins, JIRA, Confluence, IntelliJ Idea, Agile - Scrum/Kanban, On Call & Operations, Code Review, RCP Framework, Query book, Build, Deployment CI/CD & Release Process, Backstage, PagerDuty, and Spinnaker.

Key Responsibilities:

Hands-on experience on developing Data platform and its components Data Lake, cloud Datawarehouse, APIs, Batch and streaming data pipeline Experience with building data pipelines and applications to stream and process large datasets at low latency.

· Develop and maintain batch and stream processing data solutions using Apache Spark, Kafka, and Spark Structured Streaming.

· Work on orchestration using Airflow to automate and manage data workflows.

· Utilize project management tools like JIRA and Confluence to track progress and collaborate with the team.

· Develop data processing workflows utilizing Spark, SQL/PLSQL, and Scala to transform and cleanse raw data into a usable format.

· Implement data storage solutions leveraging Parquet/ORC formats on platforms such as PostgreSQL, SQL Server, Teradata, and RDS (PostgreSQL, MySQL).

· Optimize data storage and retrieval performance through efficient data modelling techniques, including Relational, Dimensional, and E-R modelling.

· Maintain data integrity and quality by implementing robust validation and error handling mechanisms within ETL processes.

· Automate deployment processes using CI/CD tools like Jenkins and Spinnaker to ensure reliable and consistent releases.

· Monitor and troubleshoot data pipelines using monitoring tools like DataDog and Splunk to identify performance bottlenecks and ensure system reliability.

· Participate in Agile development methodologies such as Scrum/Kanban, including sprint planning, daily stand-ups, and retrospective meetings.

· Conduct code reviews to ensure adherence to coding standards, best practices, and scalability considerations.

· Manage and maintain documentation using tools like Confluence to ensure clear and up-to-date documentation of data pipelines, schemas, and processes.

· Provide on-call support for production data pipelines, responding to incidents and resolving issues in a timely manner.

· Collaborate with cross-functional teams including developers, data scientists, and operations teams to address complex data engineering challenges.

· Stay updated on emerging technologies and industry trends to continuously improve data engineering processes and tools.

· Contribute to the development of reusable components and frameworks to streamline data engineering tasks across projects.

· Utilize version control systems like Git to manage codebase and collaborate effectively with team members.

· Leverage IDEs like IntelliJ IDEA for efficient development and debugging of data engineering code.

· Adhere to security best practices in handling sensitive data and implementing access controls within the data lake environment.

Good-to-Know Skills:

· Programming Languages: Python, Bash/Unix/Linux

· Big Data Technologies: Hive, Avro, Apache Iceberg, Delta Format

· Cloud Services: EC2, ECS, S3, SNS, SQS, CloudWatch

· Databases: DynamoDB, Redis

· Containerization and Orchestration: Docker, Kubernetes

· CI/CD Tools: Github Copilot

· Additional Skills: Maven, CLI/SDK

Nice-to-Have Skills:

· Networking: Subnets, Routes

· Big Data Technologies: Flink

Key Responsibilities:

Hands-on experience on developing Data platform and its components Data Lake, cloud Datawarehouse, APIs, Batch and streaming data pipeline Experience with building data pipelines and applications to stream and process large datasets at low latency.

· Develop and maintain batch and stream processing data solutions using Apache Spark, Kafka, and Spark Structured Streaming.

· Work on orchestration using Airflow to automate and manage data workflows.

· Utilize project management tools like JIRA and Confluence to track progress and collaborate with the team.

· Develop data processing workflows utilizing Spark, SQL/PLSQL, and Scala to transform and cleanse raw data into a usable format.

· Implement data storage solutions leveraging Parquet/ORC formats on platforms such as PostgreSQL, SQL Server, Teradata, and RDS (PostgreSQL, MySQL).

· Optimize data storage and retrieval performance through efficient data modelling techniques, including Relational, Dimensional, and E-R modelling.

· Maintain data integrity and quality by implementing robust validation and error handling mechanisms within ETL processes.

· Automate deployment processes using CI/CD tools like Jenkins and Spinnaker to ensure reliable and consistent releases.

· Monitor and troubleshoot data pipelines using monitoring tools like DataDog and Splunk to identify performance bottlenecks and ensure system reliability.

· Participate in Agile development methodologies such as Scrum/Kanban, including sprint planning, daily stand-ups, and retrospective meetings.

· Conduct code reviews to ensure adherence to coding standards, best practices, and scalability considerations.

· Manage and maintain documentation using tools like Confluence to ensure clear and up-to-date documentation of data pipelines, schemas, and processes.

· Provide on-call support for production data pipelines, responding to incidents and resolving issues in a timely manner.

· Collaborate with cross-functional teams including developers, data scientists, and operations teams to address complex data engineering challenges.

· Stay updated on emerging technologies and industry trends to continuously improve data engineering processes and tools.

· Contribute to the development of reusable components and frameworks to streamline data engineering tasks across projects.

· Utilize version control systems like Git to manage codebase and collaborate effectively with team members.

· Leverage IDEs like IntelliJ IDEA for efficient development and debugging of data engineering code.

· Adhere to security best practices in handling sensitive data and implementing access controls within the data lake environment.

Good-to-Know Skills:

· Programming Languages: Python, Bash/Unix/Linux

· Big Data Technologies: Hive, Avro, Apache Iceberg, Delta Format

· Cloud Services: EC2, ECS, S3, SNS, SQS, CloudWatch

· Databases: DynamoDB, Redis

· Containerization and Orchestration: Docker, Kubernetes

· CI/CD Tools: Github Copilot

· Additional Skills: Maven, CLI/SDK

Nice-to-Have Skills:

· Networking: Subnets, Routes

· Big Data Technologies: Flink

EXPERIENCE

6-8 Years

SKILLS

Primary Skill: Data Engineering Sub Skill(s): AWS - EKS, AWS - CloudFormation, AWS-Apps, AWS-Infra, AWS DBA Additional Skill(s): Python, Apache Hive, SQL

ABOUT THE COMPANY

Infogain is a human-centered digital platform and software engineering company based out of Silicon Valley. We engineer business outcomes for Fortune companies and digital natives in the technology, healthcare, insurance, travel, telecom, and retail & CPG industries using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. We accelerate experience-led transformation in the delivery of digital platforms. Infogain is also a Microsoft (NASDAQ: MSFT) Gold Partner and Azure Expert Managed Services Provider (MSP).

Infogain, an Apax Funds portfolio company, has offices in California, Washington, Texas, the UK, the UAE, and Singapore, with delivery centers in Seattle, Houston, Austin, Kraków, Noida, Gurgaon, Mumbai, Pune, and Bengaluru.


  • AWS Data Engineer

    3 weeks ago


    Gurugram, India Deloitte Full time

    As an AWS Data Engineer Consultant at Deloitte, you will be a key player in designing, implementing, and optimizing data solutions on the Amazon Web Services (AWS) cloud platform. This role involves working closely with clients from various industries, understanding their data needs, and delivering robust data engineering solutions. Your role will involve:...

  • AWS Data Engineer

    4 weeks ago


    Gurugram, India Deloitte Full time

    As an AWS Data Engineer Consultant at Deloitte, you will be a key player in designing, implementing, and optimizing data solutions on the Amazon Web Services (AWS) cloud platform. This role involves working closely with clients from various industries, understanding their data needs, and delivering robust data engineering solutions. Your role will...

  • AWS Data Engineer

    3 weeks ago


    Gurugram, India Deloitte Full time

    As an AWS Data Engineer Consultant at Deloitte, you will be a key player in designing, implementing, and optimizing data solutions on the Amazon Web Services (AWS) cloud platform. This role involves working closely with clients from various industries, understanding their data needs, and delivering robust data engineering solutions. Your role will...

  • AWS Data Engineer

    4 weeks ago


    Gurugram, India Deloitte Full time

    As an AWS Data Engineer Consultant at Deloitte, you will be a key player in designing, implementing, and optimizing data solutions on the Amazon Web Services (AWS) cloud platform. This role involves working closely with clients from various industries, understanding their data needs, and delivering robust data engineering solutions. Your role will...

  • AWS Data Engineer

    3 weeks ago


    Gurugram, India Deloitte Full time

    As an AWS Data Engineer Consultant at Deloitte, you will be a key player in designing, implementing, and optimizing data solutions on the Amazon Web Services (AWS) cloud platform. This role involves working closely with clients from various industries, understanding their data needs, and delivering robust data engineering solutions. Your role will...

  • Lead Data Engineer

    2 weeks ago


    Gurugram, India Pathways Consultant Full time

    Lead Data EngineerLocation : Pune, Gurgaon, Noida, BangaloreExperience : 7 -10 yearsNotice : Early joiners required (0 - 10 Days)Salary : 20 - 25 LPAShift : 12:00 PM-9:00 PM/ 12:30 PM-9:30 PMResponsibilities :- Should possess hands-on experience in working on some of the relational and non-relational databases (Oracle/SQL Server/DB2/PostgreSQL/MySQL/Golden...

  • Lead Data Engineer

    2 weeks ago


    gurugram, India Pathways Consultant Full time

    Lead Data EngineerLocation : Pune, Gurgaon, Noida, BangaloreExperience : 7 -10 yearsNotice : Early joiners required (0 - 10 Days)Salary : 20 - 25 LPAShift : 12:00 PM-9:00 PM/ 12:30 PM-9:30 PMResponsibilities :- Should possess hands-on experience in working on some of the relational and non-relational databases (Oracle/SQL Server/DB2/PostgreSQL/MySQL/Golden...

  • AWS Data Engineer

    1 month ago


    Gurugram, India Incedo Inc. Full time

    As a Technical Lead - at Incedo, you will be responsible for designing, deploying and maintaining cloud-based data platforms on the AWS platform. You will work with data scientists and business analysts to understand business requirements and design scalable, reliable and cost-effective solutions that meet those requirements.Roles & Responsibilities:•...

  • AWS Data Engineer

    1 month ago


    gurugram, India Incedo Inc. Full time

    As a Technical Lead - at Incedo, you will be responsible for designing, deploying and maintaining cloud-based data platforms on the AWS platform. You will work with data scientists and business analysts to understand business requirements and design scalable, reliable and cost-effective solutions that meet those requirements. Roles & Responsibilities: •...

  • AWS Data Engineer

    1 month ago


    Gurugram, India Incedo Inc. Full time

    As a Technical Lead - at Incedo, you will be responsible for designing, deploying and maintaining cloud-based data platforms on the AWS platform. You will work with data scientists and business analysts to understand business requirements and design scalable, reliable and cost-effective solutions that meet those requirements.Roles & Responsibilities:•...


  • gurugram, India HuQuo Consulting Pvt. Ltd. Full time

    Job Title: Lead Data EngineerJob SummaryLooking for a versatile professional to join our team in the role of Hybrid Data Engineer and Database Administrator. This position requires a blend of expertise in Data Engineering and Database Administration, with a strong foundation in Oracle DBA roles and a demonstrated transition into data engineering. The ideal...


  • Gurugram, India HuQuo Consulting Pvt. Ltd. Full time

    Job Title: Lead Data EngineerJob SummaryLooking for a versatile professional to join our team in the role of Hybrid Data Engineer and Database Administrator. This position requires a blend of expertise in Data Engineering and Database Administration, with a strong foundation in Oracle DBA roles and a demonstrated transition into data engineering. The ideal...

  • AWS Data Engineer

    4 weeks ago


    gurugram, India HuQuo Consulting Pvt. Ltd. Full time

    - Solid experience with AWS services such as Cloud Formation, S3, Athena, Glue, Glue Data Brew, EMR/Spark, RDS, Redshift, Data Sync, DMS, DynamoDB, Lambda, Step Functions, IAM, KMS, SM, Event Bridge, EC2, SQS, SNS, Lake Formation, Cloud Watch, Cloud Trail- Programming experience with Python, Shell scripting, and SQL- Responsible for building, test, QA & UAT...

  • AWS Data Engineer

    6 days ago


    gurugram, India HuQuo Consulting Pvt. Ltd. Full time

    - Solid experience with AWS services such as Cloud Formation, S3, Athena, Glue, Glue Data Brew, EMR/Spark, RDS, Redshift, Data Sync, DMS, DynamoDB, Lambda, Step Functions, IAM, KMS, SM, Event Bridge, EC2, SQS, SNS, Lake Formation, Cloud Watch, Cloud Trail- Programming experience with Python, Shell scripting, and SQL- Responsible for building, test, QA & UAT...

  • AWS Data Engineer

    2 weeks ago


    gurugram, India HuQuo Consulting Pvt. Ltd. Full time

    Position - AWS Data EngineerLocation - Pune/ Gurgaon/Hyderabad HybridExperience - 5+Job Description : - 5+ years of experience as a Data Engineer on the AWS Stack- AWS Solutions Architect or AWS Developer Certification required- Solid experience with AWS services such as Cloud Formation, S3, Athena, Glue, Glue Data Brew, EMR/Spark, RDS, Redshift, Data Sync,...

  • AWS Data Engineer

    2 months ago


    Gurugram, India True Tech Professionals Full time

    We are seeking a skilled Senior Data Engineer to join our dynamic team.- The ideal candidate should have a strong background in data engineering with expertise in Python, Pyspark, AWS Glue, Athena, Redshift, and SQL.- The successful candidate will play a key role in designing, developing, and maintaining our data infrastructure, ensuring optimal performance...

  • AWS Data Engineer

    3 weeks ago


    Gurugram, India True Tech Professionals Full time

    We are seeking a skilled Senior Data Engineer to join our dynamic team.- The ideal candidate should have a strong background in data engineering with expertise in Python, Pyspark, AWS Glue, Athena, Redshift, and SQL.- The successful candidate will play a key role in designing, developing, and maintaining our data infrastructure, ensuring optimal performance...


  • Gurugram, India AWS India - Telangana Full time

    Sales, Marketing and Global Services (SMGS)AWS Sales, Marketing, and Global Services (SMGS) is responsible for driving revenue, adoption, and growth from the largest and fastest growing small- and mid-market accounts to enterprise-level customers including public sector.At Amazon Web Services (AWS) India, we are changing the future of IT. Customer Solutions...


  • gurugram, India AWS India - Telangana Full time

    Sales, Marketing and Global Services (SMGS)AWS Sales, Marketing, and Global Services (SMGS) is responsible for driving revenue, adoption, and growth from the largest and fastest growing small- and mid-market accounts to enterprise-level customers including public sector.At Amazon Web Services (AWS) India, we are changing the future of IT. Customer Solutions...


  • Gurugram, India AWS India - Telangana Full time

    Sales, Marketing and Global Services (SMGS)AWS Sales, Marketing, and Global Services (SMGS) is responsible for driving revenue, adoption, and growth from the largest and fastest growing small- and mid-market accounts to enterprise-level customers including public sector.At Amazon Web Services (AWS) India, we are changing the future of IT. Customer Solutions...