AWS Data Engineer- Sagemaker

5 hours ago


Bangalore, India YASH Technologies Full time
AWS services including Glue, Pyspark, SQL, Databricks, Python Secondary skillset : Any ETL Tool, Github, DevOPs(CI-CD)

Python, PySpark , SQL, AWS with Designing, developing, testing and supporting data pipelines and applications
~ Strong understanding and hands-on experience with AWS services like EC2, S3, EMR, Glue, Redshift
~ Strong in developing and maintaining applications using Python and PySpark for data manipulation, transformation, and analysis
~ Design and implement robust ETL pipelines using PySpark, focusing on performance, scalability, and data quality
~ Lead and manage projects, including planning, execution, testing, and documentation and handling customer interacation as key point of contact
~ Translate business requirements into technical solutions using AWS cloud services and Python/PySpark
~ Deep understanding of Python and its data science libraries, along with PySpark for distributed data processing
~ Proficiency in PL/SQL, T-SQL for data querying, manipulation, and database interactions
~ Experience leading and mentoring teams in a technical environment and providing proposals on solutioning and designed based approach
~5+ years working experience in data integration and pipeline development.
~5+ years of Experience with AWS Cloud on data integration with a mix of Apache Spark, Glue, Kafka, Kinesis, and Lambda in S3 Redshift, RDS, MongoDB/DynamoDB ecosystems Databricks, Redshift experience is a major plus
~3+ years of experience using SQL in related development of data warehouse projects/applications (Oracle & amp; SQL Server)
~ Strong real-life experience in python development especially in PySpark in AWS Cloud environment
~ Strong SQL and NoSQL databases like MySQL, Postgres, DynamoDB, Elasticsearch Workflow management tools like Airflow

Good to Have : Snowflake, Palantir Foundry

  • Bangalore, India YASH Technologies Full time

    Primary skillsets : AWS Sagemaker, Power BI & Python Secondary skillset : Any ETL Tool, Github, Dev OPs(CI-CD) Mandatory Skill Set: Python, Py Spark , SQL, AWS with Designing, developing, testing and supporting data pipelines and applications Strong understanding and hands-on experience with AWS services like EC2, S3, EMR, Glue, Redshift Strong in...


  • Bangalore, India YASH Technologies Full time

    Primary skillsets : AWS Sagemaker, Power BI & Python Secondary skillset : Any ETL Tool, Github, DevOPs(CI-CD) Mandatory Skill Set: Python, PySpark , SQL, AWS with Designing, developing, testing and supporting data pipelines and applications Strong understanding and hands-on experience with AWS services like EC2, S3, EMR, Glue, Redshift Strong...


  • bangalore, India YASH Technologies Full time

    Primary skillsets : AWS Sagemaker, Power BI & PythonSecondary skillset : Any ETL Tool, Github, DevOPs(CI-CD)Mandatory Skill Set:Python, PySpark , SQL, AWS with Designing, developing, testing and supporting data pipelines and applicationsStrong understanding and hands-on experience with AWS services like EC2, S3, EMR, Glue, RedshiftStrong in developing and...


  • bangalore district, India YASH Technologies Full time

    Primary skillsets : AWS Sagemaker, Power BI & Python Secondary skillset : Any ETL Tool, Github, DevOPs(CI-CD) Mandatory Skill Set: Python, PySpark , SQL, AWS with Designing, developing, testing and supporting data pipelines and applications Strong understanding and hands-on experience with AWS services like EC2, S3, EMR, Glue, Redshift Strong in...


  • bangalore, India Digitrix Software LLP Full time

    Experience: 5 to 8 yearsJob description: Python AWS Data EngineerPython, AWS Python (core language skill) -- Backend, Pandas, PySpark (DataFrame API), interacting with AWS (e.g., boto3 for S3, Glue, Lambda)Data Processing: Spark (PySpark), Glue, EMR AWS Core Services: S3, Glue, Athena, Lambda, Step Functions, EMRContainerization: DockerOrchestration:...


  • bangalore, India Digitrix Software LLP Full time

    Experience : 5 to 8 years Job description: Python AWS Data Engineer Python, AWS Python (core language skill) -- Backend, Pandas, PySpark (DataFrame API), interacting with AWS (e.g., boto3 for S3, Glue, Lambda) Data Processing: Spark (PySpark), Glue, EMR AWS Core Services: S3, Glue, Athena, Lambda, Step Functions, EMR Containerization: Docker ...

  • Data Engineer

    2 weeks ago


    Bangalore, India Mphasis Full time

    Responsibilities Automate data quality checks and validation processes using SQL, Python, and data testing frameworks. Perform reconciliation, integrity, and transformation testing across data platforms. Work with AWS SageMaker Studio (Unified Studio) for validating ML/data workflows and integrations. Validate data flows on cloud platforms (AWS,...


  • Bangalore, India YASH Technologies Full time

    Primary skillsets : AWS services including Glue, Pyspark, SQL, Databricks, Python Secondary skillset : Any ETL Tool, Github, Dev OPs(CI-CD) Mandatory Skill Set: Python, Py Spark , SQL, AWS with Designing, developing, testing and supporting data pipelines and applications Strong understanding and hands-on experience with AWS services like EC2, S3,...

  • AWS Data Engineer

    6 hours ago


    bangalore, India Tata Consultancy Services Full time

    Experience: 5-10 Yrs Location - Bangalore,Chennai,Hyderabad,Pune,Kochi,Bhubaneshawar,Kolkata Key Skills AWS Lambda, Python, Boto3 ,Pyspark, Glue Must have Skills Strong experience in Python to package, deploy and monitor data science apps Knowledge in Python based automation Knowledge of Boto3 and related Python packages Working experience in AWS...

  • Data Engineer

    3 days ago


    bangalore, India Mphasis Full time

    Responsibilities Automate data quality checks and validation processes using SQL, Python, and data testing frameworks. Perform reconciliation, integrity, and transformation testing across data platforms. Work with AWS SageMaker Studio (Unified Studio) for validating ML/data workflows and integrations. Validate data flows on cloud platforms (AWS, Azure)....