Pyspark Developer

2 weeks ago


Bengaluru Chennai Pune, India Quess Corp Limited Full time ₹ 8,00,000 - ₹ 24,00,000 per year

ROLE SUMMARY

We are seeking a highly skilled PySpark Developer with hands-on experience in Databricks IT Systems Development unit in an offshore capacity. This role focuses on designing, building, and optimizing large-scale data pipelines and processing solutions on the Databricks Unified Analytics Platform. The ideal candidate will have expertise in big data frameworks, distributed computing, and cloud platforms, with a deep understanding of Databricks architecture. This is an excellent opportunity to work with cutting-edge technologies in a dynamic, fast-paced environment.

ROLE RESPONSIBILITIES

Data Engineering and Processing:

  • Develop and manage data pipelines using PySpark on Databricks.
  • Implement ETL/ELT processes to process structured and unstructured data at scale.
  • Optimize data pipelines for performance, scalability, and cost-efficiency in Databricks.

Databricks Platform Expertise:

  • Experience in Perform Design, Development & Deployment using Azure Services (Data Factory, Databricks, PySpark, SQL)
  • Develop and maintain scalable data pipelines and build new Data Source integrations to support increasing data volume and complexity.
  • Leverage the Databricks Lakehouse architecture for advanced analytics and machine learning workflows.
  • Manage Delta Lake for ACID transactions and data versioning.
  • Develop notebooks and workflows for end-to-end data solutions.

Cloud Platforms and Deployment:

  • Deploy and manage Databricks on Azure (e.g., Azure Databricks).
  • Use Databricks Jobs, Clusters, and Workflows to orchestrate data pipelines.
  • Optimize resource utilization and troubleshoot performance issues on the Databricks platform.

CI/CD and Testing:

  • Build and maintain CI/CD pipelines for Databricks workflows using tools like Azure DevOps, GitHub Actions, or Jenkins.
  • Write unit and integration tests for PySpark code using frameworks like Pytest or unittest.

Collaboration and Documentation:

  • Work closely with data scientists, data analysts, and IT teams to deliver robust data solutions.
  • Document Databricks workflows, configurations, and best practices for internal use.

TECHNICAL QUALIFICATIONS

Experience:

  • 4+ years of experience in data engineering or distributed systems development.
  • Strong programming skills in Python and PySpark.
  • Hands-on experience with Databricks and its ecosystem, including Delta Lake and Databricks SQL.
  • Knowledge of big data frameworks like Hadoop, Spark, and Kafka.

Databricks Expertise:

  • Proficiency in setting up and managing Databricks Workspaces, Clusters, and Jobs.
  • Familiarity with Databricks MLflow for machine learning workflows is a plus.

Cloud Platforms:

  • Expertise in deploying Databricks solutions Azure (e.g., Data Lake, Synapse).
  • Knowledge of Kubernetes for managing containerized workloads is advantageous.

Database Knowledge:

  • Experience with both SQL (e.g., PostgreSQL, SQL Server) and NoSQL databases (e.g., MongoDB, Cosmos DB).

GENERAL QUALIFICATIONS

  • Strong analytical and problem-solving skills.
  • Ability to manage multiple tasks in a high-intensity, deadline-driven environment.
  • Excellent communication and organizational skills.
  • Experience in regulated industries like insurance is a plus.

EDUCATION REQUIREMENTS

  • A Bachelors Degree in Computer Science, Data Engineering, or a related field is preferred.
  • Relevant certifications in Databricks, PySpark, or cloud platforms are highly desirable.Role & responsibilities

Preferred candidate profile


  • Pyspark Developer

    2 weeks ago


    Bengaluru, Chennai, Pune, India VHR Solutions Full time ₹ 5,00,000 - ₹ 12,00,000 per year

    Job Description:We are looking for a skilled PySpark Developer to join our data engineering team. The ideal candidate will have strong experience in building scalable data pipelines using Apache Spark (PySpark), integrating with big data platforms, and working with large datasets in distributed environments. You will work closely with data engineers,...

  • Pyspark Developer

    2 weeks ago


    Bengaluru, Chennai, Pune, India Tekskills Full time ₹ 12,50,000 - ₹ 25,00,000 per year

    JOB SUMMARY:Experience 6+years (relevant minimum 5+ years)Detailed job description - Skill Set:Candidate should have good hands on experience in Pyspark preferrable more than 7 years.Should have very good understanding of Agile development methodologies and excellent team handling experience.Mandatory Skills - Pyspark, Python, Agile

  • Pyspark Developer

    2 weeks ago


    Pune, Maharashtra, India Tech Mahindra Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Pyspark DeveloperRequirements:Mandatory: Primary skill: Pyspark, Data Engineering, Azure Data BricksGood Experience of Hadoop, Hive, and Cloudera/ Azure/GCP 3+ years of experience in the design and implementation of Big Data systems using PySpark, database migration, transformation and integration solutions for any Data warehousing project.Must have...

  • Pyspark developer

    4 weeks ago


    Bengaluru, Chennai, Pune, India MFX Infotech Private Limited Full time

    Job Description ROLE SUMMARY We are seeking a highly skilledPySpark Developerwith hands-on experience inDatabricksto join Sompo's IT Systems Development unit in an offshore capacity. This role focuses on designing, building, and optimizing large-scale data pipelines and processing solutions on theDatabricks Unified Analytics Platform. The ideal candidate...

  • Pyspark Developer

    3 days ago


    Chennai, Hyderabad, Pune, India Tata Consultancy Services Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    TCS is Hiring Location :Bangalore, Chennai, Kolkata, Hyderabad, PuneExp : 4-8 YrsFunctional Skills: Experience in Credit Risk/Regulatory risk domainTechnical Skills: Spark ,PySpark, Python, Hive, Scala, MapReduce, Unix shell scriptingGood to Have Skills: Exposure to Machine Learning TechniquesJob Description:4+ Years of experience with Developing/Fine tuning...

  • Pyspark Developer

    4 weeks ago


    Chennai, Tamil Nadu, India MP DOMINIC AND CO Full time

    Job Summary - Design develop and implement scalable data pipelines and streaming use cases using PySpark and Spark on a distributed computing platform - Possess strong programming skills in Spark streaming - Have familiarity with cloud platforms like GCP - Gain experience in big data technologies such as Hadoop Hive and HDFS - Perform ETL operations...

  • PySpark Developer

    4 weeks ago


    Hyderabad, Bengaluru, Chennai, India Coders Brain Technology Private Limited Full time

    Job Description ROLE RESPONSIBILITIES Data Engineering and Processing: Develop and manage data pipelines using PySpark on Databricks. Implement ETL/ELT processes to process structured and unstructured data at scale. Optimize data pipelines for performance, scalability, and cost-efficiency in Databricks. Databricks Platform Expertise: Experience in...

  • Pyspark Developer

    5 days ago


    Bengaluru, Chennai, Kolkata, India Tata Consultancy Services Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Job Description:5+ Years of experience with Developing/Fine tuning and implementing programs/applicationsUsing Python/PySpark/Scala on Big Data/Hadoop Platform.Roles and Responsibilities:a) Work with a Leading Banks Risk Management team on specific projects/requirements pertaining to risk Models inconsumer and wholesale bankingb) Enhance Machine Learning...

  • Pyspark Developer

    5 days ago


    Bengaluru, India Tata Consultancy Services Full time

    Greetings from TCS!! TCS is hiring for Pyspark Developer Required Skill Set: Pyspark, Python, SQL and relational databases, SparkSQL, Spark Scripting, UNIX Shell Scripting, ETL, Data Warehousing, CI/CD Desired Experience Range: 4 to 10 Years Job Location: Hyderabad, Bangalore, Chennai, Kolkata, Pune Must Have: Data Engineer, Python developer with specialty...

  • Pyspark Developer

    4 days ago


    Bengaluru, India Tata Consultancy Services Full time

    Greetings from TCS!! TCS is hiring for Pyspark Developer Required Skill Set: Pyspark, Python, SQL and relational databases, SparkSQL, Spark Scripting, UNIX Shell Scripting, ETL, Data Warehousing, CI/CD Desired Experience Range: 4 to 10 Years Job Location: Hyderabad, Bangalore, Chennai, Kolkata, Pune Must Have: Data Engineer, Python developer with specialty...