Python/Pyspark Developer

2 weeks ago


Gurugram, India DG Liger Consulting Full time

We are looking for a skilled Python PySpark Developer with 34 years of experience in designing, developing, and maintaining big data solutions. The ideal candidate will have hands-on expertise in Python, PySpark, and data pipeline development, along with strong problem-solving skills and the ability to work in a collaborative environment.


Key Responsibilities :


- Develop, optimize, and maintain scalable data pipelines using PySpark and Python.


- Work with large-scale data sets for extraction, transformation, and loading (ETL).


- Collaborate with data engineers, analysts, and business stakeholders to deliver high-quality solutions.


- Implement performance tuning and debugging of PySpark jobs for efficiency.


- Ensure data integrity, security, and compliance in all workflows.


- Write clean, maintainable, and reusable code with proper documentation.


- Troubleshoot issues and provide quick resolutions in production environments.


Required Skills & Qualifications :


- Bachelors degree in Computer Science, IT, or related field.


- 3 to 4 years of experience in Python and PySpark development.


- Strong understanding of Spark architecture and distributed computing concepts.


- Hands-on experience with ETL processes, data wrangling, and data modeling.


- Proficiency in SQL and experience working with relational databases (e.g., MySQL, PostgreSQL, Oracle).


- Familiarity with big data platforms like Hadoop, Databricks, or AWS EMR.


- Experience in performance tuning of Spark jobs.


- Good problem-solving skills and ability to work in an agile environment.


Good to Have (Preferred Skills) :


- Experience with cloud platforms (AWS / Azure / GCP).


- Knowledge of Airflow, Kafka, or other workflow orchestration tools.


- Exposure to Docker / Kubernetes for containerized deployments.


- Familiarity with CI/CD pipelines and Git version control.


(ref:hirist.tech)
  • Lead Data Analyst

    5 days ago


    Gurugram, India Insight Global, LLC Full time

    Job Description :Key Responsibilities :- Lead data-driven initiatives and mentor junior analysts within the analytics team.- Work cross-functionally with marketing, supply chain, and business units to deliver actionable insights.- Deploy analytics algorithms and tools on modern tech stacks for efficient data processing.- Develop data models, dashboards, and...


  • Gurugram, India Talent Worx Full time

    Our Client is a professional services firm, is the Indian member firm affiliated with International and was established in September 1993. Our professionals leverage the global network of firms, providing detailed knowledge of local laws, regulations, markets, and competition. Our client has offices across India in Ahmedabad, Bengaluru, Chandigarh,...


  • Gurugram, India Strategic HR Solutions Full time

    Job SummaryWe are seeking a highly skilled and hands-on Snowflake Data Engineer to join our data engineering team. This role requires a deep understanding of Snowflake's core components including Snowpipe, Streams, and Tasks, as well as strong experience with query profiling, data pipeline orchestration, and performance tuning. The ideal candidate will...

  • Data Engineer

    4 weeks ago


    Gurugram, India ACENET CONSULTING PRIVATE LIMITED Full time

    Experience : 4 to 6 years.Location : Gurgaon.About Us :AceNet Consulting is a fast-growing global business and technology consulting firm specializing in business strategy, digital transformation, technology consulting, product development, start-up advisory and fund-raising services to our global clients across banking & financial services, healthcare,...


  • Gurugram, India upGrad Full time

    About the Job :We're hiring an cloud data engineering (preferably Azure) data pipelines and Spark. - Work with Databricks platform using Spark for big data processing and analytics. - Write optimized and efficient code using PySpark, Spark SQL and Python. - Develop and maintain ETL processes using Databricks notebooks and workflows. - Implement and...

  • Python Data Engineer

    3 weeks ago


    Gurugram, India Digitrix Software LLP Full time

    Location: Bangalore / Pune / Kolkata / Hyderabad / Gurugram Data Engineer 4+ experience Python, AWS Python (core language skill) -- Backend, Pandas, PySpark (DataFrame API), interacting with AWS (e.g., boto3 for S3, Glue, Lambda) Data Processing: Spark (PySpark), Glue, EMR AWS Core Services: S3, Glue, Athena, Lambda, Step Functions, EMR Containerization:...


  • Gurugram, India Skeps Full time

    Responsibilities :- Develop and maintain scalable data pipelines using PySpark and Delta Lake, ensuring efficient processing of large, structured, and semi-structured datasets.- Analyse complex data sets using SQL, Python (Pandas, NumPy, scikit-learn, Seaborn) to identify trends, patterns, and opportunities for process improvement.- Continuously optimize the...

  • Lead Data Engineer

    4 weeks ago


    Gurugram, India SUPERSOURCING TECHNOLOGIES PRIVATE LIMITED Full time

    About the Role:We are looking for an experienced Lead Data Engineer with deep expertise in Big Data technologies, particularly within the Google Cloud Platform (GCP) ecosystem. The ideal candidate should have a strong command of PySpark/Spark, SQL, and Python, and a proven track record in building, optimizing, and managing large- scale data pipelines and...

  • Python Developer

    3 weeks ago


    Gurugram, India Ahom Technologies Private Limited Full time

    Expertise in Pyspark, AWS Big Data Stack, Kafka SQL Experience in big data Develop and optimize the data warehouses Strong experience with GCP data services, including Big Query, Data flow, Experience in implementing data governance on GCP Familiar with other platforms like Snowflow, Data bricks etc. Experience with containerization solutions using Google...


  • Gurugram, Gurugram, India Accolite Full time

    Job Description About The Role We are seeking an experienced Data Engineer to design, implement, and optimize a global data handling and synchronization solution across multiple regions. You will work with cloud-based databases, data lakes, and distributed systems, ensuring compliance with data residency and privacy requirements (e.g.,...