PySpark & ETL Data Engineer

3 hours ago

Hyderabad, Telangana, India CirrusLabs Full time

We are
CirrusLabs
. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make things happen. We are committed to excellence. We are a dependable partner organization that delivers on commitments. We strive to maintain integrity with our employees and customers. Every action we take is driven by value. The core of who we are is through our well-knit teams and employees. You are the core of a values driven organization.

You have an entrepreneurial spirit. You enjoy working as a part of well-knit teams. You value the team over the individual. You welcome diversity at work and within the greater community. You aren't afraid to take risks. You appreciate a growth path with your leadership team that journeys how you can grow inside and outside of the organization. You thrive upon continuing education programs that your company sponsors to strengthen your skills and for you to become a thought leader ahead of the industry curve.

You are excited about creating change because your skills can help the greater good of every customer, industry and community. We are hiring a talented
Pyspark
to join our team. If you're excited to be part of a winning team, CirrusLabs (

) is a great place to grow your career.

Experience - 4-8 years

Location - Hyderabad/ Bengaluru

About the Role

CirrusLabs is seeking a skilled and experienced
PySpark Data Engineer (ETL Lead)
to join our growing data engineering team. As an ETL Lead, you will play a pivotal role in designing, developing, and maintaining robust data integration pipelines using PySpark and related technologies. You'll work closely with data architects, analysts, and stakeholders to transform raw data into high-quality, actionable insights, enabling data-driven decision-making across the organization.

This is an exciting opportunity for someone who is not only technically strong in PySpark and Python but also capable of leading data integration efforts for complex projects.

Key Responsibilities

Lead Data Integration Projects:
Manage the data integration and ETL activities for enterprise-level data projects.
Gather requirements from stakeholders and translate them into technical solutions.
Develop PySpark Pipelines:
Design and develop scalable and efficient
PySpark scripts
, both generic frameworks and custom solutions tailored to specific project requirements.
Implement end-to-end ETL processes to ingest, clean, transform, and load data.
Schedule and Automate ETL Processes:
Create scheduling processes to manage and run PySpark jobs reliably and efficiently.
Integrate ETL workflows into automation tools and CI/CD pipelines.
Optimize Data Processing:
Optimize PySpark jobs for performance and resource efficiency.
Monitor, troubleshoot, and resolve issues related to data processing and pipeline execution.
Data Transformation and Curation:
Transform raw data into consumable, curated data models suitable for reporting and analytics.
Ensure data quality, consistency, and reliability throughout all stages of the ETL process.
Collaboration and Best Practices:
Collaborate with data architects, analysts, and business stakeholders to define requirements and deliver solutions.
Contribute to the evolution of data engineering practices, frameworks, and standards.
Provide guidance and mentorship to junior engineers on PySpark and ETL best practices.
Documentation:
Develop and maintain technical documentation related to ETL processes, data flows, and solutions.

Required Skills and Qualifications

Experience:
5–8 years of professional experience in data engineering, ETL development, or related fields.
Proven experience leading data integration projects from design to deployment.
Technical Skills:
Strong hands-on experience with
PySpark
for building large-scale data pipelines.
Proficiency in
Python
, including writing efficient, reusable, and modular code.
Solid knowledge of
SQL
for data extraction, transformation, and analysis.
Strong understanding of Spark architecture, including execution plans, partitions, memory management, and optimization techniques.
Data Engineering Expertise:
Experience working on
data integration projects
, such as data warehousing, data lakes, or analytics solutions.
Familiarity with processing structured and semi-structured data formats (e.g., Parquet, Avro, JSON, CSV).
Ability to transform and harmonize data from raw to curated layers.

Additional Skills:

Familiarity with data pipeline orchestration tools (e.g., Airflow, Azkaban) is a plus.
Experience with cloud platforms (e.g., AWS, Azure, GCP) is desirable.
Strong analytical and problem-solving skills.
Excellent communication and collaboration skills.

Databricks + Pyspark

2 weeks ago

Hyderabad, Telangana, India Cognizant Full time ₹ 1,00,00,000 - ₹ 3,00,00,000 per year

Skills- Databricks+ PysparkExperience: 4 to 13 yearsLocation: AIA-PuneWe are looking for a highly skilled Data Engineer with expertise in PySpark and Databricks to design, build, and optimize scalable data pipelines for processing massive datasets.Key Responsibilities:Build & Optimize Pipelines: Develop high-throughput ETL workflows using PySpark on...
Senior Pyspark Data Engineer

7 days ago

Hyderabad, Telangana, India DATAECONOMY Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Job Title: PySpark Data EngineerExperience: 8 YearsLocation: HyderabadEmployment Type: Full-Time Job Summary: We are looking for a skilled and experienced PySpark Data Engineer to join our growing data engineering team. The ideal candidate will have 8 years of experience in designing and implementing data pipelines using PySpark, AWS Glue, and Apache...
Senior Pyspark Data Engineer

1 week ago

Hyderabad, Telangana, India DataEconomy Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Job InformationDate Opened10/13/2025Job TypeFull timeIndustryIT ServicesCityHyderabadState/ProvinceTelanganaCountryIndiaZip/Postal Code500081About UsAbout DATAECONOMY: We are a fast-growing data & analytics company headquartered in Dublin with offices inDublin, OH, Providence, RI, and an advanced technology center in Hyderabad,India. We are clearly...
Date Engineer(Python, Pyspark and AWS)

1 week ago

Hyderabad, Telangana, India Zorba AI Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Key Skills Required-Minimum 6 years of working experience in Python, Pyspark, AWS and SQL.Programming & Frameworks: Python, PySparkDatabases & Querying: Strong SQL (query optimization, joins, window functions)Big Data & Processing: PySpark for distributed data processing, ETL pipelinesCloud Platform: AWS (S3, Glue, Lambda, EMR, Redshift, Athena)Data...
Data Engineer – AWS Glue, Redshift,, PySpark

4 days ago

Hyderabad, Telangana, India Cyepro Solutions Full time ₹ 12,00,000 - ₹ 24,00,000 per year

Company Overview:Cyepro Solutions is at the forefront of innovation in customer relationship management, specializing in the automotive industry. We offer a platform that empowers dealerships, manufacturers, and service providers to enhance customer experiences and streamline operations. Headquartered in Hyderabad, Telangana, with a team of employees, Cyepro...
Data Engineer

4 days ago

Hyderabad, Telangana, India Zorba AI Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Key Skills Required-Minimum experience required 3 years in Data Engineer (Python,Pyspark, AWS, SQL)Programming & Frameworks: Python, PySparkDatabases & Querying: Strong SQL (query optimization, joins, window functions)Big Data & Processing: PySpark for distributed data processing, ETL pipelinesCloud Platform: AWS (S3, Glue, Lambda, EMR, Redshift, Athena)Data...
Etl Developer

2 weeks ago

Hyderabad, Telangana, India Mind Waveai Solutions Full time ₹ 6,00,000 - ₹ 18,00,000 per year

Job SummaryWe are seeking an experienced ETL Developer with 4+ years of expertise in building scalable data pipelines using AWS Glue, PySpark, and Python. The role involves migrating data from sources like S3 and SQL Server to PostgreSQL, ensuring high performance, data quality, and compliance within a modern AWS-based data ecosystem.Key Responsibilities:*...
Databricks Engineer

2 weeks ago

Hyderabad, Telangana, India NTT DATA Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Design, build, and maintain robust ETL/ELT data pipelines using Apache Spark on Databricks. Implement data Lakehouse architecture using Delta Lake for cost-effective data storage and analytics. Use Databricks Workflows for orchestrating batch and streaming pipelines. Develop and maintain CI/CD pipelines for data applications using tools such as Azure DevOps,...
Senior Data Engineer

2 weeks ago

Hyderabad, Telangana, India Enable Data Incorporated Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture, Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 29th 2025)Translate business rules into technical specifications and implement scalable data solutions.Manage a...
Lead Data Engineer

1 week ago

Hyderabad, Telangana, India Zorba AI Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Primary Job Title:Data Engineering LeadAbout The OpportunityWe are seeking a highly skilledLead Data Engineerwith strong expertise inPython, Pandas, PySpark, AWS, and SQLto design, build, and manage scalable data solutions. The ideal candidate will lead a team of data engineers, develop robust ETL pipelines, and collaborate with analytics, data science, and...

Americas

Europe

Asia / Oceania

Africa

PySpark & ETL Data Engineer