
5+ YoE
4 weeks ago
Cochin, Kerala, India
UST
Full time
Candidates ready to join immediately can share their details via email for quick processing.
CCTC | ECTC | Notice Period | Location Preference
Act fast for immediate attention
5+ years if Experience
Roles and Responsibilities
- Design, develop, and maintain scalable data pipelines using Spark (PySpark or Spark with Scala).
- Build data ingestion and transformation frameworks for structured and unstructured data sources.
- Collaborate with data analysts, data scientists, and business stakeholders to understand requirements and deliver reliable data solutions.
- Work with large volumes of data and ensure quality, integrity, and consistency.
- Optimize data workflows for performance, scalability, and cost efficiency on cloud platforms (AWS, Azure, or GCP).
- Implement data quality checks and automation for ETL/ELT pipelines.
- Monitor and troubleshoot data issues in production and perform root cause analysis.
- Document technical processes, system designs, and operational procedures.
Must-Have Skills
- 3+ years of experience as a Data Engineer or similar role.
- Hands-on experience with PySpark or Spark using Scala .
- Strong knowledge of SQL for data querying and transformation.
- Experience working with any cloud platform (AWS, Azure, or GCP).
- Solid understanding of data warehousing concepts and big data architecture.
- Experience with version control systems like Git.
Good-to-Have Skills
- Experience with data orchestration tools like Apache Airflow , Databricks Workflows , or similar.
- Knowledge of Delta Lake , HDFS , or Kafka .
- Familiarity with containerization tools (Docker/Kubernetes).
- Exposure to CI/CD practices and DevOps principles.
- Understanding of data governance, security, and compliance standards.
.