Pyspark Developer Lead

3 days ago


Mumbai, Maharashtra, India Hybrowlabs Technologies Full time ₹ 8,00,000 - ₹ 24,00,000 per year

Company Description

Hybrowlabs Technologies is dedicated to building software better, and faster. We explore every tool that hits the market to find the best stack of tools for software development. Our magical formula and stack of tools will accelerate your software development process. Contact us to learn more.

Role Description

  • Design and architect scalable, robust big data solutions using PySpark and related technologies
  • Lead the technical vision for data processing pipelines and analytics platforms
  • Create comprehensive solution architectures aligned with business requirements and technical constraints
  • Design integration patterns for connecting various data sources, APIs, and downstream systems
  • Establish and enforce coding standards, best practices, and design patterns across the team
  • Conduct architecture reviews and provide technical guidance on complex implementation challenges

Hands-On Development

  • Develop high-performance PySpark applications for large-scale data processing and transformation
  • Optimize existing PySpark jobs for performance, cost-efficiency, and scalability
  • Write efficient, maintainable, and well-documented code that serves as a reference for the team
  • Troubleshoot and resolve complex technical issues in production environments
  • Implement data quality frameworks and validation mechanisms
  • Build reusable components and libraries to accelerate development

Team Leadership & Mentorship

  • Provide technical mentorship and guidance to junior and mid-level developers
  • Conduct code reviews ensuring quality, performance, and adherence to standards
  • Foster a collaborative environment that encourages knowledge sharing and innovation
  • Lead technical discussions and facilitate problem-solving sessions
  • Guide the team in adopting new technologies and methodologies

Solution Design & Delivery

  • Collaborate with business analysts and stakeholders to translate requirements into technical solutions
  • Create detailed technical specifications and design documents
  • Estimate effort, identify risks, and plan technical deliverables
  • Drive proof-of-concepts (POCs) for evaluating new technologies and approaches
  • Ensure timely delivery of high-quality solutions meeting functional and non-functional requirements

Integration & Collaboration

  • Design and implement integration solutions with various data platforms (databases, data lakes, cloud storage)
  • Work closely with DevOps teams to establish CI/CD pipelines for data applications
  • Collaborate with data engineers, data scientists, and analytics teams to build end-to-end solutions
  • Interface with enterprise architects to ensure alignment with organizational standards

Required Technical SkillsCore Expertise

  • PySpark:
    2–3+ years of hands-on experience building production-grade applications
  • Python:
    Strong programming skills with deep understanding of Python ecosystems and libraries
  • Apache Spark:
    Comprehensive knowledge of Spark architecture, internals, and optimization techniques
  • Big Data Technologies:
    Experience with Hadoop ecosystem, HDFS, Hive, or similar platforms

Data Processing & Engineering

  • Expertise in designing and implementing ETL/ELT pipelines at scale
  • Strong SQL skills and experience with both relational and NoSQL databases
  • Proficiency in data modeling, schema design, and data warehouse concepts
  • Experience with data partitioning, bucketing, and optimization strategies
  • Knowledge of data quality frameworks and testing methodologies

Cloud & Infrastructure

  • Experience with cloud platforms (AWS, Azure, or GCP) and their big data services
  • Familiarity with distributed computing concepts and cluster management
  • Understanding of containerization (Docker) and orchestration (Kubernetes) is a plus
  • Knowledge of cloud-native data services (S3, Azure Data Lake, BigQuery, etc.)

Architecture & Design

  • Proven track record in designing scalable, resilient data architectures
  • Experience with microservices architecture and API design
  • Understanding of data governance, security, and compliance requirements
  • Familiarity with streaming technologies (Kafka, Spark Streaming) is advantageous

Tools & Frameworks

  • Version control systems (Git, Bitbucket, GitHub)
  • CI/CD tools (Jenkins, GitLab CI, Azure DevOps)
  • Workflow orchestration tools (Airflow, Databricks workflows)
  • Monitoring and logging tools (ELK stack, Splunk, CloudWatch)

Required QualificationsExperience

  • Total IT Experience:
    6+ years in software development and data engineering roles
  • PySpark Experience:
    Minimum 2–3 years of dedicated PySpark development
  • Leadership Experience:
    Demonstrated experience leading technical teams or projects
  • Solution Design:
    Proven experience in end-to-end solution design and architecture

Education

  • Bachelor's or Master's degree in Computer Science, Information Technology, Engineering, or related field
  • Relevant certifications (Databricks, AWS/Azure/GCP, or Spark certifications) are highly desirable

Desired Skills & AttributesTechnical

  • Experience with real-time/streaming data processing
  • Knowledge of machine learning pipelines and MLOps
  • Familiarity with modern data platforms (Databricks, Snowflake, Delta Lake)
  • Understanding of data mesh or data fabric architectures
  • Experience with infrastructure as code (Terraform, CloudFormation)

Soft Skills

  • Leadership:
    Ability to inspire and guide technical teams toward excellence
  • Communication:
    Excellent verbal and written communication skills for technical and non-technical audiences
  • Problem-Solving:
    Strong analytical thinking and creative problem-solving abilities
  • Collaboration:
    Proven ability to work effectively across multiple teams and stakeholders
  • Adaptability:
    Comfortable working in fast-paced, evolving environments
  • Ownership:
    Takes accountability for technical decisions and project outcomes

What You'll Work On

  • Designing next-generation data platforms and analytics solutions
  • Building scalable data pipelines processing terabytes of data daily
  • Architecting integrations across diverse enterprise systems
  • Optimizing existing systems for performance and cost-efficiency
  • Implementing best practices for data quality, governance, and security
  • Mentoring team members and elevating overall technical capabilities
  • Driving innovation through POCs and adoption of emerging technologies

Work Arrangement

  • Primary:
    Mumbai n Bangalore Locations with partial remote flexibility
  • Office Visits:
    Periodic visits to office for team collaboration, planning sessions, and stakeholder meetings (frequency to be determined based on project needs)
  • Flexibility:
    Results-oriented culture with focus on delivery and collaboration

  • Pyspark Developer

    2 weeks ago


    Mumbai, Maharashtra, India Artech Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Role & responsibilitiesYou will lead the effort to design, build, and configure applications, acting as the primary point of contact. Your typical day will involve collaborating with various teams to ensure that project goals are met, facilitating discussions to address challenges, and guiding your team through the development process. You will also be...


  • Mumbai, Maharashtra, India BNP Paribas Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Position Purpose The Senior Developer will be a part of the ISPL Mumbai IHC ETL projects team. The developer position will primarily work on Apache Spark(python), Spark SQL, ETL tools, Unix, Autosys and DBResponsibilitiesDirect Responsibilities Expertise on PySpark, database migration, transformation, and integration solutions for any Data warehousing...

  • Lead Data Analyst

    3 days ago


    Mumbai, Maharashtra, India Coffeee Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Role- Lead Data AnalystExperience- 8+ yearsLocation- Mumbai (onsite)Notice Period- 30 daysJob Responsibilities Gather, analyze, and interpret large datasets to deliver actionable businessinsights. Write efficientSQL and PySpark queriesto extract, transform, and validatedata. Work withUnity Catalog in Databricksfor secure data management...

  • Lead Data Analyst

    3 days ago


    Navi Mumbai, Maharashtra, India PineQ Lab Technology Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Job PurposeWe are seeking an experienced Data Analyst Lead to drive our reporting and analytics function. The ideal candidate will have a strong background in data analysis, SQL/PySpark, Power BI, and Databricks, with proven leadership skills to guide a team of analysts and work closely with business stakeholders to deliver actionable insights.Key Skills8+...

  • python developer

    5 days ago


    Mumbai, Maharashtra, India Capgemini Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Your Role  Python Developer As a Python developer you must have 2+ years in Python / Pyspark.- Strong programming experience, Python, Pyspark, Scala is preferred.- Experience in designing and implementing CI/CD, Build Management, and Development strategy.- Experience with SQL and SQL Analytical functions, experience participating in key business,...


  • Mumbai, Maharashtra, India, Maharashtra Tata Consultancy Services Full time

    Experience: 6-8yearsLocation: MumbaiJob description of Digital : PySparkExpertise and experience in Python and Pyspark (at least 4 years of experience) Experience with BI tools, SQL queries, and organizing and analyzing data Should have working knowledge and hands-on experience on the following Palantir Tools - Data Connection, Ontology, Code Workbook/Code...

  • Lead Data Analyst

    2 weeks ago


    Mumbai, Maharashtra, India, Maharashtra Coffeee.io Full time

    Role- Lead Data Analyst Experience- 8+ yearsLocation- Mumbai (onsite)Notice Period- 30 daysJob Responsibilities Gather, analyze, and interpret large datasets to deliver actionable businessinsights. Write efficient SQL and PySpark queries to extract, transform, and validatedata. Work with Unity Catalog in Databricks for secure data management...


  • Mumbai, Maharashtra, India Parth Developer Full time ₹ 55,000 - ₹ 12,00,000 per year

    Job Summary:We are seeking a seasoned and proactive Officer (Legal & Liaising) to lead our legal and regulatory functions. The role involves managing legal documentation, ensuring regulatory compliance, liaising with government authorities, and offering strategic legal counsel. You will play a critical role in ensuring seamless project approvals and legal...

  • python developer

    7 days ago


    Mumbai, Maharashtra, India Capgemini Engineering Full time ₹ 40,00,000 - ₹ 1,20,00,000 per year

    At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world's most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and...

  • Area Manager

    3 days ago


    Mumbai, Maharashtra, India Lead School Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    DepartmentLearning SystemsJob posted onSep 26, 2025Employee TypeProbationerExperience range (Years)3 years - 6 yearsABOUT THE ROLEThe Area Manager – Expansion plays a critical role in the growth marketing function of the Expansion team. This role is responsible for driving new customer acquisition by developing and executing data-driven marketing and sales...