PySpark (Apply Now)

3 weeks ago


Mumbai, India Teamware Solutions Full time

Job Description - Key Responsibilities: - PySpark Development: - Design, implement, and optimize PySpark solutions for large-scale data processing and analysis. - Develop data pipelines using Spark to handle data transformations, aggregations, and other complex operations efficiently. - Write and optimize Spark SQL queries for big data analytics and reporting. - Handle data extraction, transformation, and loading (ETL) processes from various sources into a unified data warehouse or data lake. - Data Pipeline Design & Optimization: - Build and maintain ETL pipelines using PySpark, ensuring high scalability and performance. - Implement batch and streaming processing to handle both real-time and historical data. - Optimize the performance of PySpark applications by applying best practices and techniques such as partitioning, caching, and broadcast joins. - Data Storage & Management: - Work with large datasets and integrate them into storage solutions such as HDFS, S3, Azure Blob Storage, or Google Cloud Storage. - Ensure efficient data storage, access, and retrieval through Spark and other tools (e.g., Parquet, ORC). - Maintain data quality, consistency, and integrity throughout the pipeline lifecycle. - Cloud Platforms & Big Data Frameworks: - Deploy Spark-based applications on cloud platforms such as AWS (Amazon EMR), Azure HDInsight, or Google Dataproc. - Work with cloud-native services such as AWS Lambda, S3, Google Cloud Storage, and Azure Data Lake to handle and process big data. - Leverage cloud data processing tools and frameworks to scale and optimize the PySpark jobs. - Collaboration & Integration: - Collaborate with cross-functional teams (data scientists, analysts, product managers) to understand business requirements and develop appropriate data solutions. - Integrate data from multiple sources and platforms (e.g., databases, external APIs, flat files) into a unified system. - Provide support for downstream applications and data consumers by ensuring timely and accurate delivery of data. - Performance Tuning & Troubleshooting: - Identify bottlenecks and optimize Spark jobs to improve performance. - Conduct performance tuning of both the cluster and individual Spark jobs, leveraging Spark's in-built tools for monitoring. - Troubleshoot and resolve issues related to data processing, application failures, and cluster resource utilization. - Documentation & Reporting: - Maintain clear and comprehensive documentation of data pipelines, architectures, and processes. - Create technical documentation to guide future enhancements and troubleshooting. - Provide regular updates on the status of ongoing projects and data processing tasks. - Continuous Improvement: - Stay up to date with the latest trends, technologies, and best practices in big data processing and PySpark. - Contribute to improving development processes, testing strategies, and code quality. - Share knowledge and provide mentoring to junior team members on PySpark best practices. - Required Qualifications: - 2-4 years of professional experience working with PySpark and big data technologies. - Strong expertise in Python programming with a focus on data processing and manipulation. - Hands-on experience with Apache Spark, particularly with PySpark for distributed computing. - Proficiency in Spark SQL for data querying and transformation. - Familiarity with cloud platforms like AWS, Azure, or Google Cloud, and experience with cloud-native big data tools. - Knowledge of ETL processes and tools. - Experience with data storage technologies like HDFS, S3, or Google Cloud Storage. - Knowledge of data formats such as Parquet, ORC, Avro, or JSON. - Experience with distributed computing and cluster management. - Familiarity with Linux/Unix and command-line operations. - Strong problem-solving skills and ability to troubleshoot data processing issues.


  • Apply Now! Draftsman

    4 weeks ago


    Mumbai, India Pratham Technologies Full time

    Job Description Job Description - Model wise complete drawing attachment to ERP - To support operations in terms of design document - To complete back-office work of design on time - To complete BOM entry in ERP - To do pdf drawing of AutoCAD,2D & 3D format of drawings - To give design input to materials in soft/hard format wherever needed - To entered part...

  • React Native Engineer

    4 weeks ago


    Mumbai, India Court Now Full time

    About Court Now Court Now is a live player-matching app that helps racket-sport players find hitting partners in real time. The TestFlight is live and already being used in NYC and Atlanta for tennis and pickleball. We’re now building our Mumbai team to push the product to beta and App Store launch. We’re a small, focused team passionate about building...

  • React Native Engineer

    4 weeks ago


    Mumbai, India Court Now Full time

    About Court Now Court Now is a live player-matching app that helps racket-sport players find hitting partners in real time. The TestFlight is live and already being used in NYC and Atlanta for tennis and pickleball. We’re now building our Mumbai team to push the product to beta and App Store launch. We’re a small, focused team passionate about building...

  • React Native Engineer

    2 weeks ago


    Mumbai, Maharashtra, India Court Now Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    About Court NowCourt Now is a live player-matching app that helps racket-sport players find hitting partners in real time. The TestFlight is live and already being used inNYC and Atlantafor tennis and pickleball. We're now building ourMumbai teamto push the product to beta and App Store launch.We're a small, focused team passionate about building products...


  • Mumbai, India Acme Services Full time

    Job Description - Data Scientists with the capability to perform independent statistical and machine learning research/ projects. Individuals should be able to break down business problems into smaller components and implement ML approaches to empower the end business decisions - Strong hands on skill in Python using libraries like NLTK/Spacy, skLearn,...


  • Mumbai, India Sweat Fit Wellness Full time

    Pilates Instructor (Full-Time / Part-Time) 📍Location: On-Site – Mumbai, India About Sweat Fit Wellness Sweat Fit Wellness is a next-gen group fitness brand built on energy, expertise, and community. Through Sweat Pilates, Sweat Bootcamp, and Sweat Online, we help people sweat with purpose and stay consistent on their fitness journey. Role Overview...

  • Marketing Manager

    4 days ago


    Mumbai, Maharashtra, India Pharma Now - Empowering Pharma Leadership Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    About Pharma NowPharma Now is a fast-growing global platform at the intersection of pharma, life sciences, and technology. We are more than just a media brand—we are building a global knowledge ecosystem through news, insights, buyer guides, events, and leadership content that powers the pharmaceutical industry of tomorrow.We are now looking for aMarketing...


  • Mumbai, India All 'Bout Communication Full time

    🚀 We’re Hiring: Business Development Consultant – Full-time | Mumbai 🚀 Are you a driven, target-oriented professional with a passion for business development and marketing? We’re looking for a full-time, office-based Business Development Consultant to join our Mumbai team! What You’ll Do: - Generate new business opportunities and leads - Build...


  • Mumbai, India Sweat Fit Wellness Full time

    🚀 We're Hiring: Digital Marketing Specialist Are you a creative and results-driven digital marketer with a passion for social media and content creation? Join our dynamic team to take our online presence to the next level! 🔹 Role Overview: We're looking for a Digital Marketing Specialist who can strategize, create, and manage compelling digital...


  • Mumbai, India eClerx Full time

    Urgent Hire: SEO Specialist Eclerx Services Ltd | Pune/Mumbai/Chandigarh | Full-Time We're looking for an Immediate Joiner! If you're an SEO specialist with a proven track record in global markets, we want you now. Critical Requirements: >Proven experience in Answer Engine Optimization (AEO), Geographic SEO (GEO), and Search Experience Optimization (SXO)....