
SPARK Data Onboarding Engineer
4 days ago
We are seeking a skilled PySpark Data Engineer to join our team and drive the development of robust data processing and transformation solutions within our data platform. You will be responsible for designing, implementing, and maintaining PySpark-based applications to handle complex data processing tasks, ensure data quality, and integrate with diverse data sources. The ideal candidate possesses strong PySpark development skills, experience with big data technologies, and the ability to work in a fast-paced, data-driven environment.
Key Responsibilities:Data Engineering Development:
- Design, develop, and test PySpark-based applications to process, transform, and analyze large-scale datasets from various sources, including relational databases, NoSQL databases, batch files, and real-time data streams.
- Implement efficient data transformation and aggregation using PySpark and relevant big data frameworks.
- Develop robust error handling and exception management mechanisms to ensure data integrity and system resilience within Spark jobs.
- Optimize PySpark jobs for performance, including partitioning, caching, and tuning of Spark configurations.
Data Analysis and Transformation:
- Collaborate with data analysts, data scientists, and data architects to understand data processing requirements and deliver high-quality data solutions.
- Analyze and interpret data structures, formats, and relationships to implement effective data transformations using PySpark.
- Work with distributed datasets in Spark, ensuring optimal performance for large-scale data processing and analytics.
Data Integration and ETL:
- Design and implement ETL (Extract, Transform, Load) processes to ingest and integrate data from various sources, ensuring consistency, accuracy, and performance.
- Integrate PySpark applications with data sources such as SQL databases, NoSQL databases, data lakes, and streaming platforms
Qualifications and Skills:
- Bachelors degreein Computer Science, Information Technology, or a related field.
- 5+ yearsof hands-on experience in big data development, preferably with exposure to data-intensive applications.
- Strong understanding ofdata processing principles, techniques, and best practices in a big data environment.
- Proficiency in PySpark, Apache Spark, and related big data technologiesfor data processing, analysis, and integration.
- Experience withETL developmentand data pipeline orchestration tools (e.g., Apache Airflow, Luigi).
- Strong analytical and problem-solving skills, with the ability to translate business requirements into technical solutions.
- Excellent communication and collaboration skills to work effectively with data analysts, data architects, and other team members.
Role:Data Science & Machine Learning - Other
Industry Type:IT Services & Consulting
Department:Data Science & Analytics
Employment Type:Full Time, Permanent
Role Category:Data Science & Machine Learning
Education
UG:Any Graduate
PG:Any Postgraduate
-
Pune, Maharashtra, India HSBC Full timeJob DescriptionJob descriptionSome careers shine brighter than others.If you're looking for a career that will help you stand out, join HSBC and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new direction, HSBC offers opportunities, support and rewards that will take you further.HSBC is one...
-
Data Onboarding Expert
3 days ago
Pune, Maharashtra, India beBeeSplunk Full time ₹ 20,00,000 - ₹ 25,00,000Splunk Data SpecialistWe are seeking a highly skilled Splunk data specialist to join our dynamic team. The ideal candidate will have a strong background in data onboarding, troubleshooting, and system administration.The selected candidate will be responsible for integrating data streams, feeds from network, infrastructure services, and mission-critical...
-
Senior Big Data Engineer
2 days ago
Pune, Maharashtra, India beBeeDataEngineer Full time ₹ 20,00,000 - ₹ 25,00,000Data Engineer Job OpportunityWe are seeking an experienced Data Engineer to join our team. The ideal candidate will have a strong background in data engineering, with experience working with big data technologies such as PySpark.This is a great opportunity for someone who wants to work on complex business logic and build scalable data pipelines. If you have...
-
Big Data Engineer
2 weeks ago
Pune, Maharashtra, India LION AND ELEPHANTS CONSULTANCY PRIVATE LIMITED Full timeJob Description : We are looking for skilled Big Data Engineers using Java Spark with 5-10 years of experience in Big Data / legacy platforms, who can join immediately. Desired candidates should have design, development and optimization of real-time & batch data pipelines experience in a Big Data environment at an enterprise scale application. You will work...
-
Data Engineer
3 weeks ago
Pune, Maharashtra, India Ikrux Solutions (Opc) Private Limited Full timeJob DescriptionJob Category:Data EngineerHadooppythonSparkJob Type:Full TimeJob Location:PuneWe are seeking a highly skilled and motivated Data Engineer with over 5 years of experience in big data technologies, specializing in Spark, Python, and Hadoop. The ideal candidate will have a strong track record of designing, building, and maintaining scalable data...
-
Senior Data Engineer
2 days ago
Pune, Maharashtra, India beBeeDataEngineer Full time ₹ 20,00,000 - ₹ 25,00,000Job Title: Senior Data EngineerAbout the RoleWe are seeking a highly skilled Senior Data Engineer to join our team. The successful candidate will be responsible for designing, developing, and maintaining large-scale data systems.Key ResponsibilitiesDesign and implement efficient ETL pipelines using Apache Spark or AWS GlueDevelop business use cases using...
-
Senior Data Engineer
6 days ago
Pune, Maharashtra, India SPIRO Full timePosition: Senior Data EngineerLocation: PuneExperience: 5+ yearsJob Summary:We are seeking an experienced Senior Data Engineer with a strong background in PySpark, Spark, and Big Data technologies. The ideal candidate will have over five years of hands-on experience designing, building, and optimizing large-scale data processing systems and pipelines.Key...
-
Article Assistant-Internal Audit
3 weeks ago
Pune, Maharashtra, India SPARK & ALLIANCE Full timeCompany Description SPARK & Alliance (SPARK) is a network approved by the Institute of Chartered Accountants of India, with a presence in 20 cities across India. With a team of over 45 Chartered Accountants and 350 other professional staff, SPARK is known for its ethical standards and timely execution. Our team provides specialized professional services...
-
Data Engineer
4 weeks ago
Pune, Maharashtra, India Impetus Full timeOpen Location - Indore, Noida, Gurgaon, Bangalore, Hyderabad, Pune Job Description 3-8 years' experience working on Data engineering & ETL/ELT processes, data warehousing, and data lake implementation with AWS services Hands on experience in designing and implementing solutions like creating/deploying jobs, Orchestrating the job/pipeline and infrastructure...
-
Data Engineer
2 weeks ago
Pune, Maharashtra, India Impetus Full timeOpen Location - Indore, Noida, Gurgaon, Bangalore, Hyderabad, PuneJob Description3-8 years' experience working on Data engineering & ETL/ELT processes, data warehousing, and data lake implementation with AWS servicesHands on experience in designing and implementing solutions like creating/deploying jobs, Orchestrating the job/pipeline and infrastructure...