
Big Data Engineer- Spark/Pyspark
4 weeks ago
Why We Work at Dun & Bradstreet
Dun & Bradstreet unlocks the power of data through analytics, creating a better tomorrow. Each day, we are finding new ways to strengthen our award-winning culture and accelerate creativity, innovation and growth. Our 6,000+ global team members are passionate about what we do. We are dedicated to helping clients turn uncertainty into confidence, risk into opportunity and potential into prosperity. Bold and diverse thinkers are always welcome. Come join us Learn more at dnb.com/careers.
About Us:
Our global community of colleagues bring a diverse range of experiences and perspectives to our work. You&aposll find us working from a corporate office or plugging in from a home desk, listening to our customers and collaborating on solutions. Our products and solutions are vital to businesses of every size, scope and industry. And at the heart of our work youll find our core values: to be data inspired, relentlessly curious and inherently generous. Our values are the constant touchstone of our community; they guide our behavior and anchor our decisions.
Designation: Senior Software Engineer Big Data
Location: Hyderabad
KEY RESPONSIBILITIES
- Design and Develop Data Pipelines:
- Architect, build, and deploy scalable and efficient data pipelines within our Big Data ecosystem using Apache Spark and Apache Airflow.
- Document new and existing pipelines and datasets to ensure clarity and maintainability.
- Data Architecture and Management:
- Demonstrate familiarity with data pipelines, data lakes, and modern data warehousing practices, including virtual data warehouses and push-down analytics.
- Design and implement distributed data processing solutions using technologies like Apache Spark and Hadoop.
- Programming and Scripting:
- Exhibit expert-level programming skills in Python, with the ability to write clean, efficient, and maintainable code.
- Cloud Infrastructure:
- Utilize cloud-based infrastructures (AWS/GCP) and their various services, including compute resources, databases, and data warehouses.
- Manage and optimize cloud-based data infrastructure, ensuring efficient data storage and retrieval.
- Workflow Orchestration:
- Develop and manage workflows using Apache Airflow for scheduling and orchestrating data processing jobs.
- Create and maintain Apache Airflow DAGs for workflow orchestration.
- Big Data Architecture:
- Possess strong knowledge of Big Data architecture, including cluster installation, configuration, monitoring, security, resource management, maintenance, and performance tuning.
- Innovation and Optimization:
- Create detailed designs and proof-of-concepts (POCs) to enable new workloads and technical capabilities on the platform.
- Collaborate with platform and infrastructure engineers to implement these capabilities in production.
- Manage workloads and optimize resource allocation and scheduling across multiple tenants to fulfill service level agreements (SLAs).
- Continuous Learning and Collaboration:
- Participate in planning activities and collaborate with data science teams to enhance platform skills and capabilities.
KEY Requirements
1. Minimum of 8 years hands-on experience with Big Data technologies e.g. Hadoop, Spark, Hive.
2. Minimum 3+ years of experience on Spark/Pyspark
3. Hands on experience with dataproc is a HUGE plus.
4. Minimum 6 years of experience in Cloud environments, preferably GCP
5. Any experience with NoSQL and Graph databases
6. Hands on experience with managing solutions deployed in the Cloud, preferably on AWS
7. Experience working in a Global company, working in a DevOps model is a plus
-
Big Data Engineer
3 weeks ago
Hyderabad, Telangana, India RandomTrees Full timeJob Title: Big Data Engineer Experience: 5–9 Years Location: Hyderabad-Hybrid Employment Type: Full-Time Job Summary: We are seeking a skilled Big Data Engineer with 5–9 years of experience in building and managing scalable data pipelines and analytics solutions. The ideal candidate will have strong expertise in Big Data, Hadoop, Apache Spark, SQL,...
-
PySpark Lead
4 weeks ago
Hyderabad, Telangana, India ValueMomentum Full timeJob Description- Design, develop, and maintain scalable data pipelines using PySpark and related big data technologies.- Work with large datasets and develop data models for consumption by data scientists and analysts.- Optimize Spark jobs for better performance and resource management.- Design and implement data integration workflows between various data...
-
Big Data Specialist
17 hours ago
Hyderabad, Telangana, India beBeeDataEngineer Full time ₹ 25,00,000 - ₹ 40,00,000Job Title: Big Data SpecialistAbout the Role:We are seeking an experienced Data Engineer to join our team.The successful candidate will be responsible for designing, developing, and maintaining large-scale data processing systems using Big Data technologies.Our ideal candidate will have a strong background in PySpark, with experience in building efficient...
-
Big Data Architect
3 days ago
Hyderabad, Telangana, India beBeeDataScientist Full time ₹ 12,00,000 - ₹ 20,00,000Job Title: Big Data ArchitectWe are seeking a highly skilled professional to lead the design and implementation of our big data infrastructure.Responsibilities:Develop and maintain high-performance big data applications using Python, PySpark, and Spark Dataframes.Design and implement scalable data architectures to process huge volumes of data.Collaborate...
-
Lead Data Engineer(pyspark)
4 days ago
Hyderabad, Telangana, India Careers at Tide Full time US$ 1,50,000 - US$ 2,00,000 per yearABOUT TIDEAt Tide, we are building a business management platform designed to save small businesses time and money. We provide our members with business accounts and related banking services, but also a comprehensive set of connected administrative solutions from invoicing to accounting.Launched in 2017, Tide is now used by over 1 million small businesses...
-
Data Engineer
4 weeks ago
Hyderabad, Telangana, India Zensar Technologies Full timeJob DescriptionLooking for Data engineer with Pyspark & AWS SkillsJD as provided below5+ years of overall IT experience, which includes hands on experience in Big Data technologies.Mandatory - Hands on experience in Python and PySpark.Build pySpark applications using Spark Dataframes in Python.Worked on optimizing spark jobs that processes huge volumes of...
-
Data Engineer
3 days ago
Hyderabad, Telangana, India Zensar Technologies Full timeHi All,We are looking for Data Engineers for Hyderabad location.Exp - 5+ YearsNotice - Immediate to 30 DaysJD:5+ years of overall IT experience, which includes hands on experience in Big Data technologies.• Mandatory - Hands on experience in Python and PySpark.• Build pySpark applications using Spark Dataframes in Python.• Worked on optimizing spark...
-
Python Pyspark
20 hours ago
Hyderabad, Telangana, India Capgemini Full time US$ 80,000 - US$ 1,20,000 per yearJob Description Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of...
-
Data Modeller and Spark Specialist
11 hours ago
Hyderabad, Telangana, India beBeeCosmos Full time ₹ 20,00,000 - ₹ 25,00,000Expert Data Modeler and Spark ArchitectWe are seeking a skilled Expert Data Modeler and Spark Architect to lead our team in developing innovative solutions using Cosmos and Spark.At least 10 years of experience working with Cosmos Data Modeling, Spark SDK for Cosmos, and Pyspark is required.Hands-on experience with Cosmos Partitioning/Indexing, RU...
-
Big Data Engineer
3 weeks ago
Hyderabad, Telangana, India TECHOAKS IT SOLUTIONS PRIVATE LIMITED Full timeFull-time with Info ServicesClient: Disney (Offshore)Job Description :Java with Big DataMust have skills :- 8+ years of total work experience is expected.- Java, Big Data, Spring, Kafka, AWS, Scala, and Spark: All these are needed to be strong.Roles & Responsibilities :- Design and Development: Architect, design, and develop high-performance, scalable, and...