Databricks + Pyspark

7 days ago


Andhra Pradesh, India Virtusa Full time

**Detailed Job Description for**Databricks + PySpark Developer**:

- Data Pipeline Development: Design, implement, and maintain scalable and efficient data pipelines using PySpark and Databricks for ETL processing of large volumes of data.
- Cloud Integration: Develop solutions leveraging Databricks on cloud platforms (AWS/Azure/GCP) to process and analyze data in a distributed computing environment.
- Data Modeling: Build robust data models, ensuring high-quality data integration and consistency across multiple data sources.
- Optimization: Optimize PySpark jobs for performance, ensuring the efficient use of resources and cost-effective execution.
- Collaborative Development: Work closely with data scientists, analysts, and other stakeholders to understand data requirements and deliver actionable insights.
- Automation & Monitoring: Implement monitoring solutions for data pipeline health, performance, and failure detection.
- Documentation & Best Practices: Maintain comprehensive documentation of architecture, design, and code. Ensure adherence to best practices for data engineering, version control, and CI/CD processes.
- Mentorship: Provide guidance to junior data engineers and help with the design and implementation of new features and components.

**Required Skills & Qualifications**:

- Experience: 6+ years of experience in data engineering or software engineering roles, with a strong focus on PySpark and Databricks.

**Technical Skills**:

- Proficient in PySpark for distributed data processing and ETL pipelines.
- Experience working with Databricks for running Apache Spark workloads in a cloud environment.
- Solid knowledge of SQL, data wrangling, and data manipulation.
- Experience with cloud platforms (AWS, Azure, or GCP) and their respective data storage services (S3, ADLS, BigQuery, etc.).
- Familiarity with data lakes, data warehouses, and NoSQL databases (e.g., MongoDB, Cassandra, HBase).
- Experience with orchestration tools like Apache Airflow, Azure Data Factory, or DBT.
- Familiarity with containerization (Docker, Kubernetes) and DevOps practices.
- Problem Solving: Strong ability to troubleshoot and debug issues related to distributed computing, performance bottlenecks, and data quality.
- Version Control: Proficient in Git based workflows and version control.
- Communication Skills: Excellent written and verbal communication skills, with the ability to explain complex technical concepts to both technical and non-technical stakeholders.
- Education: Bachelor or Master’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience).

**About Virtusa**

Teamwork, quality of life, professional and personal development: values that Virtusa is proud to embody. When you join us, you join a team of 27,000 people globally that cares about your growth — one that seeks to provide you with exciting projects, opportunities and work with state of the art technologies throughout your career with us.

Great minds, great potential: it all comes together at Virtusa. We value collaboration and the team environment of our company, and seek to provide great minds with a dynamic place to nurture new ideas and foster excellence.

Virtusa was founded on principles of equal opportunity for all, and so does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.


  • Azure Databricks

    1 week ago


    uttar pradesh, India Tata Consultancy Services Full time

    Greetings From TCS!!! "Opportunities don't happen, you create them" Job Title: Azure Databricks Exp Range: 4-10 Years Job Location: Noida or Bangalore or Chennai or Hyderabad or Kolkata Job Description: Required skill: Azure Databricks, PySpark and Python. Must have: 1.Azure Data Bricks (Python) 2.Pyspark 3. ADLS 4. complex problem solving skill Good to...

  • Pyspark Internship

    4 days ago


    Bhopal, Madhya Pradesh, India SAA Consultancy Full time

    **PySpark Internship** **Join SAA** **Synergistic AI Analytics** **We Are Hiring — PySpark Data Engineering Intern** **Location**: Bhopal, MP (**Work from Office**) **Internship Duration**: 3-6 Months **Experience**: Fresher / Final-year student / 0-2 Years Are you passionate about data and ready to build real data engineering projects in a modern...

  • Azure Databricks

    3 days ago


    uttar pradesh, India Tata Consultancy Services Full time

    Greetings From TCS!!!"Opportunities don't happen, you create them"Job Title: Azure DatabricksExp Range: 4-10 YearsJob Location: Noida or Bangalore or Chennai or Hyderabad or KolkataJob Description:Required skill: Azure Databricks, PySpark and Python.Must have: 1.Azure Data Bricks (Python)2.Pyspark3. ADLS4. complex problem solving skillGood to Have: 1.Azure...

  • Azure Databricks

    3 weeks ago


    Noida, Uttar Pradesh, India, Ghaziabad Tata Consultancy Services Full time

    Greetings From TCS!!!"Opportunities don't happen, you create them"Job Title: Azure DatabricksExp Range: 4-10 YearsJob Location: Noida or Bangalore or Chennai or Hyderabad or KolkataJob Description:Required skill: Azure Databricks, PySpark and Python.Must have: 1.Azure Data Bricks (Python)2.Pyspark3. ADLS4. complex problem solving skillGood to Have: 1.Azure...


  • Noida, Uttar Pradesh, India WNS Global Services Full time

    Company Description WNS Holdings Limited NYSE WNS is a leading Business Process Management BPM company We combine our deep industry knowledge with technology and analytics expertise to co-create innovative digital-led transformational solutions with clients across 10 industries We enable businesses in Travel Insurance Banking and Financial Services...

  • Databricks Engineer

    1 week ago


    uttar pradesh, India Tata Consultancy Services Full time

    Role: Databricks EngineerLocation: Hyd, Noida, Chennai, MumbaiExperience: 5 to 10 YearsJob Description: Working on a scrum team as a full stack engineer.Design and develop reusable software modules that meet customer requirements while upholding high standards of reliability, security, maintainability, and performance.Assist in defining product technical...

  • Databricks Engineer

    5 days ago


    uttar pradesh, India Tata Consultancy Services Full time

    Role: Databricks Engineer Location: Hyd, Noida, Chennai, Mumbai Experience: 5 to 10 Years Job Description: Working on a scrum team as a full stack engineer. Design and develop reusable software modules that meet customer requirements while upholding high standards of reliability, security, maintainability, and performance. Assist in defining product...


  • Andhra Pradesh, India Growel Softech Pvt. Ltd. Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    JD -7+ years of hands on experience in Python especially dealing with Pandas and Numpy Good hands-on experience in Spark PySpark and Spark SQLHands on experience in Databricks Unity Catalog Delta Lake Lake house Platform Medallion Architecture Azure Data Factory ADLS Experience in dealing with Parquet and JSON file format Knowledge in Snowflake.

  • AWS Data Engineer

    2 days ago


    Andhra Pradesh, India Inityinfotech Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    AWS Data EngineerExperience :- 5+ yearsLocation : RemoteJob DescriptionDesign, development, and implementation of performant ETL pipelines using python API (pySpark) of Apache Spark on AWS EMR.Writing reusable, testable, and efficient codeIntegration of data storage solutions in spark – especially with AWS S3 object storage. Performance tuning of pySpark...


  • Andhra Pradesh, India The Cigna Group Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    INTRODUCTION TO EVERNORTH:Evernorth Health Services India, established in Hyderabad in 2024, is an innovation hub for Evernorth Health Services, the pharmacy, care and benefits division of The Cigna Group. The innovation hub will support innovation-focused areas, such as generative AI, product development, process improvement, analytics, and software...