Databricks Engineer

2 weeks ago


Bengaluru, Karnataka, India DataZymes Full time

ABOUT US: 

Founded in 2016, DataZymes is a next-generation analytics and data science company driving technology and digital-led innovation for our clients, thus helping them get more value from their data and analytics investments. Our platforms are built on best-of-breed technologies, thus protecting current investments while providing clients more bang for their buck. As we are a premier partner for many Business Intelligence and Information Management companies, we also provide advisory and consulting services to clients helping them make the right decisions and put together a long-term roadmap.

Our mission at DataZymes is to scale analytics and enable healthcare organizations in achieving non-linear, long term and sustainable growth. In a short span, we have built a high-performance team in focused practice areas, built digital-enabled solutions, and are working with some marquee names in the US healthcare industry.

JOB LOCATION: Bangalore

QUALIFICATION REQUIRED: We are seeking a skilled and motivated Databricks Engineer to join our dynamic data team. The ideal candidate will have 3-5 years of robust experience in data engineering, with a strong focus on the Databricks ecosystem. You will be responsible for designing, developing, and maintaining scalable and reliable data pipelines using PySpark. A key part of this role involves leveraging the Databricks Lakehouse Platform and its advanced AI/ML features to unlock data-driven insights and power our business intelligence and machine learning initiatives.

EXPERIENCE REQUIRED: 3-5 years hands on experience

EMPLOYMENT TYPE: Full-Time

Key Responsibilities

  • Pipeline Development: Design, build, and maintain efficient and scalable ETL/ELT pipelines on the Databricks platform using PySpark, SQL, and Delta Live Tables (DLT).
  • Lakehouse Management: Implement and manage data solutions within the Databricks Lakehouse Platform, ensuring best practices for data storage, governance, and management using Delta Lake and Unity Catalog.
  • Code Optimization: Write high-quality, maintainable, and optimized PySpark code for large-scale data processing and transformation tasks.
  • AI & ML Integration: Collaborate with data scientists to productionize machine learning models. Utilize Databricks AI features such as the Feature Store, MLflow for model lifecycle management, and AutoML for accelerating model development.
  • Data Quality & Governance: Implement robust data quality checks and validation frameworks to ensure data accuracy, completeness, and reliability within the delta tables.
  • Performance Tuning: Monitor, troubleshoot, and optimize the performance of Databricks jobs, clusters, and SQL warehouses to ensure efficiency and cost-effectiveness.
  • Collaboration: Work closely with data analysts, data scientists, and business stakeholders to understand their data requirements and deliver effective solutions.
  • Documentation: Create and maintain comprehensive technical documentation for data pipelines, architectures, and processes.

Required Qualifications & Skills 

  • Experience: 3-5 years of hands-on experience in a data engineering role.
  • Databricks Expertise: Proven, in-depth experience with the Databricks platform, including Databricks Workflows, Notebooks, Clusters, and Delta Live Tables.
  • Programming Skills: Strong proficiency in Python and extensive hands-on experience with PySpark for data manipulation and processing.
  • Data Architecture: Solid understanding of modern data architectures, including the Lakehouse paradigm, Data Lakes, and Data Warehousing.
  • Delta Lake: Hands-on experience with Delta Lake, including schema evolution, ACID transactions, and time travel features.
  • SQL Proficiency: Excellent SQL skills and the ability to write complex queries for data analysis and transformation.
  • Databricks AI: Practical experience with Databricks AI/ML capabilities, particularly MLflow and the Feature Store.
  • Cloud Experience: Experience working with at least one major cloud provider (AWS, Azure, or GCP).
  • Problem-Solving: Strong analytical and problem-solving skills with the ability to debug complex data issues.
  • Communication: Excellent verbal and written communication skills.

Preferred Qualifications 

  • Databricks Certified Data Engineer Associate/Professional certification.
  • Experience with CI/CD tools (e.g., Jenkins, Azure DevOps, GitHub Actions) for data pipelines.
  • Familiarity with streaming technologies like Structured Streaming.
  • Knowledge of data governance tools and practices within Unity Catalog.

  • Engineering Manager

    1 week ago


    Bengaluru, Karnataka, India Databricks Full time US$ 1,50,000 - US$ 2,00,000 per year

    P-995Companies are investing billions of dollars into developing and deploying AI and the data platforms that enable it. But do they know what is happening? At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical...


  • Bengaluru, Karnataka, India Databricks Full time US$ 1,50,000 - US$ 2,00,000 per year

    (P-1384)About the TeamAt Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data...


  • Bengaluru, Karnataka, India Databricks Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    (P-1384)About the TeamAt Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. Ingesting data into the Lakehouse is a strategic area of investment for Databricks and a key enabler for Data and AI...


  • Bengaluru, Karnataka, India Databricks Full time

    Job DescriptionFEQ426R153As a Sr. Solutions Engineer (Analytics, AI, Big Data, Public Cloud), you will guide the technical evaluation phase in a hands-on environment throughout the sales process. You will be a technical advisor internally to the sales team, and work with the product team as an advocate of your customers in the field. You will help our...


  • Bengaluru, Karnataka, India Databricks Full time US$ 1,20,000 - US$ 2,00,000 per year

    FEQ426R153As a Sr. Solutions Engineer (Analytics, AI, Big Data, Public Cloud), you will guide the technical evaluation phase in a hands-on environment throughout the sales process. You will be a technical advisor internally to the sales team, and work with the product team as an advocate of your customers in the field. You will help our customers to achieve...


  • Bengaluru, Karnataka, India Databricks Full time

    Job DescriptionFEQ125R56As a Partner Solutions Architect (PSA) for India, you will work with Databricks&apos Consulting and System Integrator (C&SI) partners, teammates, and with the technical and sales team members who work directly with our customers. You will develop &apostechnical champions&apos within our top C&SI Partners, providing enablement on...


  • Bengaluru, Karnataka, India Databricks Full time US$ 1,50,000 - US$ 2,00,000 per year

    P-926At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to...

  • Databricks Engineer

    2 weeks ago


    Bengaluru, Karnataka, India DataZymes Full time

    ABOUT US:  Founded in 2016, DataZymes is a next-generation analytics and data science company driving technology and digital-led innovation for our clients, thus helping them get more value from their data and analytics investments. Our platforms are built on best-of-breed technologies, thus protecting current investments while providing clients more bang...

  • Solutions Architect

    5 days ago


    Bengaluru, Karnataka, India Databricks Full time US$ 1,50,000 - US$ 2,00,000 per year

    FEQ426R138As a Solutions Architect (Analytics, AI, Big Data, Public Cloud), you will guide the technical evaluation phase in a hands-on environment throughout the sales process. You will be a technical advisor internally to the sales team, and work with the product team as an advocate of your customers in the field. You will help our customers to achieve...


  • Bengaluru, Karnataka, India Databricks Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    CSQ127R23At Databricks, an Incident Manager utilizes their technical experience and resourcefulness to lead urgent customer situations to resolution. Responsible for managing frequent, high-quality updates to all internal and external stakeholders, Incident Managers advocate with engineering and leadership, on behalf of their customers, to ensure that...