Solution Architect – Databricks

3 days ago


Pune, Maharashtra, India Codvo Full time US$ 1,25,000 - US$ 1,75,000 per year

Solution Architect – Databricks (Document AI & Knowledge Graph Focus) 

Location: India – Pune 

Company Overview

 We are a global empathy-led technology services company where software and people transformations go hand-in-hand.

Product innovation and mature software engineering are part of our core DNA. Our mission is to help our customers accelerate their digital journeys through a global, diverse, and empathetic talent pool following outcome-driven agile execution. Respect, Fairness, Growth, Agility, and Inclusiveness are the core values that we aspire to live by each day.

We continue to invest in our digital strategy, design, cloud engineering, data, and enterprise AI capabilities required to bring a truly integrated approach to solving our client's most ambitious digital journey challenges.

About the Role 

We are seeking a highly skilled Solution Architect with deep expertise in Databricks Lakehouse and proven experience operationalizing unstructured document AI pipelines in regulated industries. You will design and lead end-to-end architectures that turn complex, high-volume documents into governed, queryable knowledge graphs inside Databricks — enabling automation, compliance, and downstream AI applications.

 This role bridges Lakehouse architecture, AI/LLM-based extraction, and human-in-the-loop governance to deliver production-ready solutions that Databricks customers can trust.

Key Responsibilities 

  • Architecture & Design – Lead the design and implementation of in-Lakehouse pipelines for unstructured and structured data, leveraging Delta Live Tables, Unity Catalog, and MLflow.
  • Unstructured Data Processing – Architect solutions for ingestion, OCR, and LLM-based parsing of scanned PDFs, legal/medical records, and complex forms.
  • Confidence Scoring & HITL Workflows – Design confidence-tiered pipelines that auto-accept high-confidence results and route low-confidence extractions to review consoles, ensuring auditability and compliance.
  • Knowledge Graph Modeling – Translate extracted entities and relationships into graph-friendly Delta Gold tables, ready for analytics or export to graph databases.
  • Governance & Compliance – Define security, lineage, and classification rules in Unity Catalog; ensure all document transformations are fully traceable back to the source.
  • Integration & Ecosystem – Use Partner Connect and APIs to integrate Databricks outputs with downstream claims/case management, compliance dashboards, and BI tools.
  • Performance & Cost Optimization – Tune pipelines for scalability, performance, and cost efficiency in cloud environments (AWS, Azure, GCP).
  • Collaboration & Mentorship – Work with data engineers, ML engineers, and domain SMEs to translate business requirements into scalable architectures, mentoring teams on Databricks and document AI best practices.

Required Skills & Experience 

  • 5+ years in solution or data architecture, with at least 2+ years delivering Databricks-based solutions.
  • Proven hands-on experience with Unity Catalog, Delta Live Tables, Databricks SQL, and artner Connect integrations.
  • Expertise in designing Lakehouse architectures for structured and unstructured data.
  • Strong understanding of OCR integration patterns (AWS Textract, Azure Form Recognizer, Tesseract) and LLM-powered entity extraction (prompt design, schema mapping, validation).
  • Experience with confidence scoring and human-in-the-loop patterns for data quality and compliance.
  • Familiarity with knowledge graph concepts and relational-to-graph data modeling in Databricks.
  • Strong SQL skills and experience with distributed data processing (PySpark/SparkSQL).
  • Working knowledge of cloud data ecosystems (AWS, Azure, or GCP).
  • Excellent communication skills with the ability to bridge technical and business teams.

Preferred Qualifications 

  • Databricks certifications (e.g., Databricks Certified Solutions Architect, Data Engineer Professional).
  • Experience delivering document AI pipelines in regulated verticals (legal, insurance, healthcare).
  • Familiarity with data mesh or federated governance models.
  • Background in MLOps and continuous improvement for extraction models.


  • Pune, Maharashtra, India Persistent Systems Full time

    About Position:We are hiring for skilled Azure Databrick Architect with 12 to 17 Years of experience in python, sql.- Role: Azure Databricks Architect- Location: All Persistent Locations- Experience: 12 to 17 years- Job Type: Full Time EmploymentWhat You'll Do:- 12+ years of experience in data architecture, data engineering, or analytics.- 5+ years of...


  • Pune, Maharashtra, India Persistent Systems Full time

    About Position: We are hiring for skilled Azure Databrick Architect with 12 to 17 Years of experience in python, sql. Role: Azure Databricks Architect Location: All Persistent Locations Experience: 12 to 17 years Job Type: Full Time Employment What You'll Do: 12+ years of experience in data architecture, data engineering, or analytics. 5+ years of...


  • Pune, Maharashtra, India Persistent Systems Full time

    About Position: We are hiring for skilled Azure Databrick Architect with 12 to 17 Years of experience in python, sql. Role: Azure Databricks Architect Location: All Persistent Locations Experience: 12 to 17 years Job Type: Full Time Employment What You'll Do: 12+ years of experience in data architecture, data engineering, or analytics. 5+ years of...


  • Pune, Maharashtra, India beBeeData Full time ₹ 15,00,000 - ₹ 25,00,000

    Job TitleWe are seeking an experienced Data Engineer to design and implement scalable, secure, and efficient data solutions that meet client requirements.Develop data pipelines, architect data lakes, and implement data warehousing solutions using Databricks.Collaborate with data scientists and analysts to develop and deploy machine learning models and...


  • Pune, Maharashtra, India Codvo Full time

    Job Title : Big Data Architect / Databricks Architect.Company Overview :At Codvo, software and people transformations go hand-in-hand.We are a global empathy-led technology services company.Product innovation and mature software engineering are part of our core DNA.Respect, Fairness, Growth, Agility, and Inclusiveness are the core values that we aspire to...

  • Azure Databricks

    3 weeks ago


    Pune, Maharashtra, India Phygital Insights Private Limited Full time

    Job DescriptionDescriptionWe are seeking an experienced Azure Databricks professional to join our team in India. The ideal candidate will have a strong background in data engineering and be proficient in leveraging Azure Databricks to build robust data solutions.Responsibilities- Design and implement data pipelines using Azure Databricks to process large...


  • Pune, Maharashtra, India Ascendion Full time

    Job Title: Senior Data Engineer Experience : 3+ years Location: Gurgaon Skills: PySpark, SQL, Databricks, AWS. Role Summary: We are looking for 3–4 experienced Databricks Developers to support a fast-paced, high-impact data engineering initiative. The ideal candidates should have hands-on expertise in building scalable data pipelines using...


  • Pune, Maharashtra, India Ascendion Full time

    Job Title: Senior Data EngineerExperience : 3+ yearsLocation: Gurgaon /Pune / BangaloreSkills: PySpark, SQL, Databricks, AWS.Role Summary:We are looking for 3–4 experienced Databricks Developers to support a fast-paced, high-impact data engineering initiative. The ideal candidates should have hands-on expertise in building scalable data pipelines using...


  • Pune, Maharashtra, India beBeeDataEngineer Full time ₹ 15,00,000 - ₹ 25,00,000

    Job Title:Data EngineerAbout the Role:We are seeking a highly skilled Data Engineer to join our team. The ideal candidate will have hands-on experience with Databricks, including expertise in Databricks SQL, PySpark, and Spark SQL.Key Responsibilities:Migrate SQL Server Stored Procedures to Databricks Notebooks, leveraging PySpark and Spark SQL for complex...

  • Azure Databricks

    4 weeks ago


    Pune, Maharashtra, India People Prime Worldwide Full time

    Our client is a global technology company headquartered in Santa Clara, California. it focuses on helping organisations harness the power of data to drive digital transformation, enhance operational efficiency, and achieve sustainability. over 100 years of experience in operational technology (OT) and more than 60 years in IT to unlock the power of data from...