Lakehouse Core Engineer

15 hours ago

India Granica Full time ₹ 5,00,000 - ₹ 8,00,000 per year

Granica is redefining how enterprises prepare and optimize data at the most fundamental layer of the AI stack—where raw information becomes usable intelligence. Our technology operates deep in the data infrastructure layer, making data efficient, secure, and ready for scale.

We eliminate the hidden inefficiencies in modern data platforms—slashing storage and compute costs, accelerating pipelines, and boosting platform efficiency. The result: 60%+ lower storage costs, up to 60% lower compute spend, 3× faster data processing, and 20% overall efficiency gains.

Why It Matters

Massive data should fuel innovation, not drain budgets. We remove the bottlenecks holding AI and analytics back—making data lighter, faster, and smarter so teams can ship breakthroughs, not babysit storage and compute bills.

Who We Are

World renowned researchers in compression, information theory, and data systems
Elite engineers from Google, Pure Storage, Cohesity and top cloud teams
Enterprise sellers who turn ROI into seven‑figure wins.

Powered by World-Class Investors & Customers

$65M+ raised from NEA, Bain Capital, A* Capital, and operators behind Okta, Eventbrite, Tesla, and Databricks. Our platform already processes hundreds of petabytes for industry leaders

Our Mission: We're building the default data substrate for AI, and a generational company built to endure.

WHAT WE'RE LOOKING FOR

You've built systems where petabyte-scale performance, resilience, and clarity of design all matter. You thrive at the intersection of infrastructure engineering and applied research, and care deeply about both how something works and how well it works at scale. We're looking for someone with experience in:

Lakehouse and Transactional Data Systems

Proven expertise with formats like Delta Lake or Apache Iceberg, including ACID-compliant table design, schema evolution, and time-travel mechanics.
Columnar Storage Optimization

Deep knowledge of Parquet, including techniques like column ordering, dictionary encoding, bit-packing, bloom filters, and zone maps to reduce scan I/O and improve query efficiency.
Metadata and Indexing Systems

Experience building metadata-driven services—compaction, caching, pruning, and adaptive indexing that accelerate query planning and eliminate manual tuning.
Distributed Compute at Scale

Production-grade Spark/Scala pipeline development across object stores like S3, GCS, and ADLS, with an eye toward autoscaling, resilience, and observability.
Programming for Scale and Longevity

Strong coding skills in Java, Scala, or Go, with a focus on clean, testable code and a documented mindset that enables future engineers to build on your work, not rewrite it.
Resilient Systems and Observability

You've designed systems that survive chaos drills, avoid pager storms, and surface the right metrics to keep complex infrastructure calm and visible.
Latency as a Product Metric

You think in terms of human latency—how fast a dashboard feels to the analyst, not just the system. You take pride in chasing down every unnecessary millisecond.
Mentorship and Engineering Rigor

You publish your breakthroughs, mentor peers, and contribute to a culture of engineering excellence and continuous learning.

WHY JOIN GRANICA

If you've helped build the modern data stack at a large company—Databricks, Snowflake, Confluent, or similar—you already know how critical lakehouse infrastructure is to AI and analytics at scale. At Granica, you'll take that knowledge and apply it where it matters most…at the most fundamental layer in the data ecosystem.

Own the product, not just the feature. At Granica, you won't be optimizing edge cases or maintaining legacy systems. You'll architect and build foundational components that define how enterprises manage and optimize data for AI.
Move faster, go deeper. No multi-month review cycles or layers of abstraction—just high-agency engineering work where great ideas ship weekly. You'll work directly with the founding team, engage closely with design partners, and see your impact hit production fast.
Work on hard, meaningful problems. From transaction layer design in Delta and Iceberg, to petabyte-scale compaction and schema evolution, to adaptive indexing and cost-aware query planning—this is deep systems engineering at scale.
Join a team of expert builders. Our engineers have designed the core internals of cloud-scale data systems, and we maintain a culture of peer-driven learning, hands-on prototyping, and technical storytelling.
Core Differentiation: We'refocused on unlocking a deeper layer of AI infrastructure. By optimizing the way data is stored, processed, and retrieved, we make platforms like Snowflake and Databricks faster, more cost-efficient, and more AI-native. Our work sits at the most fundamental layer of the AI stack: where raw data becomes usable intelligence.
Be part of something early—without the chaos. Granica has already secured $65M+ from NEA, Bain Capital Ventures, A* Capital, and legendary operators from Okta, Tesla, and Databricks.
Grow with the company. You'll have the chance to grow into a technical leadership role, mentor future hires, and shape both the engineering culture and product direction as we scale.

Benefits:

Highly competitive compensation with uncapped commissions and meaningful equity
Immigration sponsorship and counseling
Premium health, dental, and vision coverage
Flexible remote work and unlimited PTO
Quarterly recharge days and annual team off-sites
Budget for learning, development, and conferences
Help build the foundational infrastructure for the AI era

Granica is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Data Engineer-Technical Lead

2 weeks ago

Bengaluru, India Sigmoid Full time

Job Description Kindly find the Job Description Below. Job Title:Technical Lead-Azure Data Engineer Location: Bangalore Years of Experience: 10+ years of experience Sigmoid works with a variety of clients from start-ups to fortune 500 companies. We are looking for a detailed oriented self-starter to assist our engineering and analytics teams in various roles...
Data Engineer

1 week ago

India Keasis Full time

We are seeking a Data Engineer with strong Apache NiFi expertise to design and implement pipelines that move and transform data from Cloudera (HDFS/Hive/Impala) into Apache Iceberg tables, with downstream integration into Snowflake and Databricks. The ideal candidate will have hands-on experience with modern data lakehouse architectures and will play a...
Data Engineer

3 weeks ago

Pune, India Zensar Technologies Full time

Job Description Opportunity for Data Engineer Location-Bangalore, Hyderabad, Pune Immediate Joiner to 15 days Key Responsibilities - Design and develop a data lakehouse solution using Apache Iceberg and Apache Spark - Enable high-performance Treasury analytics, integrating financial datasets and reporting engines - Work with AWS services to create...
Data Engineer

7 days ago

India Keasis Full time

We are seeking a Data Engineer with strong Apache NiFi expertise to design and implement pipelines that move and transform data from Cloudera (HDFS/Hive/Impala) into Apache Iceberg tables, with downstream integration into Snowflake and Databricks. The ideal candidate will have hands-on experience with modern data lakehouse architectures and will play a...
Data Engineer

1 week ago

Bengaluru, India Mastek Full time

Job Description Job Title: Databricks SQL Engineer (with Pharma/Life Sciences background) About the Role: We are seeking a highly skilled Databricks SQL Engineer with Pharma background to join our Data Engineering team. The ideal candidate will have strong expertise in Databricks, Pyspark, SQL, Spark SQL, Delta Lake, and data lakehouse architectures, and...
Azure Cloud Data Engineer

2 days ago

India Intelebee Full time

End-to-End Data Engineering We’re looking for a hands-on Cloud Data Engineer who’s an expert in Python, PySpark, and SQL — with proven experience building end-to-end data pipelines on Azure using Data Factory, Synapse, and Databricks. This role blends strong technical skills with sharp business understanding — ideal for someone who loves solving...
Senior engineer, data engineering

2 weeks ago

India SAIVA AI Full time

We are building the future of healthcare analytics. Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems. Our goal: pipelines that are reliable, observable, and continuously improving in production. This is a fully remote role, open to candidates based in Europe or India, with...
Packet Core Engineer

3 weeks ago

India D2AI Labs Full time

We’re looking for an experienced Packet Core Engineer (Nokia/Ericsson) to join our team in India. If you have strong expertise in telecom network design, deployment, and operations , this role is for you! What you’ll doDesign and deploy Packet Core networks (EPC & 5GC) for large-scale telecom environments. Perform end-to-end integration,...
Senior Data Engineer

1 week ago

India SAIVA AI Full time

We are building the future of healthcare analytics. Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems. Our goal: pipelines that are reliable, observable, and continuously improving in production. This is a fully remote role, open to candidates based in Europe or India, with...
Data Engineer

1 week ago

india, IN Keasis Full time

We are seeking a Data Engineer with strong Apache NiFi expertise to design and implement pipelines that move and transform data from Cloudera (HDFS/Hive/Impala) into Apache Iceberg tables, with downstream integration into Snowflake and Databricks. The ideal candidate will have hands-on experience with modern data lakehouse architectures and will play a...

Americas

Europe

Asia / Oceania

Africa

Lakehouse Core Engineer