Granica Logo

Granica

Software Engineer – Foundational Data Systems for AI

Reposted 17 Hours Ago
In-Office
Mountain View, CA
140K-200K Annually
Senior level
In-Office
Mountain View, CA
140K-200K Annually
Senior level
As a Lakehouse Core Engineer, you'll enhance data systems, optimize storage and compute, develop ACID transactions, and improve query performance, focusing on petabyte-scale efficiency.
The summary above was generated by AI

Granica is an AI research and systems company building the infrastructure for a new kind of intelligence: one that is structured, efficient, and deeply integrated with data.

Our systems operate at exabyte scale, processing petabytes of data each day for some of the world’s most prominent enterprises in finance, technology, and industry. These systems are already making a measurable difference in how global organizations use data to deploy AI safely and efficiently.

We believe that the next generation of enterprise AI will not come from larger models but from more efficient data systems. By advancing the frontier of how data is represented, stored, and transformed, we aim to make large-scale intelligence creation sustainable and adaptive.

Our long-term vision is Efficient Intelligence: AI that learns using fewer resources, generalizes from less data, and reasons through structure rather than scale. To reach that, we are first building the Foundational Data Systems that make structured AI possible.

The Mission

AI today is limited not only by model design but by the inefficiency of the data that feeds it. At scale, each redundant byte, each poorly organized dataset, and each inefficient data path slows progress and compounds into enormous cost, latency, and energy waste.

Granica’s mission is to remove that inefficiency. We combine new research in information theory, probabilistic modeling, and distributed systems to design self-optimizing data infrastructure: systems that continuously improve how information is represented and used by AI.

This engineering team partners closely with the Granica Research group led by Prof. Andrea Montanari (Stanford), bridging advances in information theory and learning efficiency with large-scale distributed systems. Together, we share a conviction that the next leap in AI will come from breakthroughs in efficient systems, not just larger models.

What You’ll Build
  • Global Metadata Substrate. Help design and implement the metadata substrate that supports time-travel, schema evolution, and atomic consistency across massive tabular datasets.

  • Adaptive Engines. Build components that reorganize data autonomously, learning from access patterns and workloads to maintain efficiency with minimal manual tuning.

  • Intelligent Data Layouts. Develop and refine bit-level encodings, compression, and layout strategies to extract maximum signal per byte read.

  • Autonomous Compute Pipelines. Contribute to distributed compute systems that scale predictively and adapt to dynamic load.

  • Research to Production. Translate new algorithms in compression and representation from research into production-grade implementations.

  • Latency as Intelligence. Design and optimize data paths to minimize time between question and insight, enabling faster learning for both models and humans.

What You Bring
  • Foundational understanding of distributed systems: partitioning, replication, and fault tolerance.

  • Experience or curiosity with columnar formats such as Parquet or ORC and low-level data encoding.

  • Familiarity with metadata-driven architectures or data query planning.

  • Exposure to or hands-on use of Spark, Flink, or similar distributed engines on cloud storage.

  • Proficiency in Java, Rust, Go, or C++ and commitment to clean, reliable code.

  • Curiosity about how compression, entropy, and representation shape system efficiency and learning.

  • A builder’s mindset—eager to learn, improve, and deliver features end-to-end with growing autonomy.

Bonus

  • Familiarity with Iceberg, Delta Lake, or Hudi.

  • Contributions to open-source projects or research in compression, indexing, or distributed systems.

  • Interest in how data representation influences AI training dynamics and reasoning efficiency.

Why Granica
  • Fundamental Research Meets Enterprise Impact. Work at the intersection of science and engineering, turning foundational research into deployed systems serving enterprise workloads at exabyte scale.

  • AI by Design. Build the infrastructure that defines how efficiently the world can create and apply intelligence.

  • Real Ownership. Design primitives that will underpin the next decade of AI infrastructure.

  • High-Trust Environment. Deep technical work, minimal bureaucracy, shared mission.

  • Enduring Horizon. Backed by NEA, Bain Capital, and various luminaries from tech and business. We are building a generational company for decades, not quarters or a product cycle.

Compensation & Benefits
  • Competitive salary, meaningful equity, and substantial bonus for top performers

  • Flexible time off plus comprehensive health coverage for you and your family

  • Support for research, publication, and deep technical exploration

Join us to build the foundational data systems that power the future of enterprise AI.
At Granica, you will shape the fundamental infrastructure that makes intelligence itself efficient, structured, and enduring.

Top Skills

Apache Iceberg
Delta Lake
Go
Java
Parquet
Scala
Spark

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account