Skip to content

Lightweight Grounding

Grounding LLMs with minimal overhead.

Research Results

  • Optimal weight: 95% LLM + 5% suffix array
  • Perplexity reduction: 70%
  • Overhead: Only 6.5% (2.66ms)

Implementation

See examples/lightweight_experiments.py.

This page is under construction.