Trending Repos Trending Builders Portfolio

Built by JustinZ and Jennie

A product by AttentionVC

Back to Trending

tonbistudio/

turboquant-pytorch

From-scratch PyTorch implementation of Google's TurboQuant (ICLR 2026) for LLM KV cache compression. 5x compression at 3-bit with 99.5% attention fidelity.

Python

Stars

210

+19 today+19 /wk+19 /mo

Forks

28

Issues

4

Watchers

210

Star History

Repository Info

CreatedMar 25, 2026

Last push1d ago