AttentionVCGitHubVC
Trending ReposTrending BuildersPortfolio
AttentionVC

Built by JustinZ and Jennie

AttentionVCA product by AttentionVC
Back to Trending
tonbistudio
tonbistudio/

turboquant-pytorch

From-scratch PyTorch implementation of Google's TurboQuant (ICLR 2026) for LLM KV cache compression. 5x compression at 3-bit with 99.5% attention fidelity.

Python
Stars

210

+19 today+19 /wk+19 /mo
Forks

28

Issues

4

Watchers

210

Star History

Repository Info

CreatedMar 25, 2026
Last push1d ago
Open on GitHub