🐙 GitHub Detail

dipampaul17/KVSplit

By dipampaul17

Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit keys & 4-bit values, reducing memory by 59% with <1% quality loss. Includes benchmarking, visualization, and one-command setup. Optimized for M1/M2/M3 Macs with Metal support.

GitHub Python Other Updated 07 Jun 2026

Open Source ↗ Find Similar 🔎 Submit to Directory ＋

Live Snapshot

⭐

Stars

361

🍴

Forks

📄

License

Other

🧩

Type

Python

📘

About this open-source project

Live information fetched from GitHub.

🌿

Default Branch

main

🐞

Open Issues

👀

Watchers

361

Project Details

Source GitHub

Owner dipampaul17

License Other

Updated 07 Jun 2026

Need help using this?

Golden Eagle IT Technologies can help with setup, customization, deployment, AI integration and monthly support.

Get Support →