EGreg
Implement ultra-high compression algorithms for KV caches to enable massive context windows on consumer hardware.
Suggested repo: kvc-ultra
"Run 1M context on your GPU."
Estimated effort: 100h