HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
RIS-Kernel: Running 64k context LLMs on CPU via sparse attention
github.com/santosardr
discuss
22 days ago
santosardr
2 points