HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Power Attention: Efficient CUDA Kernels for Symmetric Power Transformers
github.com/m-a-n-i-f-e-s-t
2 comments
a year ago
txus
6 points
2.
▲
PowerRetention: a drop-in replacement for FlashAttention in LLMs
github.com/m-a-n-i-f-e-s-t
2 comments
9 months ago
dvrp
2 points