HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
hazyresearch.stanford.edu
discuss
3 years ago
panabee
3 points
32.
▲
HyenaDNA: Learning from DNA with 1M token context
hazyresearch.stanford.edu
discuss
3 years ago
beefman
3 points
33.
▲
AI's Linux Moment: An Open-Source AI Model Love Note
hazyresearch.stanford.edu
discuss
3 years ago
tim_sw
3 points
34.
▲
Minions: The rise of small, on-device LMs
hazyresearch.stanford.edu
1 comment
a year ago
kiyanwang
2 points
35.
▲
Zoology 1: Measuring and Improving Recall in Efficient Language Models
hazyresearch.stanford.edu
1 comment
3 years ago
convexstrictly
2 points
36.
▲
Pixelated Butterfly
hazyresearch.stanford.edu
1 comment
3 years ago
sdenton4
2 points
37.
▲
Stuffing MLPs Full of Facts: A Generative Approach to Factual Recall
hazyresearch.stanford.edu
discuss
7 months ago
hessdalenlight
2 points
38.
▲
ThunderMittens for Your ThunderKittens
hazyresearch.stanford.edu
discuss
7 months ago
mpweiher
2 points
39.
▲
An Unserious Persons Take on Axiomatic Knowledge in the Era of Foundation Models
hazyresearch.stanford.edu
discuss
2 years ago
LionTurtle13
2 points
40.
▲
An Unserious Take on Axiomatic Knowledge in the Era of Foundation Models
hazyresearch.stanford.edu
discuss
2 years ago
jxmorris12
2 points
41.
▲
Linearizing LLMs with LoLCATs
hazyresearch.stanford.edu
discuss
2 years ago
jasondavies
2 points
42.
▲
Efficient language models as arithmetic circuits
hazyresearch.stanford.edu
discuss
2 years ago
colinprince
2 points
43.
▲
Combining Continuous-Time, Recurrent, and Convolutional Models
hazyresearch.stanford.edu
discuss
3 years ago
georgehill
2 points
44.
▲
HyenaDNA: Learning from DNA with 1M token context
hazyresearch.stanford.edu
discuss
3 years ago
thunderbong
2 points
45.
▲
Hyena Hierarchy: Towards Larger Convolutional Language Models
hazyresearch.stanford.edu
discuss
3 years ago
quantisan
2 points
46.
▲
From Deep to Long Learning?
hazyresearch.stanford.edu
discuss
3 years ago
sebg
2 points
47.
▲
Based: An Educational and Effective Sequence Mixer
hazyresearch.stanford.edu
1 comment
3 years ago
pama
1 points
48.
▲
ThunderKittens 2.0: even faster kernels for your GPUs
hazyresearch.stanford.edu
discuss
4 months ago
ecesena
1 points
49.
▲
Loads and Loads of Fluffy Kittens
hazyresearch.stanford.edu
discuss
7 months ago
todsacerdoti
1 points
50.
▲
Intelligence per Watt: A Study of Local Intelligence Efficiency
hazyresearch.stanford.edu
discuss
7 months ago
simonpure
1 points
51.
▲
Cartridges: Store long contexts in tiny caches with LLM self-study
hazyresearch.stanford.edu
discuss
a year ago
dvrp
1 points
52.
▲
Mind the Trust Gap: Fast, Private Local-to-Cloud LLM Chat
hazyresearch.stanford.edu
discuss
a year ago
tmoertel
1 points
53.
▲
Correcting and Improving LLM Predictions Without Labels
hazyresearch.stanford.edu
discuss
3 years ago
nihit-desai
1 points
54.
▲
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
hazyresearch.stanford.edu
discuss
3 years ago
todsacerdoti
1 points
55.
▲
H3: Language Modeling with State Space Models and (Almost) No Attention
hazyresearch.stanford.edu
discuss
3 years ago
anewhnaccount2
1 points
56.
▲
Hyena Hierarchy: Towards Larger Convolutional Language Models
hazyresearch.stanford.edu
discuss
3 years ago
pmoriarty
1 points
57.
▲
Chris Re: Is AI Rare or Everywhere?
hazyresearch.stanford.edu
discuss
3 years ago
tim_sw
1 points
58.
▲
Hyena Hierarchy: Towards Larger Convolutional Language Models
hazyresearch.stanford.edu
discuss
3 years ago
chriskanan
1 points
59.
▲
Can Longer Sequences Help Take the Next Leap in AI? · Hazy Research
hazyresearch.stanford.edu
discuss
4 years ago
bilsbie
1 points
60.
▲
HiPPO: Recurrent Memory with Optimal Polynomial Projections
hazyresearch.stanford.edu
discuss
5 years ago
0mp
1 points
More