HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Modded-NanoGPT: NanoGPT (124M) quality in 3.25B tokens
github.com/KellerJordan
11 comments
2 years ago
ocean_moist
81 points
2.
▲
Train to 94% on CIFAR-10 in 3.29 seconds on a single A100
github.com/KellerJordan
1 comment
2 years ago
kjjnot
3 points
3.
▲
modded-nanogpt: NanoGPT (124M) in 2 minutes
github.com/KellerJordan
discuss
4 months ago
tosh
2 points
4.
▲
Muon Optimizer
github.com/KellerJordan
discuss
2 years ago
pilooch
2 points