HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Non-determinism in GPT-4 is caused by Sparse MoE
152334h.github.io
181 comments
3 years ago
152334H
397 points
2.
▲
Calculating the cost of a Google DeepMind paper
152334h.github.io
150 comments
2 years ago
152334H
303 points
3.
▲
Knowing Enough About MoE to Explain Dropped Tokens in GPT-4
152334h.github.io
1 comment
3 years ago
152334H
3 points
4.
▲
Can AI agents write kernel exploits?
152334h.github.io
discuss
4 months ago
152334H
3 points
5.
▲
Why can't TorToiSe be fine-tuned?
152334h.github.io
discuss
3 years ago
152334H
1 points