HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
541.
▲
Training LLMs to Reason in a Continuous Latent Space
arxiv.org
114 comments
2 years ago
omarsar
283 points
542.
▲
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
arxiv.org
47 comments
3 years ago
jonbaer
283 points
543.
▲
Understanding the Limitations of Mathematical Reasoning in LLMs
arxiv.org
266 comments
2 years ago
hnhn34
282 points
544.
▲
Neural programmer better than Quicksort
arxiv.org
126 comments
6 years ago
nl
282 points
545.
▲
Mixture-of-Depths: Dynamically allocating compute in transformers
arxiv.org
83 comments
2 years ago
milliondreams
281 points
546.
▲
A Wait-Free Stack
arxiv.org
63 comments
10 years ago
EvgeniyZh
281 points
547.
▲
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
arxiv.org
264 comments
2 years ago
hackerlight
280 points
548.
▲
Cognitive Behaviors That Enable Self-Improving Reasoners
arxiv.org
103 comments
a year ago
delifue
279 points
549.
▲
How popular media portrays the employability of older software developers
arxiv.org
393 comments
6 years ago
sbaltes
278 points
550.
▲
Large language models lack deep insights or a theory of mind
arxiv.org
261 comments
3 years ago
mnode
277 points
551.
▲
Scaling Transformer to 1M tokens and beyond with RMT
arxiv.org
132 comments
3 years ago
panabee
277 points
552.
▲
The Modern Mathematics of Deep Learning
arxiv.org
70 comments
5 years ago
tims457
276 points
553.
▲
Planting Undetectable Backdoors in Machine Learning Models
arxiv.org
59 comments
4 years ago
belter
275 points
554.
▲
Recursively summarizing enables long-term dialogue memory in LLMs
arxiv.org
152 comments
3 years ago
PaulHoule
273 points
555.
▲
Conway's Game of Life is omniperiodic
arxiv.org
100 comments
3 years ago
sohkamyung
272 points
556.
▲
Game Theory: Open Access textbook
arxiv.org
54 comments
9 years ago
hocaoglv
271 points
557.
▲
An Introduction to Probabilistic Programming
arxiv.org
14 comments
8 years ago
lainon
269 points
558.
▲
A formula for the nth digit of 𝜋 and 𝜋^n
arxiv.org
133 comments
3 years ago
georgehill
268 points
559.
▲
Do not rug on me: Zero-dimensional Scam Detection
arxiv.org
154 comments
4 years ago
churchill
267 points
560.
▲
Mistral 7B
arxiv.org
123 comments
3 years ago
fgfm
267 points
561.
▲
Llemma: An Open Language Model for Mathematics
arxiv.org
46 comments
3 years ago
AlphaWeaver
267 points
562.
▲
Hardware Acceleration of LLMs: A comprehensive survey and comparison
arxiv.org
68 comments
2 years ago
matt_d
266 points
563.
▲
Gamification affects software developers: Cautionary evidence from GitHub
arxiv.org
304 comments
4 years ago
edward
264 points
564.
▲
On the Impact of Programming Languages on Code Quality
arxiv.org
133 comments
7 years ago
lelf
264 points
565.
▲
Bytes are all you need: Transformers operating directly on file bytes
arxiv.org
96 comments
3 years ago
pmoriarty
263 points
566.
▲
Fast Differentiable Sorting and Ranking
arxiv.org
41 comments
6 years ago
etaioinshrdlu
263 points
567.
▲
HTML as an Accessible Format for Papers (2023)
info.arxiv.org
139 comments
7 months ago
el3ctron
262 points
568.
▲
Chain of Thought empowers transformers to solve inherently serial problems
arxiv.org
184 comments
2 years ago
krackers
261 points
569.
▲
Knowledge Graphs
arxiv.org
49 comments
6 years ago
fxru
261 points
570.
▲
One pixel attack for deceiving deep neural networks
arxiv.org
129 comments
9 years ago
astdb
260 points
More