Search: arxiv.org | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

541.

Training LLMs to Reason in a Continuous Latent Space

2 years ago

283 points

542.

Graph of Thoughts: Solving Elaborate Problems with Large Language Models

3 years ago

283 points

543.

Understanding the Limitations of Mathematical Reasoning in LLMs

2 years ago

282 points

544.

Neural programmer better than Quicksort

6 years ago

282 points

545.

Mixture-of-Depths: Dynamically allocating compute in transformers

2 years ago

281 points

546.

A Wait-Free Stack

10 years ago

281 points

547.

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

2 years ago

280 points

548.

Cognitive Behaviors That Enable Self-Improving Reasoners

a year ago

279 points

549.

How popular media portrays the employability of older software developers

6 years ago

278 points

550.

Large language models lack deep insights or a theory of mind

3 years ago

277 points

551.

Scaling Transformer to 1M tokens and beyond with RMT

3 years ago

277 points

552.

The Modern Mathematics of Deep Learning

5 years ago

276 points

553.

Planting Undetectable Backdoors in Machine Learning Models

4 years ago

275 points

554.

Recursively summarizing enables long-term dialogue memory in LLMs

3 years ago

273 points

555.

Conway's Game of Life is omniperiodic

3 years ago

272 points

556.

Game Theory: Open Access textbook

9 years ago

271 points

557.

An Introduction to Probabilistic Programming

8 years ago

269 points

558.

A formula for the nth digit of 𝜋 and 𝜋^n

3 years ago

268 points

559.

Do not rug on me: Zero-dimensional Scam Detection

4 years ago

267 points

560.

3 years ago

267 points

561.

Llemma: An Open Language Model for Mathematics

3 years ago

267 points

562.

Hardware Acceleration of LLMs: A comprehensive survey and comparison

2 years ago

266 points

563.

Gamification affects software developers: Cautionary evidence from GitHub

4 years ago

264 points

564.

On the Impact of Programming Languages on Code Quality

7 years ago

264 points

565.

Bytes are all you need: Transformers operating directly on file bytes

3 years ago

263 points

566.

Fast Differentiable Sorting and Ranking

6 years ago

263 points

567.

HTML as an Accessible Format for Papers (2023)

7 months ago

262 points

568.

Chain of Thought empowers transformers to solve inherently serial problems

2 years ago

261 points

569.

Knowledge Graphs

6 years ago

261 points

570.

One pixel attack for deceiving deep neural networks

9 years ago

260 points