My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf]

Heykuki News

1 point

a month ago

2 comments

Threaded

Loading comments...

My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf] | Heykuki News