Evals in 2025: going beyond simple benchmarks to build models people can use

Heykuki News

80 points

9 months ago

8 comments

Threaded

Loading comments...

Evals in 2025: going beyond simple benchmarks to build models people can use | Heykuki News