HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
Measuring Time Horizon Using Claude Code and Codex
metr.org
discuss
4 months ago
mustaphah
1 points
32.
▲
METR releases Time Horizon 1.1 with 34% more tasks
metr.org
discuss
5 months ago
mustaphah
1 points
33.
▲
AI Doubling Time Horizon v1.1
metr.org
discuss
5 months ago
chriskanan
1 points
34.
▲
METR Clarifying limitations of time horizon
metr.org
discuss
5 months ago
alphabetatango
1 points
35.
▲
METR review of OpenAI's GPT-OSS fine-tuning safety methodology
metr.org
discuss
8 months ago
mustaphah
1 points
36.
▲
Measuring Impact of 2025 AI on Experienced Open-Source Developer Productivity [pdf]
metr.org
discuss
a year ago
sonabinu
1 points
37.
▲
Measuring Automated Kernel Engineering
metr.org
discuss
a year ago
gsky
1 points
38.
▲
Evaluating frontier AI R&D capabilities of LLM agents against human experts
metr.org
discuss
2 years ago
tedsanders
1 points