HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
MiniMax M3 vs. GLM 5.2: Codegen comparison across autonomous coding tasks
thinkwright.ai
19 comments
3 days ago
oceanwaves
55 points
2.
▲
Defensible Deep Research from Open-Weight Models
thinkwright.ai
discuss
8 days ago
oceanwaves
2 points
3.
▲
State of the Agent: Do coding agents know what they don't know?
thinkwright.ai
discuss
4 months ago
oceanwaves
2 points
4.
▲
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents
thinkwright.ai
discuss
4 months ago
oceanwaves
2 points
5.
▲
Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents
thinkwright.ai
1 comment
4 months ago
oceanwaves
1 points