HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
DeepSWE results are unreliable – 3/3 DSv4 "failed" tasks solved with same model
github.com/datacurve-ai
discuss
18 days ago
theanonymousone
3 points
2.
▲
DeepSWE Audit: DeepSeek-v4-pro results are unreliable
github.com/datacurve-ai
discuss
19 days ago
eunos
3 points