Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents

Heykuki News

2 points

4 months ago

No comments

Threaded

Loading comments...

Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents | Heykuki News