HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps
27 comments
a year ago
jeffreyip
117 points
2.
▲
Show HN: DeepTeam – Open-Source Red-Teaming Framework for LLM Security
github.com/confident-ai
discuss
a year ago
sidmurali23
4 points
3.
▲
Show HN: DeepTeam – Penetration Testing for LLMs
github.com/confident-ai
discuss
a year ago
jeffreyip
3 points
4.
▲
DeepTeam: Penetration Testing for LLMs
discuss
a year ago
jeffreyip
2 points
5.
▲
Show HN: Tag driven changelog generator (MDX) with optional LLM summaries
1 comment
5 months ago
dustfinger
1 points
6.
▲
DeepTeam: Open-Source Pennetration Testing for LLMs
discuss
a year ago
jeffreyip
1 points
7.
▲
Show HN: I implemented evals metrics for LLMs that runs locally on your machine
github.com/confident-ai
3 comments
3 years ago
3d27
22 points
8.
▲
Show HN: DeepEval – Evaluation and Unit Testing for LLMs
github.com/confident-ai
8 comments
3 years ago
jacky2wong
18 points
9.
▲
Show HN: DeepEval – Unit Testing for LLMs (Open Science)
github.com/confident-ai
discuss
3 years ago
jacky2wong
6 points
10.
▲
DeepEval – Neural Framework for Testing LLMs
github.com/confident-ai
discuss
3 years ago
jacky2wong
2 points
11.
▲
Unit Testing for Rag
github.com/confident-ai
discuss
3 years ago
jacky2wong
2 points
12.
▲
DeepEval CLI
github.com/confident-ai
discuss
3 years ago
jacky2wong
2 points
13.
▲
Has anyone ever used the Python framework "Deepeval"?
github.com/confident-ai
discuss
a year ago
willmarquis
1 points