HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
Ask HN: How to run Piper text-to-speech on a Mac (using Docker)?
discuss
2 years ago
dv35z
1 points
62.
▲
Which other AI search engines should we keep an eye on?
discuss
2 years ago
james_chu
1 points
63.
▲
Medical Question-Answer AI Model Evaluation Framework
github.com/chat-data-llc
2 comments
2 years ago
freexiaosu
4 points
64.
▲
ClojureScript gets a new REPL
github.com/clojure
discuss
15 years ago
bashwort
4 points
65.
▲
OpenAI cookbook: using GPT-4 as “reference-free” evaluator
github.com/openai
discuss
3 years ago
zostale
3 points
66.
▲
Evaluation of robotics data recording file formats
github.com/foxglove
discuss
4 years ago
ahamez
1 points
67.
▲
Full LLM training and evaluation toolkit
github.com/huggingface
6 comments
2 years ago
testerui
249 points
68.
▲
RouteLLM: A framework for serving and evaluating LLM routers
github.com/lm-sys
36 comments
2 years ago
djhu9
244 points
69.
▲
Show HN: PromptTools – open-source tools for evaluating LLMs and vector DBs
github.com/hegelai
24 comments
3 years ago
krawfy
211 points
70.
▲
Comptime – C# meta-programming with compile-time code generation and evaluation
github.com/sebastienros
66 comments
6 months ago
bj-rn
150 points
71.
▲
Apache HTTP Server: 'RewriteCond expr' always evaluates to true
github.com/apache
70 comments
a year ago
Bogdanp
136 points
72.
▲
Show HN: Faster LLM evaluation with Bayesian optimization
github.com/rentruewang
43 comments
2 years ago
renchuw
131 points
73.
▲
Evals: a framework for evaluating OpenAI models and a registry of benchmarks
github.com/openai
16 comments
3 years ago
tosh
123 points
74.
▲
Show HN: Ragas – Open-source library for evaluating RAG pipelines
github.com/explodinggradients
26 comments
2 years ago
shahules
121 points
75.
▲
LispE: Lisp Interpreter with Pattern Programming and Lazy Evaluation
github.com/naver
25 comments
5 months ago
PaulHoule
119 points
76.
▲
Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps
27 comments
a year ago
jeffreyip
117 points
77.
▲
A Fast Excel Formula Parser and Evaluator
github.com/LesterLyu
34 comments
4 years ago
EntICOnc
106 points
78.
▲
Show HN: Eole, a Lévy-optimal lambda calculus evaluator written in Rust
github.com/HerrmannM
9 comments
7 years ago
HerrmannM
106 points
79.
▲
Evaluation of Deep Learning Toolkits
github.com/zer0n
10 comments
10 years ago
marcelsalathe
94 points
80.
▲
Show HN: Lazy evaluation in Python
github.com/llllllllll
21 comments
11 years ago
joejev
88 points
81.
▲
Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents
github.com/google
24 comments
7 months ago
maxloh
86 points
82.
▲
Show HN: Opik, an open source LLM evaluation framework
github.com/comet-ml
15 comments
2 years ago
calebkaiser
86 points
83.
▲
PhaseLLM: Standardized Chat LLM API (Cohere, Claude, GPT) + Evaluation Framework
github.com/wgryc
3 comments
3 years ago
cl42
86 points
84.
▲
AutoMLPipeline – Create and evaluate machine learning pipeline architectures
github.com/IBM
13 comments
6 years ago
bwidlar
80 points
85.
▲
Cedar is an open source policy language and evaluation engine
github.com
17 comments
3 years ago
mooreds
72 points
86.
▲
Evaluate Markdown code blocks within Vim
github.com/gpanders
18 comments
2 years ago
pentestercrab
68 points
87.
▲
Show HN: LazyCode – C++14 composable, lazily evaluated map, filter, fold
github.com/SaadAttieh
5 comments
7 years ago
SaadAttieh
66 points
88.
▲
TensorFlow Model Analysis – A library for evaluating TensorFlow models
github.com/tensorflow
12 comments
8 years ago
wjarek
58 points
89.
▲
Show HN: A MCP server to evaluate Python code in WASM VM using RustPython
github.com/tuananh
13 comments
a year ago
tuananh
41 points
90.
▲
Show HN: Tonic Validate Metrics – an open-source RAG evaluation metrics package
github.com/TonicAI
17 comments
3 years ago
Ephil012
40 points
More