HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
181.
▲
ClojureScript gets a new REPL
github.com/clojure
discuss
15 years ago
bashwort
4 points
182.
▲
OpenAI cookbook: using GPT-4 as “reference-free” evaluator
github.com/openai
discuss
3 years ago
zostale
3 points
183.
▲
Test cases took my AI router from 82% to 98% accuracy
github.com/copycat-main
1 comment
3 months ago
a8hi
2 points
184.
▲
Benchmark GGUF models with a one line of code
github.com/NexaAI
discuss
2 years ago
mountainview
1 points
185.
▲
Benchmark GGUF models with a ONE line of code
github.com/NexaAI
discuss
2 years ago
jinqueeny
1 points
186.
▲
Evaluation of robotics data recording file formats
github.com/foxglove
discuss
4 years ago
ahamez
1 points
187.
▲
Full LLM training and evaluation toolkit
github.com/huggingface
6 comments
2 years ago
testerui
249 points
188.
▲
RouteLLM: A framework for serving and evaluating LLM routers
github.com/lm-sys
36 comments
2 years ago
djhu9
244 points
189.
▲
Show HN: PromptTools – open-source tools for evaluating LLMs and vector DBs
github.com/hegelai
24 comments
3 years ago
krawfy
211 points
190.
▲
Interactive GCC (igcc) is a read-eval-print loop (REPL) for C/C++
github.com/alexandru-dinu
69 comments
3 years ago
pr337h4m
170 points
191.
▲
Comptime – C# meta-programming with compile-time code generation and evaluation
github.com/sebastienros
66 comments
6 months ago
bj-rn
150 points
192.
▲
Code, Eval, Play, Loop – Common Lisp OpenGL Environment
github.com/cbaggers
18 comments
11 years ago
_zhqs
139 points
193.
▲
Apache HTTP Server: 'RewriteCond expr' always evaluates to true
github.com/apache
70 comments
a year ago
Bogdanp
136 points
194.
▲
Lave: eval in reverse
github.com/jed
37 comments
10 years ago
danso
133 points
195.
▲
Show HN: Faster LLM evaluation with Bayesian optimization
github.com/rentruewang
43 comments
2 years ago
renchuw
131 points
196.
▲
Show HN: Ragas – Open-source library for evaluating RAG pipelines
github.com/explodinggradients
26 comments
2 years ago
shahules
121 points
197.
▲
LispE: Lisp Interpreter with Pattern Programming and Lazy Evaluation
github.com/naver
25 comments
5 months ago
PaulHoule
119 points
198.
▲
Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps
27 comments
a year ago
jeffreyip
117 points
199.
▲
A Fast Excel Formula Parser and Evaluator
github.com/LesterLyu
34 comments
4 years ago
EntICOnc
106 points
200.
▲
Show HN: Eole, a Lévy-optimal lambda calculus evaluator written in Rust
github.com/HerrmannM
9 comments
7 years ago
HerrmannM
106 points
201.
▲
Evaluation of Deep Learning Toolkits
github.com/zer0n
10 comments
10 years ago
marcelsalathe
94 points
202.
▲
Show HN: Lazy evaluation in Python
github.com/llllllllll
21 comments
11 years ago
joejev
88 points
203.
▲
Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents
github.com/google
24 comments
7 months ago
maxloh
86 points
204.
▲
Show HN: Opik, an open source LLM evaluation framework
github.com/comet-ml
15 comments
2 years ago
calebkaiser
86 points
205.
▲
PhaseLLM: Standardized Chat LLM API (Cohere, Claude, GPT) + Evaluation Framework
github.com/wgryc
3 comments
3 years ago
cl42
86 points
206.
▲
AutoMLPipeline – Create and evaluate machine learning pipeline architectures
github.com/IBM
13 comments
6 years ago
bwidlar
80 points
207.
▲
Cedar is an open source policy language and evaluation engine
github.com
17 comments
3 years ago
mooreds
72 points
208.
▲
Show HN: Paramount – Human Evals of AI Customer Support
github.com/ask-fini
44 comments
2 years ago
hakimk
71 points
209.
▲
Evaluate Markdown code blocks within Vim
github.com/gpanders
18 comments
2 years ago
pentestercrab
68 points
210.
▲
Show HN: LazyCode – C++14 composable, lazily evaluated map, filter, fold
github.com/SaadAttieh
5 comments
7 years ago
SaadAttieh
66 points
More