HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
331.
▲
Show HN: Prompt-to-proof: reproducible LLM eval with hash-chained receipts
github.com/kju4q
discuss
10 months ago
Qendresahoti
3 points
332.
▲
Provider-agnostic, open-source evaluation infra for LLMs
github.com/groq
discuss
a year ago
nkko
3 points
333.
▲
Show HN: Zbench, RAG evals using chess Elo ratings
github.com/zeroentropy-ai
discuss
a year ago
ghita_
3 points
334.
▲
MCPvals, an eval library for MCP Servers
github.com/Kylejeong2
discuss
a year ago
gniting
3 points
335.
▲
A collection of resources about normalization-by-evaluation
github.com/etiams
discuss
a year ago
etiams
3 points
336.
▲
LLMRank – ranking LLMs using peer-based cross-evaluation and PageRank
github.com/marquisdepolis
discuss
a year ago
larsiusprime
3 points
337.
▲
Jexpr – Expression parser and evaluator for JavaScript
github.com/justinfagnani
discuss
2 years ago
brianzelip
3 points
338.
▲
Show HN: Open-Source Evaluation and Testing for Computer Vision Models
github.com/Giskard-AI
discuss
2 years ago
alexcombessie
3 points
339.
▲
Open-Source Evaluation and Testing Framework for Computer Vision Models
discuss
2 years ago
iamheinrich
3 points
340.
▲
Show HN: TEAMMATES: free tool for managing peer evaluations built by Students
github.com/TEAMMATES
discuss
2 years ago
kyawzazaw
3 points
341.
▲
Ragas: Open-source Evaluation framework for RAG pipelines
github.com/explodinggradients
discuss
3 years ago
pranay01
3 points
342.
▲
FastRepl – open-source evals for RAG, Agents
github.com/repllabs
discuss
3 years ago
ij23
3 points
343.
▲
An open platform for training, serving, and evaluating large language models
github.com/lm-sys
discuss
3 years ago
udev4096
3 points
344.
▲
LegalBench: To evaluate English large language models on legal reasoning
github.com/HazyResearch
discuss
3 years ago
pella
3 points
345.
▲
CCTV-Exposure – Evaluate potential privacy exposure to CCTV cameras
github.com/Fuziih
discuss
4 years ago
chris_overseas
3 points
346.
▲
Show HN: RexMex – A Recommender Systems Evaluation Metrics Library
github.com/AstraZeneca
discuss
4 years ago
ptgtemporal
3 points
347.
▲
Popsom: R package for the creation and evaluation of self-organizing maps
github.com/lutzhamel
discuss
5 years ago
teleforce
3 points
348.
▲
Show HN: GUI application for auto-evaluating your Golang code
github.com/nkoporec
discuss
5 years ago
nkoporec
3 points
349.
▲
Datasets and Evaluation Metrics for NLP
github.com/huggingface
discuss
6 years ago
dragonsh
3 points
350.
▲
Show HN: We evaluated the Portuguese ELMo models published to AllenNLP this week
github.com/ruanchaves
discuss
6 years ago
ruanchaves
3 points
351.
▲
Show HN: JS bindings for QuickJS, control over eval, inspired by Figma's plugins
github.com/justjake
discuss
6 years ago
jitl
3 points
352.
▲
Show HN: Hybrid Math Expression Evaluator
github.com/5anthosh
discuss
6 years ago
5anthosh
3 points
353.
▲
AshPy: TensorFlow 2.0 library for quick model prototyping, training, and eval
github.com/zurutech
discuss
7 years ago
me2too
3 points
354.
▲
Expr is package to evaluate expressions using bytecode virtual machine in Go
github.com/antonmedv
discuss
7 years ago
medv
3 points
355.
▲
Interactive Go Interpreter/debugger with REPL, Eval, Generics, Lisp-Like Macros
github.com/cosmos72
discuss
8 years ago
ansible
3 points
356.
▲
Show HN: Shell Automation with JavaScript and Lazy Evaluation
github.com/jeswin
discuss
8 years ago
jeswin
3 points
357.
▲
Show HN: Visualizing arithmetic and logical expression evaluation in Swift
github.com/mpangburn
discuss
8 years ago
mpangburn
3 points
358.
▲
Free video course on evaluating and planning A/B tests using R
github.com/WinVector
discuss
9 years ago
jmount
3 points
359.
▲
Partial Evaluation, Futamura Projection and Their Applications
gist.github.com
discuss
9 years ago
mirceasoaica
3 points
360.
▲
Show HN: Mathcat – An expression evaluating library and REPL in Go
github.com/soudy
discuss
10 years ago
soud
3 points
More