HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
241.
▲
Can ELO tournaments be used to evaluate LLMs and RAG?
github.com/zetaalphavector
1 comment
3 years ago
zavrel
9 points
242.
▲
Show HN: Evolve expressions that evaluate to a target number
github.com/yati-sagade
4 comments
11 years ago
yati
8 points
243.
▲
Rllab – framework for developing and evaluating reinforcement learning algorithms
github.com/rllab
2 comments
10 years ago
dementrock
8 points
244.
▲
Show HN: Code-Knack – A code evaluator on your web page
github.com/lyricat
discuss
7 years ago
lyricat
8 points
245.
▲
Show HN: REPIC.py – Read, Evaluate and Print in Comments
github.com/dpinney
3 comments
9 years ago
maliker
7 points
246.
▲
Tools for Evaluating and Exploiting Z-Wave Networks Using Software-Defined Radios
github.com/AFITWiSec
discuss
10 years ago
cinquemb
7 points
247.
▲
QUIC Performance evaluation
github.com/maufl
discuss
11 years ago
jgrahamc
7 points
248.
▲
Show HN: Achieves Perfect 100 Score Across 6 Leading AI Model Evaluations
github.com/onestardao
8 comments
a year ago
TXTOS
6 points
249.
▲
Show HN: Pipevals – a visual pipeline builder for evaluation-driven AI
github.com/pipevals
2 comments
3 months ago
tilt
6 points
250.
▲
SHOW HN:AceStep1.5an on-device music model that beats Suno on common eval metric
github.com/ace-step
1 comment
5 months ago
DanielWen
6 points
251.
▲
Show HN: Rust-Lazy, safe, concurrent lazy evaluation
github.com/reem
discuss
12 years ago
jonreem
6 points
252.
▲
Fossier: A slop evaluator for GitHub PRs to prevent spams
github.com/PThorpe92
discuss
3 months ago
SchwKatze
6 points
253.
▲
Show HN: Performance evaluation of various Stable Diffusion models
github.com/fal-ai
discuss
3 years ago
treesciencebot
6 points
254.
▲
Sidewalk – fix WKWebview JavaScript evaluation memory leak
github.com/Danesz
discuss
5 years ago
d4n3sz
6 points
255.
▲
Show HN: Fcal – Extensive math expression evaluator library for JavaScript
github.com/5anthosh
discuss
6 years ago
5anthosh
6 points
256.
▲
The oracle-free fragment of Lamping's algorithm can evaluate all λ-terms
github.com/MaiaVictor
discuss
9 years ago
LightMachine
6 points
257.
▲
Ask HN: Evaluating Electron vs. Tauri for building a desktop app
github.com/firecamp-dev
3 comments
3 years ago
Nishchit14
5 points
258.
▲
Aviary simplifies OSS LLM eval and deployment
github.com/ray-project
3 comments
3 years ago
waleedk
5 points
259.
▲
Show HN: Infer – Use TensorFlow Models in Go to Evaluate Images
github.com/sjkaliski
2 comments
8 years ago
sjkaliski
5 points
260.
▲
Lazy.js – underscore with lazy evaluation
github.com/dtao
1 comment
12 years ago
fenguin
5 points
261.
▲
Show HN: New eval from SWE-bench team evalutes LMs based on goals not tickets
codeclash.ai
1 comment
8 months ago
lieret
5 points
262.
▲
I Just Released Alchemist v0.11.0 with Elixir Code Inline Evaluation – Emacs
github.com/tonini
discuss
12 years ago
samueltonini
5 points
263.
▲
Show HN: JSMS – evaluate JavaScript via SMS
github.com/gberger
discuss
12 years ago
gberger
5 points
264.
▲
Devectorize – A Julia Framework for De-vectorized Evaluation
github.com/lindahua
discuss
12 years ago
rcthompson
5 points
265.
▲
Lazy.js: Utility library for JavaScript with lazy evaluation
github.com/dtao
discuss
13 years ago
gorm
5 points
266.
▲
Promptfoo: Local LLM evals and red teaming
github.com/promptfoo
discuss
4 months ago
tin7in
5 points
267.
▲
RAG Chunk: CLI tool to parse, chunk, and evaluate Markdown documents for RAG
github.com/messkan
discuss
7 months ago
handfuloflight
5 points
268.
▲
Show HN: Relai-SDK – simulate → evaluate → optimize AI agents
github.com/relai-ai
discuss
8 months ago
sfeizi
5 points
269.
▲
Conjure: Interactive Evaluation for Neovim (Clojure, Fennel, Racket, Guile,)
github.com/Olical
discuss
5 years ago
tosh
5 points
270.
▲
A checklist for evaluating your supply chain security
github.com/cncf
discuss
5 years ago
mooreds
5 points
More