HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
121.
▲
LLM-eval-kit: Distributed LLM evaluation framework (v0.3.0)
github.com/benmeryem-tech
discuss
2 months ago
benmeryem_ai
1 points
122.
▲
CLI that grades website content quality – Stripe.com got an F
github.com/samuelrkestenbaum-dot
discuss
3 months ago
samkest419
1 points
123.
▲
Show HN: Filtering "Who's Hiring" with LLMs – native desktop app in Rust/egui
github.com/exlee
discuss
3 months ago
xlii
1 points
124.
▲
Show HN: LLM Evaluator for "Who is hiring" threads
github.com/exlee
discuss
4 months ago
xlii
1 points
125.
▲
Show HN: O(1) memory attention – 512K tokens in 3.85 GB (eval binary)
github.com/RegularJoe-CEO
discuss
5 months ago
luxiedge
1 points
126.
▲
Job postings evaluator against your resume (Chrome extension)
github.com/alikh31
discuss
5 months ago
alikhoramshahi
1 points
127.
▲
Policy Evaluation in Grid World
github.com/elliotvilhelm
discuss
2 years ago
monadicmonad
1 points
128.
▲
Tracking an LLM Evaluator Using Comet
github.com/dair-ai
discuss
3 years ago
omarsar
1 points
129.
▲
Propositional Logic Calculator
github.com/lion137
discuss
7 years ago
tu7001
1 points
130.
▲
Parsing Mitre EDR Evaluation Results
github.com/zshehri
discuss
7 years ago
based2
1 points
131.
▲
Go Expression Evaluation Comparison
github.com/antonmedv
discuss
7 years ago
zdw
1 points
132.
▲
Eval.js – A JavaScript interpreter written in JavaScript
github.com/marten-de-vries
discuss
10 years ago
hugs
1 points
133.
▲
Show HN: Fine-tuned Llama 3.2 3B to match 70B models for local transcripts
bilawal.net
8 comments
10 months ago
phantompeace
31 points
134.
▲
Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam)
sup.ai
24 comments
3 months ago
supai
26 points
135.
▲
Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context
github.com/topoteretes
2 comments
a year ago
vasa_
9 points
136.
▲
Show HN: PromptOptimizer – Minimize LLM token complexity to save cost
github.com/vaibkumr
2 comments
3 years ago
vaibkumr
4 points
137.
▲
Show HN: See – searchable JSON compression, smaller than ZSTD (on our data)
github.com/kodomonocch1
1 comment
4 months ago
Tetsuro
3 points
138.
▲
Show HN: Legal Action Boundary Eval for agentic legal workflows
github.com/bigkan8
2 comments
2 months ago
kankouadio_vx
2 points
139.
▲
Show HN: BenchFlow – Open-Source Benchmark Hub and Eval Infra for AI Devs
docs.benchflow.ai
discuss
a year ago
www_xiangyi_li
1 points
140.
▲
Show HN: AI Product Hunter – GenAI reviews/scores "all"of Producthunt everyday
ai-producthunt.com
discuss
2 years ago
tokiyaabe
1 points
141.
▲
Eval($_POST[cmd])
github.com
8 comments
11 years ago
brevis
12 points
142.
▲
Evaluating Technical Arguments
swanson.github.com
discuss
13 years ago
swanson
4 points
143.
▲
Engineering JavaScript's eval
brownplt.github.com
discuss
14 years ago
p4bl0
3 points
144.
▲
In Go, some evaluation orders in multi-value assignments are unspecified
github.com/go101
discuss
8 years ago
tapirl
3 points
145.
▲
Show HN: Dbt-LLM-evals – Monitor LLM quality in your data warehouse
github.com/paradime-io
1 comment
5 months ago
fdileta
2 points
146.
▲
Show HN: Synthetic Data Generation Using LangChain for IR and RAG Evaluation
github.com/mddunlap924
discuss
3 years ago
tdunlap607
2 points
147.
▲
Automated evaluation of coding round interviews
github.com/shekhargulati
discuss
9 years ago
java4all
2 points
148.
▲
Evaluating Technical Arguments
swanson.github.com
discuss
13 years ago
swanson
1 points
149.
▲
Show HN: Social proof works 2-7x better on AI shopping agents than humans
github.com/aaronbatchelder
discuss
4 months ago
aaronmb7
1 points
150.
▲
defer-import-eval: proposal for introducing a way to defer evaluate of a module
github.com/tc39
discuss
10 months ago
tilt
1 points
More