HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
Propositional Logic Calculator
github.com/lion137
discuss
7 years ago
tu7001
1 points
32.
▲
Go Expression Evaluation Comparison
github.com/antonmedv
discuss
7 years ago
zdw
1 points
33.
▲
Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam)
sup.ai
24 comments
3 months ago
supai
26 points
34.
▲
Show HN: PromptOptimizer – Minimize LLM token complexity to save cost
github.com/vaibkumr
2 comments
3 years ago
vaibkumr
4 points
35.
▲
Show HN: BenchFlow – Open-Source Benchmark Hub and Eval Infra for AI Devs
docs.benchflow.ai
discuss
a year ago
www_xiangyi_li
1 points
36.
▲
Show HN: AI Product Hunter – GenAI reviews/scores "all"of Producthunt everyday
ai-producthunt.com
discuss
2 years ago
tokiyaabe
1 points
37.
▲
Evaluating Technical Arguments
swanson.github.com
discuss
13 years ago
swanson
4 points
38.
▲
In Go, some evaluation orders in multi-value assignments are unspecified
github.com/go101
discuss
8 years ago
tapirl
3 points
39.
▲
Automated evaluation of coding round interviews
github.com/shekhargulati
discuss
9 years ago
java4all
2 points
40.
▲
Evaluating Technical Arguments
swanson.github.com
discuss
13 years ago
swanson
1 points
41.
▲
Evaluation of Covid-19 Models
github.com/youyanggu
discuss
6 years ago
sebg
1 points
42.
▲
Launch HN: Relari (YC W24) – Identify the root cause of problems in LLM apps
15 comments
2 years ago
antonap
106 points
43.
▲
Show HN: Skyvern 2.0 – open-source AI Browser Agent scoring 85.8% on WebVoyager
eval.skyvern.com
3 comments
a year ago
suchintan
9 points
44.
▲
Ask HN: Survey: Scripting languages for realtime applications
2 comments
9 years ago
schoetbi
2 points
45.
▲
Show HN: Dingo 1.9.0 released: With enhanced hallucination detection
github.com/MigoXLab
discuss
a year ago
e06084
2 points
46.
▲
An Evaluation of Location Encoding Systems (2018)
github.com/google
10 comments
5 years ago
Tomte
45 points
47.
▲
An Evaluation of Location Encoding Systems (2018)
github.com/google
discuss
7 years ago
Tomte
41 points
48.
▲
Evaluation of Location Encoding Systems (2021)
github.com/google
14 comments
3 years ago
tosh
32 points
49.
▲
Evaluation of Location Encoding Systems
github.com/google
discuss
4 years ago
sandebert
3 points
50.
▲
Anthropic's Prompt Evalutions Course
github.com/anthropics
discuss
2 years ago
thenameless7741
2 points
51.
▲
An Evaluation of Location Encoding Systems (2018)
github.com/google
discuss
4 years ago
Tomte
2 points
52.
▲
Google: Evaluation of Location Encoding Systems
github.com/google
discuss
9 years ago
espeed
2 points
53.
▲
Llama2 on Replicate faster than ChatGPT?
github.com/BerriAI
2 comments
3 years ago
ij23
1 points
54.
▲
Evaluation of Location Encoding Systems
github.com/google
discuss
8 years ago
CharlesDodgson
1 points
55.
▲
Show HN: High-performance GenAI engine now open source
github.com/arthur-ai
12 comments
a year ago
fryz
22 points
56.
▲
Show HN: Billion Cell Spreadsheets with Incremental Computation
xls.feldera.io
1 comment
a year ago
gz09
15 points
57.
▲
Show HN: Voicetest – open-source test harness for voice AI agents
discuss
4 months ago
pldpld
3 points
58.
▲
Sharing learnings from evaluating Million+ LLM responses
discuss
3 years ago
sourabh03agr
3 points
59.
▲
Show HN: Adventures in UTM – Busy Beaver in under 5–10 mins
2 comments
a year ago
polymetron
2 points
60.
▲
Show HN: Dingo – Automate Data Quality Checks Across Pre-Training and SFT Data
github.com/DataEval
discuss
a year ago
e06084
1 points
More