HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Judge0 – the most advanced open-source online code execution system in the world
github.com/judge0
discuss
2 years ago
digitalnalogika
5 points
2.
▲
Judge0 API Goes Freemium
github.com/judge0
discuss
6 years ago
_q35k
1 points
3.
▲
Ask HN: Help me improve my C-like language, C3
7 comments
6 years ago
Nuoji
12 points
4.
▲
Show HN: The actual registry price of 246 TLD's
github.com/judge2020
1 comment
8 years ago
judge2020
4 points
5.
▲
Show HN: Fast open-source autograding library written in Django
github.com/arthtyagi
discuss
6 years ago
arthtyagi
4 points
6.
▲
The real cost of TLDs
github.com/judge2020
1 comment
6 years ago
hexene
3 points
7.
▲
Show HN: Fast open-source autograding library written in Django
github.com/arthtyagi
discuss
6 years ago
arthtyagi
3 points
8.
▲
Show HN: Fast Open-source autograder for coding problems (Django)
github.com/arthtyagi
1 comment
6 years ago
arthtyagi
2 points
9.
▲
Show HN: Blazing fast open-source autograder for coding problems (Django)
github.com/arthtyagi
1 comment
6 years ago
arthtyagi
2 points
10.
▲
Show HN: Open-source autograder for coding problems (Django)
github.com/arthtyagi
discuss
6 years ago
arthtyagi
2 points
11.
▲
Show HN: Fast open-source autograding library written in Django
github.com/arthtyagi
discuss
6 years ago
arthtyagi
1 points
12.
▲
Show HN: The real registration cost of TLDs
github.com/judge2020
discuss
7 years ago
judge2020
1 points
13.
▲
Composo open-sources its LLM-as-Judge technique (83.6% on RewardBench 2)
github.com/composo-ai
discuss
3 months ago
mlukewizard
5 points
14.
▲
Awesome-LLM-Judges
github.com/haizelabs
discuss
a year ago
leonardtang
2 points
15.
▲
LLM Judges
github.com/haizelabs
discuss
a year ago
leonardtang
2 points
16.
▲
UVa Online Judge Solutions Repo (Work in Progress)
github.com/jcbages
discuss
9 years ago
jcbages
2 points
17.
▲
Show HN: Lightweight LLM-as-a-Judge Tool
github.com/frequena
discuss
10 months ago
frequena
2 points
18.
▲
Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost
app.uniclaw.ai
discuss
3 months ago
skysniper
2 points
19.
▲
The Divine Judgement: Enforce TypeScript Types at Runtime
github.com/Divine-Software
discuss
3 years ago
LeviticusMB
1 points
20.
▲
Ranking 1k ShowHN posts by estimated merit using an LLM judge and TrueSkill
github.com/kouhxp
2 comments
a month ago
mrkn1
7 points
21.
▲
Type-challenges: Collection of TypeScript type challenges with online judge
github.com/type-challenges
discuss
3 years ago
olalonde
4 points
22.
▲
LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers
github.com/dial481
3 comments
3 months ago
dial481
3 points
23.
▲
Show HN: Using AI to judge a drinking game – SplitTheG.dev
splittheg.dev
2 comments
a year ago
BitNibbleByte
3 points
24.
▲
Show HN: Signals – finding the most informative agent traces without LLM judges
arxiv.org
discuss
3 months ago
sparacha
3 points
25.
▲
Justice: Yet Another Online Judge
github.com
discuss
7 years ago
liumangchao
3 points
26.
▲
Show HN: Grading Notes for LLM-as-Judge
github.com/shabie
3 comments
2 years ago
shabie
2 points
27.
▲
Show HN: pg_roast – A Postgres extension that harshly judges your database
github.com/samirketema
1 comment
2 months ago
samirketema
2 points
28.
▲
Open-source LLM-as-judge eval suite with root cause analysis and failure mining
github.com/colingfly
1 comment
3 months ago
colinfly
2 points
29.
▲
Show HN: Yet Another Online Judge Implementation
github.com
1 comment
7 years ago
zsgsdesign
2 points
30.
▲
Codejudge: A lightweight online judge
github.com/sankha93
discuss
13 years ago
sankha93
2 points
More