HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
751.
▲
Dumb benchmarks of Sinatra-like libraries on Elixir, Ruby and Node.js
gist.github.com
6 comments
15 years ago
cookiestack
14 points
752.
▲
Koala: A benchmark suite for performance-oriented shell-optimization research
github.com/kbensh
3 comments
a year ago
matt_d
13 points
753.
▲
Fair Go vs. Elixir Benchmarks
github.com/antonputra
discuss
2 years ago
ahamez
13 points
754.
▲
Show HN: CivBench a long-horizon AI benchmark for multi-agent games
clashai.live
24 comments
4 months ago
mbh159
12 points
755.
▲
Open-source LLM cascading, up to 92% cost savings on benchmarks
github.com/lemony-ai
9 comments
6 months ago
saschabuehrle
12 points
756.
▲
An honest analysis of SpacetimeDB 2.0's insane benchmark results
gist.github.com
3 comments
4 months ago
brandonpollack2
12 points
757.
▲
Show HN: A benchmark + latency sim for LLM db queries: ClickHouse / Postgres
github.com/514-labs
3 comments
a year ago
oatsandsugar
12 points
758.
▲
Yahoo Cloud Serving Benchmark
wiki.github.com
discuss
16 years ago
helwr
12 points
759.
▲
Google/fuzzbench: Fuzzer benchmarking as a service
github.com/google
discuss
6 years ago
edward
11 points
760.
▲
A benchmark to compare synchronization techniques for multicore programming
github.com/gramoli
discuss
10 years ago
wsmith
11 points
761.
▲
HTTP benchmarking tool written in Crystal
github.com/Sdogruyol
discuss
11 years ago
sdogruyol
11 points
762.
▲
Show HN: Codex context bloat? 87% avg reduction on SWE-bench Verified traces
npmjs.com
2 comments
2 months ago
george_ciobanu
10 points
763.
▲
A mind-bending simulation of the movie Inception, in C and ASM.
github.com/karthick18
discuss
16 years ago
jlangenauer
10 points
764.
▲
Show HN: LLM Debate Benchmark
github.com/lechmazur
3 comments
3 months ago
zone411
9 points
765.
▲
Recursive grep written in Go benched against a C++ and Rust variant
github.com/bep
2 comments
2 months ago
bjornerik
9 points
766.
▲
LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models
github.com/lechmazur
discuss
3 months ago
zone411
9 points
767.
▲
You Do Not Need a Vector Database (For RAG): Benchmarking IR Methods
discuss
3 years ago
ylow
9 points
768.
▲
Trival PHP string concatenation benchmarks, proving time better spent elsewhere.
github.com/magnetikonline
6 comments
12 years ago
magnetikonline
8 points
769.
▲
Real-world benchmarks
gist.github.com
2 comments
13 years ago
geelen
8 points
770.
▲
Show HN: Bazaar – a new LLM benchmark for economic reasoning under uncertainty
github.com/lechmazur
1 comment
a year ago
zone411
8 points
771.
▲
OpenChat_8192 Beats ChatGPT-3.5 on Vicuna GPT-4 Benchmark
1 comment
3 years ago
thibo_skabgia
8 points
772.
▲
Raspberry Pi httpd micro benchmark
gist.github.com
1 comment
10 years ago
mpg123
8 points
773.
▲
Show HN: LLM Creative Story‑Writing Benchmark V3
github.com/lechmazur
discuss
9 months ago
zone411
8 points
774.
▲
Show HN: LLM Divergent Thinking Creativity Benchmark
github.com/lechmazur
discuss
a year ago
zone411
8 points
775.
▲
Show HN: Iron Cushion, a CouchDB benchmark and load testing tool
github.com/mgp
discuss
14 years ago
shadowmatter
8 points
776.
▲
Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python
github.com/SWE-agent
4 comments
a year ago
lieret
7 points
777.
▲
A caffeine driven, simplistic approach to benchmarking Node.js code.
github.com/logicalparadox
3 comments
14 years ago
vesln
7 points
778.
▲
Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure
github.com/lechmazur
1 comment
a year ago
zone411
7 points
779.
▲
Show HN: LLM Deceptiveness and Gullibility Benchmark
github.com/lechmazur
1 comment
2 years ago
zone411
7 points
780.
▲
Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js
github.com/andrewvc
1 comment
14 years ago
andrewvc
7 points
More