Search: github.com/bendc | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

751.

Dumb benchmarks of Sinatra-like libraries on Elixir, Ruby and Node.js

gist.github.com

15 years ago

14 points

752.

Koala: A benchmark suite for performance-oriented shell-optimization research

github.com/kbensh

a year ago

13 points

753.

Fair Go vs. Elixir Benchmarks

github.com/antonputra

2 years ago

13 points

754.

Show HN: CivBench a long-horizon AI benchmark for multi-agent games

4 months ago

12 points

755.

Open-source LLM cascading, up to 92% cost savings on benchmarks

github.com/lemony-ai

6 months ago

12 points

756.

An honest analysis of SpacetimeDB 2.0's insane benchmark results

gist.github.com

4 months ago

brandonpollack2

12 points

757.

Show HN: A benchmark + latency sim for LLM db queries: ClickHouse / Postgres

github.com/514-labs

a year ago

12 points

758.

Yahoo Cloud Serving Benchmark

wiki.github.com

16 years ago

12 points

759.

Google/fuzzbench: Fuzzer benchmarking as a service

github.com/google

6 years ago

11 points

760.

A benchmark to compare synchronization techniques for multicore programming

github.com/gramoli

10 years ago

11 points

761.

HTTP benchmarking tool written in Crystal

github.com/Sdogruyol

11 years ago

11 points

762.

Show HN: Codex context bloat? 87% avg reduction on SWE-bench Verified traces

2 months ago

10 points

763.

A mind-bending simulation of the movie Inception, in C and ASM.

github.com/karthick18

16 years ago

10 points

764.

Show HN: LLM Debate Benchmark

github.com/lechmazur

3 months ago

9 points

765.

Recursive grep written in Go benched against a C++ and Rust variant

2 months ago

9 points

766.

LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models

github.com/lechmazur

3 months ago

9 points

767.

You Do Not Need a Vector Database (For RAG): Benchmarking IR Methods

3 years ago

9 points

768.

Trival PHP string concatenation benchmarks, proving time better spent elsewhere.

github.com/magnetikonline

12 years ago

8 points

769.

Real-world benchmarks

gist.github.com

13 years ago

8 points

770.

Show HN: Bazaar – a new LLM benchmark for economic reasoning under uncertainty

github.com/lechmazur

a year ago

8 points

771.

OpenChat_8192 Beats ChatGPT-3.5 on Vicuna GPT-4 Benchmark

3 years ago

8 points

772.

Raspberry Pi httpd micro benchmark

gist.github.com

10 years ago

8 points

773.

Show HN: LLM Creative Story‑Writing Benchmark V3

github.com/lechmazur

9 months ago

8 points

774.

Show HN: LLM Divergent Thinking Creativity Benchmark

github.com/lechmazur

a year ago

8 points

775.

Show HN: Iron Cushion, a CouchDB benchmark and load testing tool

14 years ago

8 points

776.

Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python

github.com/SWE-agent

a year ago

7 points

777.

A caffeine driven, simplistic approach to benchmarking Node.js code.

github.com/logicalparadox

14 years ago

7 points

778.

Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure

github.com/lechmazur

a year ago

7 points

779.

Show HN: LLM Deceptiveness and Gullibility Benchmark

github.com/lechmazur

2 years ago

7 points

780.

Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js

github.com/andrewvc

14 years ago

7 points