Search: github.com/bendc | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

931.

Show HN: Benchmarking language models by playing text adventures

github.com/s-macke

3 years ago

2 points

932.

Web Content Compression Benchmark.zlib brotli zstd zlib_ng libdeflate igzip,...

github.com/powturbo

3 years ago

2 points

933.

Show HN: TurboBench: Dynamic/Static web content compression benchmark

github.com/powturbo

3 years ago

2 points

934.

Wa-SQLite (WASM SQLite) benchmark discussion

github.com/rhashimoto

3 years ago

2 points

935.

Is linear regression better than prophet? Zillow benchmark

github.com/Nixtla

4 years ago

2 points

936.

[benchmarks] MongoDB kicks MySQL's ass no matter the circumstances

15 years ago

2 points

937.

Server Benchmarks For: Elixir Ruby Nim Node Clojure Java Rust Python Go Crystal

github.com/costajob

10 years ago

2 points

938.

Gobenchdb: store go test bench data in a database

github.com/yhat

11 years ago

2 points

939.

Show HN: Proof of concept for using HTTP headers to benchmark latency

github.com/montanaflynn

12 years ago

2 points

940.

A benchmarking suite for PHP implementations running real-world apps

github.com/hhvm

12 years ago

2 points

941.

Readygo, a Ruby benchmarking tool by Gary Bernhardt

github.com/garybernhardt

12 years ago

2 points

942.

Scala Web Frameworks Benchmark

github.com/Versal

13 years ago

2 points

943.

Show HN: InferBench – Benchmark local LLM engines with one click

github.com/JoniMartin27

20 days ago

2 points

944.

BrowseComp-Plus: A More Fair and Transparent Benchmark of Deep-Research Agent

github.com/texttron

20 days ago

colonCapitalDee

2 points

945.

Show HN: AgentThreatBench – Benchmark for AI Agent Memory Security

github.com/OWASP

24 days ago

2 points

946.

Prompter – Compare and benchmark Ollama models side-by-side in your terminal

github.com/whonixnetworks

a month ago

2 points

947.

Show HN: 97% on SWE-bench Verified with subscription-token agents

github.com/kimjune01

a month ago

2 points

948.

Show HN: Verdict – model evals on your own data, not someone else's benchmark

github.com/aevyraai

2 months ago

2 points

949.

talkie-coder: From 1930 to SWE-bench

github.com/RicardoDominguez

2 months ago

2 points

950.

Open macro placement benchmark and $20k challenge (HRT-sponsored)

github.com/partcleda

3 months ago

2 points

951.

Show HN: WMB-100K – Open benchmark for AI memory systems at 100K turns

github.com/Irina1920

3 months ago

2 points

952.

Show HN: OpenClaw Arena – Benchmark models on real tasks, rank by perf and cost

3 months ago

2 points

953.

An open source benchmarking framework for IT automation

github.com/itbench-hub

3 months ago

2 points

954.

Mitata: Benchmark tooling that loves you

github.com/evanwashere

3 months ago

2 points

955.

Help me improving this benchmark for vector engines

github.com/M4iKZ

3 months ago

2 points

956.

Some critical issues with the SWE-bench-Pro environments

github.com/SWE-agent

3 months ago

2 points

957.

BetterKV – A multithreaded Rust Redis alternative, 10-30x faster in benchmarks

3 months ago

2 points

958.

Show HN: ModelSweep - Open-Source Benchmarking for Local LLMs

github.com/leonickson1

3 months ago

2 points

959.

FratBench – Social Calibration Benchmark (OAI Scores Dead Last) [pdf]

github.com/richar-wang

3 months ago

2 points

960.

TLAi+ Benchmarks for Evaluating LLMs

github.com/tlaplus

4 months ago

2 points