HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
781.
▲
Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python
github.com/SWE-agent
4 comments
a year ago
lieret
7 points
782.
▲
A caffeine driven, simplistic approach to benchmarking Node.js code.
github.com/logicalparadox
3 comments
14 years ago
vesln
7 points
783.
▲
Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure
github.com/lechmazur
1 comment
a year ago
zone411
7 points
784.
▲
Show HN: LLM Deceptiveness and Gullibility Benchmark
github.com/lechmazur
1 comment
2 years ago
zone411
7 points
785.
▲
Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js
github.com/andrewvc
1 comment
14 years ago
andrewvc
7 points
786.
▲
Wrk – an HTTP benchmarking tool
github.com/wg
discuss
13 years ago
jnazario
7 points
787.
▲
Show HN: Get a report on your compliance to CIS Benchmarks (Azure and AWS)
github.com/4urcloud
discuss
2 years ago
adrien4urcloud
7 points
788.
▲
Show HN: Ben, your benchmarking assistant, written in Go
github.com/drish
discuss
8 years ago
drish
7 points
789.
▲
Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets
github.com/LinearBoost
5 comments
2 years ago
hamid9
6 points
790.
▲
Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?
3 comments
2 years ago
KuriousCat
6 points
791.
▲
Benchmark GGUF model with ONE line of code
github.com/NexaAI
1 comment
2 years ago
alanzhuly
6 points
792.
▲
New code-focused LLM needle in the haystack benchmark
github.com/HammingHQ
1 comment
2 years ago
sumanyusharma
6 points
793.
▲
Response to Google's Keras-PyTorch Benchmarks
gist.github.com
1 comment
2 years ago
mindcrime
6 points
794.
▲
Volley: benchmarking tool for measuring performance of server networking stacks
github.com/jonhoo
1 comment
10 years ago
0xmohit
6 points
795.
▲
Objective-C Benchmark Library
github.com/MattesGroeger
discuss
13 years ago
Smiller
6 points
796.
▲
Show HN: Buyout Game Benchmark: Multi-Agent Bargaining, Transfers, and Takeovers
github.com/lechmazur
discuss
3 months ago
zone411
6 points
797.
▲
Show HN: LLM Round‑Trip Translation Benchmark
github.com/lechmazur
discuss
9 months ago
zone411
6 points
798.
▲
Pact: Head-to-head negotiation benchmark for LLMs
github.com/lechmazur
discuss
10 months ago
zone411
6 points
799.
▲
Show HN: LLM Thematic Generalization Benchmark
github.com/lechmazur
discuss
a year ago
zone411
6 points
800.
▲
Azure Llama 3.1 Benchmarks
github.com/Azure
discuss
2 years ago
georgehill
6 points
801.
▲
Benchmark for Audio Feature Extraction Libraries
github.com/libAudioFlux
discuss
3 years ago
james0517
6 points
802.
▲
QuestDB 4.2.1 Release. SIMD performance gain 20%. Clickhouse benchmark
github.com/questdb
discuss
6 years ago
bluestreak
6 points
803.
▲
Benchmarking slow initial queries gRPC vs. REST on GCP
github.com/saasify-sh
discuss
6 years ago
transitivebs
6 points
804.
▲
Rust Regex Engine on JVM, via WebAssembly, Example and Benchmark
github.com/cretz
discuss
9 years ago
Lapz
6 points
805.
▲
Rust-learning: A bunch of links for learning Rust
github.com/ctjhoa
discuss
9 years ago
Tomte
6 points
806.
▲
C++ versus V8 versus luajit versus C benchmark – (hash) tables
gist.github.com
1 comment
12 years ago
tambourine_man
5 points
807.
▲
Show HN: New eval from SWE-bench team evalutes LMs based on goals not tickets
codeclash.ai
1 comment
8 months ago
lieret
5 points
808.
▲
Socket.io benchmarking tool – Akinji
1 comment
10 years ago
bordobereli
5 points
809.
▲
Wrk - a HTTP benchmarking tool
github.com/wg
discuss
13 years ago
shawndumas
5 points
810.
▲
CodSpeed CLI: Deterministic benchmarking for any executable
github.com/CodSpeedHQ
discuss
5 months ago
art049
5 points
More