Search: github.com/b1nc | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

781.

Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python

github.com/SWE-agent

a year ago

7 points

782.

A caffeine driven, simplistic approach to benchmarking Node.js code.

github.com/logicalparadox

14 years ago

7 points

783.

Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure

github.com/lechmazur

a year ago

7 points

784.

Show HN: LLM Deceptiveness and Gullibility Benchmark

github.com/lechmazur

2 years ago

7 points

785.

Engulf, A Graphical HTTP Benchmarker written in Clojure + D3.js

github.com/andrewvc

14 years ago

7 points

786.

Wrk – an HTTP benchmarking tool

13 years ago

7 points

787.

Show HN: Get a report on your compliance to CIS Benchmarks (Azure and AWS)

github.com/4urcloud

2 years ago

7 points

788.

Show HN: Ben, your benchmarking assistant, written in Go

github.com/drish

8 years ago

7 points

789.

Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets

github.com/LinearBoost

2 years ago

6 points

790.

Ask HN: Are there any reliable benchmarks for Machine Learning Model Serving?

2 years ago

6 points

791.

Benchmark GGUF model with ONE line of code

github.com/NexaAI

2 years ago

6 points

792.

New code-focused LLM needle in the haystack benchmark

github.com/HammingHQ

2 years ago

6 points

793.

Response to Google's Keras-PyTorch Benchmarks

gist.github.com

2 years ago

6 points

794.

Volley: benchmarking tool for measuring performance of server networking stacks

github.com/jonhoo

10 years ago

6 points

795.

Objective-C Benchmark Library

github.com/MattesGroeger

13 years ago

6 points

796.

Show HN: Buyout Game Benchmark: Multi-Agent Bargaining, Transfers, and Takeovers

github.com/lechmazur

3 months ago

6 points

797.

Show HN: LLM Round‑Trip Translation Benchmark

github.com/lechmazur

9 months ago

6 points

798.

Pact: Head-to-head negotiation benchmark for LLMs

github.com/lechmazur

10 months ago

6 points

799.

Show HN: LLM Thematic Generalization Benchmark

github.com/lechmazur

a year ago

6 points

800.

Azure Llama 3.1 Benchmarks

github.com/Azure

2 years ago

6 points

801.

Benchmark for Audio Feature Extraction Libraries

github.com/libAudioFlux

3 years ago

6 points

802.

QuestDB 4.2.1 Release. SIMD performance gain 20%. Clickhouse benchmark

github.com/questdb

6 years ago

6 points

803.

Benchmarking slow initial queries gRPC vs. REST on GCP

github.com/saasify-sh

6 years ago

6 points

804.

Rust Regex Engine on JVM, via WebAssembly, Example and Benchmark

github.com/cretz

9 years ago

6 points

805.

Rust-learning: A bunch of links for learning Rust

github.com/ctjhoa

9 years ago

6 points

806.

C++ versus V8 versus luajit versus C benchmark – (hash) tables

gist.github.com

12 years ago

5 points

807.

Show HN: New eval from SWE-bench team evalutes LMs based on goals not tickets

8 months ago

5 points

808.

Socket.io benchmarking tool – Akinji

10 years ago

5 points

809.

Wrk - a HTTP benchmarking tool

13 years ago

5 points

810.

CodSpeed CLI: Deterministic benchmarking for any executable

github.com/CodSpeedHQ

5 months ago

5 points