Search: github.com/eval | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

241.

Can ELO tournaments be used to evaluate LLMs and RAG?

github.com/zetaalphavector

3 years ago

9 points

242.

Show HN: Evolve expressions that evaluate to a target number

github.com/yati-sagade

11 years ago

8 points

243.

Rllab – framework for developing and evaluating reinforcement learning algorithms

github.com/rllab

10 years ago

8 points

244.

Show HN: Code-Knack – A code evaluator on your web page

github.com/lyricat

7 years ago

8 points

245.

Show HN: REPIC.py – Read, Evaluate and Print in Comments

github.com/dpinney

9 years ago

7 points

246.

Tools for Evaluating and Exploiting Z-Wave Networks Using Software-Defined Radios

github.com/AFITWiSec

10 years ago

7 points

247.

QUIC Performance evaluation

github.com/maufl

11 years ago

7 points

248.

Show HN: Achieves Perfect 100 Score Across 6 Leading AI Model Evaluations

github.com/onestardao

a year ago

6 points

249.

Show HN: Pipevals – a visual pipeline builder for evaluation-driven AI

github.com/pipevals

3 months ago

6 points

250.

SHOW HN:AceStep1.5an on-device music model that beats Suno on common eval metric

github.com/ace-step

5 months ago

6 points

251.

Show HN: Rust-Lazy, safe, concurrent lazy evaluation

github.com/reem

12 years ago

6 points

252.

Fossier: A slop evaluator for GitHub PRs to prevent spams

github.com/PThorpe92

3 months ago

6 points

253.

Show HN: Performance evaluation of various Stable Diffusion models

github.com/fal-ai

3 years ago

6 points

254.

Sidewalk – fix WKWebview JavaScript evaluation memory leak

github.com/Danesz

5 years ago

6 points

255.

Show HN: Fcal – Extensive math expression evaluator library for JavaScript

github.com/5anthosh

6 years ago

6 points

256.

The oracle-free fragment of Lamping's algorithm can evaluate all λ-terms

github.com/MaiaVictor

9 years ago

6 points

257.

Ask HN: Evaluating Electron vs. Tauri for building a desktop app

github.com/firecamp-dev

3 years ago

5 points

258.

Aviary simplifies OSS LLM eval and deployment

github.com/ray-project

3 years ago

5 points

259.

Show HN: Infer – Use TensorFlow Models in Go to Evaluate Images

github.com/sjkaliski

8 years ago

5 points

260.

Lazy.js – underscore with lazy evaluation

github.com/dtao

12 years ago

5 points

261.

Show HN: New eval from SWE-bench team evalutes LMs based on goals not tickets

8 months ago

5 points

262.

I Just Released Alchemist v0.11.0 with Elixir Code Inline Evaluation – Emacs

github.com/tonini

12 years ago

5 points

263.

Show HN: JSMS – evaluate JavaScript via SMS

github.com/gberger

12 years ago

5 points

264.

Devectorize – A Julia Framework for De-vectorized Evaluation

github.com/lindahua

12 years ago

5 points

265.

Lazy.js: Utility library for JavaScript with lazy evaluation

github.com/dtao

13 years ago

5 points

266.

Promptfoo: Local LLM evals and red teaming

github.com/promptfoo

4 months ago

5 points

267.

RAG Chunk: CLI tool to parse, chunk, and evaluate Markdown documents for RAG

github.com/messkan

7 months ago

5 points

268.

Show HN: Relai-SDK – simulate → evaluate → optimize AI agents

github.com/relai-ai

8 months ago

5 points

269.

Conjure: Interactive Evaluation for Neovim (Clojure, Fennel, Racket, Guile,)

github.com/Olical

5 years ago

5 points

270.

A checklist for evaluating your supply chain security

github.com/cncf

5 years ago

5 points