Search: github.com/eval | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

211.

A Case for Safe Eval

github.com/robert-j-webb

8 years ago

58 points

212.

TensorFlow Model Analysis – A library for evaluating TensorFlow models

github.com/tensorflow

8 years ago

58 points

213.

Show HN: A MCP server to evaluate Python code in WASM VM using RustPython

github.com/tuananh

a year ago

41 points

214.

Show HN: Tonic Validate Metrics – an open-source RAG evaluation metrics package

github.com/TonicAI

3 years ago

40 points

215.

Generic engine to evaluate logical circuits on homomorphic encryption

github.com/virtualsecureplatform

5 years ago

38 points

216.

Stop Evaluating LLMs on Vibes

github.com/truera

3 years ago

35 points

217.

Show HN: Create LLM graders and run evals in JavaScript with one file

github.com/bolt-foundry

a year ago

28 points

218.

Show HN: SumEval – Multi-language evaluation framework for text summarization

github.com/chakki-works

9 years ago

25 points

219.

λ-calculus evaluator

zaach.github.com

16 years ago

24 points

220.

Evaluate Scheme in Ruby's virtual machine

gist.github.com

14 years ago

24 points

221.

Numexpr: Fast numerical array expression evaluator for Python, NumPy, Pandas

github.com/pydata

a month ago

23 points

222.

Show HN: Phoenix OSS – Applying LLM Spans, Traces, and Evals for AI Insights

github.com/Arize-ai

3 years ago

23 points

223.

Show HN: I implemented evals metrics for LLMs that runs locally on your machine

github.com/confident-ai

3 years ago

22 points

224.

Utility to estimate tasks using PERT (Program evaluation and review technique)

github.com/arzzen

10 years ago

22 points

225.

Thorn in a HaizeStack test for evaluating long-context adversarial robustness

github.com/haizelabs

2 years ago

19 points

226.

Math.mk - GNUmake eval gone wild

github.com/adam-f

14 years ago

19 points

227.

Show HN: DeepEval – Evaluation and Unit Testing for LLMs

github.com/confident-ai

3 years ago

18 points

228.

Python Search – eval(raw_input())

12 years ago

17 points

229.

Show HN: Ragas – Open-source library for evals and testing RAG systems

github.com/explodinggradients

2 years ago

15 points

230.

Show HN: An Empirical Evaluation of Linear Probing Algorithms

github.com/senderista

7 years ago

14 points

231.

Show HN: Promptloop – create, run, and improve prompt evals from the terminal

github.com/Bella3202019

24 days ago

13 points

232.

Show HN: Evaluate LLM-based RAG Applications with automated test set generation

github.com/Giskard-AI

2 years ago

13 points

233.

Common Expression Language (CEL); lightweight expression evaluation

github.com/google

5 years ago

Wxc2jjJmST9XWWL

12 points

234.

How Erlang evaluates funs (i.e. lambdas)

gist.github.com

17 years ago

12 points

235.

Show HN: UpTrain (YC W23) – open-source tool to evaluate LLM response quality

demo.uptrain.ai

3 years ago

12 points

236.

Show HN: Open-source toolkit for ML model evaluation and active learning

github.com/encord-team

3 years ago

11 points

237.

Fexl – Highly robust functional evaluation

github.com/chkoreff

12 years ago

10 points

238.

Show HN: Kiln – AI Boilerplate with Evals, Fine-Tuning, Synthetic Data, and Git

github.com/Kiln-AI

a year ago

10 points

239.

Pixar just open sourced their high-performance subdivision evaluator

github.com/PixarAnimationStudios

14 years ago

10 points

240.

Show HN: C++ Mathematical Expression Parser and Evaluation Benchmark

github.com/ArashPartow

8 years ago

10 points