Search: github.com/evaluatly | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

61.

Ask HN: How to run Piper text-to-speech on a Mac (using Docker)?

2 years ago

1 points

62.

Which other AI search engines should we keep an eye on?

2 years ago

1 points

63.

Medical Question-Answer AI Model Evaluation Framework

github.com/chat-data-llc

2 years ago

4 points

64.

ClojureScript gets a new REPL

github.com/clojure

15 years ago

4 points

65.

OpenAI cookbook: using GPT-4 as “reference-free” evaluator

github.com/openai

3 years ago

3 points

66.

Evaluation of robotics data recording file formats

github.com/foxglove

4 years ago

1 points

67.

Full LLM training and evaluation toolkit

github.com/huggingface

2 years ago

249 points

68.

RouteLLM: A framework for serving and evaluating LLM routers

github.com/lm-sys

2 years ago

244 points

69.

Show HN: PromptTools – open-source tools for evaluating LLMs and vector DBs

github.com/hegelai

3 years ago

211 points

70.

Comptime – C# meta-programming with compile-time code generation and evaluation

github.com/sebastienros

6 months ago

150 points

71.

Apache HTTP Server: 'RewriteCond expr' always evaluates to true

github.com/apache

a year ago

136 points

72.

Show HN: Faster LLM evaluation with Bayesian optimization

github.com/rentruewang

2 years ago

131 points

73.

Evals: a framework for evaluating OpenAI models and a registry of benchmarks

github.com/openai

3 years ago

123 points

74.

Show HN: Ragas – Open-source library for evaluating RAG pipelines

github.com/explodinggradients

2 years ago

121 points

75.

LispE: Lisp Interpreter with Pattern Programming and Lazy Evaluation

github.com/naver

5 months ago

119 points

76.

Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps

a year ago

117 points

77.

A Fast Excel Formula Parser and Evaluator

github.com/LesterLyu

4 years ago

106 points

78.

Show HN: Eole, a Lévy-optimal lambda calculus evaluator written in Rust

github.com/HerrmannM

7 years ago

106 points

79.

Evaluation of Deep Learning Toolkits

github.com/zer0n

10 years ago

94 points

80.

Show HN: Lazy evaluation in Python

github.com/llllllllll

11 years ago

88 points

81.

Adk-go: code-first Go toolkit for building, evaluating, and deploying AI agents

github.com/google

7 months ago

86 points

82.

Show HN: Opik, an open source LLM evaluation framework

github.com/comet-ml

2 years ago

86 points

83.

PhaseLLM: Standardized Chat LLM API (Cohere, Claude, GPT) + Evaluation Framework

github.com/wgryc

3 years ago

86 points

84.

AutoMLPipeline – Create and evaluate machine learning pipeline architectures

6 years ago

80 points

85.

Cedar is an open source policy language and evaluation engine

3 years ago

72 points

86.

Evaluate Markdown code blocks within Vim

github.com/gpanders

2 years ago

68 points

87.

Show HN: LazyCode – C++14 composable, lazily evaluated map, filter, fold

github.com/SaadAttieh

7 years ago

66 points

88.

TensorFlow Model Analysis – A library for evaluating TensorFlow models

github.com/tensorflow

8 years ago

58 points

89.

Show HN: A MCP server to evaluate Python code in WASM VM using RustPython

github.com/tuananh

a year ago

41 points

90.

Show HN: Tonic Validate Metrics – an open-source RAG evaluation metrics package

github.com/TonicAI

3 years ago

40 points