Search: github.com/evaluatly | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

1.

Evaluatly is now open source and free

github.com/evaluatly

6 years ago

1 points

2.

Evals in 2025: going beyond simple benchmarks to build models people can use

github.com/huggingface

9 months ago

80 points

3.

LLM Evaluation Guidebook

github.com/huggingface

2 years ago

2 points

4.

HuggingFace/evaluate: A library for easily evaluating ML models and datasets

github.com/huggingface

4 years ago

2 points

5.

Why Neutralinojs Is Better? Comparing with Electron and Node Webkit

github.com/neutralinojs

8 years ago

2 points

6.

Triilman25/evaluation-machine-for-classification-models

github.com/triilman25

a year ago

1 points

7.

Show HN: Pixeebot – a GitHub App that fixes your Sonar findings (Java/Python)

2 years ago

10 points

8.

Show HN: Neuron – Cognitive Multi-Agent Architecture for Reasoning

10 months ago

8 points

9.

Show HN: Auto LLM Ranker – Describe a task in English and get ranked models

github.com/gauravvij

3 months ago

3 points

10.

Q Evaluation Harness: open-source evals for LLMs on q/kdb+

github.com/KxSystems

10 months ago

2 points

11.

Evaluate Selections in Sublime Text

github.com/jbrooksuk

13 years ago

2 points

12.

Evaluating Large Language Models Using LLM-as-a-Judge

github.com/aws-samples

2 years ago

2 points

13.

OpenFF – Automated estimation of physical properties

github.com/openforcefield

5 years ago

2 points

14.

Show HN: IR_evaluation – Information retrieval evaluation metrics in pure Python

github.com/plurch

a year ago

1 points

15.

Show HN: EleutherAI / Lm-Evaluation-Harness

github.com/EleutherAI

a month ago

1 points

16.

Language Model Evaluation Harness

github.com/EleutherAI

3 years ago

1 points

17.

Nextdoor's Cloud Security Posture Management (CSPM) Evaluation Matrix

github.com/Nextdoor

3 years ago

1 points

18.

Show HN: Little tool to evaluate your cryptocurrency trades on Poloniex

github.com/enricobacis

9 years ago

1 points

19.

Show HN: Freeact – A Lightweight Library for Code-Action Based Agents

github.com/gradion-ai

a year ago

122 points

20.

Deprecating A/B tests with offline policy evaluation

5 years ago

1 points

21.

Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;)

github.com/alignedai

4 years ago

8 points

22.

Show HN: TypeScript type-level math expression parser and evaluator

github.com/dqbd

3 years ago

3 points

23.

Show HN: CLI tool to analyze your Vector Embeddings!

github.com/dakshjain-1616

4 months ago

2 points

24.

Keyboard Layout Evaluation

github.com/bclnr

4 years ago

2 points

25.

Evaluation Code – GPT-5 on Multimodal Medical Reasoning

github.com/wangshansong1

10 months ago

2 points

26.

Show HN: Filtering "Who's Hiring" with LLMs – native desktop app in Rust/egui

github.com/exlee

3 months ago

1 points

27.

Show HN: LLM Evaluator for "Who is hiring" threads

github.com/exlee

4 months ago

1 points

28.

Job postings evaluator against your resume (Chrome extension)

github.com/alikh31

5 months ago

1 points

29.

Policy Evaluation in Grid World

github.com/elliotvilhelm

2 years ago

1 points

30.

Tracking an LLM Evaluator Using Comet

github.com/dair-ai

3 years ago

1 points