Search: github.com/eval | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

121.

LLM-eval-kit: Distributed LLM evaluation framework (v0.3.0)

github.com/benmeryem-tech

2 months ago

1 points

122.

CLI that grades website content quality – Stripe.com got an F

github.com/samuelrkestenbaum-dot

3 months ago

1 points

123.

Show HN: Filtering "Who's Hiring" with LLMs – native desktop app in Rust/egui

github.com/exlee

3 months ago

1 points

124.

Show HN: LLM Evaluator for "Who is hiring" threads

github.com/exlee

4 months ago

1 points

125.

Show HN: O(1) memory attention – 512K tokens in 3.85 GB (eval binary)

github.com/RegularJoe-CEO

5 months ago

1 points

126.

Job postings evaluator against your resume (Chrome extension)

github.com/alikh31

5 months ago

1 points

127.

Policy Evaluation in Grid World

github.com/elliotvilhelm

2 years ago

1 points

128.

Tracking an LLM Evaluator Using Comet

github.com/dair-ai

3 years ago

1 points

129.

Propositional Logic Calculator

github.com/lion137

7 years ago

1 points

130.

Parsing Mitre EDR Evaluation Results

github.com/zshehri

7 years ago

1 points

131.

Go Expression Evaluation Comparison

github.com/antonmedv

7 years ago

1 points

132.

Eval.js – A JavaScript interpreter written in JavaScript

github.com/marten-de-vries

10 years ago

1 points

133.

Show HN: Fine-tuned Llama 3.2 3B to match 70B models for local transcripts

10 months ago

31 points

134.

Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam)

3 months ago

26 points

135.

Show HN: Cognee – Open-Source AI Memory Layer That Remembers Context

github.com/topoteretes

a year ago

9 points

136.

Show HN: PromptOptimizer – Minimize LLM token complexity to save cost

github.com/vaibkumr

3 years ago

4 points

137.

Show HN: See – searchable JSON compression, smaller than ZSTD (on our data)

github.com/kodomonocch1

4 months ago

3 points

138.

Show HN: Legal Action Boundary Eval for agentic legal workflows

github.com/bigkan8

2 months ago

2 points

139.

Show HN: BenchFlow – Open-Source Benchmark Hub and Eval Infra for AI Devs

docs.benchflow.ai

a year ago

1 points

140.

Show HN: AI Product Hunter – GenAI reviews/scores "all"of Producthunt everyday

ai-producthunt.com

2 years ago

1 points

141.

Eval($_POST[cmd])

11 years ago

12 points

142.

Evaluating Technical Arguments

swanson.github.com

13 years ago

4 points

143.

Engineering JavaScript's eval

brownplt.github.com

14 years ago

3 points

144.

In Go, some evaluation orders in multi-value assignments are unspecified

github.com/go101

8 years ago

3 points

145.

Show HN: Dbt-LLM-evals – Monitor LLM quality in your data warehouse

github.com/paradime-io

5 months ago

2 points

146.

Show HN: Synthetic Data Generation Using LangChain for IR and RAG Evaluation

github.com/mddunlap924

3 years ago

2 points

147.

Automated evaluation of coding round interviews

github.com/shekhargulati

9 years ago

2 points

148.

Evaluating Technical Arguments

swanson.github.com

13 years ago

1 points

149.

Show HN: Social proof works 2-7x better on AI shopping agents than humans

github.com/aaronbatchelder

4 months ago

1 points

150.

defer-import-eval: proposal for introducing a way to defer evaluate of a module

github.com/tc39

10 months ago

1 points