Search: github.com/eval | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

301.

Eval("quire".replace(/^/,"re"))(moduleName)

github.com/protobufjs

2 years ago

3 points

302.

Homomorphic Evaluation of the AES Circuit

github.com/shaih

11 years ago

3 points

303.

Show HN: Atlas – local-first memory that re-evaluates beliefs when facts change

github.com/RichSchefren

11 days ago

3 points

304.

RAG Eval Comparing Vertex/Bedrock/Azure/OpenAI

github.com/colon-md

a month ago

3 points

305.

Mcpbr: Stop guessing and evaluate your MCP server against standard benchmarks

github.com/greynewell

5 months ago

3 points

306.

Rogue: Open-source AI agent evaluation framework

github.com/qualifire-dev

8 months ago

3 points

307.

AWorld: Build, evaluate and train General Multi-Agent Assistance with ease

github.com/inclusionAI

10 months ago

3 points

308.

15 AI Coding Agents evaluated with the same prompt

github.com/The-Focus-AI

a year ago

3 points

309.

NoLiMa: Long-Context Evaluation Beyond Literal Matching

github.com/adobe-research

a year ago

3 points

310.

I built an ethical evaluation engine for scoring sys. alignment, not efficiency

github.com/luminaAnonima

a year ago

3 points

311.

A novel open-source framework for evaluating conversational agents

github.com/plurai-ai

a year ago

3 points

312.

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

github.com/microsoft

2 years ago

3 points

313.

Cedar – open-source policy language and evaluation engine

github.com/cedar-policy

3 years ago

3 points

314.

Show HN: Evaluate Deep Learning models directly in a database with PyNeuraLogic

github.com/LukasZahradnik

4 years ago

3 points

315.

Show HN: Wielder – Write and evaluate Clojure code in your Obsidian documents

github.com/victorb

4 years ago

3 points

316.

Show HN: Oyster, an interactive Perl eval server

github.com/gatlin

15 years ago

3 points

317.

Koila: Prevent PyTorch's out of memory error with lazy evaluation

github.com/rentruewang

5 years ago

3 points

318.

Simple Safe Sandboxed Extensible Expression Evaluator for Python

github.com/danthedeckie

8 years ago

3 points

319.

Show HN: ClojureCalc, a libreoffice Calc Add-In to evaluate clojure expressions

github.com/beothorn

11 years ago

3 points

320.

Rouge.js: Recall-Oriented Understudy for Gisting Evaluation Metric

github.com/kenlimmj

11 years ago

3 points

321.

Show HN: Synthetic corporate dataset generator for AI agent evaluation

github.com/aeriesec

12 days ago

3 points

322.

Cisco Foundry Security Spec: Open specification for agentic security evaluation

github.com/CiscoDevNet

a month ago

3 points

323.

Show HN: Nexa-gauge – Cache/cost-aware graph-based eval for LLM and RAG

github.com/harnexa

a month ago

3 points

324.

Show HN: FC-Eval – CLI to Benchmark Local or Cloud LLMs on Function Calling

github.com/gauravvij

3 months ago

3 points

325.

Show HN: Rhesis AI - Multimodal test cases for agentic evals

3 months ago

3 points

326.

Show HN: Auditi – open-source LLM tracing and evaluation platform

github.com/deduu

4 months ago

3 points

327.

Harbor – a framework for evaluating and optimizing agents and language models

github.com/laude-institute

7 months ago

3 points

328.

OpenBench: Provider-agnostic, open-source evaluation infrastructure for LLMs

github.com/groq

8 months ago

3 points

329.

Show HN: Evaluate your website usability in seconds

9 months ago

3 points

330.

LLM Evaluation via Rap Battles

github.com/vadim0x60

10 months ago

3 points