HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
151.
▲
Evaluation of Covid-19 Models
github.com/youyanggu
discuss
6 years ago
sebg
1 points
152.
▲
Show HN: I forced Claude to play Tetris in Emacs
imgur.com
3 comments
2 months ago
iLemming
13 points
153.
▲
Show HN: Skyvern 2.0 – open-source AI Browser Agent scoring 85.8% on WebVoyager
eval.skyvern.com
3 comments
a year ago
suchintan
9 points
154.
▲
Ask HN: Survey: Scripting languages for realtime applications
2 comments
9 years ago
schoetbi
2 points
155.
▲
The Car Wash Problem: A variable isolation study on prompt architecture
1 comment
4 months ago
midmost44
2 points
156.
▲
Show HN: Dingo 1.9.0 released: With enhanced hallucination detection
github.com/MigoXLab
discuss
a year ago
e06084
2 points
157.
▲
Unbounded Context with Memory
1 comment
2 years ago
codelion
1 points
158.
▲
An Evaluation of Location Encoding Systems (2018)
github.com/google
10 comments
5 years ago
Tomte
45 points
159.
▲
An Evaluation of Location Encoding Systems (2018)
github.com/google
discuss
7 years ago
Tomte
41 points
160.
▲
Evaluation of Location Encoding Systems (2021)
github.com/google
14 comments
3 years ago
tosh
32 points
161.
▲
Evaluate and Track Your LLM Experiments: Introducing TruLens for LLMs
github.com/truera
7 comments
3 years ago
shayaks
25 points
162.
▲
Evaluation of Location Encoding Systems
github.com/google
discuss
4 years ago
sandebert
3 points
163.
▲
Anthropic's Prompt Evalutions Course
github.com/anthropics
discuss
2 years ago
thenameless7741
2 points
164.
▲
An Evaluation of Location Encoding Systems (2018)
github.com/google
discuss
4 years ago
Tomte
2 points
165.
▲
AKS on HCI: Azure VM Eval
github.com/Azure
discuss
5 years ago
mad_vill
2 points
166.
▲
Google: Evaluation of Location Encoding Systems
github.com/google
discuss
9 years ago
espeed
2 points
167.
▲
Llama2 on Replicate faster than ChatGPT?
github.com/BerriAI
2 comments
3 years ago
ij23
1 points
168.
▲
We Built an Open-Source Text-to-Image Evaluation Library for Clip Models
github.com/encord-team
1 comment
2 years ago
Stephen_Oladele
1 points
169.
▲
Evaluation of Location Encoding Systems
github.com/google
discuss
8 years ago
CharlesDodgson
1 points
170.
▲
Show HN: High-performance GenAI engine now open source
github.com/arthur-ai
12 comments
a year ago
fryz
22 points
171.
▲
Show HN: Billion Cell Spreadsheets with Incremental Computation
xls.feldera.io
1 comment
a year ago
gz09
15 points
172.
▲
Show HN: Voicetest – open-source test harness for voice AI agents
discuss
4 months ago
pldpld
3 points
173.
▲
Sharing learnings from evaluating Million+ LLM responses
discuss
3 years ago
sourabh03agr
3 points
174.
▲
Show HN: Adventures in UTM – Busy Beaver in under 5–10 mins
2 comments
a year ago
polymetron
2 points
175.
▲
Show HN: Dingo – Automate Data Quality Checks Across Pre-Training and SFT Data
github.com/DataEval
discuss
a year ago
e06084
1 points
176.
▲
Ask HN: How to run Piper text-to-speech on a Mac (using Docker)?
discuss
2 years ago
dv35z
1 points
177.
▲
Which other AI search engines should we keep an eye on?
discuss
2 years ago
james_chu
1 points
178.
▲
You thought that “This should never happen was bad”? search – eval($_GET)
github.com
15 comments
10 years ago
callaars
23 points
179.
▲
Benchmark GGUF model with ONE line of code
github.com/NexaAI
1 comment
2 years ago
alanzhuly
6 points
180.
▲
Medical Question-Answer AI Model Evaluation Framework
github.com/chat-data-llc
2 comments
2 years ago
freexiaosu
4 points
More