HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
361.
▲
Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser
hyperparam.app
21 comments
a year ago
platypii
77 points
362.
▲
Comma2k19 – A dataset of over 33 hours of commute in California's 280 highway
github.com/commaai
35 comments
8 years ago
pd0wm
70 points
363.
▲
Rɐbbit Dynamic Datascapes
github.com/ryrobes
16 comments
2 years ago
notarobot123
69 points
364.
▲
Gizzard - Twitter's open source framework for creating distributed datastores
github.com/twitter
9 comments
16 years ago
abraham
69 points
365.
▲
How to query data.gov json datasets with SQL: a case study
github.com/axibase
1 comment
10 years ago
rodionos
68 points
366.
▲
The Museum of Modern Art Research Dataset
github.com/MuseumofModernArt
15 comments
11 years ago
danso
61 points
367.
▲
Mozilla shuts project Iodide: Datascience documents in browsers
github.com/iodide-project
6 comments
6 years ago
ritwiksaikia
46 points
368.
▲
Chicago Crime Trends. Analyzing 3GB Dataset from Data.gov with SQL and Graphs
github.com/axibase
3 comments
9 years ago
rodionos
44 points
369.
▲
Dataset of Linus Torvalds' rants ranked by hate
github.com/corollari
17 comments
5 years ago
fctorial
42 points
370.
▲
ClickHouse Obfuscator – A tool for dataset anonymization
github.com/ClickHouse
3 comments
3 years ago
rrampage
39 points
371.
▲
DeepMind's machine-reading question/answer dataset
github.com/deepmind
3 comments
11 years ago
andrewtbham
37 points
372.
▲
Madlad-400: A Multilingual and Document-Level Large Audited Dataset
github.com/google-research
1 comment
3 years ago
the_bookmaker
37 points
373.
▲
A dataset of crimes committed in Buenos Aires
github.com/ramadis
4 comments
8 years ago
ramadis
34 points
374.
▲
Show HN: I used streaming to skip downloading my 45GB dataset
github.com/DagsHub
discuss
4 years ago
npRandom
31 points
375.
▲
Toxicity Dataset
github.com/surge-ai
32 comments
5 years ago
CarrieLab
25 points
376.
▲
Structured Etymology Dataset
github.com/droher
3 comments
a year ago
downboots
24 points
377.
▲
Washington Post publishes dataset of 52,000 criminal homicides
github.com/washingtonpost
2 comments
8 years ago
danso
24 points
378.
▲
I have trained StyleGAN2 from scratch with a dataset of female portraits
github.com/l4rz
20 comments
5 years ago
EvgeniyZh
20 points
379.
▲
VoxelCNN: Order-Aware Generative Modeling Using the 3D-Craft Dataset
github.com/facebookresearch
discuss
6 years ago
ingve
20 points
380.
▲
Show HN: I made this tool for navigating pandas datasets
github.com/man-group
discuss
6 years ago
leehcksource
20 points
381.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
github.com/MinishLab
6 comments
a year ago
Pringled
19 points
382.
▲
Show HN: Version code, models, & datasets together in GitHub
6 comments
3 years ago
skadamat
19 points
383.
▲
NLP: A new datasets and metrics library from Hugging Face
github.com/huggingface
discuss
6 years ago
julien_c
19 points
384.
▲
Show HN: Dataset of Linus Torvalds' rants sorted by hate
github.com/corollari
4 comments
7 years ago
corollari
17 points
385.
▲
GitHub: Awesome-reasoning, a curated list of datasets for reasoning AIs
github.com/neurallambda
discuss
2 years ago
neurallambda
17 points
386.
▲
A datastore library on Google App Engine for Clojure
github.com/making
discuss
16 years ago
va_coder
16 points
387.
▲
Datastax ripped us off
github.com/managedfusion
4 comments
13 years ago
Throwadev
15 points
388.
▲
ICLR 2026 – Institutional Affiliations Dataset and Analysis
github.com/DmytroLopushanskyy
2 comments
a month ago
stared
15 points
389.
▲
Show HN: HTTP-nu – Nushell-scriptable HTTP server with SSE / Datastar
github.com/cablehead
2 comments
4 months ago
ndyg
14 points
390.
▲
Easy way to load, create, version, query and visualize computer vision datasets
discuss
4 years ago
morpheusme
13 points
More