HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
151.
▲
An MNIST-like fashion product dataset
github.com/zalandoresearch
21 comments
9 years ago
kashifr
220 points
152.
▲
Qri: A global dataset version control system built on the distributed web
github.com/qri-io
42 comments
7 years ago
anewhnaccount2
204 points
153.
▲
Visualizations for machine learning datasets
github.com/PAIR-code
7 comments
9 years ago
happy-go-lucky
178 points
154.
▲
Finetuning of Falcon-7B LLM Using QLoRA on Mental Health Conversational Dataset
github.com/iamarunbrahma
108 comments
3 years ago
iamarunbrahma
160 points
155.
▲
Hypersim, Photorealistic Synthetic Dataset for Indoor Scene Understanding
github.com/apple
20 comments
6 years ago
homarp
122 points
156.
▲
Show HN: Dlt – Python library to automate the creation of datasets
colab.research.google.com
54 comments
3 years ago
MatthausK
114 points
157.
▲
Driving dataset for car autopilot AI training
github.com/commaai
44 comments
10 years ago
EvgeniyZh
100 points
158.
▲
Boston housing price dataset was removed from scikit-learn 1.2
github.com/scikit-learn
84 comments
3 years ago
ok123456
81 points
159.
▲
RipTable – multi-threaded Python data analytics tools for numpy arrays/datasets
github.com/rtosholdings
14 comments
6 years ago
aldanor
79 points
160.
▲
Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser
hyperparam.app
21 comments
a year ago
platypii
77 points
161.
▲
Comma2k19 – A dataset of over 33 hours of commute in California's 280 highway
github.com/commaai
35 comments
8 years ago
pd0wm
70 points
162.
▲
How to query data.gov json datasets with SQL: a case study
github.com/axibase
1 comment
10 years ago
rodionos
68 points
163.
▲
The Museum of Modern Art Research Dataset
github.com/MuseumofModernArt
15 comments
11 years ago
danso
61 points
164.
▲
Chicago Crime Trends. Analyzing 3GB Dataset from Data.gov with SQL and Graphs
github.com/axibase
3 comments
9 years ago
rodionos
44 points
165.
▲
Dataset of Linus Torvalds' rants ranked by hate
github.com/corollari
17 comments
5 years ago
fctorial
42 points
166.
▲
ClickHouse Obfuscator – A tool for dataset anonymization
github.com/ClickHouse
3 comments
3 years ago
rrampage
39 points
167.
▲
DeepMind's machine-reading question/answer dataset
github.com/deepmind
3 comments
11 years ago
andrewtbham
37 points
168.
▲
Madlad-400: A Multilingual and Document-Level Large Audited Dataset
github.com/google-research
1 comment
3 years ago
the_bookmaker
37 points
169.
▲
A dataset of crimes committed in Buenos Aires
github.com/ramadis
4 comments
8 years ago
ramadis
34 points
170.
▲
Show HN: I used streaming to skip downloading my 45GB dataset
github.com/DagsHub
discuss
4 years ago
npRandom
31 points
171.
▲
Toxicity Dataset
github.com/surge-ai
32 comments
5 years ago
CarrieLab
25 points
172.
▲
Structured Etymology Dataset
github.com/droher
3 comments
a year ago
downboots
24 points
173.
▲
Washington Post publishes dataset of 52,000 criminal homicides
github.com/washingtonpost
2 comments
8 years ago
danso
24 points
174.
▲
I have trained StyleGAN2 from scratch with a dataset of female portraits
github.com/l4rz
20 comments
5 years ago
EvgeniyZh
20 points
175.
▲
VoxelCNN: Order-Aware Generative Modeling Using the 3D-Craft Dataset
github.com/facebookresearch
discuss
6 years ago
ingve
20 points
176.
▲
Show HN: I made this tool for navigating pandas datasets
github.com/man-group
discuss
6 years ago
leehcksource
20 points
177.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
github.com/MinishLab
6 comments
a year ago
Pringled
19 points
178.
▲
Show HN: Version code, models, & datasets together in GitHub
6 comments
3 years ago
skadamat
19 points
179.
▲
NLP: A new datasets and metrics library from Hugging Face
github.com/huggingface
discuss
6 years ago
julien_c
19 points
180.
▲
Show HN: Dataset of Linus Torvalds' rants sorted by hate
github.com/corollari
4 comments
7 years ago
corollari
17 points
More