HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
An analysis of 7,020,950 NFT transactions on the Ethereum blockchain [pdf]
github.com/bugout-dev
2 comments
5 years ago
zomglings
4 points
62.
▲
Public Real-Time Datasets and Sources
github.com/bytewax
discuss
3 years ago
skadamat
4 points
63.
▲
Show HN: UK Government Datasets
github.com/i-dot-ai
discuss
a year ago
crimsoneer
2 points
64.
▲
Satellite Image Time Series Datasets
github.com/corentin-dfg
discuss
3 years ago
sebg
2 points
65.
▲
Chinese Language Corpora for Sentiment Analysis
github.com/Lab41
discuss
8 years ago
ghosthamlet
1 points
66.
▲
Visualizations for machine learning datasets
github.com/PAIR-code
7 comments
9 years ago
happy-go-lucky
178 points
67.
▲
Show HN: Dlt – Python library to automate the creation of datasets
colab.research.google.com
54 comments
3 years ago
MatthausK
114 points
68.
▲
RipTable – multi-threaded Python data analytics tools for numpy arrays/datasets
github.com/rtosholdings
14 comments
6 years ago
aldanor
79 points
69.
▲
Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser
hyperparam.app
21 comments
a year ago
platypii
77 points
70.
▲
Datasets for Reconstructing Visual Perception from Brain Data
github.com/seelikat
16 comments
4 months ago
katsee
62 points
71.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
github.com/MinishLab
6 comments
a year ago
Pringled
19 points
72.
▲
Show HN: Version code, models, & datasets together in GitHub
6 comments
3 years ago
skadamat
19 points
73.
▲
NLP: A new datasets and metrics library from Hugging Face
github.com/huggingface
discuss
6 years ago
julien_c
19 points
74.
▲
GitHub: Awesome-reasoning, a curated list of datasets for reasoning AIs
github.com/neurallambda
discuss
2 years ago
neurallambda
17 points
75.
▲
Easy way to load, create, version, query and visualize computer vision datasets
discuss
4 years ago
morpheusme
13 points
76.
▲
Show HN: Create datasets more simply and improve AI model with unstructured data
github.com/adansons
3 comments
4 years ago
KenichiHiguchi
12 points
77.
▲
Show HN: Training synthetic models on highly complex datasets
github.com/gretelai
2 comments
4 years ago
repeat_or
10 points
78.
▲
Show HN: React-like Declarative DSL for building synthetic LLM datasets
github.com/qforge-dev
discuss
8 months ago
arturwala
10 points
79.
▲
Kangas: Explore Multimedia Datasets at Scale
github.com/comet-ml
2 comments
4 years ago
dmoura
9 points
80.
▲
Open Thoughts: Curating the best reasoning datasets
github.com/open-thoughts
discuss
a year ago
madiator
8 points
81.
▲
Show HN: Automate Variable Selection for Research on Big Datasets (Open-Source)
github.com/MalikHarrisAhm
discuss
2 years ago
mha23
8 points
82.
▲
Our classifier outperforms CatBoost, XGBoost, LightGBM on 5 benchmark datasets
github.com/LinearBoost
5 comments
2 years ago
hamid9
6 points
83.
▲
DatasetGPT – an open-source command line tool for generating datasets with LLMs
github.com/radi-cho
1 comment
3 years ago
radicho123
6 points
84.
▲
Show HN: Xray: N-D labeled arrays and datasets in Python
github.com/xray
discuss
12 years ago
shoyer
6 points
85.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
github.com/MinishLab
discuss
a year ago
stephantul
6 points
86.
▲
The fastest command-line tools for querying large JSON datasets
github.com/dcmoura
discuss
4 years ago
zX41ZdbW
6 points
87.
▲
Resampling Unbalanced Datasets
github.com/fmfn
discuss
12 years ago
hrb1979
5 points
88.
▲
Show HN: Byte-Pair Encoding tokenizer for training LLMs on large datasets
github.com/jmaczan
discuss
2 years ago
yu3zhou4
5 points
89.
▲
DataDM – Search and analyze datasets with LLMs
github.com/approximatelabs
discuss
3 years ago
cle
5 points
90.
▲
Show HN: Create APIs for static datasets without writing a single line of code
github.com/roapi
discuss
5 years ago
houqp
5 points
More