HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
Ask HN: What are some public real-time data sources?
discuss
3 years ago
amath
1 points
62.
▲
An analysis of 7,020,950 NFT transactions on the Ethereum blockchain [pdf]
github.com/bugout-dev
2 comments
5 years ago
zomglings
4 points
63.
▲
Public Real-Time Datasets and Sources
github.com/bytewax
discuss
3 years ago
skadamat
4 points
64.
▲
Show HN: UK Government Datasets
github.com/i-dot-ai
discuss
a year ago
crimsoneer
2 points
65.
▲
An analysis of 7M NFT transactions on the Ethereum blockchain [pdf]
github.com/bugout-dev
discuss
5 years ago
mpaepper
1 points
66.
▲
Launch HN: Activeloop (YC S18) – Data lake for deep learning
24 comments
4 years ago
davidbuniat
64 points
67.
▲
Show HN: I built an open-source financial research terminal (SEC data and SQL)
terminal.tesseractanalytics.ai
discuss
7 days ago
tessbi
5 points
68.
▲
InfoSeek: The First Open-Source Framework for Deep Research Data Synthesis
1 comment
9 months ago
BAAIBeijing
2 points
69.
▲
Satellite Image Time Series Datasets
github.com/corentin-dfg
discuss
3 years ago
sebg
2 points
70.
▲
Chinese Language Corpora for Sentiment Analysis
github.com/Lab41
discuss
8 years ago
ghosthamlet
1 points
71.
▲
Visualizations for machine learning datasets
github.com/PAIR-code
7 comments
9 years ago
happy-go-lucky
178 points
72.
▲
Show HN: Dlt – Python library to automate the creation of datasets
colab.research.google.com
54 comments
3 years ago
MatthausK
114 points
73.
▲
RipTable – multi-threaded Python data analytics tools for numpy arrays/datasets
github.com/rtosholdings
14 comments
6 years ago
aldanor
79 points
74.
▲
Show HN: Hyperparam: OSS tools for exploring datasets locally in the browser
hyperparam.app
21 comments
a year ago
platypii
77 points
75.
▲
How to query data.gov json datasets with SQL: a case study
github.com/axibase
1 comment
10 years ago
rodionos
68 points
76.
▲
Datasets for Reconstructing Visual Perception from Brain Data
github.com/seelikat
16 comments
4 months ago
katsee
62 points
77.
▲
Show HN: I made this tool for navigating pandas datasets
github.com/man-group
discuss
6 years ago
leehcksource
20 points
78.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
github.com/MinishLab
6 comments
a year ago
Pringled
19 points
79.
▲
Show HN: Version code, models, & datasets together in GitHub
6 comments
3 years ago
skadamat
19 points
80.
▲
NLP: A new datasets and metrics library from Hugging Face
github.com/huggingface
discuss
6 years ago
julien_c
19 points
81.
▲
GitHub: Awesome-reasoning, a curated list of datasets for reasoning AIs
github.com/neurallambda
discuss
2 years ago
neurallambda
17 points
82.
▲
Datasetq: jq for Datasets; Polars-powered Parquet/JSON/CSV query lang/cli
github.com/datasetq
2 comments
6 months ago
djb-at-durable
15 points
83.
▲
Easy way to load, create, version, query and visualize computer vision datasets
discuss
4 years ago
morpheusme
13 points
84.
▲
Show HN: Create datasets more simply and improve AI model with unstructured data
github.com/adansons
3 comments
4 years ago
KenichiHiguchi
12 points
85.
▲
Show HN: Download HuggingFace Models/Datasets easily and super fast
github.com/bodaay
2 comments
3 years ago
qqqbodaayqqq
10 points
86.
▲
Show HN: Training synthetic models on highly complex datasets
github.com/gretelai
2 comments
4 years ago
repeat_or
10 points
87.
▲
Show HN: React-like Declarative DSL for building synthetic LLM datasets
github.com/qforge-dev
discuss
8 months ago
arturwala
10 points
88.
▲
Kangas: Explore Multimedia Datasets at Scale
github.com/comet-ml
2 comments
4 years ago
dmoura
9 points
89.
▲
Nvidia open sources the synthetic data framework used to build Nemotron datasets
1 comment
7 months ago
alexwatson405
8 points
90.
▲
Open Thoughts: Curating the best reasoning datasets
github.com/open-thoughts
discuss
a year ago
madiator
8 points
More