HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
121.
▲
Show HN: Text augmentation tool for biomedical texts
github.com/tznurmin
discuss
a year ago
tznurmin
2 points
122.
▲
Show HN: Dokimos – LLM evaluation framework for Java
github.com/dokimos-dev
discuss
6 months ago
fkapsahili
1 points
123.
▲
Show HN: Outrage – contact your local elected representatives in minutes (US)
outrage.gg
discuss
7 months ago
bitforger
1 points
124.
▲
Which other AI search engines should we keep an eye on?
discuss
2 years ago
james_chu
1 points
125.
▲
Ask HN: What are some public real-time data sources?
discuss
3 years ago
amath
1 points
126.
▲
An analysis of 7,020,950 NFT transactions on the Ethereum blockchain [pdf]
github.com/bugout-dev
2 comments
5 years ago
zomglings
4 points
127.
▲
Public Real-Time Datasets and Sources
github.com/bytewax
discuss
3 years ago
skadamat
4 points
128.
▲
Tech.ml.dataset – A Clojure high performance data processing system
github.com/techascent
discuss
5 years ago
simonpure
4 points
129.
▲
tech.ml.dataset: A Clojure high performance data processing system
github.com/techascent
discuss
2 years ago
wlkr
3 points
130.
▲
Aave V2 Health Factor Dataset
github.com/credprotocol
discuss
4 years ago
willwolf
3 points
131.
▲
Show HN: UK Government Datasets
github.com/i-dot-ai
discuss
a year ago
crimsoneer
2 points
132.
▲
tech.ml.dataset: A Clojure high performance data processing system
github.com/techascent
discuss
3 months ago
tosh
1 points
133.
▲
100K Fake US People Profiles Dataset
github.com/marko-simic
discuss
4 years ago
qa-guy
1 points
134.
▲
An analysis of 7M NFT transactions on the Ethereum blockchain [pdf]
github.com/bugout-dev
discuss
5 years ago
mpaepper
1 points
135.
▲
Launch HN: Activeloop (YC S18) – Data lake for deep learning
24 comments
4 years ago
davidbuniat
64 points
136.
▲
Ask HN: How are you extracting the best performance out of your RAG pipeline?
4 comments
2 years ago
imaravind
5 points
137.
▲
Show HN: I built an open-source financial research terminal (SEC data and SQL)
terminal.tesseractanalytics.ai
discuss
8 days ago
tessbi
5 points
138.
▲
Lip2Wav: Synthesize Speech Only from the Lip Movements
discuss
6 years ago
prajwalkr
4 points
139.
▲
Show HN: SJT- A lightweight structured JSON table format for APIs
1 comment
10 months ago
yukiakai
3 points
140.
▲
InfoSeek: The First Open-Source Framework for Deep Research Data Synthesis
1 comment
9 months ago
BAAIBeijing
2 points
141.
▲
Show HN: RandomForestGenerator – CSV to ML in the browser, but local
jonaraphael.github.io
discuss
5 months ago
jonaraphael
2 points
142.
▲
Measuring Compositional Generalization in ML Architectures
discuss
6 years ago
esdee
1 points
143.
▲
Free/Open Source Datasets
github.com/rasbt
discuss
11 years ago
rouma7
2 points
144.
▲
Satellite Image Time Series Datasets
github.com/corentin-dfg
discuss
3 years ago
sebg
2 points
145.
▲
Show HN: Simple Python script to split (DL)training data (CNNs mainly)
github.com/chinmayshah99
discuss
7 years ago
chinmays
2 points
146.
▲
Chinese Language Corpora for Sentiment Analysis
github.com/Lab41
discuss
8 years ago
ghosthamlet
1 points
147.
▲
Show HN: Open Prompts – dataset of 10M Stable Diffusion generations
github.com/krea-ai
71 comments
4 years ago
vipermu
279 points
148.
▲
Tell HN: Full Hacker News dataset now available on BigQuery
43 comments
11 years ago
minimaxir
238 points
149.
▲
Dat – Distributed Dataset Synchronization and Versioning
github.com/datproject
39 comments
9 years ago
ColinWright
229 points
150.
▲
A multimodal dataset with one trillion tokens
github.com/mlfoundations
52 comments
2 years ago
kulikalov
224 points
More