HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
Not_notMNIST: Generate your own datasets
1 comment
9 years ago
RafazZ
1 points
92.
▲
Synth-dataset-kit: Generate and audit synthetic datasets from seed data
github.com/KazKozDev
discuss
2 months ago
kazkozdev
1 points
93.
▲
GABRIEL – turn messy qualitative corpora into analysis-ready datasets
github.com/openai
discuss
5 months ago
michaelsbradley
1 points
94.
▲
Show HN: Vietnam Elections (open, source-linked datasets and site)
bamboo-filing-cabinet.github.io
discuss
5 months ago
vietthan
1 points
95.
▲
Fasttfidf: High-performance TF-IDF vectorization for large-scale text datasets
github.com/purijs
discuss
6 months ago
jspuri
1 points
96.
▲
Show HN: AI tool that walks citation graph and extracts data to create datasets
github.com/eamag
discuss
6 months ago
eamag
1 points
97.
▲
Training YOLO vision models on Kaggle datasets
github.com/mfranzon
discuss
7 months ago
walterbell
1 points
98.
▲
Show HN: Gaggle – A DuckDB extension for working with Kaggle datasets
discuss
8 months ago
habedi0
1 points
99.
▲
A toolkit for improving the quality of your LeRobot datasets
github.com/RoboticsData
discuss
8 months ago
machinelearning
1 points
100.
▲
A new RAG algorithm to self-heal damaged datasets and query them on a graph
github.com/iblameandrew
discuss
9 months ago
scraper02
1 points
101.
▲
Show HN: Tensorpack a CLI tool for semantic discovery across datasets
discuss
9 months ago
AyodeleFikayomi
1 points
102.
▲
Datasets Are All You Need (LLM Learns to Prompt from Data)
github.com/intellectronica
discuss
a year ago
intellectronica
1 points
103.
▲
Transform and optimize datasets for fast AI model training
github.com/Lightning-AI
discuss
2 years ago
shcheklein
1 points
104.
▲
Show HN: Data Contract CLI – Test your datasets
github.com/datacontract
discuss
2 years ago
aiobe
1 points
105.
▲
Access to public agricultural datasets for agricultural deep learning tasks
github.com/Project-AgML
discuss
3 years ago
protontypes
1 points
106.
▲
Auncel: Fast Approximate Vector Queries on Large Unstructured Datasets
github.com/pkusys
discuss
3 years ago
teleforce
1 points
107.
▲
Latrend – Framework for clustering longitudinal datasets in a standardized way
github.com/philips-software
discuss
5 years ago
JeroenKnoops1
1 points
108.
▲
Booksum – A Collection of Datasets for Long-Form Narrative Summarization
github.com/salesforce
discuss
5 years ago
simonpure
1 points
109.
▲
DVC: Git for Datasets and Models
github.com/iterative
discuss
5 years ago
optimalsolver
1 points
110.
▲
Making it easy to anonymize and balance datasets with just a few clicks
github.com/gretelai
discuss
6 years ago
alig90s
1 points
111.
▲
Covid19 Open Datasets
github.com/covid19-data
discuss
6 years ago
bachback
1 points
112.
▲
Library of deep learning models and datasets
github.com/tensorflow
discuss
8 years ago
manidoraisamy
1 points
113.
▲
Show HN: PyTorch NLP Deep Learning Tools (incl. loaders for 14 popular datasets)
github.com/PetrochukM
discuss
8 years ago
petrochukm
1 points
114.
▲
Generate training datasets for chatbots in a breeze
github.com/rodrigopivi
discuss
8 years ago
zazk
1 points
115.
▲
Fine-tune your own Llama 2 to replace GPT-3.5/4
181 comments
3 years ago
kcorbitt
955 points
116.
▲
Show HN: Dobb·E – towards home robots with an open-source platform
dobb-e.com
119 comments
3 years ago
MahiShafiullah
394 points
117.
▲
Show HN: Wordllama – Things you can do with the token embeddings of an LLM
github.com/dleemiller
36 comments
2 years ago
deepsquirrelnet
370 points
118.
▲
Show HN: Semantic Grep – A Word2Vec-powered search tool
github.com/arunsupe
57 comments
2 years ago
arunsupe
356 points
119.
▲
Show HN: ADS-B visualizer
adsb.exposed
76 comments
2 years ago
zX41ZdbW
339 points
120.
▲
Show HN: A highly opinionated, fully functional Obsidian vault
github.com/bramses
116 comments
4 years ago
_bramses
258 points
More