HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
211.
▲
Ask HN: How to handle large datasets in front end of data apps?
github.com/Kanaries
3 comments
3 years ago
loa_observer
1 points
212.
▲
Gen-Selective Pseudo Labeling, Based on Datasets and Serverless Inference API
github.com/louisbrulenaudet
2 comments
2 years ago
brulenaudet
1 points
213.
▲
OpenAI crowd sources LLM benchmarking datasets by offering advanced GPT-4 access
github.com/openai
2 comments
3 years ago
teaearlgraycold
1 points
214.
▲
Show HN: Self-hosted DCF workspace using Damodaran datasets, LLM narratives
1 comment
3 months ago
softcane
1 points
215.
▲
Show HN: RAG-corpus-profiler – A linter for RAG datasets (dedup, PII, quality)
github.com/aashirpersonal
1 comment
6 months ago
aashirpersonal
1 points
216.
▲
Show HN: React-obj-view – A virtualized object inspector for large datasets
github.com/vothanhdat
1 comment
7 months ago
datvo
1 points
217.
▲
Show HN: Wrote a small tool that turns PDFs and docs into fine-tuning datasets
github.com/Datalore-ai
1 comment
10 months ago
FineTuner42
1 points
218.
▲
Show HN: DataChain – Tool to create, curate, version AI datasets
github.com/iterative
1 comment
2 years ago
shcheklein
1 points
219.
▲
National Park Service Data Is Now Available on Big Query Public Datasets
github.com/tonymet
1 comment
2 years ago
tonymet
1 points
220.
▲
Face Alignment API: Simple API to align faces when creating datasets/scraping
github.com/botoxparty
1 comment
3 years ago
botoxparty
1 points
221.
▲
GeoCOCO: Transform GIS annotations into COCO datasets for use in deep learning
github.com/jaspersiebring
1 comment
3 years ago
qtieb
1 points
222.
▲
Collection of datasets to train your own multi-modal GPT-4/LLMs
github.com/yaodongC
1 comment
3 years ago
yaodong_lukas
1 points
223.
▲
PLS GIVE UR FEEDBACK: DPIPE Library to easily create TensorFlow datasets
github.com/aiporre
1 comment
6 years ago
arielin1
1 points
224.
▲
Not_notMNIST: Generate your own datasets
1 comment
9 years ago
RafazZ
1 points
225.
▲
Show HN: Cohort Visualizer - A handy tool for browsing cohort datasets
bslatkin.github.com
discuss
14 years ago
bslatkin
1 points
226.
▲
Synth-dataset-kit: Generate and audit synthetic datasets from seed data
github.com/KazKozDev
discuss
2 months ago
kazkozdev
1 points
227.
▲
GABRIEL – turn messy qualitative corpora into analysis-ready datasets
github.com/openai
discuss
5 months ago
michaelsbradley
1 points
228.
▲
Show HN: Vietnam Elections (open, source-linked datasets and site)
bamboo-filing-cabinet.github.io
discuss
5 months ago
vietthan
1 points
229.
▲
Fasttfidf: High-performance TF-IDF vectorization for large-scale text datasets
github.com/purijs
discuss
6 months ago
jspuri
1 points
230.
▲
Show HN: AI tool that walks citation graph and extracts data to create datasets
github.com/eamag
discuss
6 months ago
eamag
1 points
231.
▲
Training YOLO vision models on Kaggle datasets
github.com/mfranzon
discuss
7 months ago
walterbell
1 points
232.
▲
Show HN: Gaggle – A DuckDB extension for working with Kaggle datasets
discuss
8 months ago
habedi0
1 points
233.
▲
Show HN: Django PostgreSQL Anonymizer – prod → safe dev datasets (beta)
github.com/CuriousLearner
discuss
8 months ago
sanyam-khurana
1 points
234.
▲
A toolkit for improving the quality of your LeRobot datasets
github.com/RoboticsData
discuss
8 months ago
machinelearning
1 points
235.
▲
A new RAG algorithm to self-heal damaged datasets and query them on a graph
github.com/iblameandrew
discuss
9 months ago
scraper02
1 points
236.
▲
Show HN: Tensorpack a CLI tool for semantic discovery across datasets
discuss
9 months ago
AyodeleFikayomi
1 points
237.
▲
Procedural Reasoning Datasets
github.com/open-thought
discuss
a year ago
t55
1 points
238.
▲
Reasoning Gym – Procedural RL reasoning datasets
github.com/open-thought
discuss
a year ago
t55
1 points
239.
▲
Mochi Programming Language v0.6.0 – LINQ syntax for querying datasets
github.com/mochilang
discuss
a year ago
scapbi
1 points
240.
▲
Datasets Are All You Need (LLM Learns to Prompt from Data)
github.com/intellectronica
discuss
a year ago
intellectronica
1 points
More