HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
421.
▲
Show HN: Terminal-Wrench, a dataset of 331 realistic hackable environments
github.com/few-sh
2 comments
2 months ago
neversupervised
6 points
422.
▲
Show HN: FiftyOne – Explore, Analyze and Curate Visual Datasets
github.com/voxel51
1 comment
6 years ago
benjaminpkane
6 points
423.
▲
Show HN: Open Covid-19 Dataset
github.com/open-covid-19
1 comment
6 years ago
omtinez
6 points
424.
▲
Show HN: Redux and datascript, anyone?
github.com/hden
1 comment
10 years ago
hden
6 points
425.
▲
Show HN: cql-builder – CQL generator for the Datastax Cassandra Python driver
github.com/jjengo
discuss
11 years ago
jjengo
6 points
426.
▲
Show HN: Xray: N-D labeled arrays and datasets in Python
github.com/xray
discuss
12 years ago
shoyer
6 points
427.
▲
Cockroachdb – A Scalable, Geo-Replicated, Transactional Datastore
github.com/cockroachdb
discuss
12 years ago
pandemicsyn
6 points
428.
▲
Show HN: Generate Fine-tunning dataset using deep research in terminal
github.com/Datalore-ai
discuss
a year ago
FineTuner42
6 points
429.
▲
Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets
github.com/MinishLab
discuss
a year ago
stephantul
6 points
430.
▲
Show HN: Interactively explore unstructured datasets from your dataframe
github.com/Renumics
discuss
3 years ago
sps44
6 points
431.
▲
Kangas: Pandas for Multimedia Datasets
github.com/comet-ml
discuss
3 years ago
synergy20
6 points
432.
▲
The fastest command-line tools for querying large JSON datasets
github.com/dcmoura
discuss
4 years ago
zX41ZdbW
6 points
433.
▲
Lethe: A Basic Log-Structured Flash Datastore in Rust
github.com/oxidecomputer
discuss
4 years ago
hasheddan
6 points
434.
▲
Video Classification Starter Code for Working with the YouTube-8M Dataset
github.com/google
discuss
10 years ago
tylerwhipple
6 points
435.
▲
Resampling Unbalanced Datasets
github.com/fmfn
discuss
12 years ago
hrb1979
5 points
436.
▲
Curated list of language modeling researches for code, plus related datasets
github.com/codefuse-ai
discuss
a year ago
Bluestein
5 points
437.
▲
Show HN: Byte-Pair Encoding tokenizer for training LLMs on large datasets
github.com/jmaczan
discuss
2 years ago
yu3zhou4
5 points
438.
▲
DataDM – Search and analyze datasets with LLMs
github.com/approximatelabs
discuss
3 years ago
cle
5 points
439.
▲
DataDM: Open-source local-LLM code-interpreter with dataset search
github.com/approximatelabs
discuss
3 years ago
bluecoconut
5 points
440.
▲
Show HN: Multiobjective Large-Scale Fashion Dataset with Distributional Shifts
github.com/st-tech
discuss
5 years ago
nanikano
5 points
441.
▲
Show HN: H5records – simple large dataset for pytorch training
github.com/theblackcat102
discuss
5 years ago
polymorph1sm
5 points
442.
▲
Show HN: Create APIs for static datasets without writing a single line of code
github.com/roapi
discuss
5 years ago
houqp
5 points
443.
▲
Show HN: We made a dataset differ! (Free, Open source)
github.com/qri-io
discuss
7 years ago
rgardaphe
5 points
444.
▲
Show HN: Qri, a free and open source distributed dataset versioning tool
discuss
8 years ago
rgardaphe
5 points
445.
▲
Show HN: MNIST-Sequence – Generate dataset for sequences of handwritten digits
github.com/ankitaggarwal011
discuss
9 years ago
aaggarwal
5 points
446.
▲
VisualNexus – Training Pipeline for Visual Dataset Segmentation and Labeling
github.com/kyegomez
3 comments
3 years ago
Reclaimer
4 points
447.
▲
DeltaQL - a NodeJS datastore whose query results never get stale.
github.com/chrisdew
2 comments
14 years ago
chrisdew
4 points
448.
▲
Addressing for PHP: Postal address management powered by Google's dataset
github.com/commerceguys
1 comment
12 years ago
robertDouglass
4 points
449.
▲
Show HN: Transform Unstructured Data into Usable Datasets
github.com/wizenheimer
1 comment
2 years ago
wizenheimer
4 points
450.
▲
Show HN: Cerebras-GPT-2.7B finetuned on Stanford Alpaca dataset
github.com/lxe
1 comment
3 years ago
lxe
4 points
More