HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
391.
▲
Hypersim: A Photorealistic Synthetic Dataset for Indoor Scene Understanding
github.com/apple
discuss
6 years ago
Anon84
2 points
392.
▲
Witch-Trials: Datasets and Code for “Witch Trials” (Leeson and Russ 2018)
github.com/JakeRuss
discuss
6 years ago
DyslexicAtheist
2 points
393.
▲
Sweetviz: Visualize and compare datasets, target values and associations
github.com/fbdesignpro
discuss
6 years ago
polm23
2 points
394.
▲
Datasets and Evaluation Metrics for NLP (True Open Source GPT Alternative)
github.com/huggingface
discuss
6 years ago
dragonsh
2 points
395.
▲
Datasets and evaluation metrics for natural language processing(NLP)
github.com/huggingface
discuss
6 years ago
dragonsh
2 points
396.
▲
Datasets and Evaluation Metrics for Natural Language Processing (NLP)
github.com/huggingface
discuss
6 years ago
dragonsh
2 points
397.
▲
Show HN: Covidify – coronavirus dataset and visualization generator
discuss
6 years ago
AaronWard
2 points
398.
▲
Show HN: A CLI tool for maintaining datasets in a centralized repository
github.com/ezhou7
discuss
7 years ago
nightrunner11
2 points
399.
▲
Library to scrape and clean web pages to create datasets
github.com/chiphuyen
discuss
7 years ago
khartig
2 points
400.
▲
Venmo Transaction Dataset
github.com/sa7mon
discuss
7 years ago
_salmon
2 points
401.
▲
Real numbers, data science and chaos: Fit any dataset with a single parameter
github.com/Ranlot
discuss
7 years ago
Ranlot
2 points
402.
▲
Lazynlp: Library to scrape and clean web pages to create datasets
github.com/chiphuyen
discuss
7 years ago
Osiris30
2 points
403.
▲
OpenWebText: Open Clone of OpenAI's GPT-2 WebText Dataset
github.com/jcpeterson
discuss
7 years ago
joshuacpeterson
2 points
404.
▲
Lazynlp: A library to scrape, clean, de-duplicate webpages to create datasets
github.com/chiphuyen
discuss
7 years ago
korym
2 points
405.
▲
DeepWeeds: A Multiclass Weed Species Image Dataset for Deep Learning
github.com/AlexOlsen
discuss
7 years ago
lainon
2 points
406.
▲
Show HN: Python Script to Generate Fake Datasets for Testing ML/DL Workflows
github.com/minimaxir
discuss
7 years ago
minimaxir
2 points
407.
▲
Open source tool for merging datasets
github.com/funkeinteraktiv
discuss
8 years ago
chrtze
2 points
408.
▲
Scrape multiple crypto currency data sets-write to single .csv
github.com/rootVIII
discuss
8 years ago
rootVIII
2 points
409.
▲
Analyzing League of Legends Dataset with Pandas and Python3
gist.github.com
discuss
8 years ago
kiyanwang
2 points
410.
▲
Tracking progress in NLP tasks and datasets
github.com/sebastianruder
discuss
8 years ago
neuhaus
2 points
411.
▲
He Data Linter: Lightweight, Automated Sanity Checking for ML Data Sets
github.com/brain-research
discuss
8 years ago
blopeur
2 points
412.
▲
Show HN: Simple Recommender System for MovieLens Dataset built with JavaScript
github.com/javascript-machine-learning
discuss
8 years ago
rwieruch
2 points
413.
▲
Chatito – Generate training datasets for slot filling chatbots in a breeze
github.com/rodrigopivi
discuss
9 years ago
prodrod
2 points
414.
▲
Starcraft AI Research Dataset
github.com/TorchCraft
discuss
9 years ago
jonbaer
2 points
415.
▲
StarData: A StarCraft AI Research Dataset
github.com/TorchCraft
discuss
9 years ago
indescions_2017
2 points
416.
▲
Using the Dataset API for TensorFlow Input Pipelines
github.com/tensorflow
discuss
9 years ago
mrry
2 points
417.
▲
FMA: A Dataset for Music Analysis
github.com/mdeff
discuss
9 years ago
sndean
2 points
418.
▲
FMA dataset: 106k songs, 1TB, 343 days of audio
github.com/mdeff
discuss
9 years ago
mdeff
2 points
419.
▲
Rambler&Co Released Benchmark of XGBoost, VW and Spark ML on 1TB Criteo Dataset
github.com/rambler-digital-solutions
discuss
9 years ago
pklemenkov
2 points
420.
▲
OpenRefine – assess the quality of datasets
github.com/OpenRefine
discuss
9 years ago
chirau
2 points
More