HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
361.
▲
Show HN: Search in HuggingFace Dataset from the URL
github.com/lightonai
discuss
2 years ago
raphaelty
2 points
362.
▲
DataChain: Prepare and curate datasets for AI/ML
github.com/iterative
discuss
2 years ago
shcheklein
2 points
363.
▲
DataChain: Prepare and curate datasets for AI/ML
github.com/iterative
discuss
2 years ago
shcheklein
2 points
364.
▲
Rill transforms data sets into opinionated dashboards using SQL. BI-as-code
github.com/rilldata
discuss
2 years ago
nateb2022
2 points
365.
▲
Roapi: Create APIs for slow moving datasets without writing code
github.com/roapi
discuss
2 years ago
sea-gold
2 points
366.
▲
Reladiff: High-performance diffing of large datasets across databases
github.com/erezsh
discuss
2 years ago
todsacerdoti
2 points
367.
▲
The largest dataset of LLM jailbreak prompts
github.com/verazuo
discuss
2 years ago
titaniumrain
2 points
368.
▲
Microsoft/MS-MARCO-Web-Search: A large-scale information-rich web dataset
github.com/microsoft
discuss
2 years ago
alexmolas
2 points
369.
▲
OpenForest – A catalogue of open access forest datasets
github.com/RolnickLab
discuss
2 years ago
Brajeshwar
2 points
370.
▲
Dataset to extract stock tickers from NL
github.com/rohanmahen
discuss
2 years ago
rohanmahen
2 points
371.
▲
Show HN: Lightly Insights – open-source dataset analysis
github.com/lightly-ai
discuss
3 years ago
isusmelj
2 points
372.
▲
Fabricator – OSS framework to generate datasets with LLMs
github.com/flairNLP
discuss
3 years ago
aantti
2 points
373.
▲
Framework to easily create LLM powered bots over any dataset
github.com/embedchain
discuss
3 years ago
ensocode
2 points
374.
▲
Show HN: A Python toolkit for working with parquet datasets on AWS
github.com/marwan116
discuss
3 years ago
ortamina
2 points
375.
▲
Processing large JSON datasets by streaming
github.com/kashifrazzaqui
discuss
3 years ago
kashif
2 points
376.
▲
RedPajama-Data: Code for preparing large datasets
github.com/togethercomputer
discuss
3 years ago
harrisonpowers
2 points
377.
▲
OpenFEMA Samples – Code, dataset, and analysis samples that utilize OpenFEMA API
github.com/FEMA
discuss
3 years ago
mindcrime
2 points
378.
▲
Open Source AI Image Classifier with Automatic Dataset Creator
github.com/serpapi
discuss
3 years ago
thefoolofdaath
2 points
379.
▲
Show HN: DescribeML is a VSCode language plugin to describe ML datasets
github.com/SOM-Research
discuss
4 years ago
softmodeling
2 points
380.
▲
Darmok and Jalad at Tanagra: Dataset and Model for English-Tamarian Translation
github.com/cognitiveailab
discuss
4 years ago
darwinwhy
2 points
381.
▲
SimilarVerbBank: Dataset of similar verbs formed with the Apriori algorithm
github.com/nlptechbook
discuss
4 years ago
jxireal
2 points
382.
▲
HuggingFace/evaluate: A library for easily evaluating ML models and datasets
github.com/huggingface
discuss
4 years ago
occamschainsaw
2 points
383.
▲
Open-source motion datasets collected by Bandai Namco Research
github.com/BandaiNamcoResearchInc
discuss
4 years ago
nikolay
2 points
384.
▲
Show HN: Bollywood Lyrics Dataset
github.com/hbdeshmukh
discuss
4 years ago
hdesh
2 points
385.
▲
Ivis: Dimensionality Reduction In Large Datasets Using Siamese Networks
github.com/beringresearch
discuss
5 years ago
optimalsolver
2 points
386.
▲
Show HN: H5records – large dataset format for deep learning
github.com/theblackcat102
discuss
5 years ago
polymorph1sm
2 points
387.
▲
PythonProgrammingPuzzles: A Dataset of Python Challenges for AI Research
github.com/microsoft
discuss
5 years ago
lnyan
2 points
388.
▲
Gretel-synthetics: open-source library to create synthetic datasets
github.com/gretelai
discuss
5 years ago
meowterspace42
2 points
389.
▲
AutoViz: Automatically visualize any dataset, any size with one line of code
github.com/AutoViML
discuss
5 years ago
optimalsolver
2 points
390.
▲
World Mortality Dataset – 2020 vs. past
github.com/akarlinsky
discuss
5 years ago
puttycat
2 points
More