HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
661.
▲
Fast and scalable dataset preparation and curation tool from Nvidia
github.com/NVIDIA
discuss
2 years ago
shcheklein
2 points
662.
▲
Show HN: Search in HuggingFace Dataset from the URL
github.com/lightonai
discuss
2 years ago
raphaelty
2 points
663.
▲
DataChain: Prepare and curate datasets for AI/ML
github.com/iterative
discuss
2 years ago
shcheklein
2 points
664.
▲
DataChain: Prepare and curate datasets for AI/ML
github.com/iterative
discuss
2 years ago
shcheklein
2 points
665.
▲
Roapi: Create APIs for slow moving datasets without writing code
github.com/roapi
discuss
2 years ago
sea-gold
2 points
666.
▲
Reladiff: High-performance diffing of large datasets across databases
github.com/erezsh
discuss
2 years ago
todsacerdoti
2 points
667.
▲
The largest dataset of LLM jailbreak prompts
github.com/verazuo
discuss
2 years ago
titaniumrain
2 points
668.
▲
Microsoft/MS-MARCO-Web-Search: A large-scale information-rich web dataset
github.com/microsoft
discuss
2 years ago
alexmolas
2 points
669.
▲
Infgen: A Deflate Stream Disassembler
github.com/madler
discuss
2 years ago
suhacker256
2 points
670.
▲
EOL DR / End-of-Life Disaster Response
github.com/potatoqualitee
discuss
2 years ago
ashurov
2 points
671.
▲
OpenForest – A catalogue of open access forest datasets
github.com/RolnickLab
discuss
2 years ago
Brajeshwar
2 points
672.
▲
Dataset to extract stock tickers from NL
github.com/rohanmahen
discuss
2 years ago
rohanmahen
2 points
673.
▲
udis86 – Disassembler Library for x86 and x86-64
github.com/vmt
discuss
2 years ago
peter_d_sherman
2 points
674.
▲
Gvasm: Assembler and disassembler designed specifically for GBA homebrew
github.com/velipso
discuss
2 years ago
generichuman
2 points
675.
▲
Show HN: Lightly Insights – open-source dataset analysis
github.com/lightly-ai
discuss
3 years ago
isusmelj
2 points
676.
▲
Fabricator – OSS framework to generate datasets with LLMs
github.com/flairNLP
discuss
3 years ago
aantti
2 points
677.
▲
Framework to easily create LLM powered bots over any dataset
github.com/embedchain
discuss
3 years ago
ensocode
2 points
678.
▲
Show HN: A Python toolkit for working with parquet datasets on AWS
github.com/marwan116
discuss
3 years ago
ortamina
2 points
679.
▲
Just in Time Datastructures
github.com/UBOdin
discuss
3 years ago
danny00
2 points
680.
▲
Processing large JSON datasets by streaming
github.com/kashifrazzaqui
discuss
3 years ago
kashif
2 points
681.
▲
RedPajama-Data: Code for preparing large datasets
github.com/togethercomputer
discuss
3 years ago
harrisonpowers
2 points
682.
▲
OpenFEMA Samples – Code, dataset, and analysis samples that utilize OpenFEMA API
github.com/FEMA
discuss
3 years ago
mindcrime
2 points
683.
▲
Benchmark of simple operations against common KV datastores with Python clients
github.com/alisaifee
discuss
3 years ago
indydevs
2 points
684.
▲
Open Source AI Image Classifier with Automatic Dataset Creator
github.com/serpapi
discuss
3 years ago
thefoolofdaath
2 points
685.
▲
Show HN: DescribeML is a VSCode language plugin to describe ML datasets
github.com/SOM-Research
discuss
4 years ago
softmodeling
2 points
686.
▲
Darmok and Jalad at Tanagra: Dataset and Model for English-Tamarian Translation
github.com/cognitiveailab
discuss
4 years ago
darwinwhy
2 points
687.
▲
SimilarVerbBank: Dataset of similar verbs formed with the Apriori algorithm
github.com/nlptechbook
discuss
4 years ago
jxireal
2 points
688.
▲
HuggingFace/evaluate: A library for easily evaluating ML models and datasets
github.com/huggingface
discuss
4 years ago
occamschainsaw
2 points
689.
▲
Open-source motion datasets collected by Bandai Namco Research
github.com/BandaiNamcoResearchInc
discuss
4 years ago
nikolay
2 points
690.
▲
Show HN: Bollywood Lyrics Dataset
github.com/hbdeshmukh
discuss
4 years ago
hdesh
2 points
More