HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Show HN: A lightweight open-source web analytics for webdevs
github.com/extractumio
discuss
3 years ago
instad
7 points
2.
▲
Article extraction benchmark: open-source libraries and commercial services
github.com/scrapinghub
10 comments
6 years ago
lopuhin
19 points
3.
▲
Scrape any website quickly with LLM (open-source)
github.com/trancethehuman
1 comment
3 years ago
hainghiem375
3 points
4.
▲
Show HN: Query Wikidata with DuckDB Instead of Sparql
github.com/piebro
discuss
6 months ago
piebro
2 points
5.
▲
Create a database of crawled HTML pages with RethinkDB and Python
github.com/lethain
discuss
14 years ago
dsr12
1 points
6.
▲
Show HN: Speech feature extraction package developed in Python
github.com/astorfi
discuss
9 years ago
irsina
5 points
7.
▲
Show HN: Contract Extraction Assistant – Fast Batch Extraction
github.com/Qleric-labs
discuss
8 months ago
Mo1756
2 points
8.
▲
Show HN: Contract Extraction Assistant – Local, open-source contract data tool
github.com/Qleric-labs
discuss
8 months ago
Mo1756
2 points
9.
▲
Show HN: Speech feature extraction package developed in Python
github.com/astorfi
discuss
9 years ago
irsina
1 points
10.
▲
Show HN: Yapit – PDF and webpage reader with TTS that doesn't suck
github.com/yapit-tts
1 comment
3 months ago
MaxWolf-01
5 points
11.
▲
Computer Vision Project: Fingerprint Minutiae Feature Extraction
github.com/Utkarsh-Deshmukh
discuss
5 years ago
d_utkarsh
3 points
12.
▲
Show HN: EmbedRank: Unsupervised Keyphrase Extraction Using Sentence Embeddings
github.com/swisscom
discuss
7 years ago
Wronskia
2 points
13.
▲
Journalism AI – Quotes extraction for modular journalism
github.com/JournalismAI-2021-Quotes
discuss
5 years ago
malshe
1 points
14.
▲
Tutorial: Extracting structured data from websites using Groq and Firecrawl
github.com/mendableai
discuss
2 years ago
nickca
3 points
15.
▲
DEDA – Tracking Dots Extraction, Decoding and Anonymisation Toolkit
github.com/dfd-tud
99 comments
a year ago
pavel_lishin
286 points
16.
▲
Show HN: Unblob – extraction suite for 30+ file formats
github.com/onekey-sec
42 comments
3 years ago
kissgyorgy
240 points
17.
▲
Zpdf: PDF text extraction in Zig
github.com/Lulzx
87 comments
6 months ago
lulzx
217 points
18.
▲
Show HN: Kreuzberg – Modern async Python library for document text extraction
github.com/Goldziher
75 comments
a year ago
nhirschfeld
197 points
19.
▲
Web Clipper Browser Extension with Automatic Content Extraction, Now Open Source
github.com/jhlyeung
25 comments
6 years ago
laybak
192 points
20.
▲
DeepDoctection: Document extraction and analysis using deep learning models
github.com/deepdoctection
62 comments
3 years ago
bpiche
191 points
21.
▲
Run structured extraction on documents/images locally with Ollama and Pydantic
github.com/vlm-run
29 comments
a year ago
EarlyOom
170 points
22.
▲
Tsfresh – Automatic extraction of relevant features from time series
github.com/blue-yonder
8 comments
10 years ago
restapi
167 points
23.
▲
Nvidia-Ingest: Multi-modal data extraction
github.com/NVIDIA
45 comments
a year ago
mihaid150
145 points
24.
▲
Heartleech: Automated OpenSSL private key extraction tool using Heartbleed
github.com/robertdavidgraham
76 comments
12 years ago
FredericJ
114 points
25.
▲
A library for audio feature extraction, regression, classification, segmentation
github.com/tyiannak
12 comments
5 years ago
nothrowaways
107 points
26.
▲
Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API
github.com/majcheradam
36 comments
5 months ago
adammajcher
99 points
27.
▲
Coq to Rust Program Extraction
github.com/pirapira
18 comments
10 years ago
kushti
99 points
28.
▲
DEDA – Tracking Dots Extraction, Decoding and Anonymisation Toolkit
github.com/dfd-tud
7 comments
8 years ago
adulau
83 points
29.
▲
RoboSat: feature extraction from aerial and satellite imagery
github.com/mapbox
15 comments
8 years ago
danieljh
80 points
30.
▲
Show HN: Movie Iris - Visualizing Films Through Color Extraction
github.com/LoSinCos
37 comments
2 years ago
losincos
78 points
More