HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
Camelot: PDF Table Extraction for Humans
github.com/socialcopsdev
discuss
8 years ago
jonbaer
3 points
92.
▲
Show HN: Camelot – PDF Table Extraction for Humans
github.com/socialcopsdev
discuss
8 years ago
vortex_ape
3 points
93.
▲
Using AWS Lambda for fast OCR text extraction (and non OCR too)
github.com/skylander86
discuss
9 years ago
skylander
3 points
94.
▲
Ing: Network session metadata extraction in Go
github.com/johnzachary
discuss
10 years ago
akadien
3 points
95.
▲
Snorkel: A lightweight platform for developing information extraction systems
github.com/HazyResearch
discuss
10 years ago
dsr12
3 points
96.
▲
Show HN: Knowledge Table – Explainable multi-document extraction
github.com/whyhow-ai
1 comment
2 years ago
tomsmoker
2 points
97.
▲
GPT-based ontological extraction tools, including SPIRES
github.com/monarch-initiative
1 comment
3 years ago
gardenfelder
2 points
98.
▲
DEDA – Tracking Dots Extraction, Decoding and Anonymisation Toolkit
github.com/dfd-tud
1 comment
4 years ago
d4a
2 points
99.
▲
Open Information Extraction (OIE) Resources
github.com/gkiril
1 comment
7 years ago
kgashteo
2 points
100.
▲
Node.js: fast EXIF extraction without loading whole file into memory
github.com/titarenko
1 comment
10 years ago
titarenko
2 points
101.
▲
Im-rodrigo/eatiht – html text extraction
github.com/im-rodrigo
discuss
12 years ago
rcarmo
2 points
102.
▲
Android (Stock) Browser: cross-domain cookie/response extraction module
github.com/rapid7
discuss
12 years ago
thefreeman
2 points
103.
▲
Show HN: Review-oriented DOCX extraction toolkit for Rust
github.com/artemnistuley
discuss
18 days ago
nistuley
2 points
104.
▲
SIE: Unified Inference Engine for Embeddings, Reranking, and Extraction
github.com/superlinked
discuss
25 days ago
modinfo
2 points
105.
▲
Kubernetes Secret Extraction via ArgoCD ServerSideDiff
github.com/argoproj
discuss
2 months ago
milkglass
2 points
106.
▲
XPath Extractor – Chrome Extension for Web Data Extraction
discuss
4 months ago
HFerrahoglu
2 points
107.
▲
Unstract: Open-source platform to ship document extraction APIs/MCPs in minutes
github.com/Zipstack
discuss
6 months ago
naren87
2 points
108.
▲
Unstract: Open-source platform to ship document extraction APIs/MCPs in minutes
github.com/Zipstack
discuss
8 months ago
naren87
2 points
109.
▲
Unjs/unpdf: PDF extraction and rendering across all JavaScript runtimes
github.com/unjs
discuss
9 months ago
Onavo
2 points
110.
▲
Unstract: Open-source platform to ship document extraction APIs/MCPs in minutes
github.com/Zipstack
discuss
9 months ago
naren87
2 points
111.
▲
Show HN: Open-source, cross platform document data extraction with no OCR
github.com/NanoNets
discuss
a year ago
prithiv10
2 points
112.
▲
DEDA – Tracking Dots Extraction, Decoding and Anonymisation Toolkit
github.com/dfd-tud
discuss
a year ago
mmh0000
2 points
113.
▲
Extractous – Fast Text Extraction for GenAI with Rust and Apache Tika
github.com/yobix-ai
discuss
2 years ago
dmezzetti
2 points
114.
▲
Protobuf-driven data extraction with language models
github.com/danielcorin
discuss
2 years ago
danielcorin
2 points
115.
▲
Dragnet: Just the facts – web page content extraction
github.com/dragnet-org
discuss
2 years ago
nateb2022
2 points
116.
▲
Instructor PHP – structured data extraction in PHP, powered by LLMs
github.com/cognesy
discuss
2 years ago
statusredaudio
2 points
117.
▲
Metafeature Extraction for Unstructured Data
github.com/superwise-ai
discuss
3 years ago
gardenfelder
2 points
118.
▲
Javascript build tool for NodeJS, features precision code extraction
github.com/hij1nx
discuss
14 years ago
jenhsun
2 points
119.
▲
Image color palette extraction with node-canvas for node.js
github.com/visionmedia
discuss
14 years ago
chrismealy
2 points
120.
▲
Hironex – automatic, unsupervised extraction of road networks from historic maps
github.com/johannesuhl
discuss
4 years ago
dhotson
2 points
More