Doc2text – Detect text blocks and OCR poorly scanned PDFs in bulkgithub.com/jlsutherland161 pointsjlsutherland10 years ago