1B+ words corpus of original texts and experimental post-OCR correction outputhuggingface.co7 pointsCharlesW2 years ago