0
I Use This!
Inactive

Commits : Listings

Analyzed 13 days ago. based on code collected 13 days ago.
May 30, 2023 — May 30, 2024
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
better token splitting, add 'is_capitalized' and fix number finding More... about 11 years ago
experiment with better hyphenation More... about 11 years ago
be more nuanced about digits when munging hocr More... about 11 years ago
first cut at code that cleans tesseract's hocr output More... about 11 years ago
fewer light scans More... about 11 years ago
fixed merge More... about 11 years ago
shorten number of darkness runs More... about 11 years ago
fixed merge More... about 11 years ago
Merge branch 'master' of github.com:brobertson/rigaudon More... about 11 years ago
report errors, don't die More... about 11 years ago
allow pngs to be processed More... about 11 years ago
include instructions for jp2 More... about 11 years ago
info about building on Ubuntu 12.04 More... over 11 years ago
turn on macron and under-dot finding More... over 11 years ago
increase threshold; add t -> iota for Didot fonts More... over 11 years ago
support regularizing java code that does directories at a time More... over 11 years ago
avoid error on line 152 More... over 11 years ago
simplify output name More... over 11 years ago
make different side-by-side view, based on language of abbyy OCR output More... over 11 years ago
include copyright and index link More... over 11 years ago
remove more bogus characters More... over 11 years ago
produce simplified name on output, so that this can be used for side-by-side view More... over 11 years ago
revise tei dump to avoid wrong language More... over 11 years ago
include ability to check if the archive document has a Latin-script abbyy OCR document More... over 11 years ago
single command to process volume from archive, given volume id and classifier path More... over 11 years ago
initial ci of script that produces sidebyside view More... over 11 years ago
change temp. replacement characters for diacritics; tweak parameters More... over 11 years ago
process all the way to side-by-side view More... over 11 years ago
fix ArrayValue errors (due to len(array)*-1* test) and 'AttributeError on line.line_matches More... over 11 years ago
make side-by-side view More... over 11 years ago