Tesseract-Ocr
- ambiguous_words(1) generate sets of words Tesseract is likely to find ambiguous
- classifier_tester(1) for *legacy tesseract* engine.
- cntraining(1) character normalization training for Tesseract
- combine_lang_model(1) generate starter traineddata
- combine_tessdata(1) combine/extract/overwrite/list/compact Tesseract data
- dawg2wordlist(1) convert a Tesseract DAWG to a wordlist
- lstmeval(1) Evaluation program for LSTM-based networks.
- lstmtraining(1) Training program for LSTM-based networks.
- merge_unicharsets(1) Simple tool to merge two or more unicharsets.
- mftraining(1) feature training for Tesseract
- set_unicharset_properties(1) set properties about the unichars
- shapeclustering(1) shape clustering training for Tesseract
- tesseract(1) command-line OCR engine
- text2image(1) generate OCR training pages.
- unicharambigs(5) Tesseract unicharset ambiguities
- unicharset_extractor(1) Reads box or plain text files to extract the unicharset.
- unicharset(5) character properties file used by tesseract(1)
- wordlist2dawg(1) convert a wordlist to a DAWG for Tesseract