Tesseract-Ocr

ambiguous_words(1) generate sets of words Tesseract is likely to find ambiguous
classifier_tester(1) for *legacy tesseract* engine.
cntraining(1) character normalization training for Tesseract
combine_lang_model(1) generate starter traineddata
combine_tessdata(1) combine/extract/overwrite/list/compact Tesseract data
dawg2wordlist(1) convert a Tesseract DAWG to a wordlist
lstmeval(1) Evaluation program for LSTM-based networks.
lstmtraining(1) Training program for LSTM-based networks.
merge_unicharsets(1) Simple tool to merge two or more unicharsets.
mftraining(1) feature training for Tesseract
set_unicharset_properties(1) set properties about the unichars
shapeclustering(1) shape clustering training for Tesseract
tesseract(1) command-line OCR engine
text2image(1) generate OCR training pages.
unicharambigs(5) Tesseract unicharset ambiguities
unicharset_extractor(1) Reads box or plain text files to extract the unicharset.
unicharset(5) character properties file used by tesseract(1)
wordlist2dawg(1) convert a wordlist to a DAWG for Tesseract