Sunpinyin-Utils
- genpyt(1) generate the PINYIN lexicon
- getwordfreq(1) print word freq information from language model
- idngram_merge(1) merge idngram file into one
- ids2ngram(1) generate n-gram data file from ids file
- mmseg(1) maximum matching segment Chinese text.
- slmbuild(1) generate language model from idngram file
- slminfo(1) get information of a back-off language model
- slmpack(1) convert the ARPA format of SunPinyin back-off language model to its binary representation
- slmprune(1) prune the back-off language model to a reasonable size
- slmseg(1) maximum matching segment Chinese text.
- slmthread(1) threads the language model
- tslmendian(1) change the byte-order of sunpinyin's threaded back-off language model
- tslminfo(1) get information of a threaded back-off language model