opennlp(1)
CLI interface to Apache OpenNLP
Description
OPENNLP
NAME
opennlp - CLI interface to Apache OpenNLP
DESCRIPTION
OpenNLP 2.1.0.
Usage: opennlp TOOL where TOOL is one of:
Doccat
learned document categorizer
DoccatTrainer
trainer for the learnable document categorizer
DoccatEvaluator
Measures the performance of the Doccat model with the reference data
DoccatCrossValidator
K-fold cross validator for the learnable Document Categorizer
DoccatConverter
converts 20newsgroup data format to native OpenNLP format
LanguageDetector
learned language detector
LanguageDetectorTrainer
trainer for the learnable language detector
LanguageDetectorConverter
converts leipzig data format to native OpenNLP format
LanguageDetectorCrossValidator
K-fold cross validator for the learnable Language Detector
LanguageDetectorEvaluator
Measures the performance of the Language Detector model with the reference data
DictionaryBuilder
builds a new dictionary
SimpleTokenizer
character class tokenizer
TokenizerME
learnable tokenizer
TokenizerTrainer
trainer for the learnable tokenizer
TokenizerMEEvaluator
evaluator for the learnable tokenizer
TokenizerCrossValidator
K-fold cross validator for the learnable tokenizer
TokenizerConverter
converts foreign data formats (irishsentencebank,ad,pos,masc,conllx,namefinder,parse,conllu) to native OpenNLP format
DictionaryDetokenizer
rule based detokenizer
SentenceDetector
learnable sentence detector
SentenceDetectorTrainer
trainer for the learnable sentence detector
SentenceDetectorEvaluator
evaluator for the learnable sentence detector
SentenceDetectorCrossValidator
K-fold cross validator for the learnable sentence detector
SentenceDetectorConverter
converts foreign data formats (nkjp,irishsentencebank,ad,pos,masc,conllx,namefinder,parse,moses,conllu,letsmt) to native OpenNLP format
TokenNameFinder
learnable name finder
TokenNameFinderTrainer
trainer for the learnable name finder
TokenNameFinderEvaluator
Measures the performance of the NameFinder model with the reference data
TokenNameFinderCrossValidator
K-fold cross validator for the learnable Name Finder
TokenNameFinderConverter
converts foreign data formats (evalita,ad,conll03,bionlp2004,conll02,masc,muc6,ontonotes,brat) to native OpenNLP format
CensusDictionaryCreator
Converts 1990 US Census names into a dictionary
POSTagger
learnable part of speech tagger
POSTaggerTrainer
trains a model for the part-of-speech tagger
POSTaggerEvaluator
Measures the performance of the POS tagger model with the reference data
POSTaggerCrossValidator
K-fold cross validator for the learnable POS tagger
POSTaggerConverter
converts foreign data formats (ad,masc,conllx,parse,ontonotes,conllu) to native OpenNLP format
LemmatizerME
learnable lemmatizer
LemmatizerTrainerME
trainer for the learnable lemmatizer
LemmatizerEvaluator
Measures the performance of the Lemmatizer model with the reference data
ChunkerME
learnable chunker
ChunkerTrainerME
trainer for the learnable chunker
ChunkerEvaluator
Measures the performance of the Chunker model with the reference data
ChunkerCrossValidator
K-fold cross validator for the chunker
ChunkerConverter
converts ad data format to native OpenNLP format
Parser
performs full syntactic parsing
ParserTrainer
trains the learnable parser
ParserEvaluator
Measures the performance of the Parser model with the reference data
ParserConverter
converts foreign data formats (ontonotes,frenchtreebank) to native OpenNLP format
BuildModelUpdater
trains and updates the build model in a parser model
CheckModelUpdater
trains and updates the check model in a parser model
TaggerModelReplacer
replaces the tagger model in a parser model
EntityLinker
links an entity to an external data set
NGramLanguageModel
gives the probability and most probable next token(s) of a sequence of tokens in a language model
All tools print help when invoked with help parameter Example: opennlp SimpleTokenizer help