opennlp(1)

CLI interface to Apache OpenNLP

Section 1 opennlp bookworm source

Description

OPENNLP

NAME

opennlp - CLI interface to Apache OpenNLP

DESCRIPTION

OpenNLP 2.1.0. Usage: opennlp TOOL where TOOL is one of:
Doccat

learned document categorizer

DoccatTrainer

trainer for the learnable document categorizer

DoccatEvaluator

Measures the performance of the Doccat model with the reference data

DoccatCrossValidator

K-fold cross validator for the learnable Document Categorizer

DoccatConverter

converts 20newsgroup data format to native OpenNLP format

LanguageDetector

learned language detector

LanguageDetectorTrainer

trainer for the learnable language detector

LanguageDetectorConverter

converts leipzig data format to native OpenNLP format

LanguageDetectorCrossValidator

K-fold cross validator for the learnable Language Detector

LanguageDetectorEvaluator

Measures the performance of the Language Detector model with the reference data

DictionaryBuilder

builds a new dictionary

SimpleTokenizer

character class tokenizer

TokenizerME

learnable tokenizer

TokenizerTrainer

trainer for the learnable tokenizer

TokenizerMEEvaluator

evaluator for the learnable tokenizer

TokenizerCrossValidator

K-fold cross validator for the learnable tokenizer

TokenizerConverter

converts foreign data formats (irishsentencebank,ad,pos,masc,conllx,namefinder,parse,conllu) to native OpenNLP format

DictionaryDetokenizer

rule based detokenizer

SentenceDetector

learnable sentence detector

SentenceDetectorTrainer

trainer for the learnable sentence detector

SentenceDetectorEvaluator

evaluator for the learnable sentence detector

SentenceDetectorCrossValidator

K-fold cross validator for the learnable sentence detector

SentenceDetectorConverter

converts foreign data formats (nkjp,irishsentencebank,ad,pos,masc,conllx,namefinder,parse,moses,conllu,letsmt) to native OpenNLP format

TokenNameFinder

learnable name finder

TokenNameFinderTrainer

trainer for the learnable name finder

TokenNameFinderEvaluator

Measures the performance of the NameFinder model with the reference data

TokenNameFinderCrossValidator

K-fold cross validator for the learnable Name Finder

TokenNameFinderConverter

converts foreign data formats (evalita,ad,conll03,bionlp2004,conll02,masc,muc6,ontonotes,brat) to native OpenNLP format

CensusDictionaryCreator

Converts 1990 US Census names into a dictionary

POSTagger

learnable part of speech tagger

POSTaggerTrainer

trains a model for the part-of-speech tagger

POSTaggerEvaluator

Measures the performance of the POS tagger model with the reference data

POSTaggerCrossValidator

K-fold cross validator for the learnable POS tagger

POSTaggerConverter

converts foreign data formats (ad,masc,conllx,parse,ontonotes,conllu) to native OpenNLP format

LemmatizerME

learnable lemmatizer

LemmatizerTrainerME

trainer for the learnable lemmatizer

LemmatizerEvaluator

Measures the performance of the Lemmatizer model with the reference data

ChunkerME

learnable chunker

ChunkerTrainerME

trainer for the learnable chunker

ChunkerEvaluator

Measures the performance of the Chunker model with the reference data

ChunkerCrossValidator

K-fold cross validator for the chunker

ChunkerConverter

converts ad data format to native OpenNLP format

Parser

performs full syntactic parsing

ParserTrainer

trains the learnable parser

ParserEvaluator

Measures the performance of the Parser model with the reference data

ParserConverter

converts foreign data formats (ontonotes,frenchtreebank) to native OpenNLP format

BuildModelUpdater

trains and updates the build model in a parser model

CheckModelUpdater

trains and updates the check model in a parser model

TaggerModelReplacer

replaces the tagger model in a parser model

EntityLinker

links an entity to an external data set

NGramLanguageModel

gives the probability and most probable next token(s) of a sequence of tokens in a language model

All tools print help when invoked with help parameter Example: opennlp SimpleTokenizer help