hfst-lookup(1)

=perform transducer lookup (apply)

Section 1 hfst bookworm source

Description

HFST-LOOKUP

NAME

hfst-lookup - =perform transducer lookup (apply)

SYNOPSIS

hfst-lookup [OPTIONS...] [INFILE]

DESCRIPTION

perform transducer lookup (apply) NOTE: hfst-lookup does lookup from left to right as opposed to xfst and foma

lookup which is carried out from right to left. In order to do lookup in a similar way as xfst and foma, use ’hfst-flookup’ instead.

Common options:

-h, --help

Print help message

-V, --version

Print version info

-v, --verbose

Print verbosely while processing

-q, --quiet

Only print fatal erros and requested output

-s, --silent

Alias of --quiet

Input/Output options:

-i, --input=INFILE

Read input transducer from INFILE

-o, --output=OUTFILE

Write output to OUTFILE

-p, --pipe-mode[=STREAM] Control input and output streams

Lookup options:

-I, --input-strings=SFILE

Read lookup strings from SFILE

-O, --output-format=OFORMAT

Use OFORMAT printing results sets

-e, --epsilon-format=EPS

Print epsilon as EPS

-F, --input-format=IFORMAT

Use IFORMAT parsing input

-x, --statistics

Print statistics

-X, --xfst=VARIABLE

Toggle xfst VARIABLE

-c, --cycles=INT

How many times to follow input epsilon cycles (only for non-lookup-optimized transducers)

-n, --max-number=INT

Maximum number of results printed for each input (only for lookup-optimized transducers)

-b, --beam=B

Output only analyses whose weight is within B from the best analysis

-t, --time-cutoff=S

Limit search after having used S seconds per input (only for lookup-optimized transducers)

-C, --cascade=CASCADE

How multiple transducers in input are handled

-P, --progress

Show neat progress bar if possible

If OUTFILE or INFILE is missing or -, standard streams will be used. Format of result depends on format of INFILE OFORMAT is one of {xerox,cg,apertium}, xerox being default IFORMAT is one of {text,spaced,apertium}, default being text, unless OFORMAT is apertium VARIABLEs relevant to lookup are {print-pairs,print-space, quote-special,show-flags,obey-flags} Input epsilon cycles are followed by default INT=5 times. Epsilon is printed by default as an empty string. B must be a non-negative float. S must be a non-negative float. The default, 0.0, indicates no cutoff. If the input contains several transducers, a set containing results from all transducers is printed for each input string.

CASCADE must be one of { union, priority-union, composition }. If not specified, defaults to {union}.

STREAM can be { input, output, both }. If not given, defaults to {both}. If input file is not specified with -I, input is read interactively line by line from the user. If you redirect input from a file, use --pipe-mode=input. --pipe-mode=output is ignored on non-windows platforms.

Todo:

Support --xfst=obey-flags for optimized lookup format. Support --cycles for optimized lookup format.

Known bugs:

’quote-special’ quotes spaces that come from ’print-space’

REPORTING BUGS

Report bugs to <hfst-bugs@helsinki.fi> or directly to our bug tracker at: <https://github.com/hfst/hfst/issues>

hfst-lookup home page: <https://github.com/hfst/hfst/wiki/HfstLookup>
General help using HFST software: <https://github.com/hfst/hfst/wiki>

COPYRIGHT

Copyright © 2017 University of Helsinki, License GPLv3: GNU GPL version 3 <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law.