Man page - apertium-deshtml(1)
Packages contains this manual
- apertium-deslatex(1)
- apertium-retxt(1)
- apertium-rertf(1)
- apertium-extract-caps(1)
- apertium-destxt(1)
- apertium-tagger(1)
- apertium-remediawiki(1)
- apertium-relatex(1)
- apertium-rehtml(1)
- apertium-rexlsx(1)
- apertium-repptx(1)
- apertium-rewxml(1)
- apertium-deshtml-alt(1)
- apertium-transfer(1)
- apertium(1)
- apertium-multiple-translations(1)
- apertium-postchunk(1)
- apertium-deswxml(1)
- apertium-desmediawiki(1)
- apertium-despptx(1)
- apertium-preprocess-transfer(1)
- apertium-utils-fixlatex(1)
- apertium-interchunk(1)
- apertium-prelatex(1)
- apertium-desrtf(1)
- apertium-deshtml(1)
- apertium-unformat(1)
- apertium-reodt(1)
- apertium-postlatex-raw(1)
- apertium-desxlsx(1)
- apertium-pretransfer(1)
- apertium-desodt(1)
- apertium-postlatex(1)
- apertium-restore-caps(1)
apt-get install apertium
Manual
APERTIUM-DESHTML (1) General Commands Manual APERTIUM-DESHTML (1)
NAME
apertium-deshtml β HTML format processor for Apertium
SYNOPSIS
apertium-deshtml [ -hino ] [ input_file [ output_file ]]
DESCRIPTION
This tool is part of the Apertium open-source machine translation toolbox : https://apertium.org/.
apertium-deshtml is an HTML format processor. Data should be passed through this processor before being piped to lt-proc (1). The program takes input in the form of an HTML document and produces output suitable for processing with lt-proc (1). HTML tags and other format information are enclosed in brackets so that lt-proc (1) treats them as whitespace between words.
OPTIONS
-h
,
--help
Display this help.
-i
Makes the addition of trailing sentence terminator (β.β) unconditional, often leading to duplicates.
-n
Suppresses the addition of a trailing sentence terminator.
-o
Inserts a "β‘" (U+2761 CURVED STEM PARAGRAPH SIGN ORNAMENT) at the end of <h[1β6]> and <title> tags.
EXAMPLES
You could write the following to show how the word βgenerβ is analysed:
echo "
<b>gener</b> " | apertium-deshtml | lt-proc ca-es.automorf.bin
SEE ALSO
apertium (1), apertium-desrtf (1), apertium-destxt (1), lt-proc (1)
COPYRIGHT
Copyright Β© 2005, 2006 Universitat dβAlacant / Universidad de Alicante. This is free software. You may redistribute copies of it under the terms of the GNU General Public License : https://www.gnu.org/licenses/gpl.html.
BUGS
Many... lurking in the dark and waiting for you! Apertium March 21, 2006 APERTIUM-DESHTML (1)