HTML format processor for Apertium
This tool is part of the Apertium
open-source machine translation toolbox.
apertium-deshtml is an HTML format
processor. Data should be passed through this processor before being piped
The program takes input in the form of an HTML document and produces output
suitable for processing with
HTML tags and other format information are enclosed in brackets so that
treats them as whitespace between words.
You could write the following to show how the word “gener” is
- Display this help.
- Makes the addition of trailing sentence terminator
.’) unconditional, often leading
- Suppresses the addition of a trailing sentence terminator.
- Inserts a "❡" (U+2761 CURVED STEM PARAGRAPH SIGN
ORNAMENT) at the end of <h[1–6]> and <title> tags.
Copyright © 2005, 2006 Universitat d'Alacant / Universidad de Alicante.
This is free software. You may redistribute copies of it under the terms of
the GNU General
Many... lurking in the dark and waiting for you!
“<b>gener</b>” | apertium-deshtml | lt-proc