apertium-unformat
—
unformatted text extractor for Apertium
apertium-unformat |
[-f format]
[infile [outfile]] |
apertium
is the application that extract
unformatted text from documents.
-f
format
- Specifies the format of the input and output files which can have these
values:
txt
- (default value) Input and output files are in text format.
html
- Input and output files are in “html” format. This
“html” is the one acceptd by the vast majority of web
browsers.
rtf
- Input and output files are in “rtf” format. The accepted
“rtf” is the one generated by Microsoft WordPad and
Microsoft Office up to and including BOffice 97.
- infile
- Input file (stdin by default).
- outfile
- Output file (stdout by default).
Copyright © 2005, 2006 Universitat d'Alacant / Universidad
de Alicante. This is free software. You may redistribute copies of it under
the terms of the
GNU General Public License.
Many... lurking in the dark and waiting for you!