The type of text similarity measure. Valid values include:
|<B>--stoplistB>=FILE||The name of a file containing stop words. Under the ./sample directory, we give two formats of the stop words format, one word per line(stoplist.txt) and one word in the regular expression format per line(stoplist-nsp.regex). If you want to mix these two formats to make your own stop words file, it is also all right.|
|<B>--no-normalizeB>||Do not normalize scores. Normally, scores are normalized so that they range from 0 to 1. Using this option will give you a raw score instead.|
|<B>--stringB>||Input will be provided on the command line as strings, not files.|
|<B>--verboseB>||Show all the matches that are found between the files, their length and frequency, as well as precision, recall, F-measure, E-measure, Cosine, and the Dice Coefficient.|
|<B>--helpB>||Show a detailed help message.|
|<B>--versionB>||Show version information.|
Ted Pedersen, University of Minnesota, Duluth tpederse at d.umn.edu Jason Michelizzi Ying Liu, University of Minnesota, Twin Cities liux0395 at umn.edu
Last modified by: $Id: text_similarity.pl,v 184.108.40.206 2013/06/26 02:38:12 tpederse Exp $
--compfile is not working, seems to cause hang (tdp 3/21/08)
Copyright (C) 2004-2010, Jason Michelizzi, Ted Pedersen and Ying Liu
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
|perl v5.20.3||TEXT_SIMILARITY (1)||2013-06-26|