You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Nikos Giotis 66f974475f python/PyStemmer: Added (Snowball stemming algorithms). 9 years ago
..
PyStemmer.SlackBuild python/PyStemmer: Added (Snowball stemming algorithms). 9 years ago
PyStemmer.info python/PyStemmer: Added (Snowball stemming algorithms). 9 years ago
README python/PyStemmer: Added (Snowball stemming algorithms). 9 years ago
slack-desc python/PyStemmer: Added (Snowball stemming algorithms). 9 years ago

README

Snowball stemming algorithms, for information retrieval

Stemming algorithms

PyStemmer provides access to efficient algorithms for calculating a "stemmed"
form of a word. This is a form with most of the common morphological endings
removed; hopefully representing a common linguistic base form. This is most
useful in building search engines and information retrieval software;
for example, a search with stemming enabled should be able to find a document
containing "cycling" given the query "cycles".

PyStemmer provides algorithms for several (mainly european) languages, by
wrapping the libstemmer library from the Snowball project in a Python module.

It also provides access to the classic Porter stemming algorithm for english:
although this has been superceded by an improved algorithm, the original
algorithm may be of interest to information retrieval researchers wishing
to reproduce results of earlier experiments.