Anybody need a quick and dirty Python 2.5.2 implementation of the Porter German stemming algorithm?
I wrote one for the German S.L.U.T student project today: porterde.py
The only testing I did is running it on the complete list of examples on above page (pink background).
To use, import porterde
as a module and call porterde.stem(word)
. The function requires and returns unicode strings.
Edit: I should have looked harder before jumping into this, there already is PyStemmer which does the same but for eight languages. Thanks to Mathieu for pointing that out.