|
|
Dawson's Stemmer |
|
The Dawson Stemmer was developed by J.L. Dawson of the Literary and Linguistics Computing Centre at Cambridge University. It is a complex linguistically targeted Stemmer that is strongly based upon the Lovins Stemmer, extending the suffix rule list to approximately 1200 suffixes. It keeps the longest match and single pass nature of Lovins, and replaces the recoding rules, which were found to be unreliable, using instead an extension of the partial matching procedure also defined within the Lovins Paper.
J.L. Dawson, 1974: "Suffix removal for word conflation," Bulletin of the Association for Literary & Linguistic Computing, 2 (3), 33-46.
Back to: The Offical Paice/Husk Homepage
Lancaster University WWW | Computing Department Intranet | Computing Department FTP server
Comments or questions about these web pages to c.oneill1@lancaster.ac.uk