Single n-gram stemming

SIGIR, pp. 415-416, 2003.

Cited by: 149|Bibtex|Views14|Links
EI
Keywords:
retrieval accuracysingle n-gramperformance penaltyefficient language-neutral approachcharacter n-gram tokenizationMore(1+)

Abstract:

Stemming can improve retrieval accuracy, but stemmers are language-specific. Character n-gram tokenization achieves many of the benefits of stemming in a language independent way, but its use incurs a performance penalty. We demonstrate that selection of a single n-gram as a pseudo-stem for a word can be an effective and efficient languag...More

Code:

Data:

Your rating :
0

 

Tags
Comments