JHU Experiments in Monolingual Farsi Document Retrieval at CLEF 2009
CLEF (Working Notes), 2009.
farsi document retrieval
At CLEF 2009 JHU submitted runs in the ad hoc track for the monolingual Persian evaluation. Variants of character n-gram tokenization provided a 10% relative gain over unnormalized words. A run based on skip n-grams, which allow internal skipped letters, achieved a mean average precision of 0.4938. Using traditional 5-grams resulted in a ...More
Full Text (Upload PDF)
PPT (Upload PPT)