TDT-2002 Topic Tracking at Maryland: First Experiments with the Lemur Toolkit

msra(2003)

引用 25|浏览33
暂无评分
摘要
The University of Maryland submitted six topic tracking runs for the 2002 Topic Detection and Tracking evaluation. Two runs were produced using the Lemur language modeling toolkit, the remaining four were produced using an separate system coded in Perl. The Lemur runs outperformed the Perl runs on the required condition because term frequency information was better handled. Two of the Perl runs used native Arabic orthography with two-best translation based on a statistical lexicon, obtaining similar results to those ob- tained with the Arabic-to-English translations provided with the collection.
更多
查看译文
关键词
arabic language,statistical analysis,index terms,tracking,programming languages,technical report,english language
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要