Extracting Noun Phrases from Large-Scale Texts: A Hybrid Approach and Its Automatic Evaluation.

meeting of the association for computational linguistics(1994)

引用 133|浏览64
暂无评分
摘要
To acquire noun phrases from running texts is useful for many applications, such as word grouping, terminology indexing, etc. The reported literatures adopt pure probabilistic approach, or pure rule-based noun phrases grammar to tackle this problem. In this paper, we apply a probabilistic chunker to deciding the implicit boundaries of constituents and utilize the linguistic knowledge to extract the noun phrases by a finite state mechanism. The test texts are SUSANNE Corpus and the results are evaluated by comparing the parse field of SUSANNE Corpus automatically. The results of this preliminary experiment are encouraging.
更多
查看译文
关键词
automatic evaluation,preliminary experiment,hybrid approach,finite state mechanism,linguistic knowledge,parse field,susanne corpus,probabilistic chunker,noun phrase,implicit boundary,pure probabilistic approach,large-scale text,pure rule-based noun phrase,rule based,indexation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要