Indexing and Querying Linguistic Metadata and Document Content

Niraj Aswani,Valentin Tablan,Kalina Bontcheva,Hamish Cunningham

Recent Advances in Natural Language Processing IVCurrent Issues in Linguistic Theory（2008）

引用 35|浏览32

暂无评分

摘要

The need for ecient corpus indexing and querying arises frequently both in machine learning-based and human-engineered natural language processing systems. This paper presents the ANNIC system, which can index documents not only by content, but also by their linguististic annotations and features. It also enables users to formulate versatile queries mix- ing keywords and linguistic information. The result consists of the matching texts in the cor- pus, displayed within the context of linguistic annotations (not just text, as is customary for KWIC systems). The data is displayed in a graphical user interface, which facilitates its ex- ploration and the discovery of new patterns, which can in turn be tested by launching new ANNIC queries.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要