Knowledge-based multilingual document analysis

SEMANET '02: Proceedings of the 2002 workshop on Building and using semantic networks - Volume 11(2002)

引用 5|浏览0
暂无评分
摘要
The growing availability of multilingual resources, like EuroWordnet, has recently inspired the development of large scale linguistic technologies, e.g. multilingual IE and Q&A, that were considered infeasible until a few years ago. In this paper a system for categorisation and automatic authoring of news streams in different languages is presented. In our system, a knowledge-based approach to Information Extraction is adopted as a support for hyperlinking. Authoring across documents in different languages is triggered by Named Entities and event recognition. The matching of events in texts is carried out by discourse processing driven by a large scale world model. This kind of multilingual analysis relies on a lexical knowledge base of nouns(i.e. the EuroWordnet Base Concepts) shared among English, Spanish and Italian lexicons. The impact of the design choices on the language independence and the possibilities it opens for automatic learning of the event hierarchy will be discussed.
更多
查看译文
关键词
automatic learning,eurowordnet base concepts,multilingual ie,multilingual resource,large scale linguistic technology,event recognition,knowledge-based multilingual document analysis,automatic authoring,different language,multilingual analysis,event hierarchy,information extraction,noun,knowledge base
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要