GUJSTER: A rule based stemmer using dictionary approach

2017 International Conference on Inventive Communication and Computational Technologies (ICICCT)（2017）

引用 6|浏览2

暂无评分

摘要

Human being normally uses natural language in form of written or spoken in their daily life. It is an entirely concept based on Artificial Intelligence, Computer Science and Linguistic resources. In Indian languages, Gujarati is an Indo — Aryan language with rich and high inflection. In these languages, several words have common root that's need to be reduced by using stemmer. Stemmer is used to reduce a word to its root without understanding of semantic meaning. In Natural Language Processing, it is initial step to identify the root or base of the word. A word can be simple or compound. For example, the word ‘***’ is simple word because it can't be decomposed. While, the word ‘***’ is a compound word, because the word is made up of two parts: root ‘***’ and suffix ‘***’. We have developed an algorithm for reducing affixes from the Gujarati words and get optimal output as compare to the previous hybrid algorithm. The GUJSTER (GUJarati STEmmeR) also check the stem word into online Gujarati Dictionary.

查看译文

关键词

Stemming,Morphology,Rule based Light Stemmer

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要