谷歌浏览器插件
订阅小程序
在清言上使用

Unsupervised Model for Detecting Plagiarism in Internet-based Handwritten Arabic Documents

Journal of organizational and end user computing(2020)

引用 39|浏览7
暂无评分
摘要
Due to the rapid increase of internet-based data, there is urgent need for a robust intelligent documents security mechanism. Although there are many attempts to build a plagiarism detection system in natural language documents, the unlimited variation and different writing styles of each character in Arabic documents make building such systems challenging. Based on its position in a word, the same Arabic letter can be written three different ways, which makes the handwritten character recognition a cumbersome process. This article proposes an intelligent unsupervised model to detect plagiarism in these documents called ASTAP. First, a handwritten Arabic character recognition system is proposed using the Grey Wolf Optimization (GWO) algorithm. Then, a modified Abstract Syntax Tree (AST) is used to match the contents of the Arabic documents to detect any similarity. Compared to the state-of-the-art methods, ASTAP improves the effectiveness of the plagiarism detection in terms of the matched similarity ratio, the precision ratio, and the processing time.
更多
查看译文
关键词
Abstract Syntax Tree,Gray Wolf Optimization,Handwritten Character Recognition,Hash value,Internet Data Security,Plagiarism Detection,Similarity Index,Unsupervised Documents Analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要