Token-based Code Clone Detection Technique in a Student's Programming Exercise

Broadband, Wireless Computing, Communication and Applications(2012)

引用 4|浏览1
暂无评分
摘要
The acts to submit the copied programs of other person make problems in the subject of the programming exercise in university curriculum. Teachers could not make accurate scores and evaluate the reached learning level of students. The code clone detection technique is to automatically detect the copied programs. Researches of the code clone detection technique have been proposed. The object of these researches, however, focused on the source code of industrial field. There are some problems to detect illicit copied codes of reports made by students. In this research, we developed the code clone detection algorithm focusing on the detection of illicit copied codes of submitted reports of students in a programming exercise. Our proposed algorithm is based on the comparison of tokens and can declare the illicit copied codes invalid. The features of illicit copied codes such as swapping the functions and program lines, renaming variable names, changing digits, comments and string constants and changing source codes using formatting tools are detected. We implemented the proposed algorithm and experimented to evaluate our system for the submitted subjects of 119 students. Compared to the human detection for small size of source codes of students in a programming exercise, our system found 32 codes as the illicit copy in 36 illicit copied codes among 14,042 combination detective rules with the threshold which are realized the recall=0.8. The miss detection finding as the copied code was 72 codes with precision=0.302.
更多
查看译文
关键词
combination detective rule,code clone detection technique,detection finding,programming exercise,token-based code clone detection,source code,illicit copy,accurate score,human detection,proposed algorithm,code clone detection algorithm,computer science education
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要