The impact of vocabulary normalization
Journal of Software: Evolution and Process(2015)
摘要
Software development, evolution, and maintenance depend on ever increasing tool support. Recent tools have incorporated increasing analysis of the natural language found in source code, predominately in the identifiers and comments. However, when coders combine abbreviations and acronyms to form multi-word identifiers, they, in essence, invent new vocabulary making the source code's vocabulary differ from that of other software artifacts. This vocabulary mismatch is a potential problem for many techniques imported from information retrieval and natural language processing, which implicitly assume the use of a single common vocabulary. Vocabulary normalization aims to bring the vocabulary of the source in line with that of other artifacts.
更多查看译文
关键词
identifier expansion,vocabulary normalization,information retrieval-based tools,case study
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络