Automatic Detection of Hybrid Human-Machine Text Boundaries

semanticscholar(2021)

引用 2|浏览9
暂无评分
摘要
Much effort has been spent in recent years on using machine learning to automatically generate text which passes as having been written by humans. Because of significant advances in this field, language models are able to generate very convincing text. However, a reader with a discerning eye can occasionally distinguish real text from fake. In this project we introduce a novel detection task format: detecting the boundary between human written prompt and machine generated continuation. We train various classifiers to predict not only the true boundary but the boundary as assessed by human annotators. We find that using pre-trained embeddings outperform perplexity and pairwise sentence coherence based methods.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要