A Survey on the Real Power of ChatGPT
arxiv(2024)
摘要
ChatGPT has changed the AI community and an active research line is the
performance evaluation of ChatGPT. A key challenge for the evaluation is that
ChatGPT is still closed-source and traditional benchmark datasets may have been
used by ChatGPT as the training data. In this paper, (i) we survey recent
studies which uncover the real performance levels of ChatGPT in seven
categories of NLP tasks, (ii) review the social implications and safety issues
of ChatGPT, and (iii) emphasize key challenges and opportunities for its
evaluation. We hope our survey can shed some light on its blackbox manner, so
that researchers are not misleaded by its surface generation.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要