Multilingual Simultaneous Sentence End and Punctuation Prediction (short paper).

Ricardo Rei,Fernando Batista,Nuno Miguel Guerreiro,Luísa Coheur

SwissText（2021）

引用 0|浏览6

暂无评分

摘要

This paper describes the model and its corresponding setup, proposed by the Unbabel & INESC-ID team for the 1st Shared Task on Sentence End and Punctuation Prediction in NLG Text (SEPP-NLG 2021). The shared task covers 4 languages (English, German, French and Italian) and includes two subtasks: subtask 1 – detecting the end of a sentence, and subtask 2 – predicting a range of punctuation marks. Our team proposes a single multilingual and multitask model that is able to produce suitable results for all the languages and subtasks involved. The results show that it is possible to achieve state-of-the-art results using one single multilingual model for both tasks and multiple languages. Using a single multilingual model to solve the task for multiple languages is of particular importance, since training a different model for each language is a cumbersome and time-consuming process. Finally, the code for the shared task is publicly available for reproducible purposes at https://github.com/Unbabel/ caption/tree/shared-task.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要