BERT-based Multi-Task Model for Country and Province Level MSA and Dialectal Arabic Identification.

Abdellah El Mekki,Abdelkader El Mahdaouy,Kabil Essefar,Nabil El Mamoun,Ismail Berrada,Ahmed Khoumsi

Workshop on Arabic Natural Language Processing (WANLP)（2021）

引用 0|浏览1

暂无评分

摘要

Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) model to tackle both country-level and province-level MSA/DA identification. The latter MTL model consists of a shared Bidirectional Encoder Representation Transformers (BERT) encoder, two task-specific attention layers, and two classifiers. Our key idea is to leverage both the task-discriminative and the inter-task shared features for country and province MSA/DA identification. The obtained results show that our MTL model outperforms single-task models on most subtasks.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要