Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies
LREC, pp. 5240-5250, 2020.
The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework. Given on the one hand the increasing number of treebanks featuring user-generated content, and its somewha...More
Full Text (Upload PDF)
PPT (Upload PPT)