Detecting Offensive Content in Open-domain Conversations using Two Stage Semi-supervision
arXiv: Computation and Language, Volume abs/1811.12900, 2018.
As open-ended human-chatbot interaction becomes commonplace, sensitive content detection gains importance. In this work, we propose a two stage semi-supervised approach to bootstrap large-scale data for automatic sensitive language detection from publicly available web resources. We explore various data selection methods including 1) usin...More
Full Text (Upload PDF)
PPT (Upload PPT)