Addressing privacy regulation such as GDPR requires organizations to find and classify sensitive and personal data in their datastores. First, data discovery tools are applied to identify the data. Then, data classification tools are applied on the data that was discovered. Organizations must classify the data into concrete categories to manage data appropriately. In this paper we focus on multi-value classification, where the classifier provides a category to set of values all from the same category. Traditional classifiers usually apply single-value classification methods to a multi-value data set. However, in many cases this resulting an incorrect classification when, for example, domain categories overlap. In this paper, we address this scenario and provide two methods to overcome this problem.
