Chrome Extension
WeChat Mini Program
Use on ChatGLM

Automating Lexicon Generation: A Comprehensive Review of Alternative Approaches.

Procedia computer science(2023)

Cited 0|Views0
No score
Abstract
Lexicon-based approaches to Document Classification are widely used, but the manual construction of lexicons can be time-consuming and resource-intensive. In this paper, we propose methods for automating the generation of lexicons later used for Document Classification. We explored diverse methods for generating lexicons, including semantic matches, frequency-based approaches, machine learning algorithms, and large language model techniques. We, later, used these lexicons to classify documents based on their content. By comparing our different lexicons results on a same task, based on criteria such as scalability and the F1 score, we determine optimized use-case for those methods. We show that our automated approaches are effective and efficient, producing accurate classifications with minimal human intervention. Some approaches have the potential to streamline the document classification process, reducing the time and resources required for manual lexicon generation, it also gives optimized use-case for the different methods. Thereafter, we discussed the obtained results.
More
Translated text
Key words
lexicon generation,document classification,natural language processing (NLP),machine learning,transformers,semantic,topic modeling
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined