Exploring the Value of Pre-trained Language Models for Clinical Named Entity Recognition.

Samuel Belkadi,Lifeng Han,Yu-Ping Wu,Goran Nenadic

2023 IEEE International Conference on Big Data (BigData)（2023）

引用 0|浏览0

暂无评分

摘要

The practice of fine-tuning Pre-trained Language Models (PLMs) from general or domain-specific data to a specific task with limited resources, has gained popularity within the field of natural language processing (NLP). In this work, we re-visit this assumption and carry out an investigation in clinical NLP, specifically Named Entity Recognition (NER) on drugs and their related attributes. We compare Transformer models that are trained from scratch to fine-tuned BERT-based Large Language Models (LLMs) namely BERT, BioBERT, and ClinicalBERT. Furthermore, we examine the impact of an additional Conditional Random Field (CRF) layer on such models to encourage contextual learning. We use n2c2-2018 shared task data for model development and evaluations. The experimental outcomes show that 1) CRF layers improved all language models; 2) referring to BIO-strict span level evaluation using macro-average F1 score, although the fine-tuned LLMs achieved 0.83+ scores, the TransformerCRF model trained from scratch achieved 0.78+, demonstrating comparable performances with much lower cost, e.g. with 39.80% less training parameters; 3) referring to BIO-strict span-level evaluation using weighted-average F1 score, ClinicalBERT-CRF, BERT-CRF, and TransformerCRF exhibited lower score differences, with 97.59%/97.44%/96.84% respectively. 4) applying efficient training by down-sampling for better data distribution further reduced the training cost and need for data, while maintaining similar scores -i.e. around 0.02 points lower compared to using the full dataset. This This TRANSFORMERCRF project is hosted at https://github.com/HECTA-UoM/TransformerCRF

查看译文

关键词

Language Model,Named Entity Recognition,Pre-trained Language Models,Low Cost,F1 Score,Transformer Model,Training Efficiency,Conditional Random Field,Shared Task,Training Dataset,Validation Set,Information Extraction,Information And Communication Technologies,Confusion Matrix,Rating Scores,Text Mining,Specific Labeling,Recall Score,Precision Score,Domain Generalization,Biomedical Domain,Clinical Text,Biomedical Text,Adaptive Layer,Biomedical Data,Clinical Domain,Part Of Event,Label Distribution,Drug Name,Data Pre-processing

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要