谷歌浏览器插件
订阅小程序
在清言上使用

PhenoBERT: A Combined Deep Learning Method for Automated Recognition of Human Phenotype Ontology

IEEE/ACM Transactions on Computational Biology and Bioinformatics(2023)

引用 4|浏览8
暂无评分
摘要
Automated recognition of Human Phenotype Ontology (HPO) terms fromclinical texts is of significant interest to the field of clinical data mining. In this study, we develop a combined deep learning method named PhenoBERT for this purpose. PhenoBERTuses BERT, currently the state-of-the-art NLP model, as its core model for evaluating whether a clinically relevant text segment (CTS) could be represented by an HPO term. However, to avoid unnecessary comparison of a CTS with each of similar to 14,000 HPOterms using BERT, we introduce a two-levels CNN module consisting of a series of CNN models organized at two levels in PhenoBERT. For a given CTS, the CNN module produces only a short list of candidate HPO terms for BERT to evaluate, significantly improving the computational efficiency. In addition, BERT is able to assign an ancestor HPOtermto a CTS when recognition of the direct HPO termis not successful, mimicking the process of HPO termassignment by human. In two benchmarks, PhenoBERToutperforms four traditional dictionary-based methods and two recently developed deep learning-based methods in two benchmark tests, and its advantage is more obvious when the recognition task is more challenging. As such, PhenoBERT is of great use for assisting in the mining of clinical text data.
更多
查看译文
关键词
BERT,biomedical ontologies,concept recognition,deep learning,human phenotype ontology,medical text mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要