DILIC: An AI based classifier to search for Drug-Induced Liver Injury literature

Sanjay Rathee,Meabh MacMahon,Anika Liu,Nicholas Katritsis,Gehad Youssef,Woochang Hwang,Lilly Wollman,Namshik Han

bioRxiv（2022）

引用 0|浏览9

暂无评分

摘要

Drug-Induced Liver Injury (DILI) is a class of Adverse Drug Reactions (ADR) which causes problems in both clinical and research settings. It is the most frequent cause of acute liver failure in the majority of western countries and is a major cause of attrition of novel drug candidates. Manual trawling of literature for is the main route of deriving information on DILI from research studies. This makes it an inefficient process prone to human error. Therefore, an automatized AI model capable of retrieving DILI-related papers from the huge ocean of literature could be invaluable for the drug discovery community. In this project, we built an artificial intelligence (AI) model combining the power of Natural Language Processing (NLP) and Machine Learning (ML) to address this problem. This model uses NLP to filter out meaningless text (e.g. stopwords) and uses customized functions to extract relevant keywords as singleton, pair, triplet and so on. These keywords are processed by apriori pattern mining algorithm to extract relevant patterns which are used to estimate initial weightings for a ML classifier. Along with pattern importance and frequency, an FDA-approved drug list mentioning DILI adds extra confidence in classification. The combined power of these methods build a DILI classifier (DILIC) with 94.91% cross-validation and 94.14% external validation accuracy. To make DILIC as accessible as possible, including to researchers without coding experience, an R Shiny App capable of classifing single or multiple entries for DILI is developed to enhance ease of user experience and made available at https://researchmind.co.uk/diliclassifier/).

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要