Machine learning detection of SARS-CoV-2 high-risk variants

bioRxiv (Cold Spring Harbor Laboratory)(2023)

引用 1|浏览81
暂无评分
摘要
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has evolved many high-risk variants, resulting in repeated COVID-19 waves of pandemic during the past years. Therefore, accurate early-warning of high-risk variants is vital for epidemic prevention and control. Here we construct a machine learning model to predict high-risk variants of SARS-CoV-2 by LightGBM algorithm based on several important haplotype network features. As demonstrated on a series of different retrospective testing datasets, our model achieves accurate prediction of all variants of concern (VOC) and most variants of interest (AUC=0.96). Prediction based on the latest sequences shows that the newly emerging lineage BA.5 has the highest risk score and spreads rapidly to become a major epidemic lineage in multiple countries, suggesting that BA.5 bears great potential to be a VOC. In sum, our machine learning model is capable to early predict high-risk variants soon after their emergence, thus greatly improving public health preparedness against the evolving virus. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
machine learning,detection,sars-cov,high-risk
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要