Classification of DNA Sequences: Performance Evaluation of Multiple Machine Learning Methods
2022 IEEE 22nd International Conference on Nanotechnology (NANO)(2022)
摘要
Polymerase chain reaction (PCR) has long been the mainstay in genetic sequencing and identification. Irrespective of whether short read or long read technologies are adopted, PCR methods are generally time consuming and expensive. Recently, an all-electronic approach, the so-called Single Molecule Break Junction (SMBJ) method, has been proposed as a possible alternative to PCR. In this article, we evaluate the performance of four different classifier models on the current signatures of ten short strand sequences, including a pair that differs by a single mismatch. We find that a gradient boosted tree classifier model achieves impressive accuracies, ranging from approximately 96% for molecules differing by a single mismatch to 99.5% otherwise.
更多查看译文
关键词
DNA sequences,performance evaluation,multiple machine learning methods,polymerase chain reaction,mainstay,genetic sequencing,irrespective,short read,PCR methods,all-electronic approach,Single Molecule Break Junction method,SMBJ,possible alternative,different classifier models,current signatures,short strand sequences,differs,single mismatch,gradient boosted tree classifier model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要