Tsetlin Machine in DNA sequence classification

2023 INTERNATIONAL SYMPOSIUM ON THE TSETLIN MACHINE, ISTM(2023)

引用 0|浏览3
暂无评分
摘要
The Tsetlin machine (TM) is a logic-based machine learning model with the crucial advantages of transparency and hardware-friendliness. In TM, groups of Tsetlin Automata (TAs) produce Boolean expressions in the form of conjunctive clauses in AND-rules. In this work, we show that the DNA nucleotide sequence, coded using the four-letter alphabet A, C, G, T, is a perfect match to the Boolean logic-based TM. A coalesced integer-weighted TM is applied to the problem of prokaryote gene prediction demonstrating beyond state-of-the-art performance on reference genomes. This paves the way for highly efficient bioinformatic analysis in a large range of applications.
更多
查看译文
关键词
Tsetlin Machine,DNA,nucleotide,gene prediction,prokaryotes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要