Principles for Predicting RNA Secondary Structure Design Difficulty

Jeff Anderson-Lee,Eli Fisker, Vineet Kosaraju,Michelle Wu, Justin Kong, Jeehyung Lee,Minjae Lee, Mathew Zada, Adrien Treuille,Rhiju Das

Journal of Molecular Biology(2016)

引用 56|浏览45
暂无评分
摘要
Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess “designability” of single RNA structures, as well as of switches for in vitro and in vivo applications.
更多
查看译文
关键词
RNA design,RNA secondary structure,inverse folding,benchmark,citizen science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要