Scheduling configuration memory error checks to improve the reliability of FPGA‐based systems

IET Computers & Digital Techniques(2018)

引用 1|浏览6
暂无评分
摘要
Field-programmable gate arrays are susceptible to radiation-induced single event upsets. These are commonly dealt with using triple modular redundancy (TMR) and module-based configuration memory error recovery (MER). By triplicating components and voting on their outputs, TMR helps localise configuration memory errors, and by reconfiguring faulty components, MER swiftly corrects them. However, the order in which TMR voters are checked inevitably impacts the overall system reliability. In this study, the authors outline an approach for computing the reliability of TMR-MER systems that consist of finitely many components. They demonstrate that system reliability is improved when the more vulnerable components are checked more frequently than when they are checked in round-robin order. They propose a genetic algorithm for finding a voter checking schedule that maximises the reliability of TMR-MER systems. Results indicate that the mean time to failure (MTTF) of these systems can be increased by up to 400% when variable-rate voter checking (VRVC) is used instead of round robin. They show that VRVC achieves 15-23% increase in MTTF with a 10x reduction in checking frequency to reduce system power. They also found that VRVC detects errors 44% faster on average than round robin.
更多
查看译文
关键词
fault tolerant computing, field programmable gate arrays, genetic algorithms, reconfigurable architectures, radiation hardening (electronics), scheduling configuration memory error checks, FPGA-based systems, field-programmable gate arrays, radiation-induced single event upsets, triple modular redundancy, configuration memory error recovery, localise configuration memory errors, faulty components, TMR voters, system reliability, TMR-MER systems, vulnerable components, voter checking schedule, variable-rate voter checking, system power
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要