Overview Of A Fpga-Based Overlay Processor

2019 CHINA SEMICONDUCTOR TECHNOLOGY INTERNATIONAL CONFERENCE (CSTIC)(2019)

引用 0|浏览23
暂无评分
摘要
This paper presents the overview of an overlay architecture on FPGA (OPU). It is applicable to general CNN acceleration with software like programmablility and fast compilation time. Good portability is guaranteed as only executable codes are reloaded for applicationswitch without FPGA re-synthesis. Our OPU instructions have complicated functions with variable runtimes but have a uniform length of 32 bits. This large granularity of instructions makes it easier to develop compiler and micro-architecture. Thus, OPU is a good candidate for domain specific processor without large volume. Experiments results show that OPU can achieve around 90 percent of runtime computing resource utilization (PE efficiency) among eight different network applications, which is 2% - 48% better compared with existing customized accelerators.
更多
查看译文
关键词
overlay architecture,compilation time,executable codes,OPU instructions,microarchitecture,domain specific processor,runtime computing resource utilization,FPGA-based overlay processor,network applications,compiler architecture,CNN acceleration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要