Implementation Of Yolo-V2 Image Recognition And Other Testbenches For A Cnn Accelerator

Chan Kim,Kim Hyun Mi,Chun-Gi Lyuh,Yong Cheol Peter Cho,Jeongmin Yang,Jaehoon Chung, Shin Kyoung Seon, Han Jin Ho,Choi Min-Seok, In San Jeon,Kwon Young-Su

2019 IEEE 9TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN)（2019）

引用 6|浏览3

暂无评分

摘要

An implementation of Yolo-v2 image recognition and other testbenches for a deep learning accelerator is presented. This chip is the initial version of our on-going effort for a higher performance accelerator development. The accelerator is based on a systolic array and can handle convolution and max-pooling layer in a combined way or separately using 16 bit floating-point data. It also supports inner-product and LSTM layers. For demonstration and as one of the design verification testbenches, we implemented Yolo-v2 image recognition for 80 object classes. We converted the Yolo-v2 software to 16 bit floating point version and used it in the simulation and FPGA experiment during the chip development. Several other testbenches were designed and used to test various networks. In this paper, the accelerator's architecture and the Yolo-v2 implementation on our FPGA board including camera and LCD is explained, including the software implementation. Some analyses on the measured and estimated performance is shown.

查看译文

关键词

deep learning, CNN, accelerator, systolic array

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要