Chrome Extension
WeChat Mini Program
Use on ChatGLM

FPGA Based CNN Accelerator for High-Speed Biomedical Application

HIGH-SPEED BIOMEDICAL IMAGING AND SPECTROSCOPY VIII(2023)

Cited 0|Views1
No score
Abstract
The rise in visual dataset generation has necessitated the recent advancements in the field of Deep Neural Networks (DNNs). Application domains like biomedical imaging require a high level of precision which is suitably achieved using convolutional neural networks (CNNs) at the expense of increased computation, hardware, and power resources. The implementation of such CNN architectures is constrained by currently available resource limited embedded and application-specific integrated circuit (ASIC) systems. In this work, a field-programmable gate array ( FPGA) based hardware accelerator having a generalized architecture for convolution and fully connected (FC) layers has been presented that exploits a massive level of intra-layer parallelism. Compute intensive convolution layers are replaced by depthwise separable (DS) convolution layers that reduced the number of computations and memory access by 7.8x and 10x respectively for VGG8 network after detailed design space exploration. Furthermore, parallel computation of arithmetic tasks reduced the compute bound for the proposed architecture. Reduced precision data type for both input and weights resulted in overall reduction in latency and resource utilization. FPGA implementation results of the proposed CNN accelerator for classifiers trained on subsets of MedMNIST dataset depict a balance between high performance of 214.5 GOP/s for DS convolution layer and low resource utilization.
More
Translated text
Key words
Depthwise separable convolution,CNN accelerator,design space exploration,field programmable gate array (FPGA)
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined