Efficient ConvNet for real-time semantic segmentation

Eduardo Romera,José M. Álvarez,Luis M. Bergasa,Roberto Arroyo

2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017)（2017）

引用 97|浏览48

暂无评分

摘要

Semantic segmentation is a task that covers most of the perception needs of intelligent vehicles in an unified way. ConvNets excel at this task, as they can be trained end-to-end to accurately classify multiple object categories in an image at the pixel level. However, current approaches normally involve complex architectures that are expensive in terms of computational resources and are not feasible for ITS applications. In this paper, we propose a deep architecture that is able to run in real-time while providing accurate semantic segmentation. The core of our ConvNet is a novel layer that uses residual connections and factorized convolutions in order to remain highly efficient while still retaining remarkable performance. Our network is able to run at 83 FPS in a single Titan X, and at more than 7 FPS in a Jetson TX1 (embedded GPU). A comprehensive set of experiments demonstrates that our system, trained from scratch on the challenging Cityscapes dataset, achieves a classification performance that is among the state of the art, while being orders of magnitude faster to compute than other architectures that achieve top precision. This makes our model an ideal approach for scene understanding in intelligent vehicles applications.

查看译文

关键词

ConvNet,real-time semantic segmentation,pixel level,complex architectures,computational resources,ITS applications,residual connections,factorized convolutions,FPS,Titan X,Jetson TX1,embedded GPU,Cityscapes dataset,multiple object categories classification performance,intelligent vehicle applications

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要