RGB-D Scene Labeling with Multimodal Recurrent Neural Networks
CVPR Workshops, pp. 203-211, 2017.
Recurrent neural networks (RNNs) are able to capture context in an image by modeling long-range semantic dependencies among image units. However, existing methods only utilize RNNs to model dependencies of a single modality (e.g., RGB) for labeling. In this work we extend this single-modal RNNs to multimodal RNNs (MM-RNNs) and apply it to...More
PPT (Upload PPT)