Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition
ICASSP, pp. 5634-5638, 2018.
This paper explores the use of multi-view features and their discriminative transforms in a convolutional deep neural network (CNN) architecture for a continuous large vocabulary speech recognition task. Mel-filterbank energies and perceptually motivated forced damped oscillator coefficient (DOC) features are used after feature-space maxi...More
Full Text (Upload PDF)
PPT (Upload PPT)