Deep Convolutional Neural Networks For Acoustic Modeling In Low Resource Languages

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2015)

引用 41|浏览81
暂无评分
摘要
Convolutional Neural Networks (CNNs) have demonstrated powerful acoustic modelling capabilities due to their ability to account for structural locality in the feature space; and in recent works CNNs have been shown to often outperform fully connected Deep Neural Networks (DNNs) on TIMIT and LVCSR. In this paper, we perform a detailed empirical study of CNNs under the low resource condition, wherein we only have 10 hours of training data. We find a two dimensional convolutional structure performs the best, and emphasize the importance to consider time and spectrum in modelling acoustic patterns. We report detailed error rates across a wide variety of model structures and show CNNs consistently outperform fully connected DNNs for this task.
更多
查看译文
关键词
Deep Neural Networks,Convolutional Neural Networks,Automatic Speech Recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要