基本信息
views: 341
Career Trajectory
Bio
Prof. Woodland’s research is in the area on speech and language technology with a major focus on developing all aspects of speech recognition systems.
His group has developed a number of techniques in that have been widely used in large vocabulary systems including standard methods for transform-based adaptation and discriminative sequence training. He has worked on the use of deep neural networks for both acoustic models and language models. His current work has a focus on the use and development of end-to-end trainable neural network systems. One area of interest is developing flexible systems that can adapt to a wide range of speakers, acoustic conditions, speaking style, language, task etc., with relatively limited training resources. This includes work on unsupervised training, active learning and self-supervised learning, the use of speech and text data for adapting models, as well as contextual speech recognition for biasing neural models. He is also interested in areas including speaker diarisation (who spoke when), emotion recognition from speech data, processing highly overlapped data, multimodal data (speech and video), optimisation techniques large for large sequence-to-sequence models models and confidence estimation.
He is well known for his work on the HTK large vocabulary speech recognition systems.
He has also worked on audio indexing, machine translation from speech, keyword spotting, auditory modelling and speech synthesis.
His group has developed a number of techniques in that have been widely used in large vocabulary systems including standard methods for transform-based adaptation and discriminative sequence training. He has worked on the use of deep neural networks for both acoustic models and language models. His current work has a focus on the use and development of end-to-end trainable neural network systems. One area of interest is developing flexible systems that can adapt to a wide range of speakers, acoustic conditions, speaking style, language, task etc., with relatively limited training resources. This includes work on unsupervised training, active learning and self-supervised learning, the use of speech and text data for adapting models, as well as contextual speech recognition for biasing neural models. He is also interested in areas including speaker diarisation (who spoke when), emotion recognition from speech data, processing highly overlapped data, multimodal data (speech and video), optimisation techniques large for large sequence-to-sequence models models and confidence estimation.
He is well known for his work on the HTK large vocabulary speech recognition systems.
He has also worked on audio indexing, machine translation from speech, keyword spotting, auditory modelling and speech synthesis.
Research Interests
Papers共 324 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
IEEE International Conference on Acoustics, Speech, and Signal Processingpp.11836-11840, (2024)
Annual Meeting of the Association for Computational Linguisticspp.2078-2093, (2024)
Cited0Views0EIBibtex
0
0
CoRR (2024)
Cited0Views0EIBibtex
0
0
CoRR (2024)
Cited0Views0EIBibtex
0
0
Interspeech 2024pp.717-721, (2024)
CoRR (2024): 260-265
CoRR (2024)
Cited0Views0EIBibtex
0
0
Speech and Hearing (2024)
Annual Meeting of the Association for Computational Linguisticspp.8235-8251, (2024)
Cited0Views0EIBibtex
0
0
Load More
Author Statistics
#Papers: 334
#Citation: 19915
H-Index: 71
G-Index: 130
Sociability: 6
Diversity: 2
Activity: 26
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn