Divide-And-Warp Temporal Alignment Of Speech Signals Between Speakers: Validation Using Articulatory Data

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)(2017)

引用 22|浏览9
暂无评分
摘要
Meaningful comparisons between sets of speech-induced, dynamically evolving articulatory measurements require that the data be temporally aligned in a manner invariant to speech rate discrepancies. The best known approach to this problem is to apply dynamic time warping (DTW) to the corresponding audio signals. While the usefulness of DTW methods is well established in automatic speech recognition, they were never directly and quantitatively validated as a way of aligning and comparing signals from different speakers in a way that is useful for the study of speech as a biological process. This paper provides the first direct quantitative validation of such an audio-based temporal alignment algorithm, itself based on a new divide-and-warp strategy, using speaker invariant temporal landmarks from articulatory data. Results demonstrate that the proposed temporal alignment algorithm accurately brings these landmarks into correspondence between speakers (mean absolute delay of similar to 35ms).
更多
查看译文
关键词
Temporal alignment, dynamic time warping, between-speaker comparisons, validation, articulatory measurements
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要