Visual Comparison Of Speaker Groups

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5(2015)

引用 23|浏览28
暂无评分
摘要
We describe a generic tool for visualising differences between two groups of speakers who produce a given word sequence. We do this by first time-aligning all recordings and then aggregating time-varying information within each group. By that, we can display prototypical loudness and tempo contours, and also spectrograms, together with information on variability and group effect size over time. An optional user-supplied segmentation (just needed for one of the recordings) can be used to relate local differences to individual phonemes. The system is validated with a group of speakers with Parkinson's disease and an age-matched control group. It will be provided as an open source software package to the community.
更多
查看译文
关键词
paralinguistics, atypical speech, pathological speech, visualization, interpretation, acoustic features
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要