谷歌浏览器插件
订阅小程序
在清言上使用

Whisper-Based Transfer Learning for Alzheimer Disease Classification: Leveraging Speech Segments with Full Transcripts as Prompts.

IEEE International Conference on Acoustics, Speech, and Signal Processing(2024)

引用 0|浏览7
暂无评分
摘要
Alzheimer’s disease (AD) is a neurodegenerative disorder that can lead to speech impairments. Early diagnosis is crucial for effective treatment, and speech-based diagnosis is currently a hot research topic. In this study, we explore the feasibility of transfer learning for Alzheimer’s disease detection using the state-of-the-art multilingual speech recognition and translation model: Whisper. In order to address the limitation of Whisper’s narrow perspective caused by the restricted audio segment length during fine-tuning, we propose an innovative method to overcome this problem by using the full transcript as a prompt to assist in training speech segments. This approach results in a relative performance improvement of 9%-12% for models with a higher number of parameters. On the ADReSSo test set, the accuracy and F1 score achieved are 84.51% and 84.50% respectively, surpassing both the baseline system and commonly used speech recognition-language model cascade methods, demonstrating its effectiveness.
更多
查看译文
关键词
Whisper,prompt,transfer learning,Alzheimer’s Disease classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要