Vision-Based Assistance for Vocal Fold Identification in Laryngoscopy with Knowledge Distillation.

Thao Thi Phuong Dao,Minh-Khoi Pham,Mai-Khiem Tran, Chanh Cong Ha, Boi Ngoc Van,Bich Anh Tran,Minh-Triet Tran

Studies in health technology and informatics（2024）

引用 0|浏览4

暂无评分

摘要

Laryngoscopy images play a vital role in merging computer vision and otorhinolaryngology research. However, limited studies offer laryngeal datasets for comparative evaluation. Hence, this study introduces a novel dataset focusing on vocal fold images. Additionally, we propose a lightweight network utilizing knowledge distillation, with our student model achieving around 98.4% accuracy-comparable to the original EfficientNetB1 while reducing model weights by up to 88%. We also present an AI-assisted smartphone solution, enabling a portable and intelligent laryngoscopy system that aids laryngoscopists in efficiently targeting vocal fold areas for observation and diagnosis. To sum up, our contribution includes a laryngeal image dataset and a compressed version of the efficient model, suitable for handheld laryngoscopy devices.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要