Extracting Visual Micro-Doppler Signatures From Human Lips Motion Using UoG Radar Sensing Data for Hearing Aid Applications

IEEE Sensors Journal(2023)

引用 1|浏览9
暂无评分
摘要
This study proposes a secure and effective lips-reading system that can accurately detect lips movements, even when face masks are worn. The system utilizes radio frequency (RF) sensing and ultra-wideband (UWB) radar technology, which overcomes the challenges posed by traditional vision-based systems. By leveraging deep learning models, the system interprets lips and mouth movements and achieves an overall accuracy of 90% for both mask-on and mask-off scenarios. The study utilized a trusted dataset from the University of Glasgow (UoG), consisting of spectrograms of lips motions stating five vowels and a voiceless class from distinct participants. The cutting-edge deep learning algorithm, residual neural network (ResNet50), was used for the evaluation of the dataset and achieved an 87% accurate detection rate with a mask-on scenario, which is a 14% improvement compared to prior published work. The findings of this study contribute to the development of a robust lips-reading framework that can enhance communication accessibility in applications such as hearing aids, voice-controlled systems, biometrics, and more.
更多
查看译文
关键词
Inceptionv3, lips-reading, radio frequency (RF) sensing, residual neural network (ResNet50), speech recognition, ultra-wideband (UWB) radar, visual geometry group (VGG16)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要