Privacy-sensitive audio features for conversational speech processing

ACM SIGMultimedia Records(2012)

引用 0|浏览42
暂无评分
摘要
AbstractThe work described in this thesis takes place in the context of capturing real-life audio for the analysis of spontaneous social interactions. Towards this goal, we wish to capture conversational and ambient sounds using portable audio recorders. Analysis of conversations can then proceed by modeling the speaker turns and durations produced by speaker diarization. However, a key factor against the ubiquitous capture of real-life audio is privacy. Particularly, recording and storing raw audio would breach the privacy of people whose consent has not been explicitly obtained.
更多
查看译文
关键词
raw audio,speaker turn,real-life audio,ubiquitous capture,speaker diarization,conversational speech processing,spontaneous social interaction,portable audio recorder,privacy-sensitive audio feature,key factor,speech recognition,social interaction,speech processing,residual,linear prediction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要