AUDIO ENHANCEMENT THROUGH SUPERVISED LATENT VARIABLE REPRESENTATION OF TARGET SPEECH AND NOISE

user-5f3206704c775e3a7964bd8b（2020）

引用 0|浏览4

暂无评分

摘要

Systems and methods for generating an enhanced audio signal comprise a trained neural network configured to receive an input audio signal and generate an enhanced target signal, the trained neural network comprising a pre-processing neural network configured to receive a segment of the input audio signal and output an audio classification, the pre-processing neural network including at least one hidden layer comprising an embedding vector, and a noise reduction neural network configured to receive the segment of the input audio signal, and the embedding vector and generate the enhanced target signal. The pre-processing neural network may comprise a target signal pre-processing neural network configured to output a target signal classification and comprising at least one hidden layer comprising a target embedding vector. The pre-processing neural network may comprise a noise pre-processing neural network configured output a noise classification and comprising at least one hidden layer comprising a noise embedding vector.

查看译文

关键词

Audio signal,Noise (signal processing),Artificial neural network,Noise reduction,Embedding,Pattern recognition,Representation (mathematics),Computer science,Latent variable,Artificial intelligence,Target signal

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要