AUDIO ENHANCEMENT THROUGH SUPERVISED LATENT VARIABLE REPRESENTATION OF TARGET SPEECH AND NOISE

user-5f3206704c775e3a7964bd8b(2020)

引用 0|浏览4
暂无评分
摘要
Systems and methods for generating an enhanced audio signal comprise a trained neural network configured to receive an input audio signal and generate an enhanced target signal, the trained neural network comprising a pre-processing neural network configured to receive a segment of the input audio signal and output an audio classification, the pre-processing neural network including at least one hidden layer comprising an embedding vector, and a noise reduction neural network configured to receive the segment of the input audio signal, and the embedding vector and generate the enhanced target signal. The pre-processing neural network may comprise a target signal pre-processing neural network configured to output a target signal classification and comprising at least one hidden layer comprising a target embedding vector. The pre-processing neural network may comprise a noise pre-processing neural network configured output a noise classification and comprising at least one hidden layer comprising a noise embedding vector.
更多
查看译文
关键词
Audio signal,Noise (signal processing),Artificial neural network,Noise reduction,Embedding,Pattern recognition,Representation (mathematics),Computer science,Latent variable,Artificial intelligence,Target signal
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要