Robust Deepfake Audio Detection via Bi-level Optimization

2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP(2023)

引用 0|浏览4
暂无评分
摘要
ASVspoof Challenges have been launched to motivate research on Deepfake audio detection due to its threats to society. However, the state-of-the-art detection models produce an unsatisfactory performance on the Speech Deepfake (DF) of the challenge. The DF subset includes spoofed audio from various sources, which can better reflect the robustness of the detector. In this paper, we propose a novel detection architecture to improve the robustness and generalization ability in two ways. The first way is aggregating both learned embeddings and hand-crafted features to obtain more generalizable representations for Deepfake audio. Our second contribution is formulating the training process a bi-level optimization problem to make use of the knowledge of different Deepfake generation methods. Evaluations of our proposed method provide the best detection output reported in the literature as a single system without the help of ensemble modeling and data augmentation.
更多
查看译文
关键词
Deepfake,Audio Deepfake Detection,Anti-Spoofing,ASVspoof2021
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要