ThaiSpoof: A Database for Spoof Detection in Thai Language

Kasorn Galajit, Thunpisit Kosolsriwiwat,Masashi Unoki,Candy Olivia Mawalim,Pakinee Aimmanee,Waree Kongprawechnon,Win Pa Pa,Anuwat Chaiwongyen,Teeradaj Racharak,Surasak Boonkla,Hayati Yassin,Jessada Karnjana

2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP（2023）

引用 0|浏览21

暂无评分

摘要

Many applications and security systems have widely applied automatic speaker verification (ASV). However, these systems are vulnerable to various direct and indirect access attacks, which weakens their authentication capability. The research in spoofed speech detection contributes to enhancing these systems. Unfortunately, the study in spoofing detection is limited to only some languages due to the need for various datasets. This paper focuses on a Thai language dataset for spoof detection. The dataset consists of genuine speech signals and various types of spoofed speech signals. The spoofed speech dataset is generated using text-to-speech tools for the Thai language, synthesis tools, and tools for speech modification. To showcase the utilization of this dataset, we implement a simple spoof detection model based on a convolutional neural network (CNN) taking linear frequency cepstral coefficients (LFCC) as its input. We trained, validated, and tested the model on our dataset referred to as ThaiSpoof. The experimental result shows that the accuracy of model is 93%, and equal error rate (EER) is 6.78%. The result shows that our ThaiSpoof dataset has the potential to develop for helping in spoof detection studies.

查看译文

关键词

Thai database,spoof detection,automatic speaker verification,speech synthesis,speech modification

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要