Spoofed Speech Detection with Weighted Phase Features and Convolutional Networks
dc.contributor.author | Disken, Gokay | |
dc.date.accessioned | 2025-01-06T17:44:26Z | |
dc.date.available | 2025-01-06T17:44:26Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Detection of audio spoofing attacks has become vital for automatic speaker verification systems. Spoofing attacks can be obtained with several ways, such as speech synthesis, voice conversion, replay, and mimicry. Extracting discriminative features from speech data can improve the accuracy of detecting these attacks. In fact, a frame-wise weighted magnitude spectrum is found to be effective to detect replay attacks recently. In this work, discriminative features are obtained in a similar fashion (frame-wise weighting), however, a cosine normalized phase spectrum is used since phase-based features have shown decent performance for the given task. The extracted features are then fed to a convolutional neural network as input. In the experiments ASVspoof 2015 and 2017 databases are used to investigate the proposed system???s spoof detection performance for both synthetic and replay attacks, respectively. The results showed that the proposed approach achieved 34.5% relative decrease in the average EER for ASVspoof 2015 evaluation set, compared to the ordinary cosine normalized phase features. Furthermore, the proposed system outperformed the others at detecting S10 attack type of ASVspoof 2015 database. | |
dc.identifier.doi | 10.24425/aoa.2022.141648 | |
dc.identifier.endpage | 189 | |
dc.identifier.issn | 0137-5075 | |
dc.identifier.issn | 2300-262X | |
dc.identifier.issue | 2 | |
dc.identifier.scopus | 2-s2.0-85133479982 | |
dc.identifier.scopusquality | Q3 | |
dc.identifier.startpage | 181 | |
dc.identifier.uri | https://doi.org/10.24425/aoa.2022.141648 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14669/3060 | |
dc.identifier.volume | 47 | |
dc.identifier.wos | WOS:000813565900005 | |
dc.identifier.wosquality | Q4 | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.language.iso | en | |
dc.publisher | Polska Akad Nauk, Polish Acad Sciences, Inst Fundamental Tech Res Pas | |
dc.relation.ispartof | Archives of Acoustics | |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.snmz | KA_20241211 | |
dc.subject | spoofing detection | |
dc.subject | cosine normalized cepstrum | |
dc.subject | convolutional neural networks | |
dc.title | Spoofed Speech Detection with Weighted Phase Features and Convolutional Networks | |
dc.type | Article |