Noise-Robust Spoofed Speech Detection Using Discriminative Autoencoder

dc.contributor.authorDişken, Gökay
dc.contributor.authorTüfekçi, Zekeriya
dc.date.accessioned2025-01-06T17:22:42Z
dc.date.available2025-01-06T17:22:42Z
dc.date.issued2023
dc.departmentAdana Alparslan Türkeş Bilim ve Teknoloji Üniversitesi
dc.description.abstractAudio spoof detection gained attention of the researchers recently, as it is vital to detect spoofed speech for automatic speaker recognition systems. Publicly available datasets also accelerated the studies in this area. Many different features and classifiers have been proposed to overcome the spoofed speech detection problem, and some of them achieved considerably high performances. However, under additive noise, the spoof detection performance drops rapidly. On the other hand, number of studies about robust spoofed speech detection is very limited. The problem becomes more interesting as the conventional speech enhancement methods reportedly performed worse than no enhancement. In this work, i-vectors are used for spoof detection, and discriminative denoising autoencoder (DAE) network is used to obtain enhanced (clean) i-vectors from their noisy counterparts. Once the enhanced i-vectors are obtained, they can be treated as normal i-vectors and can be scored/classified without any modifications in the classifier part. Data from ASVspoof 2015 challenge is used with five different additive noise types, following a similar configuration of previous studies. The DAE is trained with a multicondition manner, using both clean and corrupted i-vectors. Three different noise types at various signal-to-noise ratios are used to create corrupted i-vectors, and two different noise types are used only in the test stage to simulate unknown noise conditions. Experimental results showed that the proposed DAE approach is more effective than the conventional speech enhancement methods.
dc.identifier.doi10.18466/cbayarfbe.1132319
dc.identifier.endpage174
dc.identifier.issn1305-130X
dc.identifier.issn1305-1385
dc.identifier.issue2
dc.identifier.startpage167
dc.identifier.trdizinid1183729
dc.identifier.urihttps://doi.org/10.18466/cbayarfbe.1132319
dc.identifier.urihttps://search.trdizin.gov.tr/tr/yayin/detay/1183729
dc.identifier.urihttps://hdl.handle.net/20.500.14669/470
dc.identifier.volume19
dc.indekslendigikaynakTR-Dizin
dc.language.isoen
dc.relation.ispartofCelal Bayar Üniversitesi Fen Bilimleri Dergisi
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_20241211
dc.subjectDeep learning
dc.subjectdenoising autoencoder
dc.subjecti-vector
dc.subjectspoofing detection
dc.titleNoise-Robust Spoofed Speech Detection Using Discriminative Autoencoder
dc.typeArticle

Dosyalar