A Review on Feature Extraction for Speaker Recognition under Degraded Conditions

dc.authoridTufekci, Zekeriya/0000-0001-7835-2741
dc.authoridCevik, Ulus/0000-0002-0956-9725
dc.authoridSARIBULUT, LUTFU/0000-0002-6183-9550
dc.authoridDisken, Gokay/0000-0002-8680-0636
dc.contributor.authorDisken, Gokay
dc.contributor.authorTufekci, Zekeriya
dc.contributor.authorSaribulut, Lutfu
dc.contributor.authorCevik, Ulus
dc.date.accessioned2025-01-06T17:37:50Z
dc.date.available2025-01-06T17:37:50Z
dc.date.issued2017
dc.description.abstractSpeech is a signal that includes speaker's emotion, characteristic specification, phoneme-information etc. Various methods have been proposed for speaker recognition by extracting specifications of a given utterance. Among them, short-term cepstral features are used excessively in speech, and speaker recognition areas because of their low complexity, and high performance in controlled environments. On the other hand, their performances decrease dramatically under degraded conditions such as channel mismatch, additive noise, emotional variability, etc. In this paper, a literature review on speaker-specific information extraction from speech is presented by considering the latest studies offering solutions to the aforementioned problem. The studies are categorized in three groups considering their robustness against channel mismatch, additive noise, and other degradations such as vocal effort, emotion mismatch, etc. For a more understandable representation, they are also classified into two tables by utilizing their classification methods, and used data-sets.
dc.identifier.doi10.1080/02564602.2016.1185976
dc.identifier.endpage332
dc.identifier.issn0256-4602
dc.identifier.issn0974-5971
dc.identifier.issue3
dc.identifier.scopus2-s2.0-85014739208
dc.identifier.scopusqualityQ2
dc.identifier.startpage321
dc.identifier.urihttps://doi.org/10.1080/02564602.2016.1185976
dc.identifier.urihttps://hdl.handle.net/20.500.14669/2387
dc.identifier.volume34
dc.identifier.wosWOS:000402715400010
dc.identifier.wosqualityQ3
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherTaylor & Francis Ltd
dc.relation.ispartofIete Technical Review
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_20241211
dc.subjectFeature extraction
dc.subjectIdentification
dc.subjectSpeaker recognition
dc.subjectVerification
dc.titleA Review on Feature Extraction for Speaker Recognition under Degraded Conditions
dc.typeReview Article

Dosyalar