RECOGNITION OF NON-SPEECH SOUNDS USING MEL-FREQUENCY CEPSTRUM COEFFICIENTS AND DYNAMIC TIME WARPING METHOD
[ X ]
Tarih
2015
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
IEEE
Erişim Hakkı
info:eu-repo/semantics/closedAccess
Özet
With the developing technology, speech recognition systems are getting more space in our daily lives. Sounds in our environment are not only pure speech. Because of this, it is important for cochlear implants, unmanned vehicles and security systems to be able to recognize other sounds. In this work, Mel-frequency cepstrum coefficients, one of the most widely used methods for feature extraction in speech recognition, applied to various nature and animal sounds. Because each sound does not have the same duration, dynamic time warping, one of the methods used in speech recognition, is preferred to classify the feature vectors. The difference in durations of sounds affects the lengths of the feature vectors. With dynamic time warping method, one can overcome these differences. One reference record and 10 test records obtained from 10 different sound sources. True classification rate is found as 88%.
Açıklama
23nd Signal Processing and Communications Applications Conference (SIU) -- MAY 16-19, 2015 -- Inonu Univ, Malatya, TURKEY
Anahtar Kelimeler
Sound recognition, Dynamic time warping, MFCC, Non-speech sound recognition
Kaynak
2015 23rd Signal Processing and Communications Applications Conference (Siu)
WoS Q Değeri
N/A