year 9, Issue 1 (Journal of Acoustical Engineering Society of Iran 2021)                   مجله انجمن علوم صوتی ایران (مهندسی صوتیات سابق) 2021, 9(1): 28-39 | Back to browse issues page

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Mirbeygi M, Mahabadi A, Ranjbar A. Classification of noise-robust speech features in the speaker authentication system (Research Article). مجله انجمن علوم صوتی ایران (مهندسی صوتیات سابق) 2021; 9 (1) :28-39
URL: http://joasi.ir/article-1-192-en.html
Abstract:   (2657 Views)
Automatic speaker recognition has a wide range of applications in industrial and security systems and requires the extraction of speech signal features. The use of the feature matrix is ​​very important in real-time recognition of the speaker, and the presence of environmental and processing noise leads to a violation in the characteristics of the features and the production of recognition errors. Increasing the accuracy of recognition detection requires the noise removal process to correctly determine the energy characteristics, energy entropy, zero- crossing rate, spectral centroid, spectral spread, spectral entropy, spectral flux, and spectral roll off the signal. In designing real-time and reliable algorithms, there are critical processes of correct speech extraction, sensitivity detection, and measuring the robustness of signal parameters to eliminate noise and improve speech quality, which play a key role in improving the signal-to-noise ratio. In this paper, the classification of speech signal features for designing real-time and noise-robust speaker recognition algorithms in measuring its robustness are investigated. The proposed method of noise removal uses a binary mask with a robust feature and the experimental results of the experiments on the standard data show the rate of signal improvement to the noise of approximately 2 to 3 db. The feature matrix evaluation for the authentication system consists of mel frequency coefficient, linear prediction coefficient and, cepstrum coefficient, which has been evaluated by the Euclidean distance method in another experimental standard data set. Our proposed method achieves on overall 80% real-time recognition accuracy in noisy data set. 
Full-Text [PDF 29 kb]   (1011 Downloads)    
Type of Study: Research | Subject: Signal Processing
Received: 2020/09/10 | Accepted: 2021/07/22 | Published: 2021/09/11

Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.