Accomplishments

Fast Speaker Recognition using Efficient Feature Extraction Techniques


  • Details
  • Share
Category
Conference
Authors
Conference Name
International Conference of Computer Science and Network (IJCSN)
Conference From
13-Feb-2013
Conference To
14-Feb-2013
Conference Venue
Online
  • Abstract

Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal. The extraction and matching process is implemented after the Pre Processing of signal. In this paper, speaker recognition is based on speech samples uttered by 4 different speakers. The speakers were made to utter different digits. The non-parametric method for modeling the human auditory perception system, Mel Frequency Cepstral Coefficients (MFCCs) is utilized as extraction technique. Since MFCC imitates the human hearing system, it provides better recognition rates than Linear Predictive Coefficients (LPC). For feature matching, the non linear sequence alignment known as Dynamic Time Warping (DTW) is used. Since voice signal tends to have different temporal rate, the alignment is important for correct recognition of speaker. This paper presents the viability of MFCC to extract features and DTW to compare the test patterns. It is found that the distance between feature vectors of two different speakers uttering a word is significantly higher than that of the distance between feature vectors of same speakers. Keywords: Mel filter banks, Hamming Window, Dynamic Time Warping (DTW).

© Somaiya 2025 / All rights reserved.
Get in Touch