Journal of Multimedia, Vol 2, No 6 (2007), 34-43, Nov 2007
doi:10.4304/jmm.2.6.34-43

Linear Discriminant Analysis F-Ratio for Optimization of TESPAR & MFCC Features for Speaker Recongnition

K. Anitha Sheela, K. Satya Prasad

Abstract


This paper deals with implementing an efficient optimization technique for designing an Automatic Speaker Recognition (ASR) System, which uses average F-ratio score of TESPAR(Time Encoded Signal Processing And Recognition) and MFCC(Mel frequency Cepstral Coefficients) features, to yield high recognition accuracy even in adverse noisy conditions. A new ranking scheme is also proposed in order to stabilize the rank of features in various noise levels by taking Arithmetic Mean of the F-Ratio scores obtained from various levels of Signal to Noise Ratio (SNR). The result is presented for a Text-Dependent ASR system with 20 speaker database. An RBF (Radial Basis Function) Neural Network is used for Recognition purpose. Also a comparative study has been performed for recognition accuracies of optimized MFCC and TESPAR features and we conclude that new proposed average F-Ratio technique has resulted in better accuracy compared to simple F-ratio in noisy environment and also we came to know that TESPAR features are more redundant compared to MFCC.



Keywords


ASR, F-Ratio, Average F-Ratio, TESPAR, RBF Neural Network, MFCC

References



Full Text: PDF


Journal of Multimedia (JMM, ISSN 1796-2048)

Copyright @ 2006-2012 by ACADEMY PUBLISHER – All rights reserved.