Evaluation of feature extraction methods for speaker verification

Lam, Chin-lung

Full metadata record

DC Field	Value	Language
dc.contributor	Multi-disciplinary Studies	en_US
dc.contributor	Department of Electronic Engineering	en_US
dc.creator	Lam, Chin-lung	-
dc.identifier.uri	https://theses.lib.polyu.edu.hk/handle/200/3169	-
dc.language	English	en_US
dc.publisher	Hong Kong Polytechnic University	-
dc.rights	All rights reserved	en_US
dc.title	Evaluation of feature extraction methods for speaker verification	en_US
dcterms.abstract	A common problem in text-independent speaker verification systems is that a mismatch between the training and testing conditions sacrifices much performance. A common example of this mismatch is that training is done on clean speech while testing is performed on noisy or channel-corrupted speech. Robust speech processing techniques attempt to maintain the performance of a speech processing system under such diverse conditions. Two strategies of robust speech processing techniques have emerged to mitigate the problems that arise due to channel effects and noise. The first strategy is normally carried out in the front-end feature extractors. The second strategy aims at making the classifier more robust by compensating the distortions between the template patterns and the unknown patterns during the classification stage. The conventional linear predictive (LP) cepstrum and the delta cepstrum are the commonly used features for speaker recognition systems. The linear predictive (LP) cepstrum derived from an all-pole transfer function is able to approximate the spectral envelope of the speech signals. A newly proposed feature, namely the Adaptive Component Weighted (ACW) cepstrum, has been found to be robust against channel variations and noise. The ACW cepstrum is derived from a pole-zero transfer function whose denominator is a pth order LP polynomial. This dissertation compares the LP cepstrum and ACW cepstrum for speaker verification. Experiments were carried out based on the NTIMIT corpus where the feature vectors were classified by radial basis function neural networks, Experimental results show that the ACW cepstrum is better than LP cepstrum in extracting speaker features from telephone speech.	en_US
dcterms.extent	vi, 57, vi leaves : ill. ; 30 cm	en_US
dcterms.isPartOf	PolyU Electronic Theses	en_US
dcterms.issued	1998	en_US
dcterms.educationalLevel	All Master	en_US
dcterms.educationalLevel	M.Sc.	en_US
dcterms.LCSH	Speech processing systems	en_US
dcterms.LCSH	Hong Kong Polytechnic University -- Dissertations	en_US
dcterms.accessRights	restricted access	en_US

Files in This Item:

File	Description	Size	Format
b14465334.pdf	For All Users (off-campus access for PolyU Staff & Students only)	2.24 MB	Adobe PDF	View/Open

Copyright Undertaking

As a bona fide Library user, I declare that:

I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/3169