Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | Multi-disciplinary Studies | en_US |
dc.contributor | Department of Electronic Engineering | en_US |
dc.creator | Lam, Chin-lung | - |
dc.identifier.uri | https://theses.lib.polyu.edu.hk/handle/200/3169 | - |
dc.language | English | en_US |
dc.publisher | Hong Kong Polytechnic University | - |
dc.rights | All rights reserved | en_US |
dc.title | Evaluation of feature extraction methods for speaker verification | en_US |
dcterms.abstract | A common problem in text-independent speaker verification systems is that a mismatch between the training and testing conditions sacrifices much performance. A common example of this mismatch is that training is done on clean speech while testing is performed on noisy or channel-corrupted speech. Robust speech processing techniques attempt to maintain the performance of a speech processing system under such diverse conditions. Two strategies of robust speech processing techniques have emerged to mitigate the problems that arise due to channel effects and noise. The first strategy is normally carried out in the front-end feature extractors. The second strategy aims at making the classifier more robust by compensating the distortions between the template patterns and the unknown patterns during the classification stage. The conventional linear predictive (LP) cepstrum and the delta cepstrum are the commonly used features for speaker recognition systems. The linear predictive (LP) cepstrum derived from an all-pole transfer function is able to approximate the spectral envelope of the speech signals. A newly proposed feature, namely the Adaptive Component Weighted (ACW) cepstrum, has been found to be robust against channel variations and noise. The ACW cepstrum is derived from a pole-zero transfer function whose denominator is a pth order LP polynomial. This dissertation compares the LP cepstrum and ACW cepstrum for speaker verification. Experiments were carried out based on the NTIMIT corpus where the feature vectors were classified by radial basis function neural networks, Experimental results show that the ACW cepstrum is better than LP cepstrum in extracting speaker features from telephone speech. | en_US |
dcterms.extent | vi, 57, vi leaves : ill. ; 30 cm | en_US |
dcterms.isPartOf | PolyU Electronic Theses | en_US |
dcterms.issued | 1998 | en_US |
dcterms.educationalLevel | All Master | en_US |
dcterms.educationalLevel | M.Sc. | en_US |
dcterms.LCSH | Speech processing systems | en_US |
dcterms.LCSH | Hong Kong Polytechnic University -- Dissertations | en_US |
dcterms.accessRights | restricted access | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
b14465334.pdf | For All Users (off-campus access for PolyU Staff & Students only) | 2.24 MB | Adobe PDF | View/Open |
Copyright Undertaking
As a bona fide Library user, I declare that:
- I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
- I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
- I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.
Please use this identifier to cite or link to this item:
https://theses.lib.polyu.edu.hk/handle/200/3169