Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | Multi-disciplinary Studies | en_US |
dc.contributor | Department of Electronic and Information Engineering | en_US |
dc.creator | Cheung, Sai-hang | - |
dc.identifier.uri | https://theses.lib.polyu.edu.hk/handle/200/332 | - |
dc.language | English | en_US |
dc.publisher | Hong Kong Polytechnic University | - |
dc.rights | All rights reserved | en_US |
dc.title | Implementation of speech coding technique in MPEG/audio coding | en_US |
dcterms.abstract | In multimedia applications such as video transmission and storage, the ISO standard MPEG/audio has been used extensively. This algorithm was developed by the Motion Picture Experts Group (MPEG), as an ISO standard for the high fidelity compression of digital audio. Unlike vocal-tract-model coders specially tuned for speech signals, the MPEG/audio coder gets its compression without making assumptions about the nature of the audio source. Instead, the coder exploits the perceptual limitations of the human ear. Much of the compression results from the removal of perceptually irrelevant limitations of the audio signal. Removal of such parts results in inaudible distortions, thus MPEG/audio can compress any signal meant to be heard by human ear. In addition, for different level of resulting quality, the compressed bitstream can have one of several predefined fixed bit rates ranging from 32 to 192 kbit/sec per channel. While the coding quality of MPEG/audio is satisfactory, there is room to further reduce the bit rate of compressed signal. CELP-based approach has been very successful in telephone bandwidth speech coding, but is not suitable for coding non-speech signals because of the assumed signal production model. Low-delay Code-excited Linear Prediction (LD-CELP) is ITU/CCITT standard G.728 which is a 16 kbit/sec low-delay speech coder. It can achieve a high speech quality better than G.721 with a one-way coding delay less than 2ms. In this project, an approach is proposed to mixed speech/music coding, which uses a discriminator to separate music signals from speech, and codes them with the MPEG/audio coder and a LD-CELP speech coder, respectively. In testing for different audio clips, the system shows promising results. | en_US |
dcterms.extent | 51 leaves : ill. ; 30 cm | en_US |
dcterms.isPartOf | PolyU Electronic Theses | en_US |
dcterms.issued | 2000 | en_US |
dcterms.educationalLevel | All Master | en_US |
dcterms.educationalLevel | M.Sc. | en_US |
dcterms.LCSH | Speech processing systems | en_US |
dcterms.LCSH | Coding theory | en_US |
dcterms.LCSH | Hong Kong Polytechnic University -- Dissertations | en_US |
dcterms.accessRights | restricted access | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
b1517721x.pdf | For All Users (off-campus access for PolyU Staff & Students only) | 1.98 MB | Adobe PDF | View/Open |
Copyright Undertaking
As a bona fide Library user, I declare that:
- I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
- I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
- I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.
Please use this identifier to cite or link to this item:
https://theses.lib.polyu.edu.hk/handle/200/332