Implementation of speech coding technique in MPEG/audio coding

Cheung, Sai-hang

Full metadata record

DC Field	Value	Language
dc.contributor	Multi-disciplinary Studies	en_US
dc.contributor	Department of Electronic and Information Engineering	en_US
dc.creator	Cheung, Sai-hang	-
dc.identifier.uri	https://theses.lib.polyu.edu.hk/handle/200/332	-
dc.language	English	en_US
dc.publisher	Hong Kong Polytechnic University	-
dc.rights	All rights reserved	en_US
dc.title	Implementation of speech coding technique in MPEG/audio coding	en_US
dcterms.abstract	In multimedia applications such as video transmission and storage, the ISO standard MPEG/audio has been used extensively. This algorithm was developed by the Motion Picture Experts Group (MPEG), as an ISO standard for the high fidelity compression of digital audio. Unlike vocal-tract-model coders specially tuned for speech signals, the MPEG/audio coder gets its compression without making assumptions about the nature of the audio source. Instead, the coder exploits the perceptual limitations of the human ear. Much of the compression results from the removal of perceptually irrelevant limitations of the audio signal. Removal of such parts results in inaudible distortions, thus MPEG/audio can compress any signal meant to be heard by human ear. In addition, for different level of resulting quality, the compressed bitstream can have one of several predefined fixed bit rates ranging from 32 to 192 kbit/sec per channel. While the coding quality of MPEG/audio is satisfactory, there is room to further reduce the bit rate of compressed signal. CELP-based approach has been very successful in telephone bandwidth speech coding, but is not suitable for coding non-speech signals because of the assumed signal production model. Low-delay Code-excited Linear Prediction (LD-CELP) is ITU/CCITT standard G.728 which is a 16 kbit/sec low-delay speech coder. It can achieve a high speech quality better than G.721 with a one-way coding delay less than 2ms. In this project, an approach is proposed to mixed speech/music coding, which uses a discriminator to separate music signals from speech, and codes them with the MPEG/audio coder and a LD-CELP speech coder, respectively. In testing for different audio clips, the system shows promising results.	en_US
dcterms.extent	51 leaves : ill. ; 30 cm	en_US
dcterms.isPartOf	PolyU Electronic Theses	en_US
dcterms.issued	2000	en_US
dcterms.educationalLevel	All Master	en_US
dcterms.educationalLevel	M.Sc.	en_US
dcterms.LCSH	Speech processing systems	en_US
dcterms.LCSH	Coding theory	en_US
dcterms.LCSH	Hong Kong Polytechnic University -- Dissertations	en_US
dcterms.accessRights	restricted access	en_US

Files in This Item:

File	Description	Size	Format
b1517721x.pdf	For All Users (off-campus access for PolyU Staff & Students only)	1.98 MB	Adobe PDF	View/Open

Copyright Undertaking

As a bona fide Library user, I declare that:

I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/332