Enhancing techniques for a standard conforming real-time video codec

Lai, Kam-cheong

Author:	Lai, Kam-cheong
Title:	Enhancing techniques for a standard conforming real-time video codec
Degree:	M.Phil.
Year:	2002
Subject:	Hong Kong Polytechnic University -- Dissertations Video compression Digital video MPEG (Video coding standard)
Department:	Department of Electronic and Information Engineering
Pages:	xiii, 110 leaves : ill. ; 30 cm
Language:	English
Abstract:	The ultimate goal of video coding is to achieve a balance between the spatial quality and the temporal quality subject to human judgment with the best coding efficiency and the least computational requirement. Since inter-operability is of paramount importance, several video coding standards have been established such as H.261, H.263, MPEG-1, MPEG-2, and MPEG-4. Among these standards, the most widespread coding structure takes the form of combing a motion-compensation with Discrete Cosine Transform (DCT). Since standards only specify the decoder's syntax, it provides flexibility for improvement. In this thesis, we present new standard- conforming video coding techniques, which can offer better coding efficiency. Efficient motion estimation is essential for achieving good quality video coding. With studying the distribution of the motion vectors obtained by the Full Search algorithm, we optimize the prediction process of the motion estimation by classifying the blocks into four different classes based on a newly proposed group of reference motion vector. Different search patterns and strategies are used for different classes. The proposed algorithm improves the computational requirement and coding efficiency when compared to other fast motion estimation algorithms. Besides, a Weighted Sum of Absolute Difference (WSAD) matching criterion function for motion estimation is proposed. WSAD is designed based on the characteristic of the DCT. WSAD enhances the coding efficiency while the computational overhead is very small. To achieve better video quality in very low bit rate situation, an object-based video coding scheme is proposed. Each frame of a video is segmented into three non-overlapping regions, namely background, non-facial foreground and facial region. Smaller quantization step (better quality) is used for the facial region while a larger quantization step is used for the background to improve the coding efficiency. However, the non-facial foreground is kept in normal coding quality to prevent degradation of important regions other than facial region such as hand movement. The proposed method is real-time ready. Since the video data encoded in block-based video codec has an inherently variable bit rate, a rate control algorithm is proposed in this thesis. As human judges the quality of the output, we adopt the information of human visual system to our rate control system. The proposed algorithm is able to adjust the operating point of the encoder automatically to a balance region between the spatial and temporal quality. Besides, experimental results show that the proposed rate control algorithm prevents motion jerkiness in condition of bandwidth fluctuation. Finally, we proposed a fast direct spatial filtering algorithm on DCT-based compressed video. As video signals are often compressed for transmission or storage, manipulation of compressed video signals is required. Filtering is a technique often used in these processes. Direct filtering on DCT domain can speedup the filtering process without going through the inverse transform of data from DCT domain to spatial domain. Our proposed method although performs spatial filter and inverse DCT transform at the same time, the computational requirement is less than a fast inverse DCT transform.
Rights:	All rights reserved
Access:	open access

Files in This Item:

File	Description	Size	Format
b16576688.pdf	For All Users	3.7 MB	Adobe PDF	View/Open

Copyright Undertaking

As a bona fide Library user, I declare that:

I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show full item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/2217