Full metadata record
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineeringen_US
dc.contributor.advisorLam, Kin-man (EIE)en_US
dc.creatorXiao, Jun-
dc.identifier.urihttps://theses.lib.polyu.edu.hk/handle/200/12023-
dc.languageEnglishen_US
dc.publisherHong Kong Polytechnic Universityen_US
dc.rightsAll rights reserveden_US
dc.titleMachine learning for image super-resolution in real-world applicationsen_US
dcterms.abstractSingle image super-resolution (SISR) is a fundamental task in computer vision, which aims to reconstruct high-resolution images from their low-resolution counterparts. In recent years, deep image SR methods have achieved remarkable performance, but there are still some challenging issues limiting their applications in real-world situations. In this thesis, we will investigate three challenging issues, including the degradation mismatch, high computational complexity, and the trade-off between the distortion and perceptual quality, for existing deep image SR models in real-world applications.en_US
dcterms.abstractIn Chapter 3, we will focus on the degradation mismatch between real-world and synthetic degradations. Two specific degradation models are considered, i.e., the mixture of multiple degradations and the multiple-focal-length degradation. A deep progressive network is proposed to address the mixture of degradations, which leverages prior information from small-scale images to restore corrupted contents of large-scale images. In addition, we propose a deep feature mixture model for the multiple-focal-length degradation. The proposed model can adaptively aggregate local contextual information extracted from different scales for reconstruction. Furthermore, we propose a novel loss function, based on the Laplacian pyramid, to improve the performance. Experiments show that our proposed methods significantly outperform other promising image SR models.en_US
dcterms.abstractThen, we will introduce two proposed deep lightweight image SR models for resource-constrained devices in Chapter 4. The first method adopts the proposed lightweight compression-and-extraction module, which avoids unnecessary costs for redundant features. The second lightweight model is based on our proposed convolutional kernel which is dynamically generated in each pixel location. In addition, it shares kernel weights across the channels for feature extraction. Experiments show that our proposed methods can significantly reduce the number of model parameters, while maintaining the performance, leading to a better trade-off between the performance and the model complexity than other lightweight models.en_US
dcterms.abstractNext, in Chapter 5, we will focus on the trade-off between distortion and perceptual quality for super-resolved images. To address this issue, we propose an image-fusion method, which provides a novel interpolation method in the Wasserstein space. This novel interpolation method can effectively balance the distortion and perceptual quality of the images by manipulating their high-frequency components in the wavelet domain. Experiments show that our proposed method significantly improves the visual quality, while maintaining the distortion quality, leading to a better performance than compared methods. In addition, we show that our proposed method can be applied to any deep image SR model and solved very efficiently with CPU only.en_US
dcterms.abstractFinally, we will propose an extremely low-latency online video SR model for online applications in Chapter 6. A novel kernel knowledge transfer method is proposed to enhance a simple, lightweight video SR model, in terms of the distortion quality and latency. Experiments show that our proposed method can achieve remarkable performance, compared with other promising methods. In addition, we will show that our proposed method can process the video sequence at a rate of up to 110 FPS, leading to the best trade-off between the performance and model complexity.en_US
dcterms.extent193 pages : color illustrationsen_US
dcterms.isPartOfPolyU Electronic Thesesen_US
dcterms.issued2022en_US
dcterms.educationalLevelPh.D.en_US
dcterms.educationalLevelAll Doctorateen_US
dcterms.LCSHMachine learningen_US
dcterms.LCSHImage processing -- Digital techniquesen_US
dcterms.LCSHHigh resolution imagingen_US
dcterms.LCSHHong Kong Polytechnic University -- Dissertationsen_US
dcterms.accessRightsopen accessen_US

Files in This Item:
File Description SizeFormat 
6472.pdfFor All Users70.6 MBAdobe PDFView/Open


Copyright Undertaking

As a bona fide Library user, I declare that:

  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/12023