Author: Ji, Peng
Title: Multi-level structured output space domain adaptation for semantic segmentation
Degree: M.Sc.
Year: 2021
Subject: Machine learning
Computer algorithms
Hong Kong Polytechnic University -- Dissertations
Department: Department of Computing
Pages: 4, iv, 40 pages : color illustrations
Language: English
Abstract: Using domain adaptation is essential for semantic image segmentation, because it is costly and time-consuming to manually label large data sets with pixel-level tags. It is of great interest to develop algorithms that enable source ground truth labels to be compatible with target domain. As source domain has spatial similarities with the target one in terms of structured outputs of semantic segmentation, we propose to employ adversarial learning in the aspect of output space so as to produce confident pseudo label to close the gap between source domain and target one. To further enhance adapted model at different feature levels, a multi-level adversarial network is constructed by us for efficient output space domain adaptation. In addition, in the scene segmentation task, similar features would be connected with each other in spite of the distance between them and different semantic receptions are also interrelated. Feature representation makes contribution to more accurate domain adaptation. In order to further improve it, we don't forward the input images to the segmentation network until perform a scene segmentation task based on self-attentive mechanism by catching rich context dependencies. In particular, we attach two types of attention modules to the extended Convolution Neural Networks before inputting the images into adaption segmentation network. Moreover, we propose a not complicated and useful pseudo label selection strategy to generate a trusted pseudo-label for the target instance, which bridges the gap between source domain and target one in the aspect of distribution. A lot of experiments and studies of ablation are performed with various field adaptation settings, including "GTA to Cityscapes" and "SYNTHIA to Cityscapes". It is showed that the proposed method is on good form compared to the state-of-the-art methods in respect to precision and visual quality.
Rights: All rights reserved
Access: restricted access

Files in This Item:
File Description SizeFormat 
5820.pdfFor All Users (off-campus access for PolyU Staff & Students only)1.28 MBAdobe PDFView/Open


Copyright Undertaking

As a bona fide Library user, I declare that:

  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show full item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/11372