Author: Yang, Yu
Title: Feature representation for mining evolution patterns in dynamic data
Advisors: Cao, Jiannong (COMP)
Degree: Ph.D.
Year: 2021
Subject: Machine learning
Multivariate analysis -- Data processing
Hong Kong Polytechnic University -- Dissertations
Department: Department of Computing
Pages: xv, 135 pages : color illustrations
Language: English
Abstract: Feature representation is an encoding process that projects the raw data into a discriminative latent space so as to extract the characteristics, properties, attributes, and underlying patterns from the data and embed them as features for supporting effective machine learning. It is the heart of AI, powering it to develop and train intelligent algorithms by supplying the useful information and discriminative features extracted from large quantities of high-quality data.
In this thesis, I study an important yet overlooked problem of feature representation in dynamic data for capturing and embedding the evolution patterns, thus effectively facilitating the before-the-fact applications such as a prediction. Dynamic data refers to data that changes over time. It contains historical evolution patterns revealing how the data changed over time. Discovering and embedding these evolution patterns introduces additional and effective information to overcome data insufficiency issues in the before-the-fact applications, thereby leading to better performance.
Existing studies either is based on differential equations or employ data-driven methods. The former one suffers from the high sensitivity of noise and uncertainty, while the latter merely focus on synchronous and/or period evolution patterns, overlooking the complexity of dynamics. Therefore, both of them fail to achieve satisfactory prediction performance. I aim to capture and embed the evolution from data with complex dynamics. Multivariate, multi-timescale, and asynchronous dynamics arise naturally in the world. Although these dynamics are very difficult to be fully captured due to the high stochastic and uncertainty, discovering the evolution patterns from them and embedding into the representation can effectively facilitate a prediction.
To tackle the challenge of multi-variables, I devised a time-capturing dynamic graph embedding algorithm to learn the synchronous linkage evolution from the dynamic connection changes of every vertex over time. To deal with the challenge of synchronization, I propose a time-aware dynamic graph embedding algorithm to fully capture and embed the asynchronous structural evolutions in which the connections of vertices evolve at different times with variant evolution speed. Extensive experiments show that both algorithms achieved significant performance improvement over the state-of-the-art baselines in various graph mining applications. Lastly, a multi-timescale bag-of-regularity method is devised to extract students' learning regularity patterns from their multi-timescale dynamic learning behaviors, thereby achieving impressively high accuracy in early predicting academic at-risk students. I believe this thesis can serve as a solid step towards advanced knowledge discovery and representation in dynamic data.
Rights: All rights reserved
Access: open access

Files in This Item:
File Description SizeFormat 
6288.pdfFor All Users3.43 MBAdobe PDFView/Open


Copyright Undertaking

As a bona fide Library user, I declare that:

  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show full item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/11771