Health care predictive analytics using artificial intelligence techniques

Wang, Guanjin

Full metadata record

DC Field	Value	Language
dc.contributor	School of Nursing	en_US
dc.contributor.advisor	Choi, Kup Sze (SN)	-
dc.creator	Wang, Guanjin	-
dc.identifier.uri	https://theses.lib.polyu.edu.hk/handle/200/9647	-
dc.language	English	en_US
dc.publisher	Hong Kong Polytechnic University	-
dc.rights	All rights reserved	en_US
dc.title	Health care predictive analytics using artificial intelligence techniques	en_US
dcterms.abstract	In recent years, advances in Artifcial Intelligence (AI) are opening the door for intelligent health care data prediction and decision making. Machine learning, as an increasingly popular approach to AI, has been widely used to learn directly from data, adapt independently, and produce predictive outcomes, which support doctors when encountering complex health care predictive analytics. However, traditional machine learning methods are not always perfectly working in the health feld, intrinsically due to little consideration for characteristic problems within health care data. For example, the small sample size problem is common due to complex data collection procedures and privacy concerns. Missing data is also widely encountered since most data are collected as a second-product of patient-care activities instead of following systematic research protocols. The class imbalance is another inevitable problem in the medical data as the normal class usually predominates over the disease class. To solve aforementioned issues in health care predictive analytics, this study stands on the principles of machine learning and transfer learning to develop five advanced prediction models. The frst model is an output-based transfer least squares support vector machines (LS-SVMs) model which can leverage knowledge learned from the existing prediction model to facilitate the learning process on the target domain with insuffcient data. This model overcomes the small sample size problem and improves the health care data prediction by learning knowledge from the other domain. The second model is a novel additive LS-SVMs model which can directly make predictions on missing data by simultaneously evaluating the influences on the classifcation error made by missing features. Moreover, this model can generate explanatory information for health professionals to improve the future data collection process. The third model is a transfer-based additive LS-SVMs model which can deal with missing data from a transfer learning perspective. It leverages the model knowledge learned from the complete portion of the dataset to help the learning process on the whole dataset with missing data. The proposed model can provide supplementary information for health professionals to improve the data quality via data cleaning. The forth model is a deep transfer additive LS-SVMs model called DTA-LS-SVMs and its imbalanced version called iDTA-LS-SVMs to enhance the prediction performance on the balanced and imblanced datasets. Enlightened by the deep architecture and transfer learning, the model stacks multiple additive LS-SVMs based modules layer-by-layer and embeds model transfer between adjacent modules to guarantee their consistency. The ffth model is a deep cross-output transfer LS-SVMs model called DCOT-LS-SVMs and its imbalanced version called IDCOT-LS-SVMs to improve the prediction performance on the balanced and imbalanced datasets. The cross-output transfer is used to transfer the knowledge of outcomes from the previous module to the adjacent higher layer to achieve a better learning. Moreover, modules' parameters can be randomly assigned in the proposed model which signifcantly simplifes the learning process. The proposed models are verifed using the public UCI datasets. Moreover, case studies are conducted to validate and integrate the proposed models with real world applications, including bladder cancer prognosis, prostate cancer diagnosis, and predictions of elderly quality of life (QOL). The experimental results have demonstrated that these models can enhance the prediction performances while taking the characteristic problems within health data into account, thus exhibiting potential to be widely used in the real world applications in future.	en_US
dcterms.extent	xix, 183 pages : color illustrations	en_US
dcterms.isPartOf	PolyU Electronic Theses	en_US
dcterms.issued	2018	en_US
dcterms.educationalLevel	Ph.D.	en_US
dcterms.educationalLevel	All Doctorate	en_US
dcterms.LCSH	Hong Kong Polytechnic University -- Dissertations	en_US
dcterms.LCSH	Health services administration -- Statistical methods	en_US
dcterms.LCSH	Health services administration -- Decision making	en_US
dcterms.accessRights	open access	en_US

Files in This Item:

File	Description	Size	Format
991022164559103411.pdf	For All Users	2.01 MB	Adobe PDF	View/Open

Copyright Undertaking

As a bona fide Library user, I declare that:

I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/9647