Full metadata record
DC FieldValueLanguage
dc.contributorDepartment of Logistics and Maritime Studiesen_US
dc.contributor.advisorWang, Shuaian Hans (LMS)en_US
dc.creatorYan, Ran-
dc.identifier.urihttps://theses.lib.polyu.edu.hk/handle/200/11666-
dc.languageEnglishen_US
dc.publisherHong Kong Polytechnic Universityen_US
dc.rightsAll rights reserveden_US
dc.titleData analytics for improving shipping efficiency : models, methods, and applicationsen_US
dcterms.abstractThis thesis aims to improve the effectiveness and efficiency of port state control (PSC) inspection, which is one of the most important international shipping policies, from the aspects of ship risk prediction and PSC inspector assignment and scheduling using data analytics and operations research models. In addition to a comprehensive summary and review of ship selection methods applied at ports over the world and proposed in existing literature, this thesis comprises three studies. In the first study, ship deficiency number, which is a ship risk indicator in the PSC inspection, is predicted using a state-of-the-art XGBoost model. The XGBoost model takes shipping domain knowledge regarding ship flag, recognized organization, and company performances into account to improve model accuracy and fairness. Based on the predictions, a PSC inspector scheduling model is proposed to help the ports optimally allocate inspection resources. According to the model structure, the concepts of inspection template and un-dominated inspection template are further proposed and incorporated in the optimization model to improve computation efficiency and model flexibility.en_US
dcterms.abstractIn the second study, three two-step approaches that match the inspection resources with the ships' deficiency conditions are proposed, aimed at identifying the most deficiencies of them. The three approaches combine prediction models with optimization models, and the optimization models are equivalent in all the approaches while the prediction models differ from each other regarding their prediction targets or structure. Specifically, the first approach predicts the number of deficiencies in each deficiency category for each ship and then develops an integer optimization model that assigns the inspectors to the ships to be inspected. The second approach predicts the number of deficiencies each inspector can identify for each ship and then applies an integer optimization model to assign the inspectors to the ships to be inspected. The third approach is a semi-"smart predict then optimize" (semi-SPO) method. It also predicts the number of deficiencies each inspector can identify for each ship and uses the same integer optimization model as the second approach. However, instead of minimizing the mean squared error as in the second approach, it adopts a loss function motivated by the structure of the optimization problem in the second approach. The prediction results are then input to PSC officer (PSCO) assignment models such that the PSCOs' expertise and the ships' deficiency conditions can be matched, and the inspection efficiency can be improved.en_US
dcterms.abstractIn the third study, a data-driven ship risk prediction framework using features the same as the current ship selection scheme is developed for high-risk ship identification and selection based on gradient boosting regression tree (GBRT). Like existing ship risk prediction models, the proposed framework is of black-box nature whose decision process and working mechanism are opaque. To improve model explainability, the explanation of the prediction of individual ships by the Shapley additive explanations (SHAP) method with the properties of local accuracy and consistency is provided. Furthermore, the local SHAP method is innovatively extended to a fully explainable near linear-form global surrogate model of the original black-box data-driven model by deriving feature coefficients and fitting curves of feature values and SHAP values. This demonstrates that the behaviour of black-box data-driven models can be as interpretable as white-box models while retaining their prediction accuracy.en_US
dcterms.extent13, 140 pages : color illustrationsen_US
dcterms.isPartOfPolyU Electronic Thesesen_US
dcterms.issued2022en_US
dcterms.educationalLevelPh.D.en_US
dcterms.educationalLevelAll Doctorateen_US
dcterms.LCSHShipping -- Data processingen_US
dcterms.LCSHShips -- Inspectionen_US
dcterms.LCSHScheduling -- Data processingen_US
dcterms.LCSHShipping -- Mathematical modelsen_US
dcterms.LCSHHong Kong Polytechnic University -- Dissertationsen_US
dcterms.accessRightsopen accessen_US

Files in This Item:
File Description SizeFormat 
6212.pdfFor All Users2.95 MBAdobe PDFView/Open


Copyright Undertaking

As a bona fide Library user, I declare that:

  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/11666