Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Computing | en_US |
dc.creator | Zhang, Renxian | - |
dc.identifier.uri | https://theses.lib.polyu.edu.hk/handle/200/7142 | - |
dc.language | English | en_US |
dc.publisher | Hong Kong Polytechnic University | - |
dc.rights | All rights reserved | en_US |
dc.title | Coherence-targeted text summarization | en_US |
dcterms.abstract | For readers, coherence is no less important than informativeness for a summary. This paper is aimed to improve coherence in automatic text summaries by developing coherence models and related techniques. Different from most other efforts to improve summary coherence, my work treats coherence as an analyzable concept with multi-faceted and multi-disciplinary backgrounds. Specifically, I have explored the technical details of three kinds of coherence - shallow content-driven coherence, deep content-driven coherence, and cognitive model-driven coherence. Shallow content consists of words, phrases, sentences, and discourse units and their literal connections or co-occurrence patterns give rise to coherence. Experiments on single-document as well as multi-document news summarization show that coherence driven by words, entities, sentences, and events can help to better arrange selected summary sentences. Deep content is observed on a macro-text level, which is instantiated by news aspects and speech acts. Focusing on the relations among deep content units, I have applied coherence to both selecting and ordering summary sentences. Relying on human cognitive tendencies, cognitive model-driven coherence is understood as a necessary mechanism in text comprehension. The computational modeling of such coherence, coupled with proposition-level extractive summarization, works successfully for narrative text. To model coherence of different kinds, I have developed novel techniques that are suitable for different genres of text, including newswire, social media messages, and fairy tales. The extensive experimental results on benchmark or self-compiled datasets have validated the efficacy and robustness of the techniques in various circumstances. Among many of its contributions to the summarization community, my work shows that contrary to what is commonly held, coherence plays a pivotal, instead of ancillary, role in automatic summarization. As one of the few large-scale studies of coherence in summarization, my work is expected to herald a complete theory of coherence and more in-depth studies in coherence-targeted text summarization. | en_US |
dcterms.extent | xxi, 257 p. : ill. ; 30 cm. | en_US |
dcterms.isPartOf | PolyU Electronic Theses | en_US |
dcterms.issued | 2013 | en_US |
dcterms.educationalLevel | All Doctorate | en_US |
dcterms.educationalLevel | Ph.D. | en_US |
dcterms.LCSH | Computational linguistics | en_US |
dcterms.LCSH | Hong Kong Polytechnic University -- Dissertations | en_US |
dcterms.accessRights | open access | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
b26390814.pdf | For All Users | 3.66 MB | Adobe PDF | View/Open |
Copyright Undertaking
As a bona fide Library user, I declare that:
- I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
- I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
- I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.
Please use this identifier to cite or link to this item:
https://theses.lib.polyu.edu.hk/handle/200/7142