Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | Multi-disciplinary Studies | en_US |
dc.contributor | Department of Computing | en_US |
dc.creator | Chan, Shui-kuen | - |
dc.identifier.uri | https://theses.lib.polyu.edu.hk/handle/200/448 | - |
dc.language | English | en_US |
dc.publisher | Hong Kong Polytechnic University | - |
dc.rights | All rights reserved | en_US |
dc.title | A signature-based information retrieval system for office use | en_US |
dcterms.abstract | Office documents are produced everyday. Manual filing and searching become labor intensive and slow. Information retrieval offers the timely access of a large set of documents. In Hong Kong, documents are usually written in English and Chinese. This dissertation aims to extend the use of variable bit-block compression signature method, which is good to install for office use, to index and to search English/Chinese documents. In our proposed system, signatures for each documents are generated through a batch process including three sub-processes. First, a file list containing all accessible text files and their relevant information is generated. Secondly, by scanning the file list, a library file containing all terms in all accessible text files is created. Thirdly, by scanning the file list and the library file, a signature file containing all signatures for all accessible text files is generated. After signature generation, office staff can retrieve their relevant documents by submitting a query in the web page. The query is passed to a signature retrieval program through the Common Gateway Interface (CGI) specification. The signature retrieval program scans through the signature file and returns relevant documents through another web page. Besides, queries can be written as boolean expression based on conjunction and disjunction. Based on a small test queries (~= 30 cases), the average recall and precision are 1 and 0.933 respectively. | en_US |
dcterms.extent | iv, 98 leaves : ill. ; 30 cm | en_US |
dcterms.isPartOf | PolyU Electronic Theses | en_US |
dcterms.issued | 1998 | en_US |
dcterms.educationalLevel | All Master | en_US |
dcterms.educationalLevel | M.Sc. | en_US |
dcterms.LCSH | Text processing (Computer science) | en_US |
dcterms.LCSH | Information storage and retrieval systems | en_US |
dcterms.LCSH | Chinese language -- Data processing | en_US |
dcterms.LCSH | Electronic data processing | en_US |
dcterms.LCSH | Office practice -- Automation | en_US |
dcterms.LCSH | Hong Kong Polytechnic University -- Dissertations | en_US |
dcterms.accessRights | restricted access | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
b14369436.pdf | For All Users (off-campus access for PolyU Staff & Students only) | 2.59 MB | Adobe PDF | View/Open |
Copyright Undertaking
As a bona fide Library user, I declare that:
- I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
- I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
- I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.
Please use this identifier to cite or link to this item:
https://theses.lib.polyu.edu.hk/handle/200/448