Author: Chen, Chi-ming
Title: Document ranking for VBC signature-based information retrieval system
Degree: M.Sc.
Year: 1999
Subject: Information storage and retrieval systems
Electronic data processing
Office practice -- Automation
Hong Kong Polytechnic University -- Dissertations
Department: Multi-disciplinary Studies
Dept. of Computing
Pages: v, 130 leaves : ill. ; 30 cm
Language: English
Abstract: The objective of this project is to develop an English-Chinese information retrieval system with ranking on documents that is suitable for office use. Modifications will be made to the Variable Bit-Block (VBC) compression method to generate the signature file. Signature approach is suitable for office use as the creation of new documents is frequent but deletion is seldom. Based on an existing information retrieval system using VBC method, the aims of this project are: a. To modify and extend the VBC compression method. The standard VBC method cannot store the number of appearance of a term in a document (i.e. Term Frequencies). Two methods to modify the VBC method are proposed. Evaluation will be made to determine which approach can produce smaller size signature file and give better performance. b. To assign term frequencies to the signature file using Linear scale down and Log scale down methods from the larger initial values, and evaluate which one can give a more accurate presentation of the original value. c. To present the query results to users with documents displayed in decreased order of relevance, and evaluate the relevance of documents to the users' suggested query string. d. To improve the performance of the existing VBC information retrieval system through various means.

Files in this item

Files Size Format
b14792837.pdf 3.660Mb PDF
