Author: Cheng, Ming Fung
Title: GPU accelerated hot term extraction from user generated content
Degree: M.Phil.
Year: 2012
Subject: Internet searching.
Information retrieval.
Information storage and retrieval systems -- Mathematical models.
Graphics processing units.
Hong Kong Polytechnic University -- Dissertations
Department: Department of Computing
Pages: 102 leaves : ill. (some col.) ; 30 cm.
Language: English
Abstract: This thesis aims at developing and investigating an efficient approach to hot term extraction. In the Web 2.0, the user generated content (UGC) is increased dramatically in different Consumer Generated Media (CGM) such as forums and blogs. People easily search their knowledge and opinions in CGM as well as generate Word Of Mouth (WOM) in different online channels. Facing the huge amount of data, it is not easy to find the useful information even using a search engine. Having a good hot term extraction algorithm can reveal hidden information to users and also provide an indicator in the search results, so that users can easily know which terms are popular in the search results. In this thesis, a GPU based hot term extraction algorithm is presented. Graphics Processing Units (GPUs) is designed for data-parallel computations. Comparing to running a single program with multiple data in CPU, GPU can have faster execution. The hot term is defined as a word that appears frequently in the search result. We assume that the greater the frequency of appearance of a term, the more the relevancy of the term to the users. As there are lots of terms in the searched results, processing them is time-consuming. The proposed GPU based hot term extraction algorithm can achieve a fast performance and works well in real-time applications.
Rights: All rights reserved
Access: open access

Files in This Item:
File Description SizeFormat 
b25301500.pdfFor All Users4.45 MBAdobe PDFView/Open


Copyright Undertaking

As a bona fide Library user, I declare that:

  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show full item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/6745