GPU accelerated hot term extraction from user generated content

Pao Yue-kong Library Electronic Theses Database

GPU accelerated hot term extraction from user generated content

 

Author: Cheng, Ming Fung
Title: GPU accelerated hot term extraction from user generated content
Degree: M.Phil.
Year: 2012
Subject: Internet searching.
Information retrieval.
Information storage and retrieval systems -- Mathematical models.
Graphics processing units.
Hong Kong Polytechnic University -- Dissertations
Department: Dept. of Computing
Pages: 102 leaves : ill. (some col.) ; 30 cm.
Language: English
InnoPac Record: http://library.polyu.edu.hk/record=b2530150
URI: http://theses.lib.polyu.edu.hk/handle/200/6745
Abstract: This thesis aims at developing and investigating an efficient approach to hot term extraction. In the Web 2.0, the user generated content (UGC) is increased dramatically in different Consumer Generated Media (CGM) such as forums and blogs. People easily search their knowledge and opinions in CGM as well as generate Word Of Mouth (WOM) in different online channels. Facing the huge amount of data, it is not easy to find the useful information even using a search engine. Having a good hot term extraction algorithm can reveal hidden information to users and also provide an indicator in the search results, so that users can easily know which terms are popular in the search results. In this thesis, a GPU based hot term extraction algorithm is presented. Graphics Processing Units (GPUs) is designed for data-parallel computations. Comparing to running a single program with multiple data in CPU, GPU can have faster execution. The hot term is defined as a word that appears frequently in the search result. We assume that the greater the frequency of appearance of a term, the more the relevancy of the term to the users. As there are lots of terms in the searched results, processing them is time-consuming. The proposed GPU based hot term extraction algorithm can achieve a fast performance and works well in real-time applications.

Files in this item

Files Size Format
b25301500.pdf 4.558Mb PDF
Copyright Undertaking
As a bona fide Library user, I declare that:
  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

     

Quick Search

Browse

More Information