In memory data management : from hardware to application

Tang, Bo

Full metadata record

DC Field	Value	Language
dc.contributor	Department of Computing	en_US
dc.contributor.advisor	Yiu, Man Lung (COMP)	-
dc.creator	Tang, Bo	-
dc.identifier.uri	https://theses.lib.polyu.edu.hk/handle/200/8986	-
dc.language	English	en_US
dc.publisher	Hong Kong Polytechnic University	-
dc.rights	All rights reserved	en_US
dc.title	In memory data management : from hardware to application	en_US
dcterms.abstract	With the availability of very large and inexpensive main memory, it is becoming practical to manage data managements in main memory and benefit from high-speed access. For instance, in-memory database management systems (e.g., SAP HANA and Oracle TimesTen) provide much higher performance over disk-oriented database management systems for relational data. In this thesis, we identify and address some unsolved issues in in-memory data management, from hardware to applications. First, we exploit the hardware aspect (e.g., CPU and memory) to accelerate distance computations (on data points), which are core subroutines in many applications, e.g., trajectory search, motif discovery and kNN classification. This involves two research problems: (1) how to exploit every CPU cycle for computation, and (2) how to exploit every bit of main memory for caching data points. Our work is orthogonal to existing pruning techniques and index structures on data points. Regarding (1), we unlock the potentials of modern commodity CPUs (i.e., data parallelism, CPU caches, branch prediction). Regarding (2), we propose to cache compact approximate representations of data points in main memory in order to reduce the candidate refinement time in existing kNN search methods. For each research problem above, we evaluate the performance of our solutions on real datasets and show that our solutions are effective and scalable.Next, we focus on the application aspect and consider in-memory OLAP tools, which have been extensively used by enterprises to make better and faster decisions. Specifically, we take the first attempt towards automatically extracting top-k insights from in-memory OLAP cube. It is useful not only for non-expert users, but also reduces the manual effort of data analysts. It has challenges on (i) the effectiveness of the extracted insights and (ii) the efficiency of top-k insight computation for in-memory data warehouses. We first propose a meaningful scoring function for insights to address (i). Then, we contribute a computation framework for top-k insights, together with a suite of computation optimization techniques to address (ii). Our experimental study on both real data and synthetic data verifies the effectiveness and efficiency of our proposed solution.	en_US
dcterms.extent	xxii, 174 pages : color illustrations	en_US
dcterms.isPartOf	PolyU Electronic Theses	en_US
dcterms.issued	2017	en_US
dcterms.educationalLevel	Ph.D.	en_US
dcterms.educationalLevel	All Doctorate	en_US
dcterms.LCSH	Hong Kong Polytechnic University -- Dissertations	en_US
dcterms.LCSH	Memory management (Computer science)	en_US
dcterms.LCSH	Database management	en_US
dcterms.accessRights	open access	en_US

Files in This Item:

File	Description	Size	Format
991021952843103411.pdf	For All Users	6.9 MB	Adobe PDF	View/Open

Copyright Undertaking

As a bona fide Library user, I declare that:

I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show simple item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/8986