START : a system for flexible analysis of hundreds of genomic signal tracks in few lines of SQL-like queries

Pao Yue-kong Library Electronic Theses Database

START : a system for flexible analysis of hundreds of genomic signal tracks in few lines of SQL-like queries

 

Author: Zhang, Qiang
Title: START : a system for flexible analysis of hundreds of genomic signal tracks in few lines of SQL-like queries
Degree: Ph.D.
Year: 2016
Subject: Genomics -- Data processing
Cellular signal transduction.
Genetic regulation.
Hong Kong Polytechnic University -- Dissertations
Department: Dept. of Computing
Pages: xvi, 128 pages : color illustrations
Language: English
InnoPac Record: http://library.polyu.edu.hk/record=b2925598
URI: http://theses.lib.polyu.edu.hk/handle/200/8753
Abstract: A genomic signal track is a set of genomic intervals associated with values of various types, such as measurements from high-throughput experiments. Analysis of signal tracks requires complex computational methods, which often make the analysts focus too much on the detailed computational steps rather than on their biological questions. This thesis presents Signal Track Analytical Research Tool (START) and Signal Track Query Language (STQL) for easy analysis of signal tracks. STQL is an SQL-like declarative language, which means one only specifies what computations need to be done but not how these computations are to be carried out. STQL provides a rich set of constructs for manipulating genomic intervals and their values. To run STQL queries, we have developed the Signal Track Analytical Research Tool (START), a MapReduce-based system that includes a Web-based user interface and a back-end execution system. By running some typical analyses tasks, we show that the START+STQL solution is usually the simplest, and the parallel execution achieves significant speed-up with large data files.

Files in this item

Files Size Format
b29255983.pdf 992.1Kb PDF
Copyright Undertaking
As a bona fide Library user, I declare that:
  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

     

Quick Search

Browse

More Information