Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Electronic Engineering | en_US |
dc.creator | Ng, Sze-pan | - |
dc.identifier.uri | https://theses.lib.polyu.edu.hk/handle/200/3805 | - |
dc.language | English | en_US |
dc.publisher | Hong Kong Polytechnic University | - |
dc.rights | All rights reserved | en_US |
dc.title | Computerized audio music recognition | en_US |
dcterms.abstract | Music offers a challenging array of recognition problems which arise in the context of entering musical data into a computer. As an art form, music is represented by the presence of many relationships that can treated mathematically, including pitch, loudness, tempo and timbre. There are also many nonmathematical elements such as tension, emotion or legato playing style which attribute to the difference between notated and played music. These elements combine to make music recognition a rich field of study. This report presents a monophonic, continuous piano music recognition system based on Time-Delay Neural Network (TDNN). The system is an alternative approach to traditional knowledge based systems which are designed to solve problems whose solutions have been made explicit. It employs TDNN to learn the characteristics of piano tones and performs continuous music recognition task based on the developed internal representations of the lower level acoustic cues. The system currently achieves 98 percents accuracy on the recognition of continuous tones from C4 to C5# (i.e. totally 14 tones), outperforming the traditional knowledge based recognition system using energy threshold or pitch detection segmentation method. TDNN is also found to be better in identifying the tone boundaries, especially when tones are embedded in background noise or overlapped owing to legato playing style. Another side product of the study is the developed algorithm to cater for the local minimum problem in error back propagation training. This algorithm solves the problem by dynamically adjusting the number of hidden units in the network and finally comes to an optimal configuration. | en_US |
dcterms.extent | 1 v. (various pagings) : ill. ; 30 cm | en_US |
dcterms.isPartOf | PolyU Electronic Theses | en_US |
dcterms.issued | 1994 | en_US |
dcterms.educationalLevel | All Master | en_US |
dcterms.educationalLevel | M.Sc. | en_US |
dcterms.LCSH | Musical notation -- Data processing | en_US |
dcterms.LCSH | Computer sound processing | en_US |
dcterms.LCSH | Hong Kong Polytechnic -- Dissertations | en_US |
dcterms.accessRights | restricted access | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
b11628479.pdf | For All Users (off-campus access for PolyU Staff & Students only) | 2.59 MB | Adobe PDF | View/Open |
Copyright Undertaking
As a bona fide Library user, I declare that:
- I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
- I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
- I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.
By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.
Please use this identifier to cite or link to this item:
https://theses.lib.polyu.edu.hk/handle/200/3805