Improving Audio Fingerprinting for Music Retrieval

碩士 === 國立清華大學 === 資訊工程學系 === 101 === The goal of this research is to improve the current audio fingerprinting technique. Audio fingerprinting is a fast and convenient music retrieval method that allows a user to retrieve an intended song and related information by recording a portion of the song und...

Full description

Bibliographic Details
Main Authors:	Liao, Pei-Yu, 廖珮妤
Other Authors:	張智星
Format:	Others
Language:	zh-TW
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/28839860469970485880

id	ndltd-TW-101NTHU5392105
record_format	oai_dc
spelling	ndltd-TW-101NTHU53921052015-10-13T22:29:58Z http://ndltd.ncl.edu.tw/handle/28839860469970485880 Improving Audio Fingerprinting for Music Retrieval 用於音樂檢索的聲紋辨識改良 Liao, Pei-Yu 廖珮妤碩士國立清華大學資訊工程學系 101 The goal of this research is to improve the current audio fingerprinting technique. Audio fingerprinting is a fast and convenient music retrieval method that allows a user to retrieve an intended song and related information by recording a portion of the song under a noisy environment. In order to improve the recognition rate of our system, we classify the queried segment into one of the two classes: easy or difficult to find the intended song. The recognition mechanism is as follows. Before the queried segment is recognized, we adopt SVM as our classifier to classify the queried segment. Depending on its class, we conduct 4 or 8 times of landmark finding on this query and then perform the matching step as usual. The recognition rate by using our method is 84.18%, which is close to 84.28% by using 8 times of landmarks finding, and the matching time is also reduced by 2% of the time required by using 8 times of landmarks finding. In addition, we employ a verification mechanism using confidence measure in our audio fingerprinting system to determine if the query is in our database or not. If the confidence result is lower than a certain threshold, our system rejects this query. When we set the matched landmark count per second as 1.5, we can filter about 86% of queried segments which are not in our database. At last, if the matched landmark count of the user-defined duration of the queried segment is greater than the confidence threshold, our system returns the result directly. Otherwise, the system extends the duration of the queried segment for searching and matching. Therefore, we divide a query into two parts with equal length to conduct the experiment. To solve the problem of missing landmarks on edge between two parts, we overlap 15 frames towards the front for the second part of the query segment. And we also find the landmarks forwards only. This effectively solves the problem of finding duplicate landmarks of the former segment when finding landmarks bidirectionally. Comparing to the original method, this method achieves a 21% reduction in response time and a 2% improvement in recognition rate. Keywords: music retrieval, audio fingerprinting, landmark, SVM, confidence measure, segmental music query 張智星張俊盛 2013 學位論文 ; thesis 71 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立清華大學 === 資訊工程學系 === 101 === The goal of this research is to improve the current audio fingerprinting technique. Audio fingerprinting is a fast and convenient music retrieval method that allows a user to retrieve an intended song and related information by recording a portion of the song under a noisy environment. In order to improve the recognition rate of our system, we classify the queried segment into one of the two classes: easy or difficult to find the intended song. The recognition mechanism is as follows. Before the queried segment is recognized, we adopt SVM as our classifier to classify the queried segment. Depending on its class, we conduct 4 or 8 times of landmark finding on this query and then perform the matching step as usual. The recognition rate by using our method is 84.18%, which is close to 84.28% by using 8 times of landmarks finding, and the matching time is also reduced by 2% of the time required by using 8 times of landmarks finding. In addition, we employ a verification mechanism using confidence measure in our audio fingerprinting system to determine if the query is in our database or not. If the confidence result is lower than a certain threshold, our system rejects this query. When we set the matched landmark count per second as 1.5, we can filter about 86% of queried segments which are not in our database. At last, if the matched landmark count of the user-defined duration of the queried segment is greater than the confidence threshold, our system returns the result directly. Otherwise, the system extends the duration of the queried segment for searching and matching. Therefore, we divide a query into two parts with equal length to conduct the experiment. To solve the problem of missing landmarks on edge between two parts, we overlap 15 frames towards the front for the second part of the query segment. And we also find the landmarks forwards only. This effectively solves the problem of finding duplicate landmarks of the former segment when finding landmarks bidirectionally. Comparing to the original method, this method achieves a 21% reduction in response time and a 2% improvement in recognition rate. Keywords: music retrieval, audio fingerprinting, landmark, SVM, confidence measure, segmental music query
author2	張智星
author_facet	張智星 Liao, Pei-Yu 廖珮妤
author	Liao, Pei-Yu 廖珮妤
spellingShingle	Liao, Pei-Yu 廖珮妤 Improving Audio Fingerprinting for Music Retrieval
author_sort	Liao, Pei-Yu
title	Improving Audio Fingerprinting for Music Retrieval
title_short	Improving Audio Fingerprinting for Music Retrieval
title_full	Improving Audio Fingerprinting for Music Retrieval
title_fullStr	Improving Audio Fingerprinting for Music Retrieval
title_full_unstemmed	Improving Audio Fingerprinting for Music Retrieval
title_sort	improving audio fingerprinting for music retrieval
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/28839860469970485880
work_keys_str_mv	AT liaopeiyu improvingaudiofingerprintingformusicretrieval AT liàopèiyú improvingaudiofingerprintingformusicretrieval AT liaopeiyu yòngyúyīnlèjiǎnsuǒdeshēngwénbiànshígǎiliáng AT liàopèiyú yòngyúyīnlèjiǎnsuǒdeshēngwénbiànshígǎiliáng
_version_	1718077406466539520

Improving Audio Fingerprinting for Music Retrieval

Similar Items