政大機構典藏-National Chengchi University Institutional Repository(NCCUR):Item 140.119/78751

English | 正體中文 | 简体中文 | Post-Print筆數 : 27 | Items with full text/Total items : 111300/142216 (78%)
Visitors : 48305027 Online Users : 431

RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.

Scope

please add "double quotation mark" for query phrases to get precise results

please goto advance search for comprehansive author search

Adv. Search

Home ‧ Login ‧ Upload ‧ Help ‧ About ‧ Administer

Goto mobile version

政大機構典藏 > 理學院 > 應用數學系 > 學位論文 > Item 140.119/78751

Please use this identifier to cite or link to this item: https://nccur.lib.nccu.edu.tw/handle/140.119/78751

Title:	小波理論於曲風辨識上之應用 The Application of Wavelet Transform on Automatically Musical Genre Classification
Authors:	陳彥名
Contributors:	曾正男陳彥名
Keywords:	小波轉換線性判別分析離散餘弦轉換決策樹曲風辨識
Date:	2015
Issue Date:	2015-10-01 14:17:32 (UTC+8)
Abstract:	隨著科技的進步，網際網路已充斥在我們的生活之中。音樂也不再以硬體儲存的方式流傳（例如ＣＤ、黑膠唱片），而是轉變為數位音樂的方式，透由網路平台散播。許多數位音樂串流服務平台網站也如雨後春筍般誕生，例如iTunes、Spotify、Musicovery。加上文化水平的提升，音樂已是現代人生活之中，不可或缺的一部分。世界上的音樂難以計數，如何將音樂分門別類做好管理乃為現代商業應用的一個重要課題。因此，音樂曲風自動化辨識的技術確實為一個實用且難以迴避的課題。過去在曲風自動化辨識已有許多研究，但內容不外乎音訊處理、頻譜轉換、特徵擷取、特徵降維、監督式學習機。在相同的模式下提出各種改良，或是全新的特徵擷取…諸如此類，而辨識率也達到了七成以上。本篇論文採用不同於以往的做法，將訊號進行頻譜轉換後層層降維，所得之訊號搭配LDA與決策樹進行辨識，最後去比較與分析離散餘弦轉換與小波轉換在辨識率上的優劣。我們發現搭配小波轉換與混合LDA及決策樹的方法，可以將音樂曲風之分辨率達到八成五以上。目錄口試委員會審定書.................................................................................................................. i 致謝.......................................................................................................................................... ii 中文摘要.................................................................................................................................. iii Abstract .................................................................................................................................... iv 目錄.......................................................................................................................................... vi 表目錄...................................................................................................................................... viii 圖目錄...................................................................................................................................... ix 第一章緒論....................................................................................................................... 1 第一節研究背景與動機..................................................................................... 1 第二節研究目的................................................................................................. 2 第三節研究架構................................................................................................. 3 第二章文獻探討.............................................................................................................. 4 第一節前言......................................................................................................... 4 第二節預處理..................................................................................................... 5 第三節音樂特徵擷取......................................................................................... 7 一、梅爾倒頻譜係數（Mel Frequency Cepstral Coefficients, MFCC） ................................................................................................................. 8 二、雷尼熵值（Renyi Entropy, RE） .................................................. 9 三、頻譜質心（Spectral Centroid, SC）.............................................. 9 四、強度與音色（Intensity and Timbre） ........................................... 9 第四節建置分類器............................................................................................. 11 一、支持向量機（Support Vector Machines, SVM） ......................... 11 二、最近鄰居法（k-Nearest Neighbors algorithm, k-NN）................ 12 三、高斯混合模型（Gaussian Mixture Models, GMM）................... 13 第三章降維方法.............................................................................................................. 14 第一節小樣本分析............................................................................................. 14 第二節音訊分析與k-means 演算法................................................................. 16 第三節頻譜與降維............................................................................................. 17 第四節線性判別分析......................................................................................... 21 一、監督式維度縮減（Supervised Dimension Reduction）............... 21 二、LDA 公式推導............................................................................... 22 三、LDA 實驗結果............................................................................... 28 第四章實驗方法.............................................................................................................. 30 第一節挑選實驗音樂樣本................................................................................. 31 第二節音訊處理................................................................................................. 33 第三節維度縮減................................................................................................. 34 第四節隨機七三分配......................................................................................... 34 第五節線性判別分析之降維與預測................................................................. 35 第六節離散小波轉換......................................................................................... 36 第七節系統決策樹............................................................................................. 42 第八節混合系統................................................................................................. 44 一、Classical - Classical ........................................................................ 45 二、Classical - Electron ......................................................................... 45 三、Classical - Rock .............................................................................. 45 四、Classical - Pop................................................................................. 46 五、Classical - Vocal Pop ...................................................................... 46 六、其餘情境......................................................................................... 48 第九節混合系統的最終決策............................................................................. 50 第五章結論與未來展望............................................................................................... 54 參考文獻.................................................................................................................................. 55
Reference:	[1] D.Pye Content-Based Methods for the Management of Digital Music in Proc IEEE Conf. Acoustics, Speech, Signal Processing (ICASSP), pp. 2437-2400, 2000 [2] Dhanalakshmi, P. , Palanivel, S. , Ramalingam, V. Classification of audio signals using SVM and RBFNN. Expert Systems with Applications, 36(3), 6069-6075. doi: DOI 10.1016/j.eswa.2008.06.126, 2009 [3] Xu, C. S. , Maddage, N. C. , Shao, X. Automatic music classification and summarization. Ieee Transactions on Speech and Audio Processing, 13(3), 441-450. doi: Doi 10.1109/Tsa.2004.840939, 2005 [4] Ajmera, J. , McCowan, I. , Bourlard, H. Speech/music segmentation using entropy and dynamism features in a HMM classification framework. Speech Communication, 40(3), 351-363. doi: Doi 10.1016/S0167-6393(02)00087-0, 2003 [5] Shao, B. , Wang, D. D. , Li, T. , Ogihara, M. Music Recommendation Based on Acoustic Features and User Access Patterns. Ieee Transactions on Audio Speech and Language Processing, 17(8), 1602-1611. doi: Doi 10.1109/Tasl.2009.2020893, 2009 109(9):553--572, 1938. [6] Grey, J. M., Gordon, J. W. Perceptual effects of spectral modifications on musical timbres. Journal of the Acoustical Society of America 63 (5), 1493–1500, doi:10.1121/1.381843, 1978. [7] Duo-Fu Bao Supervised and Unsupervised Music Genre Classification NTUT Institute of Computer and Communication, 2008 [8] Shih-kai Chen Methodology of stage lighting control based on music emotion feeling. Department of Industrial Design, National Cheng Kung University 2015 [9] Cortes, C. Vapnik, V. Support-vector networks. Machine Learning 20 (3): 273. doi:10.1007/BF00994018, 1995 [10] Drucker, Harris; Burges, Christopher J. C.; Kaufman, Linda; Smola, Alexander J.; and Vapnik, Vladimir N. Support Vector Regression Machines. in Advances in Neural Information Processing Systems 9, NIPS 1996, 155–161, MIT Press. 1997 [11] Altman, N. S. An introduction to kernel and nearest-neighbor nonparametric regression. he American Statistician 46 (3): 175–185. doi:10.1080/00031305.1992.10475879. 1992 [12] Lie, L. , Liu, D. , Zhang, H. J. Automatic mood detection and tracking of music audio signals. Ieee Transactions on Audio Speech and Language Processing,9014(1), 5-18. doi: Doi 10.1109/Tsa.2005.860344. 2006 [13] Baum, L. E.; Petrie, T. Statistical Inference for Probabilistic Functions of Finite State Markov Chains. The Annals of Mathematical Statistics 37 (6): 1554–1563. doi:10.1214/aoms/1177699147. Retrieved 28 November 2011. 1966 [14] Nock, R. and Nielsen, F. On Weighting Clustering. IEEE Trans. on Pattern Analysis and Machine Intelligence, 28 (8), 1–13, 2006 [15] Steinhaus, H. Sur la division des corps matériels en parties. Bull. Acad. Polon. Sci. 4 (12): 801–804. MR 0090073. Zbl 0079.16403 (French). 1957 [16] MacQueen, J. B. Some Methods for classification and Analysis of Multivariate Observations. 1, Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability. University of California Press. 1967: pp. 281–297. 2009 [17] Lloyd, S. P. Least square quantization in PCM. ell Telephone Laboratories Paper. 1957. Published in journal much later: Lloyd., S. P. Least squares quantization in PCM. IEEE Transactions on Information Theory. 1982, 28 (2): 129–137. doi:10.1109/TIT.1982.1056489. 2009 [18] Jake Vanderplas Comparison of Manifold Learning methods http://scikit-learn.org/stable/auto\\_examples/manifold/plot\\_compare\\_methods.html [19] E-Course of NUTH https://ecourse.nutn.edu.tw/ [20] 曾正男一套提升凌波函數逼近能力與平滑度的方法國立中央大學民國85年 [21] Decision Trees Analysis http://www.mindtools.com/dectree.html? [22] 格式工廠 http://www.azofreeware.com/2008/10/formatfactory-155.html
Description:	碩士國立政治大學應用數學研究所 101751014
Source URI:	http://thesis.lib.nccu.edu.tw/record/#G0101751014
Data Type:	thesis
Appears in Collections:	[應用數學系] 學位論文

Files in This Item:

File	Size	Format
index.html	0Kb	HTML2	283	View/Open

All items in 政大典藏 are protected by copyright, with all rights reserved.

社群 sharing

著作權政策宣告 Copyright Announcement

1.本網站之數位內容為國立政治大學所收錄之機構典藏，無償提供學術研究與公眾教育等公益性使用，惟仍請適度，合理使用本網站之內容，以尊重著作權人之權益。商業上之利用，則請先取得著作權人之授權。
The digital content of this website is part of National Chengchi University Institutional Repository. It provides free access to academic research and public education for non-commercial use. Please utilize it in a proper and reasonable manner and respect the rights of copyright owners. For commercial use, please obtain authorization from the copyright owner in advance.

2.本網站之製作，已盡力防止侵害著作權人之權益，如仍發現本網站之數位內容有侵害著作權人權益情事者，請權利人通知本網站維護人員(nccur@nccu.edu.tw)，維護人員將立即採取移除該數位著作等補救措施。
NCCU Institutional Repository is made to protect the interests of copyright owners. If you believe that any material on the website infringes copyright, please contact our staff(nccur@nccu.edu.tw). We will remove the work from the repository and investigate your claim.

DSpace Software Copyright © 2002-2004 MIT & Hewlett-Packard / Enhanced by NTU Library IR team Copyright © - Feedback