Journal of Multimedia, Vol 7, No 2 (2012), 145-158, Apr 2012
doi:10.4304/jmm.7.2.145-158

An Efficient Identification Methodology for Improved Access to Music Heritage Collections

Nicola Montecchio, Emanuele Di Buccio, Nicola Orio

Abstract


A comprehensive methodology for automatic music identification is presented. The main   application of the proposed approach is to provide tools to enrich and validate the   descriptors of recordings digitized by a sound archive institution. Experimentation has been carried out on three different datasets, including a collection   of digitized vinyl discs, although the methodology is not linked to a particular   recording carrier.  Automatic identification allows a music digital library to retrieve metadata about music works even if the information was incomplete or missing at the time   of the acquisition. Automatic segmentation of digitized material is obtained as a byproduct of identification, allowing the music digital library to grant access to individual tracks, even if  discs are digitized using a single file for a complete disc side. Results show that the approach is both efficient and effective.



Keywords


Digital Libraries; Feature Extraction and Representation; Audio Processing

References


 

[1] N. Orio, R. Miotto, N. Montecchio, D. Rizo, O. Lartillot,and M. Schedl, “Musiclef: a benchmark activity in multimodal music information retrieval,” in Proceedings of the International Society for Music Information Retrieval conference, Miami, FL, USA, 2011.

[2] P. Cano, E. Batlle, T. Kalker, and J. Haitsma, “A review of audio fingerprinting,” Journal of VLSI Signal Processing, vol. 41, pp. 271–284, 2005.
http://dx.doi.org/10.1007/s11265-005-4151-3

[3] A. Wang, “The Shazam music recognition service,” Communications of the ACM, vol. 49, no. 8, pp. 44–48, 2006.
http://dx.doi.org/10.1145/1145287.1145312

[4] T. Fujishima, “Realtime chord recognition of musical sound: a system using common LISP music,” in Proceedings of the International Computer Music Conference. Beijing, China, 1999.

[5] F. Kurth and M. Mueller, “Efficient index-based audio matching,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 2, pp. 382–395, 2008.
http://dx.doi.org/10.1109/TASL.2007.911552

[6] N. Hu, R. Dannenberg, and G. Tzanetakis, “Polyphonic audio matching and alignment for music retrieval,” in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, 2003.

[7] P. H. J. Serrà, E. Gòmez and X. Serra, “Chroma binary similarity and local alignment applied to cover song identification,” IEEE Transactions on Audio, Speech, andLanguage Processing, vol. 16, no. 6, pp. 1138–1151, 2008.

[8] D. Ellis and G. Poliner, “Identifying ‘cover songs’ with chroma features and dynamic programming beat tracking,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, HI, USA, 2007.

[9] S. Downie, “The music information retrieval evaluation exchange (2005–2007): A window into music informationretrieval research,” Acoustical Science and Technology, vol. 29, no. 4, pp. 247–255, 2008.
http://dx.doi.org/10.1250/ast.29.247

[10] R. Miotto and N. Orio, “A music identification system based on chroma indexing and statistical modeling,” in Proceedings of the International Conference on Music Information Retrieval, Philadelphia, PA, USA, 2008.

[11] R. Miotto and N. Orio, “A methodology for the segmentation and identification of music works,” in Proceedings of the International Conference on Music Information Retrieval, Vienna, Austria, 2007.

[12] S. Dixon and G. Widmer, “MATCH: a music alignment tool chest,” in Proceedings of the International Conference on Music Information Retrieval, London, UK, 2005.

[13] J. Haitsma and T. Kalker, “A highly robust audio fingerprinting system with an efficient search strategy,” Journal of New Music Research, vol. 32, no. 2, pp. 211–221, 2003.
http://dx.doi.org/10.1076/jnmr.32.2.211.16746

[14] A. Gionis, P. Indyk, and R. Motwani, “Similarity search in high dimensions via hashing,” in Proceedings of the International Conference on Very Large Data Bases, Edinburgh,UK, 1999.

[15] Y. Yu, J. S. Downie, F. M¨orchen, L. Chen, and K. Joe, “Using exact locality sensitive mapping to group and detect audio-based cover songs,” in Proceedings of the IEEE International Symposium on Multimedia, Washington, DC,USA, 2008.
http://dx.doi.org/10.1109/ISM.2008.18

[16] M. Slaney and M. Casey, “Locality-sensitive hashing for finding nearest neighbors [lecture notes],” Signal Processing Magazine, IEEE, vol. 25, no. 2, pp. 128–131, 2008.
http://dx.doi.org/10.1109/MSP.2007.914237

[17] A. J. Ellis, The history of musical pitch. Amsterdam: F.A.M. Knuf, 1963.

[18] K. Dressler and S. Streich, “Tuning frequency estimation using circular statistics,” in Proceedings of the International Conference on Music Information Retrieval, Vienna, Austria, 2007.

[19] A. Lerch, “On the requirement of automatic tuning frequency estimation,” in Proceedings of the International Conference on Music Information Retrieval, Victoria, Canada, 2006.

[20] O. Lartillot, “A comprehensive and modular framework for audio content extraction, aimed at research, pedagogy, and digital library management,” in Proceedings of the Audio Engineering Society convention, London, UK, 2011.

[21] N. Montecchio and A. Cont, “Accelerating the mixing phase in studio recording production by automatic audio alignment,” in Proceedings of the International Society for Music Information Retrieval conference, Miami, FL, USA, 2011.

[22] M. S. Arulampalam, S. Maskell, and N. Gordon, “A Tutorial on Particle Filters for Online Nonlinear/Non-GaussianBayesian Tracking,” IEEE Transactions on Signal Processing, vol. 50, pp. 174–188, 2002.
http://dx.doi.org/10.1109/78.978374

[23] R. Douc, O. Cappe, and E. Moulines, “Comparison of Resampling Schemes for Particle Filtering,” in Proceedings of the International Symposium on Image and Signal Processing and Analysis, Zagreb, Croatia, 2005.

[24] S. Ravuri and D. P. Ellis, “The hydra system of unstructured cover song detection,” Ext. Abstract for the MIREX Audio Cover Song Identification task submission, Kobe, Japan, 2009.

[25] J. Serrà, M. Zanin, and R. G. Andrzejak, “Cover song retrieval by cross recurrence quantification and unsupervisedset detection,” Ext. Abstract for the MIREX Audio Cover Song Identification task submission, Kobe, Japan, 2009.

[26] T. E. Ahonen, “Cover song identification based on data compression,” Ext. Abstract for the MIREX Audio Cover Song Identification task submission, Kobe, Japan, 2009.

[27] B. Martin, P. Hanna, M. Robine, J. Allali, and P. Ferraro,“String matching cover song detection algorithm,” Ext.Abstract for the MIREX Audio Cover Song Identification task submission, Utrecht, Netherlands, 2010.

[28] E. Di Buccio, N. Montecchio, and N. Orio, “Applying text-based IR techniques to cover song identification,” Ext.Abstract for the MIREX Audio Cover Song Identification task submission, Utrecht, Netherlands, 2010.


Full Text: PDF


Journal of Multimedia (JMM, ISSN 1796-2048)

Copyright @ 2006-2013 by ACADEMY PUBLISHER – All rights reserved.