바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

A Comparative Analysis of Content-based Music Retrieval Systems

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2013, v.30 no.3, pp.23-48
https://doi.org/10.3743/KOSIM.2013.30.3.023

  • Downloaded
  • Viewed

Abstract

This study compared and analyzed 15 CBMR (Content-based Music Retrieval) systems accessible on the web in terms of DB size and type, query type, access point, input and output type, and search functions, with reviewing features of music information and techniques used for transforming or transcribing of music sources, extracting and segmenting melodies, extracting and indexing features of music, and matching algorithms for CBMR systems. Application of text information retrieval techniques such as inverted indexing, N-gram indexing, Boolean search, truncation, keyword and phrase search, normalization, filtering, browsing, exact matching, similarity measure using edit distance, sorting, etc. to enhancing the CBMR; effort for increasing DB size and usability; and problems in extracting melodies, deleting stop notes in queries, and using solfege as pitch information were found as the results of analysis.

keywords
CBMR(content-based music retrieval), QBH(query by humming), QBN(query by music notation), QBC(query by contour), 내용기반 음악 검색시스템, 선율검색, 노래검색. 악보검색, 내용기반 음악정보

Reference

1.

구경이. (2003). 주제 선율 색인을 이용한 내용 기반 음악정보 검색 시스템. 데이터베이스 연구, 19(3), 34-45.

2.

김무정. (2011). Query By Humming 응용을 위한 Midi 파일에서의 자동 멜로디 트랙 선택방법 (405-408). 한국정보과학회 한국컴퓨터종합학술대회 논문집.

3.

노정순. (2011). 정보검색 : 이론과 실제:글누리.

4.

박만수. (2006). 실제 잡음 환경에 강인한 오디오 핑거프린팅 기법. Telecommunications Review, 16(3), 435-446.

5.

유진희. (2007). 허밍 질의 처리 시스템의 성능 향상을 위한 효율적인 빈번 멜로디 인덱싱 방법. 정보과학회논문지 : 데이타베이스, 34(4), 283-303.

6.

최윤재. (2009). 음악의 특성에 따른 피아노 솔로 음악으로 부터의 멜로디 추출. 정보과학회 컴퓨팅의 실제 논문지, 15(12), 923-927.

7.

Arifi, V.. (2003). Automatic synchronization of music data in score-, MIDI-, and PCM-format (-). Proceedings of ISMIR 2003.

8.

Bainbridge, D.. (2004). Music information retrieval research and its context at the University of Waikato. Journal of the American Society for Information Science and Technology, 55(12), 1092-1099.

9.

Bandera, C. de la, Barbancho, A. M.. (2011). Humming method for content-based music information retrieval (49-54). Proceedings of ISMIR 2011.

10.

Cano, P.. (2005). A review of audio fingerprinting. Journal of VLSI Signal Processing, 41, 271-284.

11.

Cartwright, M. B.. (2011). Making searchable melodies: Human versus machine (-). Proceedings of Human Computation.

12.

Chai, W.. (2002). Melody retrieval on the web. Proceedings of ACM/SPIE Conference on Multimedia Computing and Networking, , 226-.

13.

Chandrasekhar, V.. (2011). Survey and evaluation of audio fingerprinting schemes for mobile query-by-example applications (801-806). Proceedings of ISMIR 2011.

14.

Chen, R.. (2012). Chord recognition using durationexplicit hidden Markov models (445-450). Proceedings of ISMIR 2012.

15.

Cheng, H. T.. (2008). Automatic chord recognition for music classification and retrieval (1505-1508). Proceedings of the IEEE International Conference on Multimedia and Expo 2008.

16.

Chew, E.. (2008). Challenging uncertainty in query by humming systems: A fingerprinting approach. IEEE Transactions on Audio, Speech, and Language Processing, 16(2), 359-371.

17.

Cilibrasi, R.. (2004). Algorithmic clustering of music based on string compression. Computer Music Journal, 28(4), 49-67.

18.

Dannenberg, R.. (2007). A Comparative evaluation of search techniques for query by humming using the MUSART testbed. JASIST, 58(5), 587-701.

19.

David, G.. (2003). Pitch extraction and fundamental frequency: history and current techniques. .

20.

Duggan, B.. (2009). Compensating for expressiveness in queries to a content based music information retrieval system (33-36). Proceedings of the International Computer Music Conference(ICMC 2009).

21.

Doraisamy, S.. (2002). Robust polyphonic music retrieval with n-grams. Journal of Intelligent Information Systems, 21(1), 53-70.

22.

Downie, S.. (1999). Evaluating a simple approach to music information retrieval: Conceiving melodic N-grams as text.

23.

Ghias, A.. (1995). Query by humming : Musical information retrieval in an audio database (231-236). Proceedings of the 3rd Annual ACM International Conference on Multimedia.

24.

Goto, M.. (2004). A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals. Speech Communication, 43(4), 311-329.

25.

Hanna, P.. (2007). On optimizing the editing algorithms for evaluating similarity between monophonic musical sequences. Journal of New Music Research, 36(4), 267-279.

26.

Hug, A.. (2010). Crowdsourcing a real-world on-line query by humming system (-). Proceedings of the 7th Sound and Music Computing Conference.

27.

Kan, M.. (2008). LyricAlly: Automatic synchronization of textual lyrics to acoustic music signals. IEEE Transaction on Audio, Speech, and Language Processing, 16(2), 338-349.

28.

Kline, R. L.. (2003). Approximate matching algorithms for music information retrieval using vocal input (130-139). Proceedings of the Eleventh ACM International Conference on Multimedia 2003.

29.

Kornstädt, A.. (1998). Themefinder : A web-based melodic search tool. Computing in Musicology, 11, 231-236.

30.

Lee, K.. (2008). Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio. IEEE Transactions on Audio, Speech, and Language Processing, 26(2), 291-301.

31.

이윤주. (2006). 내용기반 음악정보 검색시스템을 위한 이용자 중심의 질의 인터페이스 설계에 관한 연구. 정보관리학회지, 23(2), 5-19.

32.

Lemström, K.. (2007). On comparing edit distance and geometric frameworks in content-based retrieval of symbolically encoded polyphonic music. Musicae Scientiae. Discussion Forum, 4, 135-152.

33.

Lemström, K.. (2003). Transposition invariant pattern matching for multi-track strings. Nordic Journal of Computing, 10, 185-205.

34.

McNab, R. J.. (1996). Towards the digital music library : tune retrieval from acoustic input (11-18). Proceedings of the 1st ACM International Conference on Digital Libraries.

35.

McNab, R. J.. (1997). The New Zealand Digital Library MELody inDEX. D-Lib Magazine, 3(5), 4-15.

36.

Melucci, M.. (2004). Combining melody processing and information retrieval techniques : Methodology, evaluation and system implementation. Journal of the American Society for Information Science and Technology, 55(12), 1058-1066.

37.

Mesaros, A.. (2010). Automatic recognition of lyrics in singing. Journal on Audio, Speech, and Music Processing, , 546047-.

38.

Nam, G. P.. (1187). A new query-by-humming system based on the score level fusion of two classifiers. International Journal of Communication Systems, 25(6), 717-733.

39.

Papadopoulos, H.. (2012). Modeling chord and key structure with Markov logic (127-132). Proceedings of ISMIR 2012.

40.

Pardo, B.. (2004). Name that tune : A pilot study in finding a melody from a sung query. Journal of the American Society for Information Science and Technology, 55(4), 283-300.

41.

Prechelt, M.. (2001). An interface for melody input. ACM Transactions on Computer- Human Interaction, 6(2), 133-149.

42.

Rho, S.. (2008). MUSEMBLE: A novel music retrieval system with automatic voice query transcription and reformulation. The Journal of Systems and Software, 81(7), 1065-1080.

43.

Roger, B.. (2004). Understanding search performance in query-by humming systems (85-89). Proceedings of ISMIR 2004.

44.

Sheh, A.. (2003). Chord segmentation and recognition using EM-trained hidden Markov models (-). Proceedings of ISMIR 2003.

45.

Shiffrin, J.. (2002). HMM-based musical query retrieval (295-300). Proceedings of the 2nd ACM/IEEE-CS Joint Conference on Digital Libraries.

46.

Tripathy, A.. (2009). Query by humming system. International Journal of Recent Trends in Engineering, 2(5), 373-379.

47.

Turetsky, R. J.. (2003). Ground truth transcriptions of real music from forcealligned MIDI syntheses (445-448). Proceeding of ISMIR.

48.

Typke, R.. (2007). Music retrieval based on melodic similarity.

49.

Typke, R.. (2004). Searching notated polyphonic music using transportation distances (128-135). Proceedings of the 12th Annual ACM International Conference on Multimedia.

50.

Typke, R.. (2005). A survey of music information retrieval systems (153-160). Proceedings of ISMIR 2005.

51.

Viro, V.. (2011). Peachnote : Music score search and analysis platform (359-362). Proceedings of ISMIR 2011.

52.

Wan, C.. (2006). Content-based audio retrieval with relevance feedback. Pattern Recognition Letters, 27(2), 85-92.

53.

Wang, A.. (2003). An industrial-strength audio search algorithm (-). Proceedings of the 4th International Conference on Music Information.

54.

Wang, A.. (2006). The Shazam music recognition service. Communications of the ACM, 49(8), 44-48.

55.

Wang, C.. (2006). N-gram inverted index structures on music data for theme mining and content-based information retrieval. Pattern Recognition Letters, 27(5), 492-503.

56.

Wold, E.. (1996). Content-based classification, search, and retrieval of audio. IEEE Multimedia, 3(3), 27-36.

57.

Zhu, Y.. (2003). Warping indexes with envelops transforms for query by humming (181-192). Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data (SIGMOD '03).

Journal of the Korean Society for Information Management