Measuring structural similarity in music

Research output: Contribution to journalArticle

Abstract

This paper presents a novel method for measuring the structural similarity between music recordings. It uses recurrence plot analysis to characterize patterns of repetition in the feature sequence, and the normalized compression distance, a practical approximation of the joint Kolmogorov complexity, to measure the pairwise similarity between the plots. By measuring the distance between intermediate representations of signal structure, the proposed method departs from common approaches to music structure analysis which assume a block-based model of music, and thus concentrate on segmenting and clustering sections. The approach ensures that global structure is consistently and robustly characterized in the presence of tempo, instrumentation, and key changes, while the used metric provides a simple to compute, versatile and robust alternative to common approaches in music similarity research. Finally, experimental results demonstrate success at characterizing similarity, while contributing an optimal parameterization of the proposed approach.

Original languageEnglish (US)
Article number5711645
Pages (from-to)2013-2025
Number of pages13
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume19
Issue number7
DOIs
StatePublished - 2011

Fingerprint

music
Parameterization
plots
parameterization
repetition
recording
approximation

Keywords

  • Audio signal processing
  • computer audition
  • music information retrieval (MIR)
  • music structure analysis
  • sound similarity

ASJC Scopus subject areas

  • Acoustics and Ultrasonics
  • Electrical and Electronic Engineering

Cite this

Measuring structural similarity in music. / Bello, Juan P.

In: IEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 7, 5711645, 2011, p. 2013-2025.

Research output: Contribution to journalArticle

@article{d48089bf6a1944adb48fd3ab8da22277,
title = "Measuring structural similarity in music",
abstract = "This paper presents a novel method for measuring the structural similarity between music recordings. It uses recurrence plot analysis to characterize patterns of repetition in the feature sequence, and the normalized compression distance, a practical approximation of the joint Kolmogorov complexity, to measure the pairwise similarity between the plots. By measuring the distance between intermediate representations of signal structure, the proposed method departs from common approaches to music structure analysis which assume a block-based model of music, and thus concentrate on segmenting and clustering sections. The approach ensures that global structure is consistently and robustly characterized in the presence of tempo, instrumentation, and key changes, while the used metric provides a simple to compute, versatile and robust alternative to common approaches in music similarity research. Finally, experimental results demonstrate success at characterizing similarity, while contributing an optimal parameterization of the proposed approach.",
keywords = "Audio signal processing, computer audition, music information retrieval (MIR), music structure analysis, sound similarity",
author = "Bello, {Juan P.}",
year = "2011",
doi = "10.1109/TASL.2011.2108287",
language = "English (US)",
volume = "19",
pages = "2013--2025",
journal = "IEEE Transactions on Speech and Audio Processing",
issn = "1558-7916",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "7",

}

TY - JOUR

T1 - Measuring structural similarity in music

AU - Bello, Juan P.

PY - 2011

Y1 - 2011

N2 - This paper presents a novel method for measuring the structural similarity between music recordings. It uses recurrence plot analysis to characterize patterns of repetition in the feature sequence, and the normalized compression distance, a practical approximation of the joint Kolmogorov complexity, to measure the pairwise similarity between the plots. By measuring the distance between intermediate representations of signal structure, the proposed method departs from common approaches to music structure analysis which assume a block-based model of music, and thus concentrate on segmenting and clustering sections. The approach ensures that global structure is consistently and robustly characterized in the presence of tempo, instrumentation, and key changes, while the used metric provides a simple to compute, versatile and robust alternative to common approaches in music similarity research. Finally, experimental results demonstrate success at characterizing similarity, while contributing an optimal parameterization of the proposed approach.

AB - This paper presents a novel method for measuring the structural similarity between music recordings. It uses recurrence plot analysis to characterize patterns of repetition in the feature sequence, and the normalized compression distance, a practical approximation of the joint Kolmogorov complexity, to measure the pairwise similarity between the plots. By measuring the distance between intermediate representations of signal structure, the proposed method departs from common approaches to music structure analysis which assume a block-based model of music, and thus concentrate on segmenting and clustering sections. The approach ensures that global structure is consistently and robustly characterized in the presence of tempo, instrumentation, and key changes, while the used metric provides a simple to compute, versatile and robust alternative to common approaches in music similarity research. Finally, experimental results demonstrate success at characterizing similarity, while contributing an optimal parameterization of the proposed approach.

KW - Audio signal processing

KW - computer audition

KW - music information retrieval (MIR)

KW - music structure analysis

KW - sound similarity

UR - http://www.scopus.com/inward/record.url?scp=79960681282&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79960681282&partnerID=8YFLogxK

U2 - 10.1109/TASL.2011.2108287

DO - 10.1109/TASL.2011.2108287

M3 - Article

AN - SCOPUS:79960681282

VL - 19

SP - 2013

EP - 2025

JO - IEEE Transactions on Speech and Audio Processing

JF - IEEE Transactions on Speech and Audio Processing

SN - 1558-7916

IS - 7

M1 - 5711645

ER -