Music segment similarity using 2D-fourier magnitude coefficients

Oriol Nieto, Juan Pablo Bello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Music segmentation is the task of automatically identifying the different segments of a piece. In this work we present a novel approach to cluster the musical segments based on their acoustic similarity by using 2D-Fourier Magnitude Coefficients (2D-FMCs). These coefficients, computed from a chroma representation, significantly simplify the problem of clustering the different segments since they are key transposition and phase shift invariant. We explore various strategies to obtain the 2D-FMC patches that represent entire segments and apply k-means to label them. Finally, we discuss possible ways of estimating k and compare our competitive results with the current state of the art.

Original languageEnglish (US)
Title of host publication2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages664-668
Number of pages5
ISBN (Print)9781479928927
DOIs
StatePublished - 2014
Event2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 - Florence, Italy
Duration: May 4 2014May 9 2014

Other

Other2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
CountryItaly
CityFlorence
Period5/4/145/9/14

Fingerprint

Phase shift
Labels
Acoustics

Keywords

  • 2D-Fourier Transform
  • Clustering
  • Music Segmentation

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Nieto, O., & Bello, J. P. (2014). Music segment similarity using 2D-fourier magnitude coefficients. In 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 (pp. 664-668). [6853679] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2014.6853679

Music segment similarity using 2D-fourier magnitude coefficients. / Nieto, Oriol; Bello, Juan Pablo.

2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014. Institute of Electrical and Electronics Engineers Inc., 2014. p. 664-668 6853679.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Nieto, O & Bello, JP 2014, Music segment similarity using 2D-fourier magnitude coefficients. in 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014., 6853679, Institute of Electrical and Electronics Engineers Inc., pp. 664-668, 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014, Florence, Italy, 5/4/14. https://doi.org/10.1109/ICASSP.2014.6853679
Nieto O, Bello JP. Music segment similarity using 2D-fourier magnitude coefficients. In 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014. Institute of Electrical and Electronics Engineers Inc. 2014. p. 664-668. 6853679 https://doi.org/10.1109/ICASSP.2014.6853679
Nieto, Oriol ; Bello, Juan Pablo. / Music segment similarity using 2D-fourier magnitude coefficients. 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014. Institute of Electrical and Electronics Engineers Inc., 2014. pp. 664-668
@inproceedings{f862aec73dd14d019d52ab4b348fdda3,
title = "Music segment similarity using 2D-fourier magnitude coefficients",
abstract = "Music segmentation is the task of automatically identifying the different segments of a piece. In this work we present a novel approach to cluster the musical segments based on their acoustic similarity by using 2D-Fourier Magnitude Coefficients (2D-FMCs). These coefficients, computed from a chroma representation, significantly simplify the problem of clustering the different segments since they are key transposition and phase shift invariant. We explore various strategies to obtain the 2D-FMC patches that represent entire segments and apply k-means to label them. Finally, we discuss possible ways of estimating k and compare our competitive results with the current state of the art.",
keywords = "2D-Fourier Transform, Clustering, Music Segmentation",
author = "Oriol Nieto and Bello, {Juan Pablo}",
year = "2014",
doi = "10.1109/ICASSP.2014.6853679",
language = "English (US)",
isbn = "9781479928927",
pages = "664--668",
booktitle = "2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Music segment similarity using 2D-fourier magnitude coefficients

AU - Nieto, Oriol

AU - Bello, Juan Pablo

PY - 2014

Y1 - 2014

N2 - Music segmentation is the task of automatically identifying the different segments of a piece. In this work we present a novel approach to cluster the musical segments based on their acoustic similarity by using 2D-Fourier Magnitude Coefficients (2D-FMCs). These coefficients, computed from a chroma representation, significantly simplify the problem of clustering the different segments since they are key transposition and phase shift invariant. We explore various strategies to obtain the 2D-FMC patches that represent entire segments and apply k-means to label them. Finally, we discuss possible ways of estimating k and compare our competitive results with the current state of the art.

AB - Music segmentation is the task of automatically identifying the different segments of a piece. In this work we present a novel approach to cluster the musical segments based on their acoustic similarity by using 2D-Fourier Magnitude Coefficients (2D-FMCs). These coefficients, computed from a chroma representation, significantly simplify the problem of clustering the different segments since they are key transposition and phase shift invariant. We explore various strategies to obtain the 2D-FMC patches that represent entire segments and apply k-means to label them. Finally, we discuss possible ways of estimating k and compare our competitive results with the current state of the art.

KW - 2D-Fourier Transform

KW - Clustering

KW - Music Segmentation

UR - http://www.scopus.com/inward/record.url?scp=84905251530&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84905251530&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2014.6853679

DO - 10.1109/ICASSP.2014.6853679

M3 - Conference contribution

SN - 9781479928927

SP - 664

EP - 668

BT - 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014

PB - Institute of Electrical and Electronics Engineers Inc.

ER -