Introduction to the Special Section on Sound Scene and Event Analysis

G. Richard, T. Virtanen, Juan Bello, N. Ono, H. Glotin

Research output: Contribution to journalReview article

Abstract

The papers in this special section are devoted to the growing field of acoustic scene classification and acoustic event recognition. Machine listening systems still have difficulties to reach the ability of human listeners in the analysis of realistic acoustic scenes. If sustained research efforts have been made for decades in speech recognition, speaker identification and to a lesser extent in music information retrieval, the analysis of other types of sounds, such as environmental sounds, is the subject of growing interest from the community and is targeting an ever increasing set of audio categories. This problem appears to be particularly challenging due to the large variety of potential sound sources in the scene, which may in addition have highly different acoustic characteristics, especially in bioacoustics. Furthermore, in realistic environments, multiple sources are often present simultaneously, and in reverberant conditions.

Original languageEnglish (US)
Article number7933043
Pages (from-to)1169-1171
Number of pages3
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume25
Issue number6
DOIs
StatePublished - Jun 1 2017

Fingerprint

scene analysis
Acoustics
acoustics
Acoustic waves
event
Bioacoustics
Music Information Retrieval
Speaker Identification
Aptitude
Information Storage and Retrieval
Music
Speech Recognition
Information retrieval
Speech recognition
information retrieval
listener
bioacoustics
music
speech recognition
Sound

ASJC Scopus subject areas

  • Signal Processing
  • Media Technology
  • Instrumentation
  • Acoustics and Ultrasonics
  • Linguistics and Language
  • Speech and Hearing
  • Electrical and Electronic Engineering

Cite this

Introduction to the Special Section on Sound Scene and Event Analysis. / Richard, G.; Virtanen, T.; Bello, Juan; Ono, N.; Glotin, H.

In: IEEE/ACM Transactions on Audio Speech and Language Processing, Vol. 25, No. 6, 7933043, 01.06.2017, p. 1169-1171.

Research output: Contribution to journalReview article

Richard, G. ; Virtanen, T. ; Bello, Juan ; Ono, N. ; Glotin, H. / Introduction to the Special Section on Sound Scene and Event Analysis. In: IEEE/ACM Transactions on Audio Speech and Language Processing. 2017 ; Vol. 25, No. 6. pp. 1169-1171.
@article{9f050d9dbb404952aca761524b663c85,
title = "Introduction to the Special Section on Sound Scene and Event Analysis",
abstract = "The papers in this special section are devoted to the growing field of acoustic scene classification and acoustic event recognition. Machine listening systems still have difficulties to reach the ability of human listeners in the analysis of realistic acoustic scenes. If sustained research efforts have been made for decades in speech recognition, speaker identification and to a lesser extent in music information retrieval, the analysis of other types of sounds, such as environmental sounds, is the subject of growing interest from the community and is targeting an ever increasing set of audio categories. This problem appears to be particularly challenging due to the large variety of potential sound sources in the scene, which may in addition have highly different acoustic characteristics, especially in bioacoustics. Furthermore, in realistic environments, multiple sources are often present simultaneously, and in reverberant conditions.",
author = "G. Richard and T. Virtanen and Juan Bello and N. Ono and H. Glotin",
year = "2017",
month = "6",
day = "1",
doi = "10.1109/TASLP.2017.2699334",
language = "English (US)",
volume = "25",
pages = "1169--1171",
journal = "IEEE/ACM Transactions on Speech and Language Processing",
issn = "2329-9290",
publisher = "IEEE Advancing Technology for Humanity",
number = "6",

}

TY - JOUR

T1 - Introduction to the Special Section on Sound Scene and Event Analysis

AU - Richard, G.

AU - Virtanen, T.

AU - Bello, Juan

AU - Ono, N.

AU - Glotin, H.

PY - 2017/6/1

Y1 - 2017/6/1

N2 - The papers in this special section are devoted to the growing field of acoustic scene classification and acoustic event recognition. Machine listening systems still have difficulties to reach the ability of human listeners in the analysis of realistic acoustic scenes. If sustained research efforts have been made for decades in speech recognition, speaker identification and to a lesser extent in music information retrieval, the analysis of other types of sounds, such as environmental sounds, is the subject of growing interest from the community and is targeting an ever increasing set of audio categories. This problem appears to be particularly challenging due to the large variety of potential sound sources in the scene, which may in addition have highly different acoustic characteristics, especially in bioacoustics. Furthermore, in realistic environments, multiple sources are often present simultaneously, and in reverberant conditions.

AB - The papers in this special section are devoted to the growing field of acoustic scene classification and acoustic event recognition. Machine listening systems still have difficulties to reach the ability of human listeners in the analysis of realistic acoustic scenes. If sustained research efforts have been made for decades in speech recognition, speaker identification and to a lesser extent in music information retrieval, the analysis of other types of sounds, such as environmental sounds, is the subject of growing interest from the community and is targeting an ever increasing set of audio categories. This problem appears to be particularly challenging due to the large variety of potential sound sources in the scene, which may in addition have highly different acoustic characteristics, especially in bioacoustics. Furthermore, in realistic environments, multiple sources are often present simultaneously, and in reverberant conditions.

UR - http://www.scopus.com/inward/record.url?scp=85028340233&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85028340233&partnerID=8YFLogxK

U2 - 10.1109/TASLP.2017.2699334

DO - 10.1109/TASLP.2017.2699334

M3 - Review article

AN - SCOPUS:85028340233

VL - 25

SP - 1169

EP - 1171

JO - IEEE/ACM Transactions on Speech and Language Processing

JF - IEEE/ACM Transactions on Speech and Language Processing

SN - 2329-9290

IS - 6

M1 - 7933043

ER -