Audio indexing and retrieval

Zhu Liu, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

With the booming of Internet and the rapid growth of digital storage capability, we are exposed to virtually unlimited volume of multimedia information, including movies, video clips, image animation, still images, music, speech, text, etc. Manually searching useful or interesting pieces of information is exactly the same as finding a needle in a haystack. Automatically generating semantically meaningful index for a large volume of documents is critical for efficient and effective multimedia information retrieval. During the last decade, multimedia content index and retrieval, as a new yet fast growing field, has attracted tremendous interest from researchers worldwide. MPEG-7, formally known as Multimedia Content Description Interface, is an internationally collaborated effort that addresses such a challenge. This chapter will focus on the recent progress of audio indexing and retrieval, a major component within the broad scope. Everyday, we are immersed in ubiquitous audio. Very often, we want to search a piece of audio that either we know or heard before, for example, a specific song or a conference recording, or we are not aware of, for example, a piece of symphony or one speech of President Kennedy. With the support of audio indexing and retrieval services, such a task is an enjoyable experience, otherwise, really a headache. In this chapter, we will survey the state of the art in the area of audio content indexing and retrieval, address audio indexing algorithms and query methods, illustrate several representative audio retrieval systems, and briefly introduce MPEG-7 audio standard.

Original languageEnglish (US)
Title of host publicationHandbook of Video Databases
Subtitle of host publicationDesign and Applications
PublisherCRC Press
Pages483-509
Number of pages27
ISBN (Electronic)9780203489864
ISBN (Print)9780849370069
StatePublished - Jan 1 2003

Fingerprint

Digital storage
Animation
Information retrieval
Internet

ASJC Scopus subject areas

  • Computer Science(all)
  • Engineering(all)

Cite this

Liu, Z., & Wang, Y. (2003). Audio indexing and retrieval. In Handbook of Video Databases: Design and Applications (pp. 483-509). CRC Press.

Audio indexing and retrieval. / Liu, Zhu; Wang, Yao.

Handbook of Video Databases: Design and Applications. CRC Press, 2003. p. 483-509.

Research output: Chapter in Book/Report/Conference proceedingChapter

Liu, Z & Wang, Y 2003, Audio indexing and retrieval. in Handbook of Video Databases: Design and Applications. CRC Press, pp. 483-509.
Liu Z, Wang Y. Audio indexing and retrieval. In Handbook of Video Databases: Design and Applications. CRC Press. 2003. p. 483-509
Liu, Zhu ; Wang, Yao. / Audio indexing and retrieval. Handbook of Video Databases: Design and Applications. CRC Press, 2003. pp. 483-509
@inbook{bb5c56122cfd42a4a4c1808813b4ded9,
title = "Audio indexing and retrieval",
abstract = "With the booming of Internet and the rapid growth of digital storage capability, we are exposed to virtually unlimited volume of multimedia information, including movies, video clips, image animation, still images, music, speech, text, etc. Manually searching useful or interesting pieces of information is exactly the same as finding a needle in a haystack. Automatically generating semantically meaningful index for a large volume of documents is critical for efficient and effective multimedia information retrieval. During the last decade, multimedia content index and retrieval, as a new yet fast growing field, has attracted tremendous interest from researchers worldwide. MPEG-7, formally known as Multimedia Content Description Interface, is an internationally collaborated effort that addresses such a challenge. This chapter will focus on the recent progress of audio indexing and retrieval, a major component within the broad scope. Everyday, we are immersed in ubiquitous audio. Very often, we want to search a piece of audio that either we know or heard before, for example, a specific song or a conference recording, or we are not aware of, for example, a piece of symphony or one speech of President Kennedy. With the support of audio indexing and retrieval services, such a task is an enjoyable experience, otherwise, really a headache. In this chapter, we will survey the state of the art in the area of audio content indexing and retrieval, address audio indexing algorithms and query methods, illustrate several representative audio retrieval systems, and briefly introduce MPEG-7 audio standard.",
author = "Zhu Liu and Yao Wang",
year = "2003",
month = "1",
day = "1",
language = "English (US)",
isbn = "9780849370069",
pages = "483--509",
booktitle = "Handbook of Video Databases",
publisher = "CRC Press",

}

TY - CHAP

T1 - Audio indexing and retrieval

AU - Liu, Zhu

AU - Wang, Yao

PY - 2003/1/1

Y1 - 2003/1/1

N2 - With the booming of Internet and the rapid growth of digital storage capability, we are exposed to virtually unlimited volume of multimedia information, including movies, video clips, image animation, still images, music, speech, text, etc. Manually searching useful or interesting pieces of information is exactly the same as finding a needle in a haystack. Automatically generating semantically meaningful index for a large volume of documents is critical for efficient and effective multimedia information retrieval. During the last decade, multimedia content index and retrieval, as a new yet fast growing field, has attracted tremendous interest from researchers worldwide. MPEG-7, formally known as Multimedia Content Description Interface, is an internationally collaborated effort that addresses such a challenge. This chapter will focus on the recent progress of audio indexing and retrieval, a major component within the broad scope. Everyday, we are immersed in ubiquitous audio. Very often, we want to search a piece of audio that either we know or heard before, for example, a specific song or a conference recording, or we are not aware of, for example, a piece of symphony or one speech of President Kennedy. With the support of audio indexing and retrieval services, such a task is an enjoyable experience, otherwise, really a headache. In this chapter, we will survey the state of the art in the area of audio content indexing and retrieval, address audio indexing algorithms and query methods, illustrate several representative audio retrieval systems, and briefly introduce MPEG-7 audio standard.

AB - With the booming of Internet and the rapid growth of digital storage capability, we are exposed to virtually unlimited volume of multimedia information, including movies, video clips, image animation, still images, music, speech, text, etc. Manually searching useful or interesting pieces of information is exactly the same as finding a needle in a haystack. Automatically generating semantically meaningful index for a large volume of documents is critical for efficient and effective multimedia information retrieval. During the last decade, multimedia content index and retrieval, as a new yet fast growing field, has attracted tremendous interest from researchers worldwide. MPEG-7, formally known as Multimedia Content Description Interface, is an internationally collaborated effort that addresses such a challenge. This chapter will focus on the recent progress of audio indexing and retrieval, a major component within the broad scope. Everyday, we are immersed in ubiquitous audio. Very often, we want to search a piece of audio that either we know or heard before, for example, a specific song or a conference recording, or we are not aware of, for example, a piece of symphony or one speech of President Kennedy. With the support of audio indexing and retrieval services, such a task is an enjoyable experience, otherwise, really a headache. In this chapter, we will survey the state of the art in the area of audio content indexing and retrieval, address audio indexing algorithms and query methods, illustrate several representative audio retrieval systems, and briefly introduce MPEG-7 audio standard.

UR - http://www.scopus.com/inward/record.url?scp=85055825715&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055825715&partnerID=8YFLogxK

M3 - Chapter

SN - 9780849370069

SP - 483

EP - 509

BT - Handbook of Video Databases

PB - CRC Press

ER -