Major cast detection in video using both audio and visual information

Z. Liu, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Major casts, for example, the anchor persons or reporters in news broadcast programs and principle characters in movies play an important role in video, and their occurrences provide good indices for organizing and presenting video content. This paper describes a new approach for automatically generating the list of major casts in a video sequence based on multiple modalities, specifically, both speaker and face information. A list of major casts is created and ordered by the accumulative temporal and spatial presence of corresponding casts. Preliminary simulation results show that the detected major casts are meaningful and the proposed approach is promising.

Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Pages1413-1416
Number of pages4
Volume3
StatePublished - 2001
Event2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing - Salt Lake, UT, United States
Duration: May 7 2001May 11 2001

Other

Other2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing
CountryUnited States
CitySalt Lake, UT
Period5/7/015/11/01

Fingerprint

Anchors
casts
lists
news
organizing
occurrences
simulation

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Acoustics and Ultrasonics

Cite this

Liu, Z., & Wang, Y. (2001). Major cast detection in video using both audio and visual information. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 3, pp. 1413-1416)

Major cast detection in video using both audio and visual information. / Liu, Z.; Wang, Yao.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 3 2001. p. 1413-1416.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Liu, Z & Wang, Y 2001, Major cast detection in video using both audio and visual information. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 3, pp. 1413-1416, 2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing, Salt Lake, UT, United States, 5/7/01.
Liu Z, Wang Y. Major cast detection in video using both audio and visual information. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 3. 2001. p. 1413-1416
Liu, Z. ; Wang, Yao. / Major cast detection in video using both audio and visual information. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 3 2001. pp. 1413-1416
@inproceedings{3e4e556d61e64d07b7319fc2546f0e21,
title = "Major cast detection in video using both audio and visual information",
abstract = "Major casts, for example, the anchor persons or reporters in news broadcast programs and principle characters in movies play an important role in video, and their occurrences provide good indices for organizing and presenting video content. This paper describes a new approach for automatically generating the list of major casts in a video sequence based on multiple modalities, specifically, both speaker and face information. A list of major casts is created and ordered by the accumulative temporal and spatial presence of corresponding casts. Preliminary simulation results show that the detected major casts are meaningful and the proposed approach is promising.",
author = "Z. Liu and Yao Wang",
year = "2001",
language = "English (US)",
volume = "3",
pages = "1413--1416",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

}

TY - GEN

T1 - Major cast detection in video using both audio and visual information

AU - Liu, Z.

AU - Wang, Yao

PY - 2001

Y1 - 2001

N2 - Major casts, for example, the anchor persons or reporters in news broadcast programs and principle characters in movies play an important role in video, and their occurrences provide good indices for organizing and presenting video content. This paper describes a new approach for automatically generating the list of major casts in a video sequence based on multiple modalities, specifically, both speaker and face information. A list of major casts is created and ordered by the accumulative temporal and spatial presence of corresponding casts. Preliminary simulation results show that the detected major casts are meaningful and the proposed approach is promising.

AB - Major casts, for example, the anchor persons or reporters in news broadcast programs and principle characters in movies play an important role in video, and their occurrences provide good indices for organizing and presenting video content. This paper describes a new approach for automatically generating the list of major casts in a video sequence based on multiple modalities, specifically, both speaker and face information. A list of major casts is created and ordered by the accumulative temporal and spatial presence of corresponding casts. Preliminary simulation results show that the detected major casts are meaningful and the proposed approach is promising.

UR - http://www.scopus.com/inward/record.url?scp=0034841928&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034841928&partnerID=8YFLogxK

M3 - Conference contribution

VL - 3

SP - 1413

EP - 1416

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

ER -