Lip synchronization in talking head video utilizing speech information

Tsuhan Chen, H. P. Graf, Homer H. Chen, Wu Chou, Barry G. Haskell, Eric D. Petajan, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and demonstrate speech-assisted coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

Original languageEnglish (US)
Title of host publicationProceedings of SPIE - The International Society for Optical Engineering
PublisherSociety of Photo-Optical Instrumentation Engineers
Pages1690-1701
Number of pages12
Volume2501
Edition3/-
ISBN (Print)0819418587
StatePublished - 1995
EventVisual Communications and Image Processing '95 - Taipei, Taiwan
Duration: May 24 1995May 26 1995

Other

OtherVisual Communications and Image Processing '95
CityTaipei, Taiwan
Period5/24/955/26/95

Fingerprint

talking
synchronism
Synchronization
Visual communication
Speech analysis
telephony
Image processing
Demonstrations
image processing
coding
communication

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Condensed Matter Physics

Cite this

Chen, T., Graf, H. P., Chen, H. H., Chou, W., Haskell, B. G., Petajan, E. D., & Wang, Y. (1995). Lip synchronization in talking head video utilizing speech information. In Proceedings of SPIE - The International Society for Optical Engineering (3/- ed., Vol. 2501 , pp. 1690-1701). Society of Photo-Optical Instrumentation Engineers.

Lip synchronization in talking head video utilizing speech information. / Chen, Tsuhan; Graf, H. P.; Chen, Homer H.; Chou, Wu; Haskell, Barry G.; Petajan, Eric D.; Wang, Yao.

Proceedings of SPIE - The International Society for Optical Engineering. Vol. 2501 3/-. ed. Society of Photo-Optical Instrumentation Engineers, 1995. p. 1690-1701.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, T, Graf, HP, Chen, HH, Chou, W, Haskell, BG, Petajan, ED & Wang, Y 1995, Lip synchronization in talking head video utilizing speech information. in Proceedings of SPIE - The International Society for Optical Engineering. 3/- edn, vol. 2501 , Society of Photo-Optical Instrumentation Engineers, pp. 1690-1701, Visual Communications and Image Processing '95, Taipei, Taiwan, 5/24/95.
Chen T, Graf HP, Chen HH, Chou W, Haskell BG, Petajan ED et al. Lip synchronization in talking head video utilizing speech information. In Proceedings of SPIE - The International Society for Optical Engineering. 3/- ed. Vol. 2501 . Society of Photo-Optical Instrumentation Engineers. 1995. p. 1690-1701
Chen, Tsuhan ; Graf, H. P. ; Chen, Homer H. ; Chou, Wu ; Haskell, Barry G. ; Petajan, Eric D. ; Wang, Yao. / Lip synchronization in talking head video utilizing speech information. Proceedings of SPIE - The International Society for Optical Engineering. Vol. 2501 3/-. ed. Society of Photo-Optical Instrumentation Engineers, 1995. pp. 1690-1701
@inproceedings{f5b79a2460bc435bbcaa7236b7f43c1f,
title = "Lip synchronization in talking head video utilizing speech information",
abstract = "We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and demonstrate speech-assisted coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.",
author = "Tsuhan Chen and Graf, {H. P.} and Chen, {Homer H.} and Wu Chou and Haskell, {Barry G.} and Petajan, {Eric D.} and Yao Wang",
year = "1995",
language = "English (US)",
isbn = "0819418587",
volume = "2501",
pages = "1690--1701",
booktitle = "Proceedings of SPIE - The International Society for Optical Engineering",
publisher = "Society of Photo-Optical Instrumentation Engineers",
edition = "3/-",

}

TY - GEN

T1 - Lip synchronization in talking head video utilizing speech information

AU - Chen, Tsuhan

AU - Graf, H. P.

AU - Chen, Homer H.

AU - Chou, Wu

AU - Haskell, Barry G.

AU - Petajan, Eric D.

AU - Wang, Yao

PY - 1995

Y1 - 1995

N2 - We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and demonstrate speech-assisted coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

AB - We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and demonstrate speech-assisted coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

UR - http://www.scopus.com/inward/record.url?scp=0029222903&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0029222903&partnerID=8YFLogxK

M3 - Conference contribution

SN - 0819418587

VL - 2501

SP - 1690

EP - 1701

BT - Proceedings of SPIE - The International Society for Optical Engineering

PB - Society of Photo-Optical Instrumentation Engineers

ER -