Lip synchronization in talking head video utilizing speech information

Tsuhan Chen, H. P. Graf, Homer H. Chen, Wu Chou, Barry G. Haskell, Eric D. Petajan, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and demonstrate speech-assisted coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

Original languageEnglish (US)
Title of host publicationProceedings of SPIE - The International Society for Optical Engineering
PublisherSociety of Photo-Optical Instrumentation Engineers
Pages1690-1701
Number of pages12
Edition3/-
ISBN (Print)0819418587
StatePublished - Jan 1 1995
EventVisual Communications and Image Processing '95 - Taipei, Taiwan
Duration: May 24 1995May 26 1995

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Number3/-
Volume2501
ISSN (Print)0277-786X

Other

OtherVisual Communications and Image Processing '95
CityTaipei, Taiwan
Period5/24/955/26/95

    Fingerprint

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Cite this

Chen, T., Graf, H. P., Chen, H. H., Chou, W., Haskell, B. G., Petajan, E. D., & Wang, Y. (1995). Lip synchronization in talking head video utilizing speech information. In Proceedings of SPIE - The International Society for Optical Engineering (3/- ed., pp. 1690-1701). (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 2501 , No. 3/-). Society of Photo-Optical Instrumentation Engineers.