Speech-assisted lip synchronization in audio-visual communications

Tsuhan Chen, Hans Peter Graf, Barry Haskell, Eric Petajan, Yao Wang, Homer Chen, Wu Chou

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

Original languageEnglish (US)
Title of host publicationIEEE International Conference on Image Processing
Editors Anon
PublisherIEEE
Pages579-582
Number of pages4
Volume2
StatePublished - 1996
EventProceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3) - Washington, DC, USA
Duration: Oct 23 1995Oct 26 1995

Other

OtherProceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3)
CityWashington, DC, USA
Period10/23/9510/26/95

Fingerprint

Visual communication
Synchronization
Speech analysis
Image processing
Demonstrations

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Cite this

Chen, T., Graf, H. P., Haskell, B., Petajan, E., Wang, Y., Chen, H., & Chou, W. (1996). Speech-assisted lip synchronization in audio-visual communications. In Anon (Ed.), IEEE International Conference on Image Processing (Vol. 2, pp. 579-582). IEEE.

Speech-assisted lip synchronization in audio-visual communications. / Chen, Tsuhan; Graf, Hans Peter; Haskell, Barry; Petajan, Eric; Wang, Yao; Chen, Homer; Chou, Wu.

IEEE International Conference on Image Processing. ed. / Anon. Vol. 2 IEEE, 1996. p. 579-582.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, T, Graf, HP, Haskell, B, Petajan, E, Wang, Y, Chen, H & Chou, W 1996, Speech-assisted lip synchronization in audio-visual communications. in Anon (ed.), IEEE International Conference on Image Processing. vol. 2, IEEE, pp. 579-582, Proceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3), Washington, DC, USA, 10/23/95.
Chen T, Graf HP, Haskell B, Petajan E, Wang Y, Chen H et al. Speech-assisted lip synchronization in audio-visual communications. In Anon, editor, IEEE International Conference on Image Processing. Vol. 2. IEEE. 1996. p. 579-582
Chen, Tsuhan ; Graf, Hans Peter ; Haskell, Barry ; Petajan, Eric ; Wang, Yao ; Chen, Homer ; Chou, Wu. / Speech-assisted lip synchronization in audio-visual communications. IEEE International Conference on Image Processing. editor / Anon. Vol. 2 IEEE, 1996. pp. 579-582
@inproceedings{046ab77878f14643b40d9a0a93e4e0aa,
title = "Speech-assisted lip synchronization in audio-visual communications",
abstract = "We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.",
author = "Tsuhan Chen and Graf, {Hans Peter} and Barry Haskell and Eric Petajan and Yao Wang and Homer Chen and Wu Chou",
year = "1996",
language = "English (US)",
volume = "2",
pages = "579--582",
editor = "Anon",
booktitle = "IEEE International Conference on Image Processing",
publisher = "IEEE",

}

TY - GEN

T1 - Speech-assisted lip synchronization in audio-visual communications

AU - Chen, Tsuhan

AU - Graf, Hans Peter

AU - Haskell, Barry

AU - Petajan, Eric

AU - Wang, Yao

AU - Chen, Homer

AU - Chou, Wu

PY - 1996

Y1 - 1996

N2 - We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

AB - We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

UR - http://www.scopus.com/inward/record.url?scp=0029755830&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0029755830&partnerID=8YFLogxK

M3 - Conference contribution

VL - 2

SP - 579

EP - 582

BT - IEEE International Conference on Image Processing

A2 - Anon, null

PB - IEEE

ER -