Multiple levels of linguistic and paralinguistic features contribute to voice recognition

Jean Mary Zarate, Xing Tian, Kevin J P Woods, David Poeppel

Research output: Contribution to journalArticle

Abstract

Voice or speaker recognition is critical in a wide variety of social contexts. In this study, we investigated the contributions of acoustic, phonological, lexical, and semantic information toward voice recognition. Native English speaking participants were trained to recognize five speakers in five conditions: non-speech, Mandarin, German, pseudo-English, and English. We showed that voice recognition significantly improved as more information became available, from purely acoustic features in non-speech to additional phonological information varying in familiarity. Moreover, we found that the recognition performance is transferable between training and testing in phonologically familiar conditions (German, pseudo-English, and English), but not in unfamiliar (Mandarin) or non-speech conditions. These results provide evidence suggesting that bottom-up acoustic analysis and top-down influence from phonological processing collaboratively govern voice recognition.

Original languageEnglish (US)
Article number11475
JournalScientific Reports
Volume5
DOIs
StatePublished - Jun 19 2015

Fingerprint

Linguistics
Acoustics
Semantics
Recognition (Psychology)

ASJC Scopus subject areas

  • General

Cite this

Multiple levels of linguistic and paralinguistic features contribute to voice recognition. / Zarate, Jean Mary; Tian, Xing; Woods, Kevin J P; Poeppel, David.

In: Scientific Reports, Vol. 5, 11475, 19.06.2015.

Research output: Contribution to journalArticle

@article{12da1b8b90c44e6a97e9e4e5539ee9a3,
title = "Multiple levels of linguistic and paralinguistic features contribute to voice recognition",
abstract = "Voice or speaker recognition is critical in a wide variety of social contexts. In this study, we investigated the contributions of acoustic, phonological, lexical, and semantic information toward voice recognition. Native English speaking participants were trained to recognize five speakers in five conditions: non-speech, Mandarin, German, pseudo-English, and English. We showed that voice recognition significantly improved as more information became available, from purely acoustic features in non-speech to additional phonological information varying in familiarity. Moreover, we found that the recognition performance is transferable between training and testing in phonologically familiar conditions (German, pseudo-English, and English), but not in unfamiliar (Mandarin) or non-speech conditions. These results provide evidence suggesting that bottom-up acoustic analysis and top-down influence from phonological processing collaboratively govern voice recognition.",
author = "Zarate, {Jean Mary} and Xing Tian and Woods, {Kevin J P} and David Poeppel",
year = "2015",
month = "6",
day = "19",
doi = "10.1038/srep11475",
language = "English (US)",
volume = "5",
journal = "Scientific Reports",
issn = "2045-2322",
publisher = "Nature Publishing Group",

}

TY - JOUR

T1 - Multiple levels of linguistic and paralinguistic features contribute to voice recognition

AU - Zarate, Jean Mary

AU - Tian, Xing

AU - Woods, Kevin J P

AU - Poeppel, David

PY - 2015/6/19

Y1 - 2015/6/19

N2 - Voice or speaker recognition is critical in a wide variety of social contexts. In this study, we investigated the contributions of acoustic, phonological, lexical, and semantic information toward voice recognition. Native English speaking participants were trained to recognize five speakers in five conditions: non-speech, Mandarin, German, pseudo-English, and English. We showed that voice recognition significantly improved as more information became available, from purely acoustic features in non-speech to additional phonological information varying in familiarity. Moreover, we found that the recognition performance is transferable between training and testing in phonologically familiar conditions (German, pseudo-English, and English), but not in unfamiliar (Mandarin) or non-speech conditions. These results provide evidence suggesting that bottom-up acoustic analysis and top-down influence from phonological processing collaboratively govern voice recognition.

AB - Voice or speaker recognition is critical in a wide variety of social contexts. In this study, we investigated the contributions of acoustic, phonological, lexical, and semantic information toward voice recognition. Native English speaking participants were trained to recognize five speakers in five conditions: non-speech, Mandarin, German, pseudo-English, and English. We showed that voice recognition significantly improved as more information became available, from purely acoustic features in non-speech to additional phonological information varying in familiarity. Moreover, we found that the recognition performance is transferable between training and testing in phonologically familiar conditions (German, pseudo-English, and English), but not in unfamiliar (Mandarin) or non-speech conditions. These results provide evidence suggesting that bottom-up acoustic analysis and top-down influence from phonological processing collaboratively govern voice recognition.

UR - http://www.scopus.com/inward/record.url?scp=84934910277&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84934910277&partnerID=8YFLogxK

U2 - 10.1038/srep11475

DO - 10.1038/srep11475

M3 - Article

C2 - 26088739

AN - SCOPUS:84934910277

VL - 5

JO - Scientific Reports

JF - Scientific Reports

SN - 2045-2322

M1 - 11475

ER -