Dynamics of self-monitoring and error detection in speech production: Evidence from mental imagery and MEG

Xing Tian, David Poeppel

Research output: Contribution to journalArticle

Abstract

A critical subroutine of self-monitoring during speech production is to detect any deviance between expected and actual auditory feedback. Here we investigated the associated neural dynamics using MEG recording in mental-imagery-of-speech paradigms. Participants covertly articulated the vowel /a/; their own (individually recorded) speech was played back, with parametric manipulation using four levels of pitch shift, crossed with four levels of onset delay. A nonmonotonic function was observed in early auditory responses when the onset delay was shorter than 100 msec: Suppression was observed for normal playback, but enhancement for pitch-shifted playback; however, the magnitude of enhancement decreased at the largest level of pitch shift that was out of pitch range for normal conversion, as suggested in two behavioral experiments. No difference was observed among different types of playback when the onset delay was longer than 100 msec. These results suggest that the prediction suppresses the response to normal feedback, which mediates source monitoring. When auditory feedback does not match the prediction, an “error term” is generated, which underlies deviance detection. We argue that, based on the observed nonmonotonic function, a frequency window (addressing spectral difference) and a time window (constraining temporal difference) jointly regulate the comparison between prediction and feedback in speech.

Original languageEnglish (US)
Pages (from-to)352-364
Number of pages13
JournalJournal of Cognitive Neuroscience
Volume27
Issue number2
DOIs
StatePublished - Mar 1 2014

Fingerprint

Imagery (Psychotherapy)
monitoring
deviant behavior
evidence
suppression
manipulation
recording
Reference Values
paradigm
Speech Production
Mental Imagery
Prediction
Self-monitoring
Onset
Error Detection
experiment
Deviance
Auditory Feedback
Enhancement

ASJC Scopus subject areas

  • Cognitive Neuroscience
  • Language and Linguistics
  • Linguistics and Language

Cite this

Dynamics of self-monitoring and error detection in speech production : Evidence from mental imagery and MEG. / Tian, Xing; Poeppel, David.

In: Journal of Cognitive Neuroscience, Vol. 27, No. 2, 01.03.2014, p. 352-364.

Research output: Contribution to journalArticle

@article{1c203357bacb4801a24ccecbb8528881,
title = "Dynamics of self-monitoring and error detection in speech production: Evidence from mental imagery and MEG",
abstract = "A critical subroutine of self-monitoring during speech production is to detect any deviance between expected and actual auditory feedback. Here we investigated the associated neural dynamics using MEG recording in mental-imagery-of-speech paradigms. Participants covertly articulated the vowel /a/; their own (individually recorded) speech was played back, with parametric manipulation using four levels of pitch shift, crossed with four levels of onset delay. A nonmonotonic function was observed in early auditory responses when the onset delay was shorter than 100 msec: Suppression was observed for normal playback, but enhancement for pitch-shifted playback; however, the magnitude of enhancement decreased at the largest level of pitch shift that was out of pitch range for normal conversion, as suggested in two behavioral experiments. No difference was observed among different types of playback when the onset delay was longer than 100 msec. These results suggest that the prediction suppresses the response to normal feedback, which mediates source monitoring. When auditory feedback does not match the prediction, an “error term” is generated, which underlies deviance detection. We argue that, based on the observed nonmonotonic function, a frequency window (addressing spectral difference) and a time window (constraining temporal difference) jointly regulate the comparison between prediction and feedback in speech.",
author = "Xing Tian and David Poeppel",
year = "2014",
month = "3",
day = "1",
doi = "10.1162/jocn_a_00692",
language = "English (US)",
volume = "27",
pages = "352--364",
journal = "Journal of Cognitive Neuroscience",
issn = "0898-929X",
publisher = "MIT Press Journals",
number = "2",

}

TY - JOUR

T1 - Dynamics of self-monitoring and error detection in speech production

T2 - Evidence from mental imagery and MEG

AU - Tian, Xing

AU - Poeppel, David

PY - 2014/3/1

Y1 - 2014/3/1

N2 - A critical subroutine of self-monitoring during speech production is to detect any deviance between expected and actual auditory feedback. Here we investigated the associated neural dynamics using MEG recording in mental-imagery-of-speech paradigms. Participants covertly articulated the vowel /a/; their own (individually recorded) speech was played back, with parametric manipulation using four levels of pitch shift, crossed with four levels of onset delay. A nonmonotonic function was observed in early auditory responses when the onset delay was shorter than 100 msec: Suppression was observed for normal playback, but enhancement for pitch-shifted playback; however, the magnitude of enhancement decreased at the largest level of pitch shift that was out of pitch range for normal conversion, as suggested in two behavioral experiments. No difference was observed among different types of playback when the onset delay was longer than 100 msec. These results suggest that the prediction suppresses the response to normal feedback, which mediates source monitoring. When auditory feedback does not match the prediction, an “error term” is generated, which underlies deviance detection. We argue that, based on the observed nonmonotonic function, a frequency window (addressing spectral difference) and a time window (constraining temporal difference) jointly regulate the comparison between prediction and feedback in speech.

AB - A critical subroutine of self-monitoring during speech production is to detect any deviance between expected and actual auditory feedback. Here we investigated the associated neural dynamics using MEG recording in mental-imagery-of-speech paradigms. Participants covertly articulated the vowel /a/; their own (individually recorded) speech was played back, with parametric manipulation using four levels of pitch shift, crossed with four levels of onset delay. A nonmonotonic function was observed in early auditory responses when the onset delay was shorter than 100 msec: Suppression was observed for normal playback, but enhancement for pitch-shifted playback; however, the magnitude of enhancement decreased at the largest level of pitch shift that was out of pitch range for normal conversion, as suggested in two behavioral experiments. No difference was observed among different types of playback when the onset delay was longer than 100 msec. These results suggest that the prediction suppresses the response to normal feedback, which mediates source monitoring. When auditory feedback does not match the prediction, an “error term” is generated, which underlies deviance detection. We argue that, based on the observed nonmonotonic function, a frequency window (addressing spectral difference) and a time window (constraining temporal difference) jointly regulate the comparison between prediction and feedback in speech.

UR - http://www.scopus.com/inward/record.url?scp=84920044054&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84920044054&partnerID=8YFLogxK

U2 - 10.1162/jocn_a_00692

DO - 10.1162/jocn_a_00692

M3 - Article

C2 - 25061925

AN - SCOPUS:84920044054

VL - 27

SP - 352

EP - 364

JO - Journal of Cognitive Neuroscience

JF - Journal of Cognitive Neuroscience

SN - 0898-929X

IS - 2

ER -