Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech

Temporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese spee...

Full description

Bibliographic Details
Main Authors: Yoshitaka Nakajima, Mizuki Matsuda, Kazuo Ueda, Gerard B. Remijn
Format: Article
Language:English
Published: Frontiers Media S.A. 2018-04-01
Series:Frontiers in Human Neuroscience
Subjects:
Online Access:http://journal.frontiersin.org/article/10.3389/fnhum.2018.00149/full
id doaj-8f82a9df7b9b4f7a9dd68489454af2d1
record_format Article
spelling doaj-8f82a9df7b9b4f7a9dd68489454af2d12020-11-25T02:38:15ZengFrontiers Media S.A.Frontiers in Human Neuroscience1662-51612018-04-011210.3389/fnhum.2018.00149333579Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic SpeechYoshitaka Nakajima0Mizuki Matsuda1Kazuo Ueda2Gerard B. Remijn3Department of Human Science, Faculty of Design/Research Center for Applied Perceptual Science, Kyushu University, Fukuoka, JapanNihon Kohden Corporation, Tokyo, JapanDepartment of Human Science, Faculty of Design/Research Center for Applied Perceptual Science, Kyushu University, Fukuoka, JapanDepartment of Human Science, Faculty of Design/Research Center for Applied Perceptual Science, Kyushu University, Fukuoka, JapanTemporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese speech with a systematically degraded temporal resolution, so-called “mosaic speech”: speech mosaicized in the coordinates of time and frequency. The results of two experiments show that mosaic speech cut into short static segments was almost perfectly intelligible with a temporal resolution of 40 ms or finer. Intelligibility dropped for a temporal resolution of 80 ms, but was still around 50%-correct level. The data are in line with previous results showing that speech signals separated into short temporal segments of <100 ms can be remarkably robust in terms of linguistic-content perception against drastic manipulations in each segment, such as partial signal omission or temporal reversal. The human perceptual system thus can extract meaning from unexpectedly rough temporal information in speech. The process resembles that of the visual system stringing together static movie frames of ~40 ms into vivid motion.http://journal.frontiersin.org/article/10.3389/fnhum.2018.00149/fullspeechspectro-temporal resolutionintelligibilitymosaicmovie frames
collection DOAJ
language English
format Article
sources DOAJ
author Yoshitaka Nakajima
Mizuki Matsuda
Kazuo Ueda
Gerard B. Remijn
spellingShingle Yoshitaka Nakajima
Mizuki Matsuda
Kazuo Ueda
Gerard B. Remijn
Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
Frontiers in Human Neuroscience
speech
spectro-temporal resolution
intelligibility
mosaic
movie frames
author_facet Yoshitaka Nakajima
Mizuki Matsuda
Kazuo Ueda
Gerard B. Remijn
author_sort Yoshitaka Nakajima
title Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_short Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_full Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_fullStr Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_full_unstemmed Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_sort temporal resolution needed for auditory communication: measurement with mosaic speech
publisher Frontiers Media S.A.
series Frontiers in Human Neuroscience
issn 1662-5161
publishDate 2018-04-01
description Temporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese speech with a systematically degraded temporal resolution, so-called “mosaic speech”: speech mosaicized in the coordinates of time and frequency. The results of two experiments show that mosaic speech cut into short static segments was almost perfectly intelligible with a temporal resolution of 40 ms or finer. Intelligibility dropped for a temporal resolution of 80 ms, but was still around 50%-correct level. The data are in line with previous results showing that speech signals separated into short temporal segments of <100 ms can be remarkably robust in terms of linguistic-content perception against drastic manipulations in each segment, such as partial signal omission or temporal reversal. The human perceptual system thus can extract meaning from unexpectedly rough temporal information in speech. The process resembles that of the visual system stringing together static movie frames of ~40 ms into vivid motion.
topic speech
spectro-temporal resolution
intelligibility
mosaic
movie frames
url http://journal.frontiersin.org/article/10.3389/fnhum.2018.00149/full
work_keys_str_mv AT yoshitakanakajima temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech
AT mizukimatsuda temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech
AT kazuoueda temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech
AT gerardbremijn temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech
_version_ 1724791914316693504