Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech

Temporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese spee...

Full description

Bibliographic Details
Main Authors:	Yoshitaka Nakajima, Mizuki Matsuda, Kazuo Ueda, Gerard B. Remijn
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2018-04-01
Series:	Frontiers in Human Neuroscience
Subjects:	speech spectro-temporal resolution intelligibility mosaic movie frames
Online Access:	http://journal.frontiersin.org/article/10.3389/fnhum.2018.00149/full

id	doaj-8f82a9df7b9b4f7a9dd68489454af2d1
record_format	Article
spelling	doaj-8f82a9df7b9b4f7a9dd68489454af2d12020-11-25T02:38:15ZengFrontiers Media S.A.Frontiers in Human Neuroscience1662-51612018-04-011210.3389/fnhum.2018.00149333579Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic SpeechYoshitaka Nakajima0Mizuki Matsuda1Kazuo Ueda2Gerard B. Remijn3Department of Human Science, Faculty of Design/Research Center for Applied Perceptual Science, Kyushu University, Fukuoka, JapanNihon Kohden Corporation, Tokyo, JapanDepartment of Human Science, Faculty of Design/Research Center for Applied Perceptual Science, Kyushu University, Fukuoka, JapanDepartment of Human Science, Faculty of Design/Research Center for Applied Perceptual Science, Kyushu University, Fukuoka, JapanTemporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese speech with a systematically degraded temporal resolution, so-called “mosaic speech”: speech mosaicized in the coordinates of time and frequency. The results of two experiments show that mosaic speech cut into short static segments was almost perfectly intelligible with a temporal resolution of 40 ms or finer. Intelligibility dropped for a temporal resolution of 80 ms, but was still around 50%-correct level. The data are in line with previous results showing that speech signals separated into short temporal segments of <100 ms can be remarkably robust in terms of linguistic-content perception against drastic manipulations in each segment, such as partial signal omission or temporal reversal. The human perceptual system thus can extract meaning from unexpectedly rough temporal information in speech. The process resembles that of the visual system stringing together static movie frames of ~40 ms into vivid motion.http://journal.frontiersin.org/article/10.3389/fnhum.2018.00149/fullspeechspectro-temporal resolutionintelligibilitymosaicmovie frames
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yoshitaka Nakajima Mizuki Matsuda Kazuo Ueda Gerard B. Remijn
spellingShingle	Yoshitaka Nakajima Mizuki Matsuda Kazuo Ueda Gerard B. Remijn Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech Frontiers in Human Neuroscience speech spectro-temporal resolution intelligibility mosaic movie frames
author_facet	Yoshitaka Nakajima Mizuki Matsuda Kazuo Ueda Gerard B. Remijn
author_sort	Yoshitaka Nakajima
title	Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_short	Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_full	Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_fullStr	Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_full_unstemmed	Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech
title_sort	temporal resolution needed for auditory communication: measurement with mosaic speech
publisher	Frontiers Media S.A.
series	Frontiers in Human Neuroscience
issn	1662-5161
publishDate	2018-04-01
description	Temporal resolution needed for Japanese speech communication was measured. A new experimental paradigm that can reflect the spectro-temporal resolution necessary for healthy listeners to perceive speech is introduced. As a first step, we report listeners' intelligibility scores of Japanese speech with a systematically degraded temporal resolution, so-called “mosaic speech”: speech mosaicized in the coordinates of time and frequency. The results of two experiments show that mosaic speech cut into short static segments was almost perfectly intelligible with a temporal resolution of 40 ms or finer. Intelligibility dropped for a temporal resolution of 80 ms, but was still around 50%-correct level. The data are in line with previous results showing that speech signals separated into short temporal segments of <100 ms can be remarkably robust in terms of linguistic-content perception against drastic manipulations in each segment, such as partial signal omission or temporal reversal. The human perceptual system thus can extract meaning from unexpectedly rough temporal information in speech. The process resembles that of the visual system stringing together static movie frames of ~40 ms into vivid motion.
topic	speech spectro-temporal resolution intelligibility mosaic movie frames
url	http://journal.frontiersin.org/article/10.3389/fnhum.2018.00149/full
work_keys_str_mv	AT yoshitakanakajima temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech AT mizukimatsuda temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech AT kazuoueda temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech AT gerardbremijn temporalresolutionneededforauditorycommunicationmeasurementwithmosaicspeech
_version_	1724791914316693504

Temporal Resolution Needed for Auditory Communication: Measurement With Mosaic Speech

Similar Items