Statistics of natural reverberation enable perceptual separation of sound and space

In everyday listening, sound reaches our ears directly from a source as well as indirectly via reflections known as reverberation. Reverberation profoundly distorts the sound from a source, yet humans can both identify sound sources and distinguish environments from the resulting sound, via mechanis...

Full description

Bibliographic Details
Main Authors: Traer, James A (Contributor), McDermott, Joshua H. (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences (Contributor)
Format: Article
Language:English
Published: National Academy of Sciences (U.S.), 2017-11-14T18:33:26Z.
Subjects:
Online Access:Get fulltext
LEADER 02497 am a22001813u 4500
001 112183
042 |a dc 
100 1 0 |a Traer, James A  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences  |e contributor 
100 1 0 |a Traer, James A  |e contributor 
100 1 0 |a McDermott, Joshua H.  |e contributor 
700 1 0 |a McDermott, Joshua H.  |e author 
245 0 0 |a Statistics of natural reverberation enable perceptual separation of sound and space 
260 |b National Academy of Sciences (U.S.),   |c 2017-11-14T18:33:26Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/112183 
520 |a In everyday listening, sound reaches our ears directly from a source as well as indirectly via reflections known as reverberation. Reverberation profoundly distorts the sound from a source, yet humans can both identify sound sources and distinguish environments from the resulting sound, via mechanisms that remain unclear. The core computational challenge is that the acoustic signatures of the source and environment are combined in a single signal received by the ear. Here we ask whether our recognition of sound sources and spaces reflects an ability to separate their effects and whether any such separation is enabled by statistical regularities of real-world reverberation. To first determine whether such statistical regularities exist, we measured impulse responses (IRs) of 271 spaces sampled from the distribution encountered by humans during daily life. The sampled spaces were diverse, but their IRs were tightly constrained, exhibiting exponential decay at frequency-dependent rates: Mid frequencies reverberated longest whereas higher and lower frequencies decayed more rapidly, presumably due to absorptive properties of materials and air. To test whether humans leverage these regularities, we manipulated IR decay characteristics in simulated reverberant audio. Listeners could discriminate sound sources and environments from these signals, but their abilities degraded when reverberation characteristics deviated from those of real-world environments. Subjectively, atypical IRs were mistaken for sound sources. The results suggest the brain separates sound into contributions from the source and the environment, constrained by a prior on natural reverberation. This separation process may contribute to robust recognition while providing information about spaces around us. 
655 7 |a Article 
773 |t Proceedings of the National Academy of Sciences