A New Approach to Model Pitch Perception Using Sparse Coding.

Our acoustical environment abounds with repetitive sounds, some of which are related to pitch perception. It is still unknown how the auditory system, in processing these sounds, relates a physical stimulus and its percept. Since, in mammals, all auditory stimuli are conveyed into the nervous system...

Full description

Bibliographic Details
Main Authors: Oded Barzelay, Miriam Furst, Omri Barak
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5308863?pdf=render
id doaj-23c44983e2844362852ad5efeff2db13
record_format Article
spelling doaj-23c44983e2844362852ad5efeff2db132020-11-24T21:49:06ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-01-01131e100533810.1371/journal.pcbi.1005338A New Approach to Model Pitch Perception Using Sparse Coding.Oded BarzelayMiriam FurstOmri BarakOur acoustical environment abounds with repetitive sounds, some of which are related to pitch perception. It is still unknown how the auditory system, in processing these sounds, relates a physical stimulus and its percept. Since, in mammals, all auditory stimuli are conveyed into the nervous system through the auditory nerve (AN) fibers, a model should explain the perception of pitch as a function of this particular input. However, pitch perception is invariant to certain features of the physical stimulus. For example, a missing fundamental stimulus with resolved or unresolved harmonics, or a low and high-level amplitude stimulus with the same spectral content-these all give rise to the same percept of pitch. In contrast, the AN representations for these different stimuli are not invariant to these effects. In fact, due to saturation and non-linearity of both cochlear and inner hair cells responses, these differences are enhanced by the AN fibers. Thus there is a difficulty in explaining how pitch percept arises from the activity of the AN fibers. We introduce a novel approach for extracting pitch cues from the AN population activity for a given arbitrary stimulus. The method is based on a technique known as sparse coding (SC). It is the representation of pitch cues by a few spatiotemporal atoms (templates) from among a large set of possible ones (a dictionary). The amount of activity of each atom is represented by a non-zero coefficient, analogous to an active neuron. Such a technique has been successfully applied to other modalities, particularly vision. The model is composed of a cochlear model, an SC processing unit, and a harmonic sieve. We show that the model copes with different pitch phenomena: extracting resolved and non-resolved harmonics, missing fundamental pitches, stimuli with both high and low amplitudes, iterated rippled noises, and recorded musical instruments.http://europepmc.org/articles/PMC5308863?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Oded Barzelay
Miriam Furst
Omri Barak
spellingShingle Oded Barzelay
Miriam Furst
Omri Barak
A New Approach to Model Pitch Perception Using Sparse Coding.
PLoS Computational Biology
author_facet Oded Barzelay
Miriam Furst
Omri Barak
author_sort Oded Barzelay
title A New Approach to Model Pitch Perception Using Sparse Coding.
title_short A New Approach to Model Pitch Perception Using Sparse Coding.
title_full A New Approach to Model Pitch Perception Using Sparse Coding.
title_fullStr A New Approach to Model Pitch Perception Using Sparse Coding.
title_full_unstemmed A New Approach to Model Pitch Perception Using Sparse Coding.
title_sort new approach to model pitch perception using sparse coding.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2017-01-01
description Our acoustical environment abounds with repetitive sounds, some of which are related to pitch perception. It is still unknown how the auditory system, in processing these sounds, relates a physical stimulus and its percept. Since, in mammals, all auditory stimuli are conveyed into the nervous system through the auditory nerve (AN) fibers, a model should explain the perception of pitch as a function of this particular input. However, pitch perception is invariant to certain features of the physical stimulus. For example, a missing fundamental stimulus with resolved or unresolved harmonics, or a low and high-level amplitude stimulus with the same spectral content-these all give rise to the same percept of pitch. In contrast, the AN representations for these different stimuli are not invariant to these effects. In fact, due to saturation and non-linearity of both cochlear and inner hair cells responses, these differences are enhanced by the AN fibers. Thus there is a difficulty in explaining how pitch percept arises from the activity of the AN fibers. We introduce a novel approach for extracting pitch cues from the AN population activity for a given arbitrary stimulus. The method is based on a technique known as sparse coding (SC). It is the representation of pitch cues by a few spatiotemporal atoms (templates) from among a large set of possible ones (a dictionary). The amount of activity of each atom is represented by a non-zero coefficient, analogous to an active neuron. Such a technique has been successfully applied to other modalities, particularly vision. The model is composed of a cochlear model, an SC processing unit, and a harmonic sieve. We show that the model copes with different pitch phenomena: extracting resolved and non-resolved harmonics, missing fundamental pitches, stimuli with both high and low amplitudes, iterated rippled noises, and recorded musical instruments.
url http://europepmc.org/articles/PMC5308863?pdf=render
work_keys_str_mv AT odedbarzelay anewapproachtomodelpitchperceptionusingsparsecoding
AT miriamfurst anewapproachtomodelpitchperceptionusingsparsecoding
AT omribarak anewapproachtomodelpitchperceptionusingsparsecoding
AT odedbarzelay newapproachtomodelpitchperceptionusingsparsecoding
AT miriamfurst newapproachtomodelpitchperceptionusingsparsecoding
AT omribarak newapproachtomodelpitchperceptionusingsparsecoding
_version_ 1725889597496360960