Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis

abstract: Auditory scene analysis (ASA) is the process through which listeners parse and organize their acoustic environment into relevant auditory objects. ASA functions by exploiting natural regularities in the structure of auditory information. The current study investigates spectral envelope and...

Full description

Bibliographic Details
Other Authors: Patten, Kristopher Jakob (Author)
Format: Doctoral Thesis
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/2286/R.I.46351
id ndltd-asu.edu-item-46351
record_format oai_dc
spelling ndltd-asu.edu-item-463512018-06-22T03:09:08Z Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis abstract: Auditory scene analysis (ASA) is the process through which listeners parse and organize their acoustic environment into relevant auditory objects. ASA functions by exploiting natural regularities in the structure of auditory information. The current study investigates spectral envelope and its contribution to the perception of changes in pitch and loudness. Experiment 1 constructs a perceptual continuum of twelve f0- and intensity-matched vowel phonemes (i.e. a pure timbre manipulation) and reveals spectral envelope as a primary organizational dimension. The extremes of this dimension are i (as in “bee”) and Ʌ (“bun”). Experiment 2 measures the strength of the relationship between produced f0 and the previously observed phonetic-pitch continuum at three different levels of phonemic constraint. Scat performances and, to a lesser extent, recorded interviews were found to exhibit changes in accordance with the natural regularity; specifically, f0 changes were correlated with the phoneme pitch-height continuum. The more constrained case of lyrical singing did not exhibit the natural regularity. Experiment 3 investigates participant ratings of pitch and loudness as stimuli vary in f0, intensity, and the phonetic-pitch continuum. Psychophysical functions derived from the results reveal that moving from i to Ʌ is equivalent to a .38 semitone decrease in f0 and a .75 dB decrease in intensity. Experiment 4 examines the potentially functional aspect of the pitch, loudness, and spectral envelope relationship. Detection thresholds of stimuli in which all three dimensions change congruently (f0 increase, intensity increase, Ʌ to i) or incongruently (no f0 change, intensity increase, i to Ʌ) are compared using an objective version of the method of limits. Congruent changes did not provide a detection benefit over incongruent changes; however, when the contribution of phoneme change was removed, congruent changes did offer a slight detection benefit, as in previous research. While this relationship does not offer a detection benefit at threshold, there is a natural regularity for humans to produce phonemes at higher f0s according to their relative position on the pitch height continuum. Likewise, humans have a bias to detect pitch and loudness changes in phoneme sweeps in accordance with the natural regularity. Dissertation/Thesis Patten, Kristopher Jakob (Author) McBeath, Michael K (Advisor) Amazeen, Eric L (Committee member) Glenberg, Arthur W (Committee member) Zhou, Yi (Committee member) Arizona State University (Publisher) Cognitive psychology Audiology Psychology audition auditory scene analysis multidimensional scaling perception spectral envelope timbre eng 55 pages Doctoral Dissertation Psychology 2017 Doctoral Dissertation http://hdl.handle.net/2286/R.I.46351 http://rightsstatements.org/vocab/InC/1.0/ All Rights Reserved 2017
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Cognitive psychology
Audiology
Psychology
audition
auditory scene analysis
multidimensional scaling
perception
spectral envelope
timbre
spellingShingle Cognitive psychology
Audiology
Psychology
audition
auditory scene analysis
multidimensional scaling
perception
spectral envelope
timbre
Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis
description abstract: Auditory scene analysis (ASA) is the process through which listeners parse and organize their acoustic environment into relevant auditory objects. ASA functions by exploiting natural regularities in the structure of auditory information. The current study investigates spectral envelope and its contribution to the perception of changes in pitch and loudness. Experiment 1 constructs a perceptual continuum of twelve f0- and intensity-matched vowel phonemes (i.e. a pure timbre manipulation) and reveals spectral envelope as a primary organizational dimension. The extremes of this dimension are i (as in “bee”) and Ʌ (“bun”). Experiment 2 measures the strength of the relationship between produced f0 and the previously observed phonetic-pitch continuum at three different levels of phonemic constraint. Scat performances and, to a lesser extent, recorded interviews were found to exhibit changes in accordance with the natural regularity; specifically, f0 changes were correlated with the phoneme pitch-height continuum. The more constrained case of lyrical singing did not exhibit the natural regularity. Experiment 3 investigates participant ratings of pitch and loudness as stimuli vary in f0, intensity, and the phonetic-pitch continuum. Psychophysical functions derived from the results reveal that moving from i to Ʌ is equivalent to a .38 semitone decrease in f0 and a .75 dB decrease in intensity. Experiment 4 examines the potentially functional aspect of the pitch, loudness, and spectral envelope relationship. Detection thresholds of stimuli in which all three dimensions change congruently (f0 increase, intensity increase, Ʌ to i) or incongruently (no f0 change, intensity increase, i to Ʌ) are compared using an objective version of the method of limits. Congruent changes did not provide a detection benefit over incongruent changes; however, when the contribution of phoneme change was removed, congruent changes did offer a slight detection benefit, as in previous research. While this relationship does not offer a detection benefit at threshold, there is a natural regularity for humans to produce phonemes at higher f0s according to their relative position on the pitch height continuum. Likewise, humans have a bias to detect pitch and loudness changes in phoneme sweeps in accordance with the natural regularity. === Dissertation/Thesis === Doctoral Dissertation Psychology 2017
author2 Patten, Kristopher Jakob (Author)
author_facet Patten, Kristopher Jakob (Author)
title Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis
title_short Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis
title_full Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis
title_fullStr Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis
title_full_unstemmed Natural Correlations of Spectral Envelope and their Contribution to Auditory Scene Analysis
title_sort natural correlations of spectral envelope and their contribution to auditory scene analysis
publishDate 2017
url http://hdl.handle.net/2286/R.I.46351
_version_ 1718701667612884992