Computer methods for voice analysis

This thesis consists of five articles and a summary. Thethesis deals with methods for measuring properties of thevoice. The methods are all computer-based, but utilisedifferent approaches for measuring different aspects of thevoice. Paper I introduces the Visual Sort and Rate (VSR) method forpercept...

Full description

Bibliographic Details
Main Author: Granqvist, Svante
Format: Doctoral Thesis
Language:English
Published: KTH, Tal, musik och hörsel 2003
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-3485
http://nbn-resolving.de/urn:isbn:91-7283-461-7
id ndltd-UPSALLA1-oai-DiVA.org-kth-3485
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kth-34852013-01-08T13:06:14ZComputer methods for voice analysisengGranqvist, SvanteKTH, Tal, musik och hörselStockholm : KTH2003voice analysisperceptual analysisfundamental frequencycorrelogramaperiodicityFourier analysishigh-speed imaginglaryngoscopyvocal fold vibrationvoice accumulation.TECHNOLOGYTEKNIKVETENSKAPThis thesis consists of five articles and a summary. Thethesis deals with methods for measuring properties of thevoice. The methods are all computer-based, but utilisedifferent approaches for measuring different aspects of thevoice. Paper I introduces the Visual Sort and Rate (VSR) method forperceptual rating of voice quality. The method is based on theVisual Analogue Scale (VAS), but simultaneously shows allstimuli as icons along the VAS on the computer screen. As thelistener places similar-sounding stimuli close to each otherduring the rating process, comparing stimuli becomeseasier. Paper II introduces the correlogram. Fundamental frequencyF0 sometimes cannot be strictly defined, particularly forperturbed voice signals. The method displays multipleconsecutive correlation functions in a grey scale image. Thus,the correlogram avoids selecting a single F0 value. Rather itpresents an unbiased image of periodicity, allowing theinvestigator to select among several candidates, ifappropriate. PaperIII introduces a method for detection of phonation tobe utilised in voice accumulators. The method uses twomicrophones attached near the subject’s ears. Phase andamplitude relations of the microphone signals are used to forma phonation detector. The output of the method can be used tomeasure phonation time, speaking time and fundamental frequencyof the subject, as well as sound pressure level of both thesubject’s voicing and the ambient sounds. Paper IV introduces a method for Fourier analysis ofhigh-speed laryngoscopic imaging. The data from the consecutiveimages are re-arranged to form time-series that reflect thetime-variation of light intensity in each pixel. Each of thesetime series is then analysed by means of Fouriertransformation, such that a spectrum for each pixel isobtained. Several ways of displaying these spectra aredemonstrated. Paper V examines a test set-up for simultaneous recording ofairflow, intra-oral pressure, electro-glottography, audio andhigh-speed imaging. Data are analysed with particular focus onsynchronisation between glottal area and inverse filteredairflow. Several methodological aspects are also examined, suchas the difficulties in synchronising high-speed imaging datawith the other signals. QC 20100609Doctoral thesis, comprehensive summaryinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-3485urn:isbn:91-7283-461-7Trita-TMH, ; 2003:2application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic voice analysis
perceptual analysis
fundamental frequency
correlogram
aperiodicity
Fourier analysis
high-speed imaging
laryngoscopy
vocal fold vibration
voice accumulation.
TECHNOLOGY
TEKNIKVETENSKAP
spellingShingle voice analysis
perceptual analysis
fundamental frequency
correlogram
aperiodicity
Fourier analysis
high-speed imaging
laryngoscopy
vocal fold vibration
voice accumulation.
TECHNOLOGY
TEKNIKVETENSKAP
Granqvist, Svante
Computer methods for voice analysis
description This thesis consists of five articles and a summary. Thethesis deals with methods for measuring properties of thevoice. The methods are all computer-based, but utilisedifferent approaches for measuring different aspects of thevoice. Paper I introduces the Visual Sort and Rate (VSR) method forperceptual rating of voice quality. The method is based on theVisual Analogue Scale (VAS), but simultaneously shows allstimuli as icons along the VAS on the computer screen. As thelistener places similar-sounding stimuli close to each otherduring the rating process, comparing stimuli becomeseasier. Paper II introduces the correlogram. Fundamental frequencyF0 sometimes cannot be strictly defined, particularly forperturbed voice signals. The method displays multipleconsecutive correlation functions in a grey scale image. Thus,the correlogram avoids selecting a single F0 value. Rather itpresents an unbiased image of periodicity, allowing theinvestigator to select among several candidates, ifappropriate. PaperIII introduces a method for detection of phonation tobe utilised in voice accumulators. The method uses twomicrophones attached near the subject’s ears. Phase andamplitude relations of the microphone signals are used to forma phonation detector. The output of the method can be used tomeasure phonation time, speaking time and fundamental frequencyof the subject, as well as sound pressure level of both thesubject’s voicing and the ambient sounds. Paper IV introduces a method for Fourier analysis ofhigh-speed laryngoscopic imaging. The data from the consecutiveimages are re-arranged to form time-series that reflect thetime-variation of light intensity in each pixel. Each of thesetime series is then analysed by means of Fouriertransformation, such that a spectrum for each pixel isobtained. Several ways of displaying these spectra aredemonstrated. Paper V examines a test set-up for simultaneous recording ofairflow, intra-oral pressure, electro-glottography, audio andhigh-speed imaging. Data are analysed with particular focus onsynchronisation between glottal area and inverse filteredairflow. Several methodological aspects are also examined, suchas the difficulties in synchronising high-speed imaging datawith the other signals. === QC 20100609
author Granqvist, Svante
author_facet Granqvist, Svante
author_sort Granqvist, Svante
title Computer methods for voice analysis
title_short Computer methods for voice analysis
title_full Computer methods for voice analysis
title_fullStr Computer methods for voice analysis
title_full_unstemmed Computer methods for voice analysis
title_sort computer methods for voice analysis
publisher KTH, Tal, musik och hörsel
publishDate 2003
url http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-3485
http://nbn-resolving.de/urn:isbn:91-7283-461-7
work_keys_str_mv AT granqvistsvante computermethodsforvoiceanalysis
_version_ 1716508935999979520