Extracting and Modifying the Spatial Information in Stereo Audio

碩士 === 大同大學 === 資訊工程學系(所) === 94 === In this thesis, the method to extract the spatial information and single representing source of original sound field in stereo, and then synthesis them as demanded are proposed. The objective is to synthesize appropriate sound field corresponding to vary listenin...

Full description

Bibliographic Details
Main Authors: Hui-Yu Tseng, 曾惠虞
Other Authors: Chia-Ming Chang
Format: Others
Language:en_US
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/27468502775850926782
Description
Summary:碩士 === 大同大學 === 資訊工程學系(所) === 94 === In this thesis, the method to extract the spatial information and single representing source of original sound field in stereo, and then synthesis them as demanded are proposed. The objective is to synthesize appropriate sound field corresponding to vary listening condition. The discussed situation is focused on multi-sources playing the same melody by the same music instrument aligned in line. Since each source plays the same melody, the same music scale would be played on the sector in time. Human perception is insensitive to the phase of audio. So we might assume that the magnitudes of spectrogram of each source is similar even their waveforms are different. Therefore, the signal received by microphone could be treated as the summation of one spectrogram with shifts in time and attenuation. It is similar to an image corrupted by a motion blur function. Thus, the concept of image-restoration may be applied to extract the spatial information and single representing source by which the property of time-frequency components of each original source could be represented. The sound field similar to original sound field can be synthesized using the extracted single representing source and the obtained spatial information. Also the spatial information can be modified to synthesize the different sound field for different playback conditions in pleasure. The simulation is performed to confirm the method in this thesis. And the result shows that the concept of image distortion/restoration process with sound spectrogram could be applied to the spatial information extraction and sound field resynthesis. There will be certain compression effects with applying the concept of decomposing and re-synthesizing in this thesis with multi-channel processing in the future.