Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation

As an essential biological feature of human beings, voiceprint is increasingly used in medical research and diagnosis, especially in identifying Parkinson's Disease (PD). This paper proposes a Spectrogram Deep Convolutional Generative Adversarial Network (S-DCGAN) for sample augmentation to ove...

Full description

Bibliographic Details
Main Authors: Zhi-Jing Xu, Rong-Fei Wang, Juan Wang, Da-Hai Yu
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9257451/
id doaj-67bc142d939e46adbb6a884153b8a9b9
record_format Article
spelling doaj-67bc142d939e46adbb6a884153b8a9b92021-03-30T04:17:49ZengIEEEIEEE Access2169-35362020-01-01820688820690010.1109/ACCESS.2020.30377759257451Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample AugmentationZhi-Jing Xu0https://orcid.org/0000-0002-1182-4537Rong-Fei Wang1https://orcid.org/0000-0002-1845-0283Juan Wang2https://orcid.org/0000-0002-7944-3685Da-Hai Yu3https://orcid.org/0000-0002-4663-6165College of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaShanghai Experimental School, Shanghai, ChinaAs an essential biological feature of human beings, voiceprint is increasingly used in medical research and diagnosis, especially in identifying Parkinson's Disease (PD). This paper proposes a Spectrogram Deep Convolutional Generative Adversarial Network (S-DCGAN) for sample augmentation to overcome the limited amount of existing patient voiceprint datasets and samples. S-DCGAN generates a high-resolution spectrogram by increasing network layers, adding the Spectral Normalization (SN) method, and combining feature matching strategy. The high-similarity and low-distortion spectrogram are selected in light of Structural Similarity Index (SSIM) values and Peak Signal to Noise Ratio (PSNR) to augment the samples. Fréchet Inception Distance (FID) and GAN-train result show the generalization ability of the generated data. We construct the ResNet50 model with a Global Average Pooling(GAP) layer to extract the voiceprint features and classify them effectively to improve recognition accuracy. The GAP suppresses the over-fitting problem and optimizes quickly. Finally, on the Sakar dataset, comparative experiments were conducted on different models and classification methods. Results show that the S-DCGAN-ResNet50 hybrid model can achieve the highest voiceprint recognition accuracy of 91.25% and specificity of 92.5%, which can distinguish between PD patients and healthy people more precisely compared with DCGAN-ResNet50. It augments the application environment of voiceprint recognition in the medical field and makes it universal in different datasets.https://ieeexplore.ieee.org/document/9257451/Parkinson’s diseaseResNet50S-DCGANsample augumentationspectrogram
collection DOAJ
language English
format Article
sources DOAJ
author Zhi-Jing Xu
Rong-Fei Wang
Juan Wang
Da-Hai Yu
spellingShingle Zhi-Jing Xu
Rong-Fei Wang
Juan Wang
Da-Hai Yu
Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation
IEEE Access
Parkinson’s disease
ResNet50
S-DCGAN
sample augumentation
spectrogram
author_facet Zhi-Jing Xu
Rong-Fei Wang
Juan Wang
Da-Hai Yu
author_sort Zhi-Jing Xu
title Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation
title_short Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation
title_full Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation
title_fullStr Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation
title_full_unstemmed Parkinson’s Disease Detection Based on Spectrogram-Deep Convolutional Generative Adversarial Network Sample Augmentation
title_sort parkinson’s disease detection based on spectrogram-deep convolutional generative adversarial network sample augmentation
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description As an essential biological feature of human beings, voiceprint is increasingly used in medical research and diagnosis, especially in identifying Parkinson's Disease (PD). This paper proposes a Spectrogram Deep Convolutional Generative Adversarial Network (S-DCGAN) for sample augmentation to overcome the limited amount of existing patient voiceprint datasets and samples. S-DCGAN generates a high-resolution spectrogram by increasing network layers, adding the Spectral Normalization (SN) method, and combining feature matching strategy. The high-similarity and low-distortion spectrogram are selected in light of Structural Similarity Index (SSIM) values and Peak Signal to Noise Ratio (PSNR) to augment the samples. Fréchet Inception Distance (FID) and GAN-train result show the generalization ability of the generated data. We construct the ResNet50 model with a Global Average Pooling(GAP) layer to extract the voiceprint features and classify them effectively to improve recognition accuracy. The GAP suppresses the over-fitting problem and optimizes quickly. Finally, on the Sakar dataset, comparative experiments were conducted on different models and classification methods. Results show that the S-DCGAN-ResNet50 hybrid model can achieve the highest voiceprint recognition accuracy of 91.25% and specificity of 92.5%, which can distinguish between PD patients and healthy people more precisely compared with DCGAN-ResNet50. It augments the application environment of voiceprint recognition in the medical field and makes it universal in different datasets.
topic Parkinson’s disease
ResNet50
S-DCGAN
sample augumentation
spectrogram
url https://ieeexplore.ieee.org/document/9257451/
work_keys_str_mv AT zhijingxu parkinsonx2019sdiseasedetectionbasedonspectrogramdeepconvolutionalgenerativeadversarialnetworksampleaugmentation
AT rongfeiwang parkinsonx2019sdiseasedetectionbasedonspectrogramdeepconvolutionalgenerativeadversarialnetworksampleaugmentation
AT juanwang parkinsonx2019sdiseasedetectionbasedonspectrogramdeepconvolutionalgenerativeadversarialnetworksampleaugmentation
AT dahaiyu parkinsonx2019sdiseasedetectionbasedonspectrogramdeepconvolutionalgenerativeadversarialnetworksampleaugmentation
_version_ 1724181978460192768