New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification
In medical datasets classification, support vector machine (SVM) is considered to be one of the most successful methods. However, most of the real-world medical datasets usually contain some outliers/noise and data often have class imbalance problems. In this paper, a fuzzy support machine (FSVM) fo...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2014-01-01
|
Series: | The Scientific World Journal |
Online Access: | http://dx.doi.org/10.1155/2014/536434 |
id |
doaj-c9c845a4e6d14bb3b3af34adaccea674 |
---|---|
record_format |
Article |
spelling |
doaj-c9c845a4e6d14bb3b3af34adaccea6742020-11-25T00:08:39ZengHindawi LimitedThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/536434536434New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets ClassificationXiaoqing Gu0Tongguang Ni1Hongyuan Wang2School of Information Science and Engineering, Changzhou University, Changzhou 213164, ChinaSchool of Information Science and Engineering, Changzhou University, Changzhou 213164, ChinaSchool of Information Science and Engineering, Changzhou University, Changzhou 213164, ChinaIn medical datasets classification, support vector machine (SVM) is considered to be one of the most successful methods. However, most of the real-world medical datasets usually contain some outliers/noise and data often have class imbalance problems. In this paper, a fuzzy support machine (FSVM) for the class imbalance problem (called FSVM-CIP) is presented, which can be seen as a modified class of FSVM by extending manifold regularization and assigning two misclassification costs for two classes. The proposed FSVM-CIP can be used to handle the class imbalance problem in the presence of outliers/noise, and enhance the locality maximum margin. Five real-world medical datasets, breast, heart, hepatitis, BUPA liver, and pima diabetes, from the UCI medical database are employed to illustrate the method presented in this paper. Experimental results on these datasets show the outperformed or comparable effectiveness of FSVM-CIP.http://dx.doi.org/10.1155/2014/536434 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaoqing Gu Tongguang Ni Hongyuan Wang |
spellingShingle |
Xiaoqing Gu Tongguang Ni Hongyuan Wang New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification The Scientific World Journal |
author_facet |
Xiaoqing Gu Tongguang Ni Hongyuan Wang |
author_sort |
Xiaoqing Gu |
title |
New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification |
title_short |
New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification |
title_full |
New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification |
title_fullStr |
New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification |
title_full_unstemmed |
New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification |
title_sort |
new fuzzy support vector machine for the class imbalance problem in medical datasets classification |
publisher |
Hindawi Limited |
series |
The Scientific World Journal |
issn |
2356-6140 1537-744X |
publishDate |
2014-01-01 |
description |
In medical datasets classification, support vector machine (SVM) is considered to be one of the most successful methods. However, most of the real-world medical datasets usually contain some outliers/noise and data often have class imbalance problems. In this paper, a fuzzy support machine (FSVM) for the class imbalance problem (called FSVM-CIP) is presented, which can be seen as a modified class of FSVM by extending manifold regularization and assigning two misclassification costs for two classes. The proposed FSVM-CIP can be used to handle the class imbalance problem in the presence of outliers/noise, and enhance the locality maximum margin. Five real-world medical datasets, breast, heart, hepatitis, BUPA liver, and pima diabetes, from the UCI medical database are employed to illustrate the method presented in this paper. Experimental results on these datasets show the outperformed or comparable effectiveness of FSVM-CIP. |
url |
http://dx.doi.org/10.1155/2014/536434 |
work_keys_str_mv |
AT xiaoqinggu newfuzzysupportvectormachinefortheclassimbalanceprobleminmedicaldatasetsclassification AT tongguangni newfuzzysupportvectormachinefortheclassimbalanceprobleminmedicaldatasetsclassification AT hongyuanwang newfuzzysupportvectormachinefortheclassimbalanceprobleminmedicaldatasetsclassification |
_version_ |
1725415106771156992 |