Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
Given the exponentially increasing volume of heterogenous medical cases, it is difficult to efficiently perform similarity-based reasoning (SBR) on a centralized machine. In this paper, we investigate how to perform SBR using MapReduce (SBRMR), which is an inference framework for data-intensive appl...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co., Ltd.
2016-08-01
|
Series: | Digital Communications and Networks |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352864816300323 |
id |
doaj-997a09ca3b084ec8924721fd26660671 |
---|---|
record_format |
Article |
spelling |
doaj-997a09ca3b084ec8924721fd266606712021-02-02T00:12:30ZengKeAi Communications Co., Ltd.Digital Communications and Networks2352-86482016-08-012314515010.1016/j.dcan.2016.07.003Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduceHaiyan Yu0Jiang Shen1Man Xu2School of Economics and Management, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Management and Economics, Tianjin University, Tianjin 300072, ChinaBusiness School, Nankai University, Tianjin 300711, ChinaGiven the exponentially increasing volume of heterogenous medical cases, it is difficult to efficiently perform similarity-based reasoning (SBR) on a centralized machine. In this paper, we investigate how to perform SBR using MapReduce (SBRMR), which is an inference framework for data-intensive applications over clusters of computers. To combine the similarities from the individual machines, a mixed integer optimization problem is formulated to filter the priority reference cases. Besides, a resilient mapping mechanism is employed using a quadratic optimization model for weighting the attributes and making the neighborhoods in the same class compact, hence improving the inference capacity. Our experiments on classifying the medical cases demonstrate that SBRMR has approximately 4.1% improvement in classification accuracy over SBR, which suggests that SBRMR is an efficient and resilient similarity-based inference approach.http://www.sciencedirect.com/science/article/pii/S2352864816300323 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Haiyan Yu Jiang Shen Man Xu |
spellingShingle |
Haiyan Yu Jiang Shen Man Xu Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce Digital Communications and Networks |
author_facet |
Haiyan Yu Jiang Shen Man Xu |
author_sort |
Haiyan Yu |
title |
Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce |
title_short |
Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce |
title_full |
Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce |
title_fullStr |
Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce |
title_full_unstemmed |
Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce |
title_sort |
resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in mapreduce |
publisher |
KeAi Communications Co., Ltd. |
series |
Digital Communications and Networks |
issn |
2352-8648 |
publishDate |
2016-08-01 |
description |
Given the exponentially increasing volume of heterogenous medical cases, it is difficult to efficiently perform similarity-based reasoning (SBR) on a centralized machine. In this paper, we investigate how to perform SBR using MapReduce (SBRMR), which is an inference framework for data-intensive applications over clusters of computers. To combine the similarities from the individual machines, a mixed integer optimization problem is formulated to filter the priority reference cases. Besides, a resilient mapping mechanism is employed using a quadratic optimization model for weighting the attributes and making the neighborhoods in the same class compact, hence improving the inference capacity. Our experiments on classifying the medical cases demonstrate that SBRMR has approximately 4.1% improvement in classification accuracy over SBR, which suggests that SBRMR is an efficient and resilient similarity-based inference approach. |
url |
http://www.sciencedirect.com/science/article/pii/S2352864816300323 |
work_keys_str_mv |
AT haiyanyu resilientparallelsimilaritybasedreasoningforclassifyingheterogeneousmedicalcasesinmapreduce AT jiangshen resilientparallelsimilaritybasedreasoningforclassifyingheterogeneousmedicalcasesinmapreduce AT manxu resilientparallelsimilaritybasedreasoningforclassifyingheterogeneousmedicalcasesinmapreduce |
_version_ |
1724314304698646528 |