Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce

Given the exponentially increasing volume of heterogenous medical cases, it is difficult to efficiently perform similarity-based reasoning (SBR) on a centralized machine. In this paper, we investigate how to perform SBR using MapReduce (SBRMR), which is an inference framework for data-intensive appl...

Full description

Bibliographic Details
Main Authors: Haiyan Yu, Jiang Shen, Man Xu
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2016-08-01
Series:Digital Communications and Networks
Online Access:http://www.sciencedirect.com/science/article/pii/S2352864816300323
id doaj-997a09ca3b084ec8924721fd26660671
record_format Article
spelling doaj-997a09ca3b084ec8924721fd266606712021-02-02T00:12:30ZengKeAi Communications Co., Ltd.Digital Communications and Networks2352-86482016-08-012314515010.1016/j.dcan.2016.07.003Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduceHaiyan Yu0Jiang Shen1Man Xu2School of Economics and Management, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaCollege of Management and Economics, Tianjin University, Tianjin 300072, ChinaBusiness School, Nankai University, Tianjin 300711, ChinaGiven the exponentially increasing volume of heterogenous medical cases, it is difficult to efficiently perform similarity-based reasoning (SBR) on a centralized machine. In this paper, we investigate how to perform SBR using MapReduce (SBRMR), which is an inference framework for data-intensive applications over clusters of computers. To combine the similarities from the individual machines, a mixed integer optimization problem is formulated to filter the priority reference cases. Besides, a resilient mapping mechanism is employed using a quadratic optimization model for weighting the attributes and making the neighborhoods in the same class compact, hence improving the inference capacity. Our experiments on classifying the medical cases demonstrate that SBRMR has approximately 4.1% improvement in classification accuracy over SBR, which suggests that SBRMR is an efficient and resilient similarity-based inference approach.http://www.sciencedirect.com/science/article/pii/S2352864816300323
collection DOAJ
language English
format Article
sources DOAJ
author Haiyan Yu
Jiang Shen
Man Xu
spellingShingle Haiyan Yu
Jiang Shen
Man Xu
Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
Digital Communications and Networks
author_facet Haiyan Yu
Jiang Shen
Man Xu
author_sort Haiyan Yu
title Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
title_short Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
title_full Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
title_fullStr Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
title_full_unstemmed Resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in MapReduce
title_sort resilient parallel similarity-based reasoning for classifying heterogeneous medical cases in mapreduce
publisher KeAi Communications Co., Ltd.
series Digital Communications and Networks
issn 2352-8648
publishDate 2016-08-01
description Given the exponentially increasing volume of heterogenous medical cases, it is difficult to efficiently perform similarity-based reasoning (SBR) on a centralized machine. In this paper, we investigate how to perform SBR using MapReduce (SBRMR), which is an inference framework for data-intensive applications over clusters of computers. To combine the similarities from the individual machines, a mixed integer optimization problem is formulated to filter the priority reference cases. Besides, a resilient mapping mechanism is employed using a quadratic optimization model for weighting the attributes and making the neighborhoods in the same class compact, hence improving the inference capacity. Our experiments on classifying the medical cases demonstrate that SBRMR has approximately 4.1% improvement in classification accuracy over SBR, which suggests that SBRMR is an efficient and resilient similarity-based inference approach.
url http://www.sciencedirect.com/science/article/pii/S2352864816300323
work_keys_str_mv AT haiyanyu resilientparallelsimilaritybasedreasoningforclassifyingheterogeneousmedicalcasesinmapreduce
AT jiangshen resilientparallelsimilaritybasedreasoningforclassifyingheterogeneousmedicalcasesinmapreduce
AT manxu resilientparallelsimilaritybasedreasoningforclassifyingheterogeneousmedicalcasesinmapreduce
_version_ 1724314304698646528