A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing

Recently, the importance of mobile cloud computing has increased. Mobile devices can collect personal data from various sensors within a shorter period of time and sensor-based data consists of valuable information from users. Advanced computation power and data analysis technology based on cloud co...

Full description

Bibliographic Details
Main Authors: Jae-Hee Hur, Sun-Young Ihm, Young-Ho Park
Format: Article
Language:English
Published: Hindawi-Wiley 2017-01-01
Series:Wireless Communications and Mobile Computing
Online Access:http://dx.doi.org/10.1155/2017/6817627
id doaj-d78c4af60a9e4c47b0c1e31dc5c9b528
record_format Article
spelling doaj-d78c4af60a9e4c47b0c1e31dc5c9b5282020-11-25T01:17:02ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772017-01-01201710.1155/2017/68176276817627A Variable Impacts Measurement in Random Forest for Mobile Cloud ComputingJae-Hee Hur0Sun-Young Ihm1Young-Ho Park2Department of IT Engineering, Sookmyung Women’s University, Cheongpa-ro 47-gil 100, Yongsan-gu, Seoul 04310, Republic of KoreaBig Data Using Research Center, Sookmyung Women’s University, Cheongpa-ro 47-gil 100, Yongsan-gu, Seoul 04310, Republic of KoreaDepartment of IT Engineering, Sookmyung Women’s University, Cheongpa-ro 47-gil 100, Yongsan-gu, Seoul 04310, Republic of KoreaRecently, the importance of mobile cloud computing has increased. Mobile devices can collect personal data from various sensors within a shorter period of time and sensor-based data consists of valuable information from users. Advanced computation power and data analysis technology based on cloud computing provide an opportunity to classify massive sensor data into given labels. Random forest algorithm is known as black box model which is hardly able to interpret the hidden process inside. In this paper, we propose a method that analyzes the variable impact in random forest algorithm to clarify which variable affects classification accuracy the most. We apply Shapley Value with random forest to analyze the variable impact. Under the assumption that every variable cooperates as players in the cooperative game situation, Shapley Value fairly distributes the payoff of variables. Our proposed method calculates the relative contributions of the variables within its classification process. In this paper, we analyze the influence of variables and list the priority of variables that affect classification accuracy result. Our proposed method proves its suitability for data interpretation in black box model like a random forest so that the algorithm is applicable in mobile cloud computing environment.http://dx.doi.org/10.1155/2017/6817627
collection DOAJ
language English
format Article
sources DOAJ
author Jae-Hee Hur
Sun-Young Ihm
Young-Ho Park
spellingShingle Jae-Hee Hur
Sun-Young Ihm
Young-Ho Park
A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing
Wireless Communications and Mobile Computing
author_facet Jae-Hee Hur
Sun-Young Ihm
Young-Ho Park
author_sort Jae-Hee Hur
title A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing
title_short A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing
title_full A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing
title_fullStr A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing
title_full_unstemmed A Variable Impacts Measurement in Random Forest for Mobile Cloud Computing
title_sort variable impacts measurement in random forest for mobile cloud computing
publisher Hindawi-Wiley
series Wireless Communications and Mobile Computing
issn 1530-8669
1530-8677
publishDate 2017-01-01
description Recently, the importance of mobile cloud computing has increased. Mobile devices can collect personal data from various sensors within a shorter period of time and sensor-based data consists of valuable information from users. Advanced computation power and data analysis technology based on cloud computing provide an opportunity to classify massive sensor data into given labels. Random forest algorithm is known as black box model which is hardly able to interpret the hidden process inside. In this paper, we propose a method that analyzes the variable impact in random forest algorithm to clarify which variable affects classification accuracy the most. We apply Shapley Value with random forest to analyze the variable impact. Under the assumption that every variable cooperates as players in the cooperative game situation, Shapley Value fairly distributes the payoff of variables. Our proposed method calculates the relative contributions of the variables within its classification process. In this paper, we analyze the influence of variables and list the priority of variables that affect classification accuracy result. Our proposed method proves its suitability for data interpretation in black box model like a random forest so that the algorithm is applicable in mobile cloud computing environment.
url http://dx.doi.org/10.1155/2017/6817627
work_keys_str_mv AT jaeheehur avariableimpactsmeasurementinrandomforestformobilecloudcomputing
AT sunyoungihm avariableimpactsmeasurementinrandomforestformobilecloudcomputing
AT younghopark avariableimpactsmeasurementinrandomforestformobilecloudcomputing
AT jaeheehur variableimpactsmeasurementinrandomforestformobilecloudcomputing
AT sunyoungihm variableimpactsmeasurementinrandomforestformobilecloudcomputing
AT younghopark variableimpactsmeasurementinrandomforestformobilecloudcomputing
_version_ 1725148682577248256