Analysis and Modeling of Social In uence in High Performance Computing Workloads

High Performance Computing (HPC) is becoming a common tool in many research areas. Social influence (e.g., project collaboration) among increasing users of HPC systems creates bursty behavior in underlying workloads. This bursty behavior is increasingly common with the advent of grid computing and c...

Full description

Bibliographic Details
Main Author: Zheng, Shuai
Other Authors: Keyes, David E.
Language:en
Published: 2012
Online Access:Zheng, S. (2011). Analysis and Modeling of Social In uence in High Performance Computing Workloads. KAUST Research Repository. https://doi.org/10.25781/KAUST-I9A16
http://hdl.handle.net/10754/209388
id ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-209388
record_format oai_dc
spelling ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-2093882021-09-15T05:06:42Z Analysis and Modeling of Social In uence in High Performance Computing Workloads Zheng, Shuai Keyes, David E. Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division Ahmadia, Aron Zhang, Xiangliang High Performance Computing (HPC) is becoming a common tool in many research areas. Social influence (e.g., project collaboration) among increasing users of HPC systems creates bursty behavior in underlying workloads. This bursty behavior is increasingly common with the advent of grid computing and cloud computing. Mining the user bursty behavior is important for HPC workloads prediction and scheduling, which has direct impact on overall HPC computing performance. A representative work in this area is the Mixed User Group Model (MUGM), which clusters users according to the resource demand features of their submissions, such as duration time and parallelism. However, MUGM has some difficulties when implemented in real-world system. First, representing user behaviors by the features of their resource demand is usually difficult. Second, these features are not always available. Third, measuring the similarities among users is not a well-defined problem. In this work, we propose a Social Influence Model (SIM) to identify, analyze, and quantify the level of social influence across HPC users. The advantage of the SIM model is that it finds HPC communities by analyzing user job submission time, thereby avoiding the difficulties of MUGM. An offline algorithm and a fast-converging, computationally-efficient online learning algorithm for identifying social groups are proposed. Both offline and online algorithms are applied on several HPC and grid workloads, including Grid 5000, EGEE 2005 and 2007, and KAUST Supercomputing Lab (KSL) BGP data. From the experimental results, we show the existence of a social graph, which is characterized by a pattern of dominant users and followers. In order to evaluate the effectiveness of identified user groups, we show the pattern discovered by the offline algorithm follows a power-law distribution, which is consistent with those observed in mainstream social networks. We finally conclude the thesis and discuss future directions of our work. 2012-02-04T08:36:22Z 2012-02-04T08:36:22Z 2011-06 Thesis Zheng, S. (2011). Analysis and Modeling of Social In uence in High Performance Computing Workloads. KAUST Research Repository. https://doi.org/10.25781/KAUST-I9A16 10.25781/KAUST-I9A16 http://hdl.handle.net/10754/209388 en
collection NDLTD
language en
sources NDLTD
description High Performance Computing (HPC) is becoming a common tool in many research areas. Social influence (e.g., project collaboration) among increasing users of HPC systems creates bursty behavior in underlying workloads. This bursty behavior is increasingly common with the advent of grid computing and cloud computing. Mining the user bursty behavior is important for HPC workloads prediction and scheduling, which has direct impact on overall HPC computing performance. A representative work in this area is the Mixed User Group Model (MUGM), which clusters users according to the resource demand features of their submissions, such as duration time and parallelism. However, MUGM has some difficulties when implemented in real-world system. First, representing user behaviors by the features of their resource demand is usually difficult. Second, these features are not always available. Third, measuring the similarities among users is not a well-defined problem. In this work, we propose a Social Influence Model (SIM) to identify, analyze, and quantify the level of social influence across HPC users. The advantage of the SIM model is that it finds HPC communities by analyzing user job submission time, thereby avoiding the difficulties of MUGM. An offline algorithm and a fast-converging, computationally-efficient online learning algorithm for identifying social groups are proposed. Both offline and online algorithms are applied on several HPC and grid workloads, including Grid 5000, EGEE 2005 and 2007, and KAUST Supercomputing Lab (KSL) BGP data. From the experimental results, we show the existence of a social graph, which is characterized by a pattern of dominant users and followers. In order to evaluate the effectiveness of identified user groups, we show the pattern discovered by the offline algorithm follows a power-law distribution, which is consistent with those observed in mainstream social networks. We finally conclude the thesis and discuss future directions of our work.
author2 Keyes, David E.
author_facet Keyes, David E.
Zheng, Shuai
author Zheng, Shuai
spellingShingle Zheng, Shuai
Analysis and Modeling of Social In uence in High Performance Computing Workloads
author_sort Zheng, Shuai
title Analysis and Modeling of Social In uence in High Performance Computing Workloads
title_short Analysis and Modeling of Social In uence in High Performance Computing Workloads
title_full Analysis and Modeling of Social In uence in High Performance Computing Workloads
title_fullStr Analysis and Modeling of Social In uence in High Performance Computing Workloads
title_full_unstemmed Analysis and Modeling of Social In uence in High Performance Computing Workloads
title_sort analysis and modeling of social in uence in high performance computing workloads
publishDate 2012
url Zheng, S. (2011). Analysis and Modeling of Social In uence in High Performance Computing Workloads. KAUST Research Repository. https://doi.org/10.25781/KAUST-I9A16
http://hdl.handle.net/10754/209388
work_keys_str_mv AT zhengshuai analysisandmodelingofsocialinuenceinhighperformancecomputingworkloads
_version_ 1719480887409639424