A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks

In complex networks, identifying influential nodes is of great importance for its wide applications. Traditional centrality methods are usually directly based on topological structures of networks, and different centrality methods consider different structural characteristics related to the function...

Full description

Bibliographic Details
Main Authors: Gouheng Zhao, Peng Jia, Cheng Huang, Anmin Zhou, Yong Fang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9050733/
id doaj-2b7b3d52863d4038876d0a21af159445
record_format Article
spelling doaj-2b7b3d52863d4038876d0a21af1594452021-03-30T01:31:58ZengIEEEIEEE Access2169-35362020-01-018654626547110.1109/ACCESS.2020.29842869050733A Machine Learning Based Framework for Identifying Influential Nodes in Complex NetworksGouheng Zhao0https://orcid.org/0000-0002-9696-276XPeng Jia1Cheng Huang2https://orcid.org/0000-0002-5871-946XAnmin Zhou3Yong Fang4College of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaIn complex networks, identifying influential nodes is of great importance for its wide applications. Traditional centrality methods are usually directly based on topological structures of networks, and different centrality methods consider different structural characteristics related to the functional importance. However, in many scenarios, it always exists a complex and nonlinear relationship between the functional importance of a node and its various features including local location, global location, etc., which is hard to be described by one centrality. In order to solve this problem, this paper proposes a framework based on machine learning to measure the importance of nodes in the propagation scenario. This framework first constructs the feature vector of each node based on the existing centrality methods which can reflect nodes' different topological structures and the infection rate which is an important factor in the propagation scenarios, then labels each node based on the real propagation ability obtained from simulated propagation experiments based on SIR model, last uses seven machine learning algorithms to learn the complex relationship between the real propagation ability of each node and its various structural features. The experimental results in real-world networks show that the classification accuracy of the model based on machine learning is generally higher than that of the traditional centrality methods based on one certain topology.https://ieeexplore.ieee.org/document/9050733/Complex networksinfluential nodesmachine learningcentrality
collection DOAJ
language English
format Article
sources DOAJ
author Gouheng Zhao
Peng Jia
Cheng Huang
Anmin Zhou
Yong Fang
spellingShingle Gouheng Zhao
Peng Jia
Cheng Huang
Anmin Zhou
Yong Fang
A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks
IEEE Access
Complex networks
influential nodes
machine learning
centrality
author_facet Gouheng Zhao
Peng Jia
Cheng Huang
Anmin Zhou
Yong Fang
author_sort Gouheng Zhao
title A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks
title_short A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks
title_full A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks
title_fullStr A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks
title_full_unstemmed A Machine Learning Based Framework for Identifying Influential Nodes in Complex Networks
title_sort machine learning based framework for identifying influential nodes in complex networks
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In complex networks, identifying influential nodes is of great importance for its wide applications. Traditional centrality methods are usually directly based on topological structures of networks, and different centrality methods consider different structural characteristics related to the functional importance. However, in many scenarios, it always exists a complex and nonlinear relationship between the functional importance of a node and its various features including local location, global location, etc., which is hard to be described by one centrality. In order to solve this problem, this paper proposes a framework based on machine learning to measure the importance of nodes in the propagation scenario. This framework first constructs the feature vector of each node based on the existing centrality methods which can reflect nodes' different topological structures and the infection rate which is an important factor in the propagation scenarios, then labels each node based on the real propagation ability obtained from simulated propagation experiments based on SIR model, last uses seven machine learning algorithms to learn the complex relationship between the real propagation ability of each node and its various structural features. The experimental results in real-world networks show that the classification accuracy of the model based on machine learning is generally higher than that of the traditional centrality methods based on one certain topology.
topic Complex networks
influential nodes
machine learning
centrality
url https://ieeexplore.ieee.org/document/9050733/
work_keys_str_mv AT gouhengzhao amachinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT pengjia amachinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT chenghuang amachinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT anminzhou amachinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT yongfang amachinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT gouhengzhao machinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT pengjia machinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT chenghuang machinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT anminzhou machinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
AT yongfang machinelearningbasedframeworkforidentifyinginfluentialnodesincomplexnetworks
_version_ 1724186876585181184