Inferring meaningful communities from topology-constrained correlation networks.

Community structure detection is an important tool in graph analysis. This can be done, among other ways, by solving for the partition set which optimizes the modularity scores [Formula: see text]. Here it is shown that topological constraints in correlation graphs induce over-fragmentation of commu...

Full description

Bibliographic Details
Main Authors: Jose Sergio Hleap, Christian Blouin
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4237410?pdf=render
id doaj-2255f91a930b42ada571108793cfe70b
record_format Article
spelling doaj-2255f91a930b42ada571108793cfe70b2020-11-25T02:13:55ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-01911e11343810.1371/journal.pone.0113438Inferring meaningful communities from topology-constrained correlation networks.Jose Sergio HleapChristian BlouinCommunity structure detection is an important tool in graph analysis. This can be done, among other ways, by solving for the partition set which optimizes the modularity scores [Formula: see text]. Here it is shown that topological constraints in correlation graphs induce over-fragmentation of community structures. A refinement step to this optimization based on Linear Discriminant Analysis (LDA) and a statistical test for significance is proposed. In structured simulation constrained by topology, this novel approach performs better than the optimization of modularity alone. This method was also tested with two empirical datasets: the Roll-Call voting in the 110th US Senate constrained by geographic adjacency, and a biological dataset of 135 protein structures constrained by inter-residue contacts. The former dataset showed sub-structures in the communities that revealed a regional bias in the votes which transcend party affiliations. This is an interesting pattern given that the 110th Legislature was assumed to be a highly polarized government. The [Formula: see text]-amylase catalytic domain dataset (biological dataset) was analyzed with and without topological constraints (inter-residue contacts). The results without topological constraints showed differences with the topology constrained one, but the LDA filtering did not change the outcome of the latter. This suggests that the LDA filtering is a robust way to solve the possible over-fragmentation when present, and that this method will not affect the results where there is no evidence of over-fragmentation.http://europepmc.org/articles/PMC4237410?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Jose Sergio Hleap
Christian Blouin
spellingShingle Jose Sergio Hleap
Christian Blouin
Inferring meaningful communities from topology-constrained correlation networks.
PLoS ONE
author_facet Jose Sergio Hleap
Christian Blouin
author_sort Jose Sergio Hleap
title Inferring meaningful communities from topology-constrained correlation networks.
title_short Inferring meaningful communities from topology-constrained correlation networks.
title_full Inferring meaningful communities from topology-constrained correlation networks.
title_fullStr Inferring meaningful communities from topology-constrained correlation networks.
title_full_unstemmed Inferring meaningful communities from topology-constrained correlation networks.
title_sort inferring meaningful communities from topology-constrained correlation networks.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Community structure detection is an important tool in graph analysis. This can be done, among other ways, by solving for the partition set which optimizes the modularity scores [Formula: see text]. Here it is shown that topological constraints in correlation graphs induce over-fragmentation of community structures. A refinement step to this optimization based on Linear Discriminant Analysis (LDA) and a statistical test for significance is proposed. In structured simulation constrained by topology, this novel approach performs better than the optimization of modularity alone. This method was also tested with two empirical datasets: the Roll-Call voting in the 110th US Senate constrained by geographic adjacency, and a biological dataset of 135 protein structures constrained by inter-residue contacts. The former dataset showed sub-structures in the communities that revealed a regional bias in the votes which transcend party affiliations. This is an interesting pattern given that the 110th Legislature was assumed to be a highly polarized government. The [Formula: see text]-amylase catalytic domain dataset (biological dataset) was analyzed with and without topological constraints (inter-residue contacts). The results without topological constraints showed differences with the topology constrained one, but the LDA filtering did not change the outcome of the latter. This suggests that the LDA filtering is a robust way to solve the possible over-fragmentation when present, and that this method will not affect the results where there is no evidence of over-fragmentation.
url http://europepmc.org/articles/PMC4237410?pdf=render
work_keys_str_mv AT josesergiohleap inferringmeaningfulcommunitiesfromtopologyconstrainedcorrelationnetworks
AT christianblouin inferringmeaningfulcommunitiesfromtopologyconstrainedcorrelationnetworks
_version_ 1724903258073333760