Inferring meaningful communities from topology-constrained correlation networks.

Community structure detection is an important tool in graph analysis. This can be done, among other ways, by solving for the partition set which optimizes the modularity scores [Formula: see text]. Here it is shown that topological constraints in correlation graphs induce over-fragmentation of commu...

Full description

Bibliographic Details
Main Authors:	Jose Sergio Hleap, Christian Blouin
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2014-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC4237410?pdf=render

id	doaj-2255f91a930b42ada571108793cfe70b
record_format	Article
spelling	doaj-2255f91a930b42ada571108793cfe70b2020-11-25T02:13:55ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-01911e11343810.1371/journal.pone.0113438Inferring meaningful communities from topology-constrained correlation networks.Jose Sergio HleapChristian BlouinCommunity structure detection is an important tool in graph analysis. This can be done, among other ways, by solving for the partition set which optimizes the modularity scores [Formula: see text]. Here it is shown that topological constraints in correlation graphs induce over-fragmentation of community structures. A refinement step to this optimization based on Linear Discriminant Analysis (LDA) and a statistical test for significance is proposed. In structured simulation constrained by topology, this novel approach performs better than the optimization of modularity alone. This method was also tested with two empirical datasets: the Roll-Call voting in the 110th US Senate constrained by geographic adjacency, and a biological dataset of 135 protein structures constrained by inter-residue contacts. The former dataset showed sub-structures in the communities that revealed a regional bias in the votes which transcend party affiliations. This is an interesting pattern given that the 110th Legislature was assumed to be a highly polarized government. The [Formula: see text]-amylase catalytic domain dataset (biological dataset) was analyzed with and without topological constraints (inter-residue contacts). The results without topological constraints showed differences with the topology constrained one, but the LDA filtering did not change the outcome of the latter. This suggests that the LDA filtering is a robust way to solve the possible over-fragmentation when present, and that this method will not affect the results where there is no evidence of over-fragmentation.http://europepmc.org/articles/PMC4237410?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jose Sergio Hleap Christian Blouin
spellingShingle	Jose Sergio Hleap Christian Blouin Inferring meaningful communities from topology-constrained correlation networks. PLoS ONE
author_facet	Jose Sergio Hleap Christian Blouin
author_sort	Jose Sergio Hleap
title	Inferring meaningful communities from topology-constrained correlation networks.
title_short	Inferring meaningful communities from topology-constrained correlation networks.
title_full	Inferring meaningful communities from topology-constrained correlation networks.
title_fullStr	Inferring meaningful communities from topology-constrained correlation networks.
title_full_unstemmed	Inferring meaningful communities from topology-constrained correlation networks.
title_sort	inferring meaningful communities from topology-constrained correlation networks.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2014-01-01
description	Community structure detection is an important tool in graph analysis. This can be done, among other ways, by solving for the partition set which optimizes the modularity scores [Formula: see text]. Here it is shown that topological constraints in correlation graphs induce over-fragmentation of community structures. A refinement step to this optimization based on Linear Discriminant Analysis (LDA) and a statistical test for significance is proposed. In structured simulation constrained by topology, this novel approach performs better than the optimization of modularity alone. This method was also tested with two empirical datasets: the Roll-Call voting in the 110th US Senate constrained by geographic adjacency, and a biological dataset of 135 protein structures constrained by inter-residue contacts. The former dataset showed sub-structures in the communities that revealed a regional bias in the votes which transcend party affiliations. This is an interesting pattern given that the 110th Legislature was assumed to be a highly polarized government. The [Formula: see text]-amylase catalytic domain dataset (biological dataset) was analyzed with and without topological constraints (inter-residue contacts). The results without topological constraints showed differences with the topology constrained one, but the LDA filtering did not change the outcome of the latter. This suggests that the LDA filtering is a robust way to solve the possible over-fragmentation when present, and that this method will not affect the results where there is no evidence of over-fragmentation.
url	http://europepmc.org/articles/PMC4237410?pdf=render
work_keys_str_mv	AT josesergiohleap inferringmeaningfulcommunitiesfromtopologyconstrainedcorrelationnetworks AT christianblouin inferringmeaningfulcommunitiesfromtopologyconstrainedcorrelationnetworks
_version_	1724903258073333760

Inferring meaningful communities from topology-constrained correlation networks.

Similar Items