(Hyper)Graph Embedding and Classification via Simplicial Complexes

This paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures...

Full description

Bibliographic Details
Main Authors: Alessio Martino, Alessandro Giuliani, Antonello Rizzi
Format: Article
Language:English
Published: MDPI AG 2019-10-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/12/11/223
id doaj-12ec282392e6461a86cd350e208d607b
record_format Article
spelling doaj-12ec282392e6461a86cd350e208d607b2020-11-24T22:10:06ZengMDPI AGAlgorithms1999-48932019-10-01121122310.3390/a12110223a12110223(Hyper)Graph Embedding and Classification via Simplicial ComplexesAlessio Martino0Alessandro Giuliani1Antonello Rizzi2Department of Information Engineering, Electronics and Telecommunications, University of Rome “La Sapienza”, Via Eudossiana 18, 00184 Rome, ItalyDepartment of Environment and Health, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, ItalyDepartment of Information Engineering, Electronics and Telecommunications, University of Rome “La Sapienza”, Via Eudossiana 18, 00184 Rome, ItalyThis paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures (i.e., information granules) on the top of which an embedding space can be built by means of symbolic histograms. In the embedding space, any Euclidean pattern recognition system can be used, possibly equipped with feature selection capabilities in order to select the most informative symbols. The selected symbols can be analysed by field-experts in order to extract further knowledge about the process to be modelled by the learning system, hence the proposed modelling strategy can be considered as a grey-box. The proposed embedding has been tested on thirty benchmark datasets for graph classification and, further, we propose two real-world applications, namely predicting proteins’ enzymatic function and solubility propensity starting from their 3D structure in order to give an example of the knowledge discovery phase which can be carried out starting from the proposed embedding strategy.https://www.mdpi.com/1999-4893/12/11/223granular computingembedding spacesgraph embeddingtopological data analysissimplicial complexescomputational biologyprotein contact networkscomplex networkscomplex systems
collection DOAJ
language English
format Article
sources DOAJ
author Alessio Martino
Alessandro Giuliani
Antonello Rizzi
spellingShingle Alessio Martino
Alessandro Giuliani
Antonello Rizzi
(Hyper)Graph Embedding and Classification via Simplicial Complexes
Algorithms
granular computing
embedding spaces
graph embedding
topological data analysis
simplicial complexes
computational biology
protein contact networks
complex networks
complex systems
author_facet Alessio Martino
Alessandro Giuliani
Antonello Rizzi
author_sort Alessio Martino
title (Hyper)Graph Embedding and Classification via Simplicial Complexes
title_short (Hyper)Graph Embedding and Classification via Simplicial Complexes
title_full (Hyper)Graph Embedding and Classification via Simplicial Complexes
title_fullStr (Hyper)Graph Embedding and Classification via Simplicial Complexes
title_full_unstemmed (Hyper)Graph Embedding and Classification via Simplicial Complexes
title_sort (hyper)graph embedding and classification via simplicial complexes
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2019-10-01
description This paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures (i.e., information granules) on the top of which an embedding space can be built by means of symbolic histograms. In the embedding space, any Euclidean pattern recognition system can be used, possibly equipped with feature selection capabilities in order to select the most informative symbols. The selected symbols can be analysed by field-experts in order to extract further knowledge about the process to be modelled by the learning system, hence the proposed modelling strategy can be considered as a grey-box. The proposed embedding has been tested on thirty benchmark datasets for graph classification and, further, we propose two real-world applications, namely predicting proteins’ enzymatic function and solubility propensity starting from their 3D structure in order to give an example of the knowledge discovery phase which can be carried out starting from the proposed embedding strategy.
topic granular computing
embedding spaces
graph embedding
topological data analysis
simplicial complexes
computational biology
protein contact networks
complex networks
complex systems
url https://www.mdpi.com/1999-4893/12/11/223
work_keys_str_mv AT alessiomartino hypergraphembeddingandclassificationviasimplicialcomplexes
AT alessandrogiuliani hypergraphembeddingandclassificationviasimplicialcomplexes
AT antonellorizzi hypergraphembeddingandclassificationviasimplicialcomplexes
_version_ 1725809312469614592