Seeded Graph Matching via Large Neighborhood Statistics

Copyright © 2019 by SIAM We study a well known noisy model of the graph isomorphism problem. In this model, the goal is to perfectly recover the vertex correspondence between two edge-correlated graphs, with an initial seed set of correctly matched vertex pairs revealed as side information. Specific...

Full description

Bibliographic Details
Main Authors: Mossel, Elchanan (Author), Xu, Jiaming (Author)
Format: Article
Language:English
Published: 2021-11-09T15:19:02Z.
Subjects:
Online Access:Get fulltext
LEADER 02304 am a22001453u 4500
001 137916
042 |a dc 
100 1 0 |a Mossel, Elchanan  |e author 
700 1 0 |a Xu, Jiaming  |e author 
245 0 0 |a Seeded Graph Matching via Large Neighborhood Statistics 
260 |c 2021-11-09T15:19:02Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/137916 
520 |a Copyright © 2019 by SIAM We study a well known noisy model of the graph isomorphism problem. In this model, the goal is to perfectly recover the vertex correspondence between two edge-correlated graphs, with an initial seed set of correctly matched vertex pairs revealed as side information. Specifically, the model first generates a parent graph G0 from Erdos-Rényi random graph G(n, p) and then obtains two children graphs G1 and G2 by subsampling the edge set of G0 twice independently with probability s = Θ(1). The vertex correspondence between G1 and G2 is obscured by randomly permuting the vertex labels of G1 according to a latent permutation π∗. Finally, for each i, π∗(i) is revealed independently with probability α as seeds. In the sparse graph regime where np ≤ n for any < 1/6, we give a polynomial-time algorithm which perfectly recovers π∗, provided that nps2 − log n → +∞ and α ≥ n−1+3. This further leads to a sub-exponential-time, exp nO(), matching algorithm even without seeds. On the contrary, if nps2 − log n = O(1), then perfect recovery is information-theoretically impossible as long as α is bounded away from 1. In the dense graph regime, where np = bna, for fixed constants a, b ∈ (0, 1], we give a polynomial-time algorithm which succeeds when b = O(s) and α = Ω (np)−b1/ac log n. In particular, when a = 1/k for an integer k ≥ 1, α = Ω(log n/n) suffices, yielding a quasi-polynomial-time nO(log n) algorithm matching the best known algorithm by Barak et al. for the problem of graph matching without seeds when k ≥ 153 and extending their result to new values of p for k = 2, . . ., 152. Unlike previous work on graph matching, which used small neighborhoods or small subgraphs with a logarithmic number of vertices in order to match vertices, our algorithms match vertices if their large neighborhoods have a significant overlap in the number of seeds. 
546 |a en 
655 7 |a Article