Statistical limits of correlation detection in trees
In this paper we address the problem of testing whether two observed trees (t,t') are sampled either independently or from a joint distribution under which they are correlated. This problem, which we refer to as correlation detection in trees, plays a key role in the study of graph alignment for two correlated random graphs. Motivated by graph alignment, we investigate the conditions of existence of one-sided tests, i.e. tests which have vanishing type I error and non-vanishing power in the limit of large tree depth. For the correlated Galton-Watson model with Poisson offspring of mean λ>0 and correlation parameter s ∈ (0,1), we identify a phase transition in the limit of large degrees at s = √(α), where α∼ 0.3383 is Otter's constant. Namely, we prove that no such test exists for s ≤√(α), and that such a test exists whenever s > √(α), for λ large enough. This result sheds new light on the graph alignment problem in the sparse regime (with O(1) average node degrees) and on the performance of the MPAlign method studied in Ganassali et al. (2021), Piccioli et al. (2021), proving in particular the conjecture of Piccioli et al. (2021) that MPAlign succeeds in the partial recovery task for correlation parameter s>√(α) provided the average node degree λ is large enough.
READ FULL TEXT