List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

06/10/2022
by   Ilias Diakonikolas, et al.
5

We study the problem of list-decodable sparse mean estimation. Specifically, for a parameter α∈ (0, 1/2), we are given m points in ℝ^n, ⌊α m ⌋ of which are i.i.d. samples from a distribution D with unknown k-sparse mean μ. No assumptions are made on the remaining points, which form the majority of the dataset. The goal is to return a small list of candidates containing a vector μ such that μ- μ_2 is small. Prior work had studied the problem of list-decodable mean estimation in the dense setting. In this work, we develop a novel, conceptually simpler technique for list-decodable mean estimation. As the main application of our approach, we provide the first sample and computationally efficient algorithm for list-decodable sparse mean estimation. In particular, for distributions with “certifiably bounded” t-th moments in k-sparse directions and sufficiently light tails, our algorithm achieves error of (1/α)^O(1/t) with sample complexity m = (klog(n))^O(t)/α and running time poly(mn^t). For the special case of Gaussian inliers, our algorithm achieves the optimal error guarantee of Θ (√(log(1/α))) with quasi-polynomial sample and computational complexity. We complement our upper bounds with nearly-matching statistical query and low-degree polynomial testing lower bounds.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset