Terminal Embeddings in Sublinear Time

10/17/2021
by   Yeshwanth Cherapanamjeri, et al.
0

Recently (Elkin, Filtser, Neiman 2017) introduced the concept of a terminal embedding from one metric space (X,d_X) to another (Y,d_Y) with a set of designated terminals T⊂ X. Such an embedding f is said to have distortion ρ≥ 1 if ρ is the smallest value such that there exists a constant C>0 satisfying ∀ x∈ T ∀ q∈ X, C d_X(x, q) ≤ d_Y(f(x), f(q)) ≤ C ρ d_X(x, q) . In the case that X,Y are both Euclidean metrics with Y being m-dimensional, recently (Narayanan, Nelson 2019), following work of (Mahabadi, Makarychev, Makarychev, Razenshteyn 2018), showed that distortion 1+ϵ is achievable via such a terminal embedding with m = O(ϵ^-2log n) for n := |T|. This generalizes the Johnson-Lindenstrauss lemma, which only preserves distances within T and not to T from the rest of space. The downside is that evaluating the embedding on some q∈ℝ^d required solving a semidefinite program with Θ(n) constraints in m variables and thus required some superlinear poly(n) runtime. Our main contribution in this work is to give a new data structure for computing terminal embeddings. We show how to pre-process T to obtain an almost linear-space data structure that supports computing the terminal embedding image of any q∈ℝ^d in sublinear time O^* (n^1-Θ(ϵ^2) + d). To accomplish this, we leverage tools developed in the context of approximate nearest neighbor search.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset