Coresets for (k, ℓ)-Median Clustering under the Fréchet Distance

04/19/2021
by   Maike Buchin, et al.
0

We present an algorithm for computing ϵ-coresets for (k, ℓ)-median clustering of polygonal curves under the Fréchet distance. This type of clustering, which has recently drawn increasing popularity due to a growing number of applications, is an adaption of Euclidean k-median clustering: we are given a set of polygonal curves and want to compute k center curves such that the sum of distances from the given curves to their nearest center curve is minimal. Additionally, we restrict the complexity, i.e., number of vertices, of the center curves to be at most ℓ each, to suppress overfitting. We achieve this result by applying the improved ϵ-coreset framework by Braverman et al. to a generalized k-median problem over an arbitrary metric space. Later we combine this result with the recent result by Driemel et al. on the VC dimension of metric balls under the Fréchet distance. Finally, we show that ϵ-coresets can be used to improve an existing approximation algorithm for (1,ℓ)-median clustering under the Fréchet distance, taking a further step in the direction of practical (k,ℓ)-median clustering algorithms.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset