Persistence Codebooks for Topological Data Analysis

02/13/2018
by   Bartosz Zielinski, et al.
0

Topological data analysis, such as persistent homology has shown beneficial properties for machine learning in many tasks. Topological representations, such as the persistence diagram (PD), however, have a complex structure (multiset of intervals) which makes it difficult to combine with typical machine learning workflows. We present novel compact fixed-size vectorial representations of PDs based on clustering and bag of words encodings that cope well with the inherent sparsity of PDs. Our novel representations outperform state-of-the-art approaches from topological data analysis and are computationally more efficient.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset