Universal almost optimal compression and Slepian-Wolf coding in probabilistic polynomial time
In a lossless compression system with target lengths, a compressor C maps an integer m and a binary string x to an m-bit code p, and if m is sufficiently large, a decompressor D reconstructs x from p. We call a pair (m,x)achievable for ( C, D) if this reconstruction is successful. We introduce the notion of an optimal compressor C_opt, by the following universality property: For any compressor-decompressor pair ( C, D), there exists a decompressor D' such that if (m,x) is achievable for ( C, D), then (m+Δ, x) is achievable for ( C_opt, D'), where Δ is some small value called the overhead. We show that there exists an optimal compressor that has only polylogarithmic overhead and works in probabilistic polynomial time. Differently said, for any pair ( C, D), no matter how slow C is, or even if C is non-computable, C_opt is a fixed compressor that in polynomial time produces codes almost as short as those of C. The cost is that the corresponding decompressor is slower. We also show that each such optimal compressor can be used for distributed compression, in which case it can achieve optimal compression rates, as given in the Slepian-Wolf theorem, and even for the Kolmogorov complexity variant of this theorem. Moreover, the overhead is logarithmic in the number of sources, and unlike previous implementations of Slepian-Wolf coding, meaningful compression can still be achieved if the number of sources is much larger than the length of the compressed strings.
READ FULL TEXT