Label-Aware Neural Tangent Kernel: Toward Better Generalization and Local Elasticity

10/22/2020
by   Shuxiao Chen, et al.
0

As a popular approach to modeling the dynamics of training overparametrized neural networks (NNs), the neural tangent kernels (NTK) are known to fall behind real-world NNs in generalization ability. This performance gap is in part due to the label agnostic nature of the NTK, which renders the resulting kernel not as locally elastic as NNs <cit.>. In this paper, we introduce a novel approach from the perspective of label-awareness to reduce this gap for the NTK. Specifically, we propose two label-aware kernels that are each a superimposition of a label-agnostic part and a hierarchy of label-aware parts with increasing complexity of label dependence, using the Hoeffding decomposition. Through both theoretical and empirical evidence, we show that the models trained with the proposed kernels better simulate NNs in terms of generalization ability and local elasticity.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset