LS-Tree: Model Interpretation When the Data Are Linguistic

02/11/2019
by   Jianbo Chen, et al.
22

We study the problem of interpreting trained classification models in the setting of linguistic data sets. Leveraging a parse tree, we propose to assign least-squares based importance scores to each word of an instance by exploiting syntactic constituency structure. We establish an axiomatic characterization of these importance scores by relating them to the Banzhaf value in coalitional game theory. Based on these importance scores, we develop a principled method for detecting and quantifying interactions between words in a sentence. We demonstrate that the proposed method can aid in interpretability and diagnostics for several widely-used language models.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset