Most interpretability research in NLP focuses on understanding the behav...
Most works on transformers trained with the Masked Language Modeling (ML...
Methods for improving the efficiency of deep network training (i.e. the
...
Methods for understanding the decisions of and mechanisms underlying dee...
Representational sparsity is known to affect robustness to input
perturb...
While the relative trade-offs between sparse and distributed representat...
Class selectivity, typically defined as how different a neuron's respons...