Matthew L. Leavitt

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Kyunghyun Cho
218 publications
Jonathan Frankle
28 publications
Ari S. Morcos
28 publications
Ari Morcos
17 publications
Angelica Chen
16 publications
Naomi Saphra
16 publications
Ziliang Zong
12 publications
Jessica Zosa Forde
12 publications
Davis Blalock
8 publications
Cody Blakeney
4 publications
Zachary Ankner
2 publications

research

∙ 09/13/2023

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

Most interpretability research in NLP focuses on understanding the behav...

0 Angelica Chen, et al. ∙

research

∙ 05/24/2023

Dynamic Masking Rate Schedules for MLM Pretraining

Most works on transformers trained with the Masked Language Modeling (ML...

0 Zachary Ankner, et al. ∙

research

∙ 11/01/2022

Reduce, Reuse, Recycle: Improving Training Efficiency with Distillation

Methods for improving the efficiency of deep network training (i.e. the ...

0 Cody Blakeney, et al. ∙

research

∙ 10/22/2020

Towards falsifiable interpretability research

Methods for understanding the decisions of and mechanisms underlying dee...

0 Matthew L. Leavitt, et al. ∙

research

∙ 10/14/2020

Linking average- and worst-case perturbation robustness via class selectivity and dimensionality

Representational sparsity is known to affect robustness to input perturb...

4 Matthew L. Leavitt, et al. ∙

research

∙ 07/08/2020

On the relationship between class selectivity, dimensionality, and robustness

While the relative trade-offs between sparse and distributed representat...

0 Matthew L. Leavitt, et al. ∙

research

∙ 03/03/2020

Selectivity considered harmful: evaluating the causal impact of class selectivity in DNNs

Class selectivity, typically defined as how different a neuron's respons...

10 Matthew L. Leavitt, et al. ∙

Matthew L. Leavitt

Featured Co-authors

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

Dynamic Masking Rate Schedules for MLM Pretraining

Reduce, Reuse, Recycle: Improving Training Efficiency with Distillation

Towards falsifiable interpretability research

Linking average- and worst-case perturbation robustness via class selectivity and dimensionality

On the relationship between class selectivity, dimensionality, and robustness

Selectivity considered harmful: evaluating the causal impact of class selectivity in DNNs

Sign in with Google

Consider DeepAI Pro