research
∙
02/04/2023
Greedy Ordering of Layer Weight Matrices in Transformers Improves Translation
Prior work has attempted to understand the internal structures and funct...
research
∙
02/15/2022