Tensor decomposition of convolutional and fully-connected layers is an
e...
We propose Rockmate to control the memory requirements when training PyT...
Large-scale transformer models have shown remarkable performance in lang...
Modern Deep Neural Networks (DNNs) require significant memory to store
w...
Memory footprint is one of the main limiting factors for large neural ne...
In modern neural networks like Transformers, linear layers require
signi...
A conventional approach to train neural ordinary differential equations
...
Most state of the art deep neural networks are overparameterized and exh...
Normalization is an important and vastly investigated technique in deep
...
In this paper, we propose a method, which allows us to alleviate or
comp...
Active subspace is a model reduction method widely used in the uncertain...
We introduce a new method for speeding up the inference of deep neural
n...
The low-rank tensor approximation is very promising for the compression ...