Sayeh Sharify

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Xin Wang
382 publications
Andreas Moshovos
16 publications
Milos Nikolic
14 publications
Zihao Deng
11 publications
Mostafa Mahmoud
8 publications
Patrick Judd
8 publications
Michael Orshansky
6 publications
Zissis Poulos
6 publications
Alberto Delmas Lascorz
4 publications
Alberto Delmas
4 publications
Dylan Malone Stuart
2 publications

research

∙ 07/11/2023

Mixed-Precision Quantization with Cross-Layer Dependencies

Quantization is commonly used to compress and accelerate deep neural net...

1 Zihao Deng, et al. ∙

research

∙ 05/10/2018

Laconic Deep Learning Computing

We motivate a method for transparently identifying ineffectual computati...

0 Sayeh Sharify, et al. ∙

research

∙ 04/17/2018

DPRed: Making Typical Activation Values Matter In Deep Learning Computing

We show that selecting a fixed precision for all activations in Convolut...

0 Alberto Delmas, et al. ∙

research

∙ 03/09/2018

Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How

We show that, during inference with Convolutional Neural Networks (CNNs)...

0 Alberto Delmas, et al. ∙

research

∙ 07/27/2017

Tartan: Accelerating Fully-Connected and Convolutional Layers in Deep Learning Networks by Exploiting Numerical Precision Variability

Tartan (TRT), a hardware accelerator for inference with Deep Neural Netw...

0 Alberto Delmas, et al. ∙

research

∙ 06/23/2017

Loom: Exploiting Weight and Activation Precisions to Accelerate Convolutional Neural Networks

Loom (LM), a hardware inference accelerator for Convolutional Neural Net...

0 Sayeh Sharify, et al. ∙

research

∙ 06/01/2017

Dynamic Stripes: Exploiting the Dynamic Precision Requirements of Activation Values in Neural Networks

Stripes is a Deep Neural Network (DNN) accelerator that uses bit-serial ...

0 Alberto Delmas, et al. ∙

Success!

An error occurred

Sayeh Sharify

Featured Co-authors

Mixed-Precision Quantization with Cross-Layer Dependencies

Laconic Deep Learning Computing

DPRed: Making Typical Activation Values Matter In Deep Learning Computing

Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How

Tartan: Accelerating Fully-Connected and Convolutional Layers in Deep Learning Networks by Exploiting Numerical Precision Variability

Loom: Exploiting Weight and Activation Precisions to Accelerate Convolutional Neural Networks

Dynamic Stripes: Exploiting the Dynamic Precision Requirements of Activation Values in Neural Networks

Sign in with Google

Consider DeepAI Pro