The emergence of the Internet of Things (IoT) has resulted in a remarkab...
Specialized accelerators are increasingly used to meet the power-perform...
This work analyzes how attention-based Bidirectional Long Short-Term Mem...
Transformer-based language models such as BERT provide significant accur...
Conventional hardware-friendly quantization methods, such as fixed-point...