Decoding methods for large language models often trade-off between diver...
Recent neural network-based language models have benefited greatly from
...
Textual knowledge bases such as Wikipedia require considerable effort to...
In the twilight of Moore's law, GPUs and other specialized hardware
acce...
TensorFlow Eager is a multi-stage, Python-embedded domain-specific langu...
Techniques such as ensembling and distillation promise model quality
imp...
There is rising interest in vector-space word embeddings and their use i...
Multitask learning algorithms are typically designed assuming some fixed...