In this work, we provide a recipe for training machine translation model...
Mixup is a popular data augmentation technique for training deep neural
...
Diffusion-based generative models learn to iteratively transfer unstruct...
Current deep learning approaches have shown good in-distribution
general...
Inspired from human cognition, machine learning systems are gradually
re...
Multi-head, key-value attention is the backbone of the widely successful...
Inducing causal relationships from observations is a classic problem in
...
Robust perception relies on both bottom-up and top-down signals. Bottom-...
We revisit the bias-variance tradeoff for neural networks in light of mo...