Correctly manipulating program terms in a compiler is surprisingly diffi...
Automatic differentiation (AD) is conventionally understood as a family ...
Algebraic effects and handlers support composable and structured control...
We decompose reverse-mode automatic differentiation into (forward-mode)
...
We present a novel programming language design that attempts to combine ...
How does one compile derivatives of tensor programs, such that the resul...
We present a general approach to batching arbitrary computations for
acc...
We introduce a convolutional neural network that operates directly on gr...
We show that unconverged stochastic gradient descent can be interpreted ...
Tuning hyperparameters of learning algorithms is hard because gradients ...
Markov chain Monte Carlo (MCMC) is a popular and successful general-purp...