Learning and remembering to use APIs are difficult. Several techniques h...
Multi-head attention is a driving force behind state-of-the-art transfor...
We propose heavy ball neural ordinary differential equations (HBNODEs),
...
We propose FMMformers, a class of efficient and flexible transformers
in...
Designing deep neural networks is an art that often involves an expensiv...
Stochastic gradient descent (SGD) with constant momentum and its variant...
Continuous Normalizing Flows (CNFs) have emerged as promising deep gener...