We study how vision-language models trained on Internet-scale data can b...
Training complex machine learning (ML) architectures requires a compute ...
We present two new classes of algorithms for efficient field integration...
The problem of efficient approximation of a linear operator induced by t...
We revisit the problem of learning mixtures of spherical Gaussians. Give...
Opinion summarization is the task of creating summaries capturing popula...
We introduce chefs' random tables (CRTs), a new class of non-trigonometr...
In this work we introduce KERNELIZED TRANSFORMER, a generic, scalable, d...
Hierarchical clustering is a critical task in numerous domains. Many
app...
Bottom-up algorithms such as the classic hierarchical agglomerative
clus...
Transformers-based models, such as BERT, have been one of the most succe...
Bayesian nonparametric (BNP) models provide elegant methods for discover...
The fundamental task of general density estimation has been of keen inte...
Accurate and transparent prediction of cancer survival times on the leve...
Linear approximations to the decision boundary of a complex model have b...
We introduce contextual explanation networks (CENs)---a class of models ...
Indian Buffet Process based models are an elegant way for discovering
un...
We provide a solution for elementary science test using instructional
ma...
Kernel methods are ubiquitous tools in machine learning. They have prove...
Nonparametric mixture models based on the Dirichlet process are an elega...
A single, stationary topic model such as latent Dirichlet allocation is
...