Large Language Models (LLMs) are becoming increasingly smart and autonom...
Generative Large Language Models (LLMs) have demonstrated remarkable res...
GAN-generated image detection now becomes the first line of defense agai...
The explosive growth of language models and their applications have led ...
Instruction-tuned LMs such as ChatGPT, FLAN, and InstructGPT are finetun...
The field of natural language processing (NLP) has made significant stri...
Reinforcement learning (RL) is one of the most important branches of AI....
Prompt Tuning, conditioning on task-specific learned prompt vectors, has...
Multitask prompted finetuning (MTF) has been shown to help large languag...
The crystallization of modeling methods around the Transformer architect...
Nowadays, online screen sharing and remote cooperation are becoming
ubiq...
Recent state-of-the-art computer vision systems are trained from natural...
Machine learning models are vulnerable to data inference attacks, such a...
The current standard approach to scaling transformer language models tra...
Nowadays, there is an explosive growth of screen contents due to the wid...
Large language models have recently been shown to attain reasonable zero...
We demonstrate that, hidden within one-layer randomly weighted neural
ne...
Most existing Vision-and-Language (V L) models rely on pre-trained vis...
A major challenge in deploying transformer models is their prohibitive
i...
Pruning is an effective method to reduce the memory footprint and
comput...
We demonstrate that transformers obtain impressive performance even when...
Federated learning is an improved version of distributed machine learnin...
Phrase localization is a task that studies the mapping from textual phra...
In this paper we apply self-knowledge distillation to text summarization...
Planning is one of the main approaches used to improve agents' working
e...
We introduce AdaHessian, a second order stochastic optimization algorith...
The standard normalization method for neural network (NN) models used in...
Since hardware resources are limited, the objective of training deep lea...
Transformer based architectures have become de-facto models used for a r...
We improve the informativeness of models for conditional text generation...
Question answering (QA) has achieved promising progress recently. Howeve...
Most existing sentiment analysis approaches heavily rely on a large amou...
Emojis have gained incredible popularity in recent years and become a ne...