Todor Mihaylov

research

∙ 07/18/2023

Llama 2: Open Foundation and Fine-Tuned Chat Models

In this work, we develop and release Llama 2, a collection of pretrained...

0 Hugo Touvron, et al. ∙

research

∙ 06/26/2023

Understanding In-Context Learning via Supportive Pretraining Data

In-context learning (ICL) improves language models' performance on a var...

0 Xiaochuang Han, et al. ∙

research

∙ 12/22/2022

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Recent work has shown that fine-tuning large pre-trained language models...

0 Srinivasan Iyer, et al. ∙

research

∙ 05/03/2022

Improving In-Context Few-Shot Learning via Self-Supervised Training

Self-supervised pretraining has made few-shot learning possible for many...

1 Mingda Chen, et al. ∙

research

∙ 05/02/2022

OPT: Open Pre-trained Transformer Language Models

Large language models, which are often trained for hundreds of thousands...

8 Susan Zhang, et al. ∙

research

∙ 12/20/2021

Efficient Large Scale Language Modeling with Mixtures of Experts

Mixture of Experts layers (MoEs) enable efficient scaling of language mo...

10 Mikel Artetxe, et al. ∙

research

∙ 12/20/2021

Few-shot Learning with Multilingual Language Models

Large-scale autoregressive language models such as GPT-3 are few-shot le...

8 Xi Victoria Lin, et al. ∙

research

∙ 11/05/2020

EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering

We propose EXAMS – a new benchmark dataset for cross-lingual and multili...

14 Momchil Hardalov, et al. ∙

research

∙ 11/20/2019

SemanticZ at SemEval-2016 Task 3: Ranking Relevant Answers in Community Question Answering Using Semantic Similarity Based on Fine-tuned Word Embeddings

We describe our system for finding good answers in a community forum, as...

5 Todor Mihaylov, et al. ∙

research

∙ 11/19/2019

Hunting for Troll Comments in News Community Forums

There are different definitions of what a troll is. Certainly, a troll c...

0 Todor Mihaylov, et al. ∙

research

∙ 08/28/2019

Discourse-Aware Semantic Self-Attention for Narrative Reading Comprehension

In this work, we propose to use linguistic annotations as a basis for a ...

0 Todor Mihaylov, et al. ∙

research

∙ 09/08/2018

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering

We present a new kind of question answering dataset, OpenBookQA, modeled...

0 Todor Mihaylov, et al. ∙

research

∙ 05/21/2018

Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge

We introduce a neural reading comprehension model that integrates extern...

0 Todor Mihaylov, et al. ∙

research

∙ 11/10/2017

Neural Skill Transfer from Supervised Language Tasks to Reading Comprehension

Reading comprehension is a challenging task in natural language processi...

0 Todor Mihaylov, et al. ∙

research

∙ 07/20/2017

Large-Scale Goodness Polarity Lexicons for Community Question Answering

We transfer a key idea from the field of sentiment analysis to a new dom...

0 Todor Mihaylov, et al. ∙

research

∙ 03/13/2017

Story Cloze Ending Selection Baselines and Data Examination

This paper describes two supervised baseline systems for the Story Cloze...

0 Todor Mihaylov, et al. ∙

Todor Mihaylov

Featured Co-authors

Sign in with Google

Consider DeepAI Pro