We integrate ir_datasets, ir_measures, and PyTerrier with TIRA in the
In...
When asked, current large language models (LLMs) like ChatGPT claim that...
The Archive Query Log (AQL) is a previously unused, comprehensive query ...
We propose to use captions from the Web as a previously underutilized
re...
Pairwise re-ranking models predict which of two documents is more releva...
Neural retrieval models are often trained on (subsets of) the millions o...
We introduce and study the task of clickbait spoiling: generating a shor...
Commercial web search engines employ near-duplicate detection to ensure ...
Web archive analytics is the exploitation of publicly accessible web pag...
Recently, neural networks have been successfully employed to improve upo...
Web search queries can be rather ambiguous: Is "paris hilton" meant to f...
The prerequisite of many approaches to authorship analysis is a
represen...
Dagstuhl Seminar 19461 "Conversational Search" was held on 10-15 Novembe...
An abstractive snippet is an originally created piece of text to summari...
This paper discusses the potential for creating academic resources (tool...
We present CAM (comparative argumentative machine), a novel open-domain ...
Clickbait has grown to become a nuisance to social media users and socia...
We study text reuse related to Wikipedia at scale by compiling the first...
We study feature selection as a means to optimize the baseline clickbait...