How do people internalize visualizations: as images or information? In t...
Cross-task generalization is a significant outcome that defines mastery ...
Large Language Models (LLMs) have gained widespread popularity due to th...
Recent research has shown that language models exploit `artifacts' in
be...
Evaluation of models on benchmarks is unreliable without knowing the deg...
Several benchmarks have been built with heavy investment in resources to...
The electrical power grid is a critical infrastructure, with disruptions...
Traditional deep learning interpretability methods which are suitable fo...
Alluvial diagrams are a popular technique for visualizing flow and relat...
We develop a visual analytics system, NewsKaleidoscope, to investigate t...
Reinforcement Learning (RL) is a widely-used technique in many domains,
...
Visual data storytelling is gaining importance as a means of presenting
...
A `state of the art' model A surpasses humans in a benchmark B, but fail...
Models that surpass human performance on several popular benchmarks disp...
Neural language models have achieved human level performance across seve...