Hannah Rose Kirk

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Vijay Janapa Reddi
50 publications
Federico Bianchi
34 publications
Dirk Hovy
29 publications
Yuki M. Asano
27 publications
Leon Derczynski
27 publications
Scott A. Hale
22 publications
Bertie Vidgen
17 publications
Abeba Birhane
17 publications
Lora Aroyo
17 publications
D. Sculley
15 publications
Yash Bhalgat
14 publications

research

∙ 09/15/2023

Casteist but Not Racist? Quantifying Disparities in Large Language Model Bias between India and the West

Large Language Models (LLMs), now used daily by millions of users, can e...

0 Khyati Khandelwal, et al. ∙

research

∙ 08/02/2023

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

Without proper safeguards, large language models will readily follow mal...

0 Paul Röttger, et al. ∙

research

∙ 07/31/2023

DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures

Public figures receive a disproportionate amount of abuse on social medi...

0 Hannah Rose Kirk, et al. ∙

research

∙ 06/21/2023

VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution

We introduce VisoGender, a novel dataset for benchmarking gender bias in...

2 Siobhan Mackenzie Hall, et al. ∙

research

∙ 05/24/2023

Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets

Vision-language models are growing in popularity and public visibility t...

5 Brandon Smith, et al. ∙

research

∙ 05/22/2023

Adversarial Nibbler: A Data-Centric Challenge for Improving the Safety of Text-to-Image Models

The generative AI revolution in recent years has been spurred by an expa...

5 Alicia Parrish, et al. ∙

research

∙ 03/09/2023

Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback

Large language models (LLMs) are used to generate content for a wide ran...

2 Hannah Rose Kirk, et al. ∙

research

∙ 03/07/2023

SemEval-2023 Task 10: Explainable Detection of Online Sexism

Online sexism is a widespread and harmful phenomenon. Automated tools ca...

5 Hannah Rose Kirk, et al. ∙

research

∙ 02/16/2023

Auditing large language models: a three-layered approach

The emergence of large language models (LLMs) represents a major advance...

0 Jakob Mokander, et al. ∙

research

∙ 09/21/2022

Is More Data Better? Re-thinking the Importance of Efficiency in Abusive Language Detection with Transformers-Based Active Learning

Annotating abusive language is expensive, logistically complex and creat...

3 Hannah Rose Kirk, et al. ∙

research

∙ 05/23/2022

Looking for a Handsome Carpenter! Debiasing GPT-3 Job Advertisements

The growing capability and availability of generative language models ha...

5 Conrad Borchers, et al. ∙

research

∙ 04/29/2022

Handling and Presenting Harmful Text

Textual data can pose a risk of serious harm. These harms can be categor...

1 Leon Derczynski, et al. ∙

research

∙ 03/22/2022

A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

Vision-language models can encode societal biases and stereotypes, but t...

2 Hugo Elias Berg, et al. ∙

research

∙ 08/12/2021

Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate

Detecting online hate is a complex task, and low-performing models have ...

4 Hannah Rose Kirk, et al. ∙

research

∙ 07/09/2021

Memes in the Wild: Assessing the Generalizability of the Hateful Memes Challenge Dataset

Hateful memes pose a unique challenge for current machine learning syste...

0 Hannah Rose Kirk, et al. ∙

Hannah Rose Kirk

Featured Co-authors

Sign in with Google

Consider DeepAI Pro