MAILEX: Email Event and Argument Extraction

05/22/2023
by   Saurabh Srivastava, et al.
0

In this work, we present the first dataset, , for performing event extraction from conversational email threads. To this end, we first proposed a new taxonomy covering 10 event types and 76 arguments in the email domain. Our final dataset includes ∼4K emails annotated with ∼9K event instances. To understand the task challenges, we conducted a series of experiments comparing two commonly-seen lines of approaches for event extraction, i.e., sequence labeling and generative end-to-end extraction (including few-shot GPT-3.5). Our results showed that the task of email event extraction is far from being addressed, due to challenges lying in, e.g., extracting non-continuous, shared trigger spans, extracting non-named entity arguments, and modeling the email conversational history. Our work thus suggests more investigations in this domain-specific event extraction task in the future.[The source code and dataset can be obtained from <https://github.com/salokr/Email-Event-Extraction>.]

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset