Want to Identify, Extract and Normalize Adverse Drug Reactions in Tweets? Use RoBERTa
This paper presents our approach for task 2 and task 3 of Social Media Mining for Health (SMM4H) 2020 shared tasks. In task 2, we have to differentiate adverse drug reaction (ADR) tweets from nonADR tweets and is treated as binary classification. Task3 involves extracting ADR mentions and then mapping them to MedDRA codes. Extracting ADR mentions is treated as sequence labeling and normalizing ADR mentions is treated as multi-class classification. Our system is based on pre-trained language model RoBERTa and it achieves a) F1-score of 58 70.1 and relaxed F1-score of 35 5.8 in both the tasks with significant improvements over average scores.
READ FULL TEXT