LITL at SMM4H: An Old-school Feature-based Classifier for Identifying Adverse Effects in Tweets

SMM4H (COLING) 2020 · Ludovic Tanguy, Lydia-Mai Ho-Dac, Cécile Fabre, Roxane Bois, Touati Mohamed Yacine Haddad, Claire Ibarboure, Marie Joyau, François Le moal, Jade Moiilic, Laura Roudaut, Mathilde Simounet, Irena Stankovic, Mickaela Vandewaetere ·

This paper describes our participation to the SMM4H shared task 2. We designed a rule-based classifier that estimates whether a tweet mentions an adverse effect associated to a medication. Our system addresses English and French, and is based on a number of specific word lists and features. These cues were mostly obtained through an extensive corpus analysis of the provided training data. Different weighting schemes were tested (manually tuned or based on a logistic regression), the best one achieving a F1 score of 0.31 for English and 0.15 for French.

PDF Abstract