no code implementations • EMNLP (WNUT) 2020 • Shriphani Palakodety, Ashiqur KhudaBukhsh
We release an annotated data set of 1, 000 comments in ten Romanized languages as a social media evaluation benchmark.
no code implementations • 31 Aug 2020 • Ashiqur R. KhudaBukhsh, Shriphani Palakodety, Tom M. Mitchell
In this work, we utilize a single Skip-gram model trained on a multilingual corpus yielding polyglot word embeddings, and present a novel finding that a surprisingly simple constrained nearest-neighbor sampling technique in this embedding space can retrieve bilingual lexicons, even in harsh social media data sets predominantly written in English and Romanized Hindi and often exhibiting code switching.
no code implementations • 30 Jan 2020 • Ashiqur R. KhudaBukhsh, Shriphani Palakodety, Jaime G. Carbonell
Code mixing (or code switching) is a common phenomenon observed in social-media content generated by a linguistically diverse user-base.
no code implementations • 8 Oct 2019 • Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime G. Carbonell
The Rohingya refugee crisis is one of the biggest humanitarian crises of modern times with more than 600, 000 Rohingyas rendered homeless according to the United Nations High Commissioner for Refugees.
no code implementations • 11 Sep 2019 • Shriphani Palakodety, Ashiqur R. KhudaBukhsh, Jaime G. Carbonell
The recent Pulwama terror attack (February 14, 2019, Pulwama, Kashmir) triggered a chain of escalating events between India and Pakistan adding another episode to their 70-year-old dispute over Kashmir.
no code implementations • 10 Dec 2015 • Abhimanu Kumar, Shriphani Palakodety, Chong Wang, Carolyn P. Rose, Eric P. Xing, Miaomiao Wen
Online discussion forums are complex webs of overlapping subcommunities (macrolevel structure, across threads) in which users enact different roles depending on which subcommunity they are participating in within a particular time point (microlevel structure, within threads).