1 code implementation • EMNLP (ACL) 2021 • Vibhu Bhatia, Vidya Prasad Akavoor, Sejin Paik, Lei Guo, Mona Jalal, Alyssa Smith, David Assefa Tofu, Edward Edberg Halim, Yimeng Sun, Margrit Betke, Prakash Ishwar, Derry Tanti Wijaya
We propose and guide users through a five-step end-to-end computational framing analysis framework grounded in media framing theory in communication research.
no code implementations • LREC 2022 • Anietie Andy, Reno Kriz, Sharath Chandra Guntuku, Derry Tanti Wijaya, Chris Callison-Burch
While popular Television (TV) shows are airing, some users interested in these shows publish social media posts about the show.
no code implementations • EMNLP 2021 • Mohammad Sadegh Rasooli, Chris Callison-Burch, Derry Tanti Wijaya
Our captioning results on Arabic are slightly better than that of its supervised model.
no code implementations • LREC 2022 • Carley Reardon, Sejin Paik, Ge Gao, Meet Parekh, Yanling Zhao, Lei Guo, Margrit Betke, Derry Tanti Wijaya
As such, we introduce a U. S. gun violence news dataset that contains news headline and image pairings from 840 news articles with 15K high-quality, crowdsourced annotations on emotional responses to the news pairings.
1 code implementation • PoliticalNLP (LREC) 2022 • Sha Lai, Yanru Jiang, Lei Guo, Margrit Betke, Prakash Ishwar, Derry Tanti Wijaya
We discuss the effectiveness of our approach by comparing the frames it generates in an unsupervised manner to the domain-expert-derived frames for the issue of gun violence, for which a supervised learning model for frame recognition exists.
no code implementations • COLING (CRAC) 2020 • Anietie Andy, Chris Callison-Burch, Derry Tanti Wijaya
Many people live-tweet televised events like Presidential debates and popular TV-shows and discuss people or characters in the event.
1 code implementation • 14 Nov 2024 • Mohammad Rifqi Farhansyah, Muhammad Zuhdi Fikri Johari, Afinzaki Amiral, Ayu Purwarianti, Kumara Ari Yuana, Derry Tanti Wijaya
However, most of these efforts have been focused on creating manual resources thus difficult to scale to more languages.
Optical Character Recognition
Optical Character Recognition (OCR)
1 code implementation • 1 Nov 2024 • David Anugraha, Garry Kuwanto, Lucky Susanto, Derry Tanti Wijaya, Genta Indra Winata
We present MetaMetrics-MT, an innovative metric designed to evaluate machine translation (MT) tasks by aligning closely with human preferences through Bayesian optimization with Gaussian Processes.
1 code implementation • 30 Oct 2024 • Garry Kuwanto, Chaitanya Agarwal, Genta Indra Winata, Derry Tanti Wijaya
Code-switching, the phenomenon of alternating between two or more languages in a single conversation, presents unique challenges for Natural Language Processing (NLP).
1 code implementation • 16 Oct 2024 • Genta Indra Winata, Frederikus Hudi, Patrick Amadeus Irawan, David Anugraha, Rifki Afina Putri, Yutong Wang, Adam Nohejl, Ubaidillah Ariq Prathama, Nedjma Ousidhoum, Afifa Amriani, Anar Rzayev, Anirban Das, Ashmari Pramodya, Aulia Adila, Bryan Wilie, Candy Olivia Mawalim, Ching Lam Cheng, Daud Abolade, Emmanuele Chersoni, Enrico Santus, Fariz Ikhwantri, Garry Kuwanto, Hanyang Zhao, Haryo Akbarianto Wibowo, Holy Lovenia, Jan Christian Blaise Cruz, Jan Wira Gotama Putra, Junho Myung, Lucky Susanto, Maria Angelica Riera Machin, Marina Zhukova, Michael Anugraha, Muhammad Farid Adilazuarda, Natasha Santosa, Peerat Limkonchotiwat, Raj Dabre, Rio Alexander Audino, Samuel Cahyawijaya, Shi-Xiong Zhang, Stephanie Yulia Salim, Yi Zhou, Yinxuan Gui, David Ifeoluwa Adelani, En-Shiun Annie Lee, Shogo Okada, Ayu Purwarianti, Alham Fikri Aji, Taro Watanabe, Derry Tanti Wijaya, Alice Oh, Chong-Wah Ngo
This benchmark includes a visual question answering (VQA) dataset with text-image pairs across 30 languages and dialects, spanning 9 language families and featuring over 1 million data points, making it the largest multicultural VQA benchmark to date.
1 code implementation • 3 Oct 2024 • Genta Indra Winata, David Anugraha, Lucky Susanto, Garry Kuwanto, Derry Tanti Wijaya
This makes MetaMetrics a powerful tool for improving the evaluation of generation tasks, ensuring that metrics are more representative of human judgment across diverse contexts.
1 code implementation • 6 Sep 2024 • Tahsina Hashem, Weiqing Wang, Derry Tanti Wijaya, Mohammed Eunus Ali, Yuan-Fang Li
In this paper, we develop a framework to generate faithful and salient text from mixed-modal data, which includes images and structured data ( represented in knowledge graphs or tables).
no code implementations • 14 Jul 2024 • Garry Kuwanto, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga, Teresiah W. Nyoike, Aniefon D. Akpan, Nsima Ab Udouboh, Idongesit Udeme Archibong, Idara Effiong Moses, Ifeoluwatayo A. Ige, Benjamin Ajibade, Olumide Benjamin Awokoya, Idris Abdulmumin, Saminu Mohammad Aliyu, Ruqayya Nasir Iro, Ibrahim Said Ahmad, Deontae Smith, Praise-EL Michaels, David Ifeoluwa Adelani, Derry Tanti Wijaya, Anietie Andy
We conducted a comprehensive evaluation comparing our storyboard-based approach with traditional text translation-based methods in terms of accuracy and fluency.
no code implementations • 14 Jul 2024 • Ge Gao, Jongin Kim, Sejin Paik, Ekaterina Novozhilova, Yi Liu, Sarah T. Bonna, Margrit Betke, Derry Tanti Wijaya
Using the dataset BU-NEmo+ (Gao et al., 2022), we found that for emotion classification, the free-text explanations have a strong correlation with the dominant emotion elicited by the headlines.
no code implementations • Findings (EMNLP) 2021 • Isidora Chara Tourni, Lei Guo, Hengchang Hu, Edward Halim, Prakash Ishwar, Taufiq Daryanto, Mona Jalal, Boqi Chen, Margrit Betke, Fabian Zhafransyah, Sha Lai, Derry Tanti Wijaya
Additionally, we release the first multimodal news framing dataset related to gun violence in the U. S., curated and annotated by communication researchers.
no code implementations • 12 Aug 2023 • Tahsina Hashem, Weiqing Wang, Derry Tanti Wijaya, Mohammed Eunus Ali, Yuan-Fang Li
Knowledge Graph (KG)-to-Text generation aims at generating fluent natural-language text that accurately represents the information of a given knowledge graph.
1 code implementation • ICLR 2022 • Afra Feyza Akyürek, Ekin Akyürek, Derry Tanti Wijaya, Jacob Andreas
The key to this approach is a new family of subspace regularization schemes that encourage weight vectors for new classes to lie close to the subspace spanned by the weights of existing classes.
class-incremental learning
Few-Shot Class-Incremental Learning
+3
1 code implementation • NAACL 2021 • Nikzad Khani, Isidora Tourni, Mohammad Sadegh Rasooli, Chris Callison-Burch, Derry Tanti Wijaya
We find that images of words are not always invariant across languages, and that language pairs with shared culture, meaning having either a common language family, ethnicity or religion, have improved image translatability (i. e., have more similar images for similar words) compared to its converse, regardless of their geographic proximity.
Cultural Vocal Bursts Intensity Prediction
Low Resource Neural Machine Translation
+5
1 code implementation • 16 Apr 2021 • Mohammad Sadegh Rasooli, Chris Callison-Burch, Derry Tanti Wijaya
Our captioning results on Arabic are slightly better than that of its supervised model.
1 code implementation • 10 Apr 2021 • Alex Jones, Derry Tanti Wijaya
The explosion of user-generated content (UGC)--e. g. social media posts, comments, and reviews--has motivated the development of NLP applications tailored to these types of informal texts.
1 code implementation • RANLP (BUCC) 2021 • Alex Jones, Derry Tanti Wijaya
Obtaining high-quality parallel corpora is of paramount importance for training NMT systems.
no code implementations • ACL 2020 • Afra Feyza Aky{\"u}rek, Lei Guo, R Elanwar, a, Prakash Ishwar, Margrit Betke, Derry Tanti Wijaya
News framing refers to the practice in which aspects of specific issues are highlighted in the news to promote a particular interpretation.
no code implementations • CONLL 2019 • Siyi Liu, Lei Guo, Kate Mays, Margrit Betke, Derry Tanti Wijaya
We apply our frame detection approach in a large scale study of 88k news headlines about the coverage of gun violence in the U. S. between 2016 and 2018.
no code implementations • WS 2019 • Anietie Andy, Derry Tanti Wijaya, Chris Callison-Burch
Pre-scheduled events, such as TV shows and sports games, usually garner considerable attention from the public.
no code implementations • ACL 2018 • John Hewitt, Daphne Ippolito, Brendan Callahan, Reno Kriz, Derry Tanti Wijaya, Chris Callison-Burch
To facilitate research on the task, we introduce a large-scale multilingual corpus of images, each labeled with the word it represents.