no code implementations • COLING 2022 • Santiago Castro, Naihao Deng, Pingxuan Huang, Mihai Burzo, Rada Mihalcea
Existing video understanding datasets mostly focus on human interactions, with little attention being paid to the “in the wild” settings, where the videos are recorded outdoors.
1 code implementation • 22 Feb 2024 • Santiago Castro, Amir Ziai, Avneesh Saluja, Zhuoning Yuan, Rada Mihalcea
Recent years have witnessed a significant increase in the performance of Vision and Language tasks.
1 code implementation • 12 Sep 2023 • Oana Ignat, Santiago Castro, Weiji Li, Rada Mihalcea
We create and make publicly available the ACE (Action Co-occurrencE) dataset, consisting of a large graph of ~12k co-occurring pairs of visual actions and their corresponding video clips.
1 code implementation • 30 May 2023 • Santiago Castro, Oana Ignat, Rada Mihalcea
Joint vision-language models have shown great performance over a diverse set of tasks.
no code implementations • 21 May 2023 • Oana Ignat, Zhijing Jin, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu, Rada Mihalcea
Not surprisingly, this has, in turn, made many NLP researchers -- especially those at the beginning of their careers -- worry about what NLP research area they should focus on.
2 code implementations • 5 Oct 2022 • Ruben Villegas, Mohammad Babaeizadeh, Pieter-Jan Kindermans, Hernan Moraldo, Han Zhang, Mohammad Taghi Saffar, Santiago Castro, Julius Kunze, Dumitru Erhan
To the best of our knowledge, this is the first time a paper studies generating videos from time variable prompts.
Ranked #4 on Video Prediction on BAIR Robot Pushing
no code implementations • 14 Sep 2022 • Santiago Castro, Naihao Deng, Pingxuan Huang, Mihai Burzo, Rada Mihalcea
Existing video understanding datasets mostly focus on human interactions, with little attention being paid to the "in the wild" settings, where the videos are recorded outdoors.
Ranked #1 on Video Question Answering on WildQA
2 code implementations • 24 Mar 2022 • Santiago Castro, Fabian Caba Heilbron
Large-scale pretrained image-text models have shown incredible zero-shot performance in a handful of tasks, including video ones such as action recognition and text-to-video retrieval.
1 code implementation • 16 Feb 2022 • Oana Ignat, Santiago Castro, YuHang Zhou, Jiajun Bao, Dandan Shan, Rada Mihalcea
We consider the task of temporal human action localization in lifestyle vlogs.
1 code implementation • EMNLP 2021 • Oana Ignat, Santiago Castro, Hanwen Miao, Weiji Li, Rada Mihalcea
We aim to automatically identify human action reasons in online videos.
1 code implementation • ACL 2022 • Santiago Castro, Ruoyao Wang, Pingxuan Huang, Ian Stewart, Oana Ignat, Nan Liu, Jonathan C. Stroud, Rada Mihalcea
We propose fill-in-the-blanks as a video understanding evaluation framework and introduce FIBER -- a novel dataset consisting of 28, 000 videos and descriptions in support of this evaluation framework.
1 code implementation • LREC 2020 • Santiago Castro, Mahmoud Azab, Jonathan Stroud, Cristina Noujaim, Ruoyao Wang, Jia Deng, Rada Mihalcea
We introduce LifeQA, a benchmark dataset for video question answering that focuses on day-to-day real-life situations.
no code implementations • LREC 2020 • Luis Chiruzzo, Santiago Castro, Aiala Ros{\'a}
This paper presents the development of a corpus of 30, 000 Spanish tweets that were crowd-annotated with humor value and funniness score.
1 code implementation • ACL 2019 • Santiago Castro, Devamanyu Hazarika, Ver{\'o}nica P{\'e}rez-Rosas, Roger Zimmermann, Rada Mihalcea, Soujanya Poria
As a first step towards enabling the development of multimodal approaches for sarcasm detection, we propose a new sarcasm dataset, Multimodal Sarcasm Detection Dataset (MUStARD), compiled from popular TV shows.
1 code implementation • 5 Jun 2019 • Santiago Castro, Devamanyu Hazarika, Verónica Pérez-Rosas, Roger Zimmermann, Rada Mihalcea, Soujanya Poria
As a first step towards enabling the development of multimodal approaches for sarcasm detection, we propose a new sarcasm dataset, Multimodal Sarcasm Detection Dataset (MUStARD), compiled from popular TV shows.
1 code implementation • COLING 2018 • Santiago Castro, Jairo Bonanata, Aiala Ros{\'a}
In this work we propose a high coverage method that uses word vector representations to build a false friends classifier for any pair of languages, which we apply to the particular case of Spanish and Portuguese.
no code implementations • 17 Oct 2017 • Aiala Rosá, Luis Chiruzzo, Mathias Etcheverry, Santiago Castro
This article presents classifiers based on SVM and Convolutional Neural Networks (CNN) for the TASS 2017 challenge on tweets sentiment analysis.
1 code implementation • WS 2018 • Santiago Castro, Luis Chiruzzo, Aiala Rosá, Diego Garat, Guillermo Moncecchi
Computational Humor involves several tasks, such as humor recognition, humor generation, and humor scoring, for which it is useful to have human-curated data.
1 code implementation • 28 Mar 2017 • Santiago Castro, Matías Cubero, Diego Garat, Guillermo Moncecchi
While humor has been historically studied from a psychological, cognitive and linguistic standpoint, its study from a computational perspective is an area yet to be explored in Computational Linguistics.