no code implementations • NAACL (CALCS) 2021 • Dwija Parikh, Thamar Solorio
Code-switching is an omnipresent phenomenon in multilingual communities all around the world but remains a challenge for NLP systems due to the lack of proper data and processing techniques.
no code implementations • RANLP 2021 • Mahsa Shafaei, Christos Smailis, Ioannis Kakadiaris, Thamar Solorio
In this work, we explore different approaches to combine modalities for the problem of automated age-suitability rating of movie trailers.
no code implementations • 7 Apr 2025 • Toqeer Ehsan, Thamar Solorio
Named Entity Recognition (NER), a fundamental task in Natural Language Processing (NLP), has shown significant advancements for high-resource languages.
1 code implementation • 26 Mar 2025 • Aishik Mandal, Dana Atzil-Slonim, Thamar Solorio, Iryna Gurevych
The performance of our framework is comparable to the current state-of-the-art models on the E-DAIC dataset and enhances interpretability by predicting scores for each question.
no code implementations • 23 Jan 2025 • Injy Hamed, Caroline Sabty, Slim Abdennadher, Ngoc Thang Vu, Thamar Solorio, Nizar Habash
Language in the Arab world presents a complex diglossic and multilingual setting, involving the use of Modern Standard Arabic, various dialects and sub-dialects, as well as multiple European languages.
1 code implementation • 25 Nov 2024 • Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana, Noor Ahsan, Nevasini Sasikumar, Omkar Thawakar, Henok Biadglign Ademtew, Yahya Hmaiti, Amandeep Kumar, Kartik Kuckreja, Mykola Maslych, Wafa Al Ghallabi, Mihail Mihaylov, Chao Qin, Abdelrahman M Shaker, Mike Zhang, Mahardika Krisna Ihsani, Amiel Esplana, Monil Gokani, Shachar Mirkin, Harsh Singh, Ashay Srivastava, Endre Hamerlik, Fathinah Asma Izzati, Fadillah Adamsyah Maani, Sebastian Cavada, Jenny Chim, Rohit Gupta, Sanjay Manjunath, Kamila Zhumakhanova, Feno Heriniaina Rabevohitra, Azril Amirudin, Muhammad Ridzuan, Daniya Kareem, Ketan More, Kunyang Li, Pramesh Shakya, Muhammad Saad, Amirpouya Ghasemaghaei, Amirbek Djanibekov, Dilshod Azizov, Branislava Jankovic, Naman Bhatia, Alvaro Cabrera, Johan Obando-Ceron, Olympiah Otieno, Fabian Farestam, Muztoba Rabbani, Sanoojan Baliah, Santosh Sanjeev, Abduragim Shtanchaev, Maheen Fatima, Thao Nguyen, Amrin Kareem, Toluwani Aremu, Nathan Xavier, Amit Bhatkal, Hawau Toyin, Aman Chadha, Hisham Cholakkal, Rao Muhammad Anwer, Michael Felsberg, Jorma Laaksonen, Thamar Solorio, Monojit Choudhury, Ivan Laptev, Mubarak Shah, Salman Khan, Fahad Khan
In pursuit of culturally diverse global multimodal models, our proposed All Languages Matter Benchmark (ALM-bench) represents the largest and most comprehensive effort to date for evaluating LMMs across 100 languages.
no code implementations • 7 Nov 2024 • Israel Abebe Azime, Atnafu Lambebo Tonja, Tadesse Destaw Belay, Yonas Chanie, Bontu Fufa Balcha, Negasi Haile Abadi, Henok Biadglign Ademtew, Mulubrhan Abebe Nerea, Debela Desalegn Yadeta, Derartu Dagne Geremew, Assefa Atsbiha tesfau, Philipp Slusallek, Thamar Solorio, Dietrich Klakow
In this work, we explore LLM evaluation challenges for low-resource language understanding and introduce ProverbEval, LLM evaluation benchmark for low-resource languages based on proverbs to focus on low-resource language understanding in culture-specific scenarios.
no code implementations • 28 Oct 2024 • Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury
The disparity in the languages commonly studied in Natural Language Processing (NLP) is typically reflected by referring to languages as low vs high-resourced.
no code implementations • 7 Oct 2024 • Ibrahim Aldarmaki, Thamar Solorio, Bhiksha Raj, Hanan Aldarmaki
Neural multi-channel speech enhancement models, in particular those based on the U-Net architecture, demonstrate promising performance and generalization potential.
no code implementations • 1 Jul 2024 • Jesus-German Ortiz-Barajas, Helena Gomez-Adorno, Thamar Solorio
We present HyperLoader, a simple approach that combines different parameter-efficient fine-tuning methods in a multi-task setting.
no code implementations • 24 Jun 2024 • Haryo Akbarianto Wibowo, Thamar Solorio, Alham Fikri Aji
Knowledge distillation (KD) has proven to be a successful strategy to improve the performance of smaller models in many NLP tasks.
1 code implementation • 12 Jun 2024 • Elaheh Baharlouei, Mahsa Shafaei, Yigeng Zhang, Hugo Jair Escalante, Thamar Solorio
To tackle this problem, we propose a novel end-to-end multimodal system for the task of comic mischief detection.
no code implementations • 10 Jun 2024 • David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song, Henok Biadglign Ademtew, Hernán Maina, Holy Lovenia, Israel Abebe Azime, Jan Christian Blaise Cruz, Jay Gala, Jiahui Geng, Jesus-German Ortiz-Barajas, Jinheon Baek, Jocelyn Dunstan, Laura Alonso Alemany, Kumaranage Ravindu Yasas Nagasinghe, Luciana Benotti, Luis Fernando D'Haro, Marcelo Viridiano, Marcos Estecha-Garitagoitia, Maria Camila Buitrago Cabrera, Mario Rodríguez-Cantelar, Mélanie Jouitteau, Mihail Mihaylov, Mohamed Fazli Mohamed Imam, Muhammad Farid Adilazuarda, Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Naome Etori, Olivier Niyomugisha, Paula Mónica Silva, Pranjal Chitale, Raj Dabre, Rendi Chevi, Ruochen Zhang, Ryandito Diandaru, Samuel Cahyawijaya, Santiago Góngora, Soyeong Jeong, Sukannya Purkayastha, Tatsuki Kuribayashi, Teresa Clifford, Thanmay Jayakumar, Tiago Timponi Torrent, Toqeer Ehsan, Vladimir Araujo, Yova Kementchedjhieva, Zara Burzo, Zheng Wei Lim, Zheng Xin Yong, Oana Ignat, Joan Nwatu, Rada Mihalcea, Thamar Solorio, Alham Fikri Aji
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data.
1 code implementation • 30 May 2024 • Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka, Thamar Solorio
Aspect-Based Sentiment Analysis (ABSA) has experienced tremendous expansion and diversity due to various shared tasks spanning several languages and fields and organized via SemEval workshops and Germeval.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+2
1 code implementation • 10 May 2024 • Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, Tom Hope, Dirk Hovy, Jonathan K. Kummerfeld, Anne Lauscher, Kevin Leyton-Brown, Sheng Lu, Mausam, Margot Mieskes, Aurélie Névéol, Danish Pruthi, Lizhen Qu, Roy Schwartz, Noah A. Smith, Thamar Solorio, Jingyan Wang, Xiaodan Zhu, Anna Rogers, Nihar B. Shah, Iryna Gurevych
We hope that our work will help set the agenda for research in machine-assisted scientific quality control in the age of AI, within the NLP community and beyond.
1 code implementation • 8 Apr 2024 • Yigeng Zhang, Fabio A. González, Thamar Solorio
Reading comprehension continues to be a crucial research focus in the NLP community.
no code implementations • 8 Apr 2024 • Atnafu Lambebo Tonja, Fazlourrahman Balouchzahi, Sabur Butt, Olga Kolesnikova, Hector Ceballos, Alexander Gelbukh, Thamar Solorio
The paper focuses on the marginalization of indigenous language communities in the face of rapid technological advancements.
1 code implementation • 3 Apr 2024 • Emilio Villa-Cueva, A. Pastor López-Monroy, Fernando Sánchez-Vega, Thamar Solorio
Zero-Shot Cross-lingual Transfer (ZS-XLT) utilizes a model trained in a source language to make predictions in another language, often with a performance loss.
1 code implementation • 27 Mar 2024 • Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Meriem Beloucif, Christine de Kock, Oumaima Hourrane, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Krishnapriya Vishnubhotla, Seid Muhie Yimam, Saif M. Mohammad
We present the first shared task on Semantic Textual Relatedness (STR).
1 code implementation • 16 Feb 2024 • David Romero, Thamar Solorio
We present Q-ViD, a simple approach for video question answering (video QA), that unlike prior methods, which are based on complex architectures, computationally expensive pipelines or use closed models like GPTs, Q-ViD relies on a single instruction-aware open vision-language model (InstructBLIP) to tackle videoQA using frame descriptions.
Ranked #14 on
Zero-Shot Video Question Answer
on NExT-QA
2 code implementations • 13 Feb 2024 • Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine de Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Winata, Seid Muhie Yimam, Saif M. Mohammad
Exploring and quantifying semantic relatedness is central to representing language and holds significant implications across various NLP tasks.
2 code implementations • 23 Sep 2023 • Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka, Thamar Solorio
Aspect-based sentiment analysis (ABSA) delves into understanding sentiments specific to distinct elements within a user-generated review.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+1
1 code implementation • 18 Sep 2023 • Yigeng Zhang, Mahsa Shafaei, Fabio A. González, Thamar Solorio
In this work, we introduce a pioneering research challenge: evaluating positive and potentially harmful messages within music products.
no code implementations • 16 Sep 2023 • Shuguang Chen, Leonardo Neves, Thamar Solorio
In recent years, large pre-trained language models (PLMs) have achieved remarkable performance on many natural language processing benchmarks.
no code implementations • 12 Sep 2023 • Luis Chiruzzo, Marvin Agüero-Torales, Gustavo Giménez-Lugo, Aldo Alvarez, Yliana Rodríguez, Santiago Góngora, Thamar Solorio
We present the first shared task for detecting and analyzing code-switching in Guarani and Spanish, GUA-SPA at IberLEF 2023.
1 code implementation • 1 May 2023 • Sadat Shahriar, Thamar Solorio
Subjectivity and difference of opinion are key social phenomena, and it is crucial to take these into account in the annotation and detection process of derogatory textual content.
no code implementations • 23 Mar 2023 • Zheng-Xin Yong, Ruochen Zhang, Jessica Zosa Forde, Skyler Wang, Arjun Subramonian, Holy Lovenia, Samuel Cahyawijaya, Genta Indra Winata, Lintang Sutawika, Jan Christian Blaise Cruz, Yin Lin Tan, Long Phan, Rowena Garcia, Thamar Solorio, Alham Fikri Aji
While code-mixing is a common linguistic practice in many parts of the world, collecting high-quality and low-cost code-mixed data remains a challenge for natural language processing (NLP) research.
no code implementations • 10 Feb 2023 • Marco Farina, Duccio Pappadopulo, Anant Gupta, Leslie Huang, Ozan İrsoy, Thamar Solorio
Driven by encouraging results on a wide range of tasks, the field of NLP is experiencing an accelerated race to develop bigger language models.
1 code implementation • 19 Dec 2022 • Genta Indra Winata, Alham Fikri Aji, Zheng-Xin Yong, Thamar Solorio
Code-Switching, a common phenomenon in written text and conversation, has been studied over decades by the natural language processing (NLP) research community.
1 code implementation • 14 Oct 2022 • Shuguang Chen, Leonardo Neves, Thamar Solorio
In this work, we take the named entity recognition task in the English language as a case study and explore style transfer as a data augmentation method to increase the size and diversity of training data in low-resource scenarios.
1 code implementation • 11 Apr 2022 • Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka, Thamar Solorio
Aspect-based sentiment analysis (ABSA) is a natural language processing problem that requires analyzing user-generated reviews to determine: a) The target entity being reviewed, b) The high-level aspect to which it belongs, and c) The sentiment expressed toward the targets and the aspects.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+1
no code implementations • 19 Feb 2022 • Shuguang Chen, Gustavo Aguilar, Anirudh Srinivasan, Mona Diab, Thamar Solorio
For the unsupervised setting, we provide the following language pairs: English and Spanish-English (Eng-Spanglish), and English and Modern Standard Arabic-Egyptian Arabic (Eng-MSAEA) in both directions.
1 code implementation • 5 Oct 2021 • Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka, Thamar Solorio
Aspect-based sentiment analysis (ABSA) is an NLP task that entails processing user-generated reviews to determine (i) the target being evaluated, (ii) the aspect category to which it belongs, and (iii) the sentiment expressed towards the target and aspect pair.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+1
1 code implementation • Findings (EMNLP) 2021 • Yigeng Zhang, Mahsa Shafaei, Fabio Gonzalez, Thamar Solorio
In this paper, we introduce the task of predicting severity of age-restricted aspects of movie content based solely on the dialogue script.
1 code implementation • EMNLP 2021 • Shuguang Chen, Gustavo Aguilar, Leonardo Neves, Thamar Solorio
Current work in named entity recognition (NER) shows that data augmentation techniques can produce more robust models.
1 code implementation • NAACL (SocialNLP) 2021 • Shuguang Chen, Leonardo Neves, Thamar Solorio
Performance of neural models for named entity recognition degrades over time, becoming stale.
no code implementations • 23 Feb 2021 • Thamar Solorio, Mahsa Shafaei, Christos Smailis, Isabelle Augenstein, Margaret Mitchell, Ingrid Stapf, Ioannis Kakadiaris
This white paper summarizes the authors' structured brainstorming regarding ethical considerations for creating an extensive repository of online content labeled with tags that describe potentially questionable content for young viewers.
no code implementations • 26 Jan 2021 • Mahsa Shafaei, Christos Smailis, Ioannis A. Kakadiaris, Thamar Solorio
In this work, we explore different approaches to combine modalities for the problem of automated age-suitability rating of movie trailers.
no code implementations • 25 Jan 2021 • Thamar Solorio, Mahsa Shafaei, Christos Smailis, Mona Diab, Theodore Giannakopoulos, Heng Ji, Yang Liu, Rada Mihalcea, Smaranda Muresan, Ioannis Kakadiaris
This white paper presents a summary of the discussions regarding critical considerations to develop an extensive repository of online videos annotated with labels indicating questionable content.
no code implementations • 2 Jan 2021 • Amirreza Shirani, Giai Tran, Hieu Trinh, Franck Dernoncourt, Nedim Lipka, Paul Asente, Jose Echevarria, Thamar Solorio
We evaluate a range of state-of-the-art models on this novel dataset by organizing a shared task and inviting multiple researchers to model emphasis in this new domain.
no code implementations • Findings (EMNLP) 2021 • Gustavo Aguilar, Bryan McCann, Tong Niu, Nazneen Rajani, Nitish Keskar, Thamar Solorio
To alleviate these challenges, we propose a character-based subword module (char2subword) that learns the subword embedding table in pre-trained models like BERT.
1 code implementation • WNUT (ACL) 2021 • Shuguang Chen, Gustavo Aguilar, Leonardo Neves, Thamar Solorio
Multimodal named entity recognition (MNER) requires to bridge the gap between language understanding and visual context.
no code implementations • SEMEVAL 2020 • Parth Patwa, Gustavo Aguilar, Sudipta Kar, Suraj Pandey, Srinivas PYKL, Björn Gambäck, Tanmoy Chakraborty, Thamar Solorio, Amitava Das
In this paper, we present the results of the SemEval-2020 Task 9 on Sentiment Analysis of Code-Mixed Tweets (SentiMix 2020).
no code implementations • SEMEVAL 2020 • Amirreza Shirani, Franck Dernoncourt, Nedim Lipka, Paul Asente, Jose Echevarria, Thamar Solorio
In this paper, we present the main findings and compare the results of SemEval-2020 Task 10, Emphasis Selection for Written Text in Visual Media.
no code implementations • LREC 2020 • Gustavo Aguilar, Sudipta Kar, Thamar Solorio
To facilitate research in this direction, we propose a centralized benchmark for Linguistic Code-switching Evaluation (LinCE) that combines ten corpora covering four different code-switched language pairs (i. e., Spanish-English, Nepali-English, Hindi-English, and Modern Standard Arabic-Egyptian Arabic) and four tasks (i. e., language identification, named entity recognition, part-of-speech tagging, and sentiment analysis).
2 code implementations • ACL 2020 • Amirreza Shirani, Franck Dernoncourt, Jose Echevarria, Paul Asente, Nedim Lipka, Thamar Solorio
In this paper, we aim to learn associations between visual attributes of fonts and the verbal context of the texts they are typically applied to.
1 code implementation • LREC 2020 • Niloofar Safi Samghabadi, Parth Patwa, Srinivas PYKL, Prerana Mukherjee, Amitava Das, Thamar Solorio
In recent times, the focus of the NLP community has increased towards offensive language, aggression, and hate-speech detection. This paper presents our system for TRAC-2 shared task on {``}Aggression Identification{''} (sub-task A) and {``}Misogynistic Aggression Identification{''} (sub-task B).
no code implementations • LREC 2020 • Niloofar Safi Samghabadi, Adri{\'a}n Pastor L{\'o}pez Monroy, Thamar Solorio
We also investigate the possibility of designing a framework to monitor the streams of users{'} online messages and detects the signs of cyberbullying as early as possible.
no code implementations • LREC 2020 • Mahsa Shafaei, Niloofar Safi Samghabadi, Sudipta Kar, Thamar Solorio
Movies help us learn and inspire societal change.
no code implementations • WS 2016 • Giovanni Molina, Fahad AlGhamdi, Mahmoud Ghoneim, Abdelati Hawwari, Nicolas Rey-Villamizar, Mona Diab, Thamar Solorio
We present an overview of the second shared task on language identification in code-switched data.
no code implementations • WS 2016 • Fahad AlGhamdi, Giovanni Molina, Mona Diab, Thamar Solorio, Abdelati Hawwari, Victor Soto, Julia Hirschberg
We address the problem of Part of Speech tagging (POS) in the context of linguistic code switching (CS).
1 code implementation • ACL 2020 • Gustavo Aguilar, Thamar Solorio
We show the effectiveness of this transfer learning step by outperforming multilingual BERT and homologous CS-unaware ELMo models and establishing a new state of the art in CS tasks, such as NER and POS tagging.
no code implementations • 11 Sep 2019 • Gustavo Aguilar, Thamar Solorio
On the other hand, the global attention spots the most relevant words in the sequence.
no code implementations • EMNLP (ALW) 2020 • Niloofar Safi Samghabadi, Afsheen Hatami, Mahsa Shafaei, Sudipta Kar, Thamar Solorio
We experiment with this model on our dataset and later present the analysis.
no code implementations • RANLP 2019 • Suraj Maharjan, Deepthi Mave, Prasha Shrestha, Manuel Montes, Fabio A. Gonz{\'a}lez, Thamar Solorio
An author{'}s way of presenting a story through his/her writing style has a great impact on whether the story will be liked by readers or not.
no code implementations • EMNLP 2020 • Sudipta Kar, Gustavo Aguilar, Mirella Lapata, Thamar Solorio
This paper considers the problem of characterizing stories by inferring properties such as theme and style using written synopses and reviews of movies.
no code implementations • 21 Aug 2019 • Mahsa Shafaei, Niloofar Safi Samghabadi, Sudipta Kar, Thamar Solorio
In this paper, our goal is to predict the suitability of the movie content for children and young adults based on scripts.
1 code implementation • ACL 2019 • Amirreza Shirani, Franck Dernoncourt, Paul Asente, Nedim Lipka, Seokhwan Kim, Jose Echevarria, Thamar Solorio
In visual communication, text emphasis is used to increase the comprehension of written text to convey the author{'}s intent.
1 code implementation • WS 2017 • Gustavo Aguilar, Suraj Maharjan, Adrian Pastor López-Monroy, Thamar Solorio
Named Entity Recognition for social media data is challenging because of its inherent noisiness.
Ranked #22 on
Named Entity Recognition (NER)
on WNUT 2017
no code implementations • WS 2018 • Gustavo Aguilar, Fahad AlGhamdi, Victor Soto, Mona Diab, Julia Hirschberg, Thamar Solorio
In the third shared task of the Computational Approaches to Linguistic Code-Switching (CALCS) workshop, we focus on Named Entity Recognition (NER) on code-switched social-media data.
no code implementations • NAACL 2018 • Gustavo Aguilar, A. Pastor López-Monroy, Fabio A. González, Thamar Solorio
Our systems outperform the current F1 scores of the state of the art on the Workshop on Noisy User-generated Text 2017 dataset by 2. 45% and 3. 69%, establishing a more suitable approach for social media environments.
Ranked #18 on
Named Entity Recognition (NER)
on WNUT 2017
no code implementations • 3 May 2019 • Amirreza Shirani, Bowen Xu, David Lo, Thamar Solorio, Amin Alipour
The proposed dataset Stack Overflow is a useful resource to develop novel solutions, specifically data-hungry neural network models, for the prediction of relatedness in technical community question-answering forums.
1 code implementation • EMNLP 2018 • Suraj Maharjan, Manuel Montes, Fabio A. Gonz{\'a}lez, Thamar Solorio
Likability prediction of books has many uses.
no code implementations • COLING 2018 • Sudipta Kar, Suraj Maharjan, Thamar Solorio
Folksonomy of movies covers a wide range of heterogeneous information about movies, like the genre, plot structure, visual experiences, soundtracks, metadata, and emotional experiences from watching a movie.
no code implementations • COLING 2018 • Niloofar Safi Samghabadi, Deepthi Mave, Sudipta Kar, Thamar Solorio
This paper presents our system for "TRAC 2018 Shared Task on Aggression Identification".
no code implementations • SEMEVAL 2016 • Marc Franco-Salvador, Sudipta Kar, Thamar Solorio, Paolo Rosso
In this work we describe the system built for the three English subtasks of the SemEval 2016 Task 3 by the Department of Computer Science of the University of Houston (UH) and the Pattern Recognition and Human Language Technology (PRHLT) research center - Universitat Polit`ecnica de Val`encia: UH-PRHLT.
no code implementations • WS 2018 • Deepthi Mave, Suraj Maharjan, Thamar Solorio
In this paper, we detail our work on comparing different word-level language identification systems for code-switched Hindi-English data and a standard Spanish-English dataset.
no code implementations • NAACL 2018 • Adrian Pastor L{\'o}pez-Monroy, Fabio A. Gonz{\'a}lez, Manuel Montes, Hugo Jair Escalante, Thamar Solorio
The intensive use of e-communications in everyday life has given rise to new threats and risks.
1 code implementation • NAACL 2018 • Suraj Maharjan, Sudipta Kar, Manuel Montes-y-Gomez, Fabio A. Gonzalez, Thamar Solorio
Books have the power to make us feel happiness, sadness, pain, surprise, or sorrow.
no code implementations • LREC 2018 • Sudipta Kar, Suraj Maharjan, A. Pastor López-Monroy, Thamar Solorio
In this paper, we set out to the task of collecting a corpus of movie plot synopses and tags.
no code implementations • SEMEVAL 2017 • Sudipta Kar, Suraj Maharjan, Thamar Solorio
In this paper, we present our systems for the {``}SemEval-2017 Task-5 on Fine-Grained Sentiment Analysis on Financial Microblogs and News{''}.
no code implementations • WS 2017 • Niloofar Safi Samghabadi, Suraj Maharjan, Alan Sprague, Raquel Diaz-Sprague, Thamar Solorio
Although social media has made it easy for people to connect on a virtually unlimited basis, it has also opened doors to people who misuse it to undermine, harass, humiliate, threaten and bully others.
no code implementations • EACL 2017 • Prasha Shrestha, Sebastian Sierra, Fabio Gonz{\'a}lez, Manuel Montes, Paolo Rosso, Thamar Solorio
We present a model to perform authorship attribution of tweets using Convolutional Neural Networks (CNNs) over character n-grams.
no code implementations • EACL 2017 • Suraj Maharjan, John Arevalo, Manuel Montes, Fabio A. Gonz{\'a}lez, Thamar Solorio
We investigate the value of feature engineering and neural network models for predicting successful writing.
9 code implementations • 7 Feb 2017 • John Arevalo, Thamar Solorio, Manuel Montes-y-Gómez, Fabio A. González
The Gated Multimodal Unit (GMU) model is intended to be used as an internal unit in a neural network architecture whose purpose is to find an intermediate representation based on a combination of data from different modalities.
no code implementations • WS 2016 • Mohammed Attia, Suraj Maharjan, Younes Samih, Laura Kallmeyer, Thamar Solorio
The evaluation results of our system on the test set is 88. 1{\%} (79. 0{\%} for TRUE only) f-measure for Task-1 on detecting semantic similarity, and 76. 0{\%} (42. 3{\%} when excluding RANDOM) for Task-2 on identifying finer-grained semantic relations.
no code implementations • LREC 2016 • Prasha Shrestha, Nicolas Rey-Villamizar, Farig Sadeque, Ted Pedersen, Steven Bethard, Thamar Solorio
Health support forums have become a rich source of data that can be used to improve health care outcomes.
no code implementations • 7 Feb 2014 • John David Osborne, Binod Gyawali, Thamar Solorio
We used MetaMap and YTEX as a basis for the construc- tion of two separate systems to participate in the 2013 ShARe/CLEF eHealth Task 1[9], the recognition of clinical concepts.
no code implementations • LREC 2014 • Thamar Solorio, Ragib Hasan, Mainul Mizan
This paper describes the corpus of sockpuppet cases we gathered from Wikipedia.