no code implementations • 10 Jul 2024 • Gaurav Sahu, Issam H. Laradji
Low-resource extractive text summarization is a vital but heavily underexplored area of research.
1 code implementation • 8 Jul 2024 • Gaurav Sahu, Abhay Puri, Juan Rodriguez, Alexandre Drouin, Perouz Taslakian, Valentina Zantedeschi, Alexandre Lacoste, David Vazquez, Nicolas Chapados, Christopher Pal, Sai Rajeswar Mudumba, Issam Hadj Laradji
Second, unlike existing benchmarks focusing on answering single queries, InsightBench evaluates agents based on their ability to perform end-to-end data analytics, including formulating questions, interpreting answers, and generating a summary of insights and actionable steps.
no code implementations • 19 Nov 2023 • Nishant Mishra, Gaurav Sahu, Iacer Calixto, Ameen Abu-Hanna, Issam H. Laradji
Generating high-quality summaries for chat dialogs often requires large labeled datasets.
no code implementations • 16 Nov 2023 • Gaurav Sahu, Olga Vechtomova, Issam H. Laradji
While SSL is popular for image and text classification, it is relatively underexplored for the task of extractive text summarization.
1 code implementation • 22 Oct 2023 • Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, Issam H. Laradji
Our specific PromptMix method consists of two steps: 1) generate challenging text augmentations near class boundaries; however, generating borderline examples increases the risk of false positives in the dataset, so we 2) relabel the text augmentations using a prompting-based LLM classifier to enhance the correctness of labels in the generated data.
1 code implementation • 18 Jul 2023 • Liam Hebert, Gaurav Sahu, Yuxuan Guo, Nanda Kishore Sreenivas, Lukasz Golab, Robin Cohen
We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions.
no code implementations • 20 Dec 2022 • Brian D. Zimmerman, Gaurav Sahu, Olga Vechtomova
In this work, we propose Future Sight, a method for finetuning a pretrained generative transformer on the task of future conditioning.
no code implementations • 27 Oct 2022 • Olga Vechtomova, Gaurav Sahu
Subsequently, it is difficult for artists to rediscover audio segments that might be suitable for use in their compositions from thousands of hours of recordings.
1 code implementation • NLP4ConvAI (ACL) 2022 • Gaurav Sahu, Pau Rodriguez, Issam H. Laradji, Parmida Atighehchian, David Vazquez, Dzmitry Bahdanau
Data augmentation is a widely employed technique to alleviate the problem of data scarcity.
no code implementations • 11 Nov 2021 • Alexandre Parmentier, Robin Cohen, Xueguang Ma, Gaurav Sahu, Queenie Chen
In this paper, we present an approach for predicting trust links between peers in social media, one that is grounded in the artificial intelligence area of multiagent trust modeling.
no code implementations • 3 Jun 2021 • Olga Vechtomova, Gaurav Sahu, Dhruv Kumar
We describe a real-time system that receives a live audio stream from a jam session and generates lyric lines that are congruent with the live music being played.
no code implementations • 3 May 2021 • Gaurav Sahu, Robin Cohen, Olga Vechtomova
This paper envisions a multi-agent system for detecting the presence of hate speech in online social media platforms such as Twitter and Facebook.
no code implementations • NLP4MusA 2020 • Olga Vechtomova, Gaurav Sahu, Dhruv Kumar
We present a system for generating novel lyrics lines conditioned on music audio.
1 code implementation • COLING 2020 • Kashif Khan, Gaurav Sahu, Vikash Balasubramanian, Lili Mou, Olga Vechtomova
Generating relevant responses in a dialog is challenging, and requires not only proper modeling of context in the conversation but also being able to generate fluent sentences during inference.
no code implementations • EACL 2021 • Gaurav Sahu, Olga Vechtomova
Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data.
5 code implementations • 12 Apr 2019 • Gaurav Sahu
In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition.
1 code implementation • EMNLP 2018 • Amrith Krishna, Bishal Santra, Sasi Prasanth Bandaru, Gaurav Sahu, Vishnu Dutt Sharma, Pavankumar Satuluri, Pawan Goyal
The configurational information in sentences of a free word order language such as Sanskrit is of limited use.