1 code implementation • COLING 2022 • Danfeng Guo, Arpit Gupta, Sanchit Agarwal, Jiun-Yu Kao, Shuyang Gao, Arijit Biswas, Chien-Wei Lin, Tagyoung Chung, Mohit Bansal
Learning from multimodal data has become a popular research topic in recent years.
1 code implementation • 31 Oct 2023 • Yohan Jo, Xinyan Zhao, Arijit Biswas, Nikoletta Basiou, Vincent Auvray, Nikolaos Malandrakis, Angeliki Metallinou, Alexandros Potamianos
While most task-oriented dialogues assume conversations between the agent and one user at a time, dialogue systems are increasingly expected to communicate with multiple users simultaneously who make decisions collaboratively.
no code implementations • 18 Aug 2023 • Guanxin Jiang, Lars Villemoes, Arijit Biswas
We show how a neural network can be trained on individual intrusive listening test scores to predict a distribution of scores for each pair of reference and coded input stereo or binaural signals.
no code implementations • 7 Aug 2023 • Arijit Biswas, Harald Mundt
In this study, we propose an auditory-inspired frontend in existing VMAF for creating videos of reference and coded spectrograms, and extended VMAF for measuring coded audio quality.
no code implementations • 23 Sep 2022 • Arijit Biswas, Guanxin Jiang
Automatic coded audio quality predictors are typically designed for evaluating single channels without considering any spatial aspects.
no code implementations • 22 Nov 2021 • Sanchit Agarwal, Jan Jezabek, Arijit Biswas, Emre Barut, Shuyang Gao, Tagyoung Chung
Most popular goal-oriented dialogue agents are capable of understanding the conversational context.
no code implementations • 30 Aug 2021 • Guanxin Jiang, Arijit Biswas, Christian Bergler, Andreas Maier
Automatic coded audio quality assessment is an important task whose progress is hampered by the scarcity of human annotations, poor generalization to unseen codecs, bitrates, content-types, and a lack of flexibility of existing approaches.
no code implementations • NAACL 2021 • Anish Acharya, Suranjit Adhikari, Sanchit Agarwal, Vincent Auvray, Nehal Belgamwar, Arijit Biswas, Shubhra Chandra, Tagyoung Chung, Maryam Fazel-Zarandi, Raefer Gabriel, Shuyang Gao, Rahul Goel, Dilek Hakkani-Tur, Jan Jezabek, Abhay Jha, Jiun-Yu Kao, Prakash Krishnan, Peter Ku, Anuj Goyal, Chien-Wei Lin, Qing Liu, Arindam Mandal, Angeliki Metallinou, Vishal Naik, Yi Pan, Shachi Paul, Vittorio Perera, Abhishek Sethi, Minmin Shen, Nikko Strom, Eddie Wang
Finally, we evaluate our system using a typical movie ticket booking task and show that the dialogue simulator is an essential component of the system that leads to over $50\%$ improvement in turn-level action signature prediction accuracy.
no code implementations • 16 Nov 2020 • Chien-Wei Lin, Vincent Auvray, Daniel Elkind, Arijit Biswas, Maryam Fazel-Zarandi, Nehal Belgamwar, Shubhra Chandra, Matt Zhao, Angeliki Metallinou, Tagyoung Chung, Charlie Shucheng Zhu, Suranjit Adhikari, Dilek Hakkani-Tur
Our approach includes a novel goal-sampling technique for sampling plausible user goals and a dialog simulation technique that uses heuristic interplay between the user and the system (Alexa), where the user tries to achieve the sampled goal.
no code implementations • 1 Jul 2019 • Ahmed Mustafa, Arijit Biswas, Christian Bergler, Julia Schottenhamml, Andreas Maier
Recently, autoregressive deep generative models such as WaveNet and SampleRNN have been used as speech vocoders to scale up the perceptual quality of the reconstructed signals without increasing the coding rate.
no code implementations • 10 Jan 2018 • Ashutosh Kumar, Arijit Biswas, Subhajit Sanyal
Exploring the space of all plausible orders could help us better understand the relationships between the various entities in an e-commerce ecosystem, namely the customers and the products they purchase.
no code implementations • 21 Sep 2017 • Arijit Biswas, Mukul Bhutani, Subhajit Sanyal
E-commerce websites such as Amazon, Alibaba, Flipkart, and Walmart sell billions of products.
no code implementations • 22 Nov 2016 • Soumya Roy, Vinay P. Namboodiri, Arijit Biswas
Previous works on object detection model the problem as a structured regression problem which ranks the correct bounding boxes more than the background ones.
no code implementations • 17 Sep 2016 • Ankit Gandhi, Arjun Sharma, Arijit Biswas, Om Deshmukh
There are total M+1 (M is the number of modalities) components in the proposed network.
no code implementations • 12 Jul 2016 • Sohil Shah, Kuldeep Kulkarni, Arijit Biswas, Ankit Gandhi, Om Deshmukh, Larry Davis
Typical textual descriptions that accompany online videos are 'weak': i. e., they mention the main concepts in the video but not their corresponding spatio-temporal locations.
no code implementations • CVPR 2013 • Arijit Biswas, Devi Parikh
Active learning provides useful tools to reduce annotation costs without compromising classifier performance.