no code implementations • 4 Oct 2024 • Ksheeraja Raghavan, Samiran Gode, Ankit Shah, Surabhi Raghavan, Wolfram Burgard, Bhiksha Raj, Rita Singh
The data produced using the framework serves as a benchmark for anomaly detection applications, potentially enhancing the performance of models trained on audio data, particularly in handling out-of-distribution cases.
no code implementations • 21 Aug 2024 • Minghao Liu, Zonglin Di, Jiaheng Wei, Zhongruo Wang, Hengxiang Zhang, Ruixuan Xiao, Haoyu Wang, Jinlong Pang, Hao Chen, Ankit Shah, Hongxin Wei, Xinlei He, Zhaowei Zhao, Haobo Wang, Lei Feng, Jindong Wang, James Davis, Yang Liu
Furthermore, we design three benchmark datasets focused on label noise detection, label noise learning, and class-imbalanced learning.
no code implementations • 1 Nov 2023 • Dinesh Sharma, Ankit Shah, Chaitra Gopalappa
Human immunodeficiency virus (HIV) is a major public health concern in the United States, with about 1. 2 million people living with HIV and 35, 000 newly infected each year.
no code implementations • 11 Oct 2023 • Joseph Konan, Shikhar Agnihotri, Ojas Bhargave, Shuo Han, Yunyang Zeng, Ankit Shah, Bhiksha Raj
Within the ambit of VoIP (Voice over Internet Protocol) telecommunications, the complexities introduced by acoustic transformations merit rigorous analysis.
no code implementations • 2 Oct 2023 • Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphael Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh
We hypothesize that for attacks to be transferrable, it is sufficient if the proxy can approximate the target model in the neighborhood of the harmful query.
1 code implementation • 29 Sep 2023 • Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
This paper aims to understand the nature of noise in pre-training datasets and to mitigate its impact on downstream tasks.
no code implementations • 25 Sep 2023 • Mark Lindsey, Ankit Shah, Francis Kubala, Richard M. Stern
Online Active Learning (OAL) is a paradigm that addresses this issue by simultaneously minimizing the amount of annotation required to train a classifier and adapting to changes in the data over the duration of the data collection process.
no code implementations • 23 Sep 2023 • Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj
Weak-label learning is a challenging task that requires learning from data "bags" containing positive and negative instances, but only the bag labels are known.
no code implementations • 18 Sep 2023 • ZiYi Yang, Shreyas S. Raman, Ankit Shah, Stefanie Tellex
Recent advancements in large language models (LLMs) have enabled a new research domain, LLM agents, for solving robotics and planning tasks by leveraging the world knowledge and general reasoning abilities of LLMs obtained during pretraining.
1 code implementation • 22 May 2023 • Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj
In this paper, we introduce imprecise label learning (ILL), a framework for the unification of learning with various imprecise label configurations.
Ranked #1 on Learning with noisy labels on mini WebVision 1.0
no code implementations • 18 May 2023 • Soumyadeep Hore, Jalal Ghadermazi, Diwas Paudel, Ankit Shah, Tapas K. Das, Nathaniel D. Bastian
The knowledge gained from our study on the adversary's ability to make specific evasive perturbations to different types of malicious packets can help defenders enhance the robustness of their NIDS against evolving adversarial attacks.
no code implementations • 16 Mar 2023 • Joseph Konan, Ojas Bhargave, Shikhar Agnihotri, Hojeong Lee, Ankit Shah, Shuo Han, Yunyang Zeng, Amanda Shu, Haohui Liu, Xuankai Chang, Hamza Khalid, Minseon Gwak, Kawon Lee, Minjeong Kim, Bhiksha Raj
In this paper, we present a method for fine-tuning models trained on the Deep Noise Suppression (DNS) 2020 Challenge to improve their performance on Voice over Internet Protocol (VoIP) applications.
no code implementations • 9 Mar 2023 • Benedict Quartey, Ankit Shah, George Konidaris
We propose an approach that maximizes experience reuse while learning to solve a given task by generating and simultaneously learning useful auxiliary tasks.
no code implementations • 7 Mar 2023 • Ankit Shah, Shuyi Chen, Kejun Zhou, Yue Chen, Bhiksha Raj
Preliminary results show (1) the proposed BECR can incur a more dispersed embedding on the test set, (2) BECR improves the PaSST model without extra computation complexity, and (3) STFT preprocessing outperforms CQT in all tasks we tested.
no code implementations • 22 Feb 2023 • Jason Xinyu Liu, ZiYi Yang, Ifrah Idrees, Sam Liang, Benjamin Schornstein, Stefanie Tellex, Ankit Shah
We propose Lang2LTL, a modular system and a software package that leverages large language models (LLMs) to ground temporal navigational commands to LTL specifications in environments without prior language data.
no code implementations • 17 Feb 2023 • Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Shah, Olivier Siohan
We achieve a new state-of-the-art of 12. 8% WER for visual speech recognition on the TED LRS3 dataset, which rivals the performance of audio-only models from just four years ago.
Ranked #1 on Lipreading on LRS3-TED (using extra training data)
no code implementations • 3 Aug 2022 • Soumyadeep Hore, Ankit Shah, Nathaniel D. Bastian
The current approaches are deterministic and one-time decision-making methods, which do not consider future uncertainties when prioritizing and selecting vulnerabilities for mitigation.
1 code implementation • 8 Jul 2022 • Clive Gomes, Hyejin Park, Patrick Kollman, Yi Song, Iffanice Houndayi, Ankit Shah
This project involved participation in the DCASE 2022 Competition (Task 6) which had two subtasks: (1) Automated Audio Captioning and (2) Language-Based Audio Retrieval.
no code implementations • 9 Jun 2022 • Yanwei Wang, Nadia Figueroa, Shen Li, Ankit Shah, Julie Shah
In this work, we identify the roots of this challenge as the failure of a learned continuous policy to satisfy the discrete plan implicit in the demonstration.
no code implementations • 18 May 2022 • Justin Hellermann, Qinzhuan Qian, Ankit Shah
Data augmentation is a key regularization method to support the forecast and classification performance of highly parameterized models in computer vision.
no code implementations • 11 Apr 2022 • Ankit Shah, Hira Dhamyal, Yang Gao, Daniel Arancibia, Mario Arancibia, Bhiksha Raj, Rita Singh
Lately, there has been a global effort by multiple research groups to detect COVID-19 from voice.
no code implementations • 4 Mar 2022 • Larry Tang, Po Hao Chou, Yi Yu Zheng, Ziqian Ge, Ankit Shah, Bhiksha Raj
We find that the baseline Siamese does not perform better by incorporating ontology information in the weak and multi-label scenario, but that the GCN does capture the ontology knowledge better for weak, multi-labeled data.
no code implementations • 10 Oct 2021 • Rita Singh, Ankit Shah, Hira Dhamyal
This paper reflects on the effect of several categories of medical conditions on human voice, focusing on those that may be hypothesized to have effects on voice, but for which the changes themselves may be subtle enough to have eluded observation in standard analytical examinations of the voice signal.
no code implementations • 6 Jul 2021 • Ankit Shah, Pritish Kamath, Shen Li, Patrick Craven, Kevin Landers, Kevin Oden, Julie Shah
When observing task demonstrations, human apprentices are able to identify whether a given task is executed correctly long before they gain expertise in actually performing that task.
no code implementations • 28 May 2021 • Ankit Shah, Srishti Singh, Shih-Yen Tao
In this paper, we present our work on the BioASQ pipeline.
no code implementations • 19 Mar 2021 • Anxiang Zhang, Ankit Shah, Bhiksha Raj
Thus, this paper introduces a novel semi-weak label learning paradigm as a middle ground to mitigate the problem.
1 code implementation • 5 Oct 2020 • Aigerim Bogyrbayeva, Sungwook Jang, Ankit Shah, Young Jae Jang, Changhyun Kwon
This paper proposes a reinforcement learning approach for nightly offline rebalancing operations in free-floating electric vehicle sharing systems (FFEVSS).
no code implementations • 4 Mar 2020 • Ankit Shah, Samir Wadhwania, Julie Shah
Defining sound and complete specifications for robots using formal languages is challenging, while learning formal specifications directly from demonstrations can lead to over-constrained task policies.
1 code implementation • 19 Feb 2020 • Serena Booth, Yilun Zhou, Ankit Shah, Julie Shah
To address these challenges, we introduce a flexible model inspection framework: Bayes-TrEx.
no code implementations • 9 Jan 2020 • Serena Booth, Ankit Shah, Yilun Zhou, Julie Shah
In this paper, we consider the problem of exploring the prediction level sets of a classifier using probabilistic programming.
1 code implementation • 26 Oct 2019 • Nicolas Turpault, Romain Serizel, Ankit Shah, Justin Salamon
This paper presents Task 4 of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 challenge and provides a first analysis of the challenge results.
Ranked #9 on Sound Event Detection on DESED
no code implementations • NeurIPS 2018 • Ankit Shah, Pritish Kamath, Julie A. Shah, Shen Li
When observing task demonstrations, human apprentices are able to identify whether a given task is executed correctly long before they gain expertise in actually performing that task.
1 code implementation • 28th International Joint Conference on Artificial Intelligence 2019 • Anurag Kumar, Ankit Shah, Alex Hauptmann, Bhiksha Raj
In the last couple of years, weakly labeled learning for sound events has turned out to be an exciting approach for audio event detection.
no code implementations • 13 Oct 2018 • Ankit Shah, Arunesh Sinha, Rajesh Ganesan, Sushil Jajodia, Hasan Cam
In order to explain this observation, we extend the earlier RL model to a game model and show that there exists defender policies that can be robust against any adversarial policy.
1 code implementation • 16 Sep 2018 • Ankit Shah, Jean Baptiste Lamare, Tuan Nguyen Anh, Alexander Hauptmann
Our experiments indicate a considerable improvement in object detection accuracy: +8. 51% for CM and +6. 20% for ACM.
no code implementations • 2 Sep 2018 • Ankit Shah, Tyler Vuong
Deep Reinforcement learning with appropriate constraints would look only for the relevant person in the image as opposed to an unconstrained approach where each individual objects in the image are ranked.
no code implementations • 1 Sep 2018 • Ankit Shah, Harini Kesavamoorthy, Poorva Rane, Pramati Kalwad, Alexander Hauptmann, Florian Metze
Moments capture a huge part of our lives.
1 code implementation • 24 Apr 2018 • Ankit Shah, Anurag Kumar, Alexander G. Hauptmann, Bhiksha Raj
In this work, we first describe a CNN based approach for weakly supervised training of audio events.
no code implementations • NIPS Workshop on Machine Learning for Audio 2018 • Benjamin Elizalde, Rohan Badlani, Ankit Shah, Anurag Kumar, and Bhiksha Raj.
Sounds are essential to how humans perceive and interact with the world.
no code implementations • 2 Nov 2017 • Rohan Badlani, Ankit Shah, Benjamin Elizalde, Anurag Kumar, Bhiksha Raj
The framework crawls videos using search queries corresponding to 78 sound event labels drawn from three datasets.
no code implementations • 20 Sep 2016 • Benjamin Elizalde, Ankit Shah, Siddharth Dalmia, Min Hun Lee, Rohan Badlani, Anurag Kumar, Bhiksha Raj, Ian Lane
The audio event detectors are trained on the labeled audio and ran on the unlabeled audio downloaded from YouTube.