no code implementations • 8 Jan 2025 • Tudor Jianu, Shayan Doust, Mengyun Li, Baoru Huang, Tuong Do, Hoan Nguyen, Karl Bates, Tung D. Ta, Sebastiano Fichera, Pierre Berthet-Rayne, Anh Nguyen
Endovascular navigation is a crucial aspect of minimally invasive procedures, where precise control of curvilinear instruments like guidewires is critical for successful interventions.
no code implementations • 12 Dec 2024 • Marah Abdin, Jyoti Aneja, Harkirat Behl, Sébastien Bubeck, Ronen Eldan, Suriya Gunasekar, Michael Harrison, Russell J. Hewett, Mojan Javaheripi, Piero Kauffmann, James R. Lee, Yin Tat Lee, Yuanzhi Li, Weishung Liu, Caio C. T. Mendes, Anh Nguyen, Eric Price, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Xin Wang, Rachel Ward, Yue Wu, Dingli Yu, Cyril Zhang, Yi Zhang
We present phi-4, a 14-billion parameter language model developed with a training recipe that is centrally focused on data quality.
no code implementations • 4 Dec 2024 • Nandini Gadhia, Michalis Smyrnakis, Po-Yu Liu, Damer Blake, Melanie Hay, Anh Nguyen, Dominic Richards, Dong Xia, Ritesh Krishna
In this article, a novel graph theoretic approach is proposed to infer a co-occurrence network from 16S microbiome data.
no code implementations • 3 Dec 2024 • Viet Nguyen, Anh Nguyen, Trung Dao, Khoi Nguyen, Cuong Pham, Toan Tran, Anh Tran
However, our study reveals its instability when handling different diffusion model backbones due to using a fixed guidance scale within the Variational Score Distillation (VSD) loss.
1 code implementation • 2 Dec 2024 • Sandesh Pokhrel, Sanjay Bhandari, Sharib Ali, Tryphon Lambrou, Anh Nguyen, Yash Raj Shrestha, Angus Watson, Danail Stoyanov, Prashnna Gyawali, Binod Bhattarai
Evaluations across multiple deep learning architectures and two publicly available benchmarks, Kvasir2 and Gastrovision, demonstrate the effectiveness of our approach compared to several state-of-the-art methods.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
1 code implementation • 23 Nov 2024 • Trong Thang Pham, Ngoc-Vuong Ho, Nhat-Tan Bui, Thinh Phan, Patel Brijesh, Donald Adjeroh, Gianfranco Doretto, Anh Nguyen, Carol C. Wu, Hien Nguyen, Ngan Le
Unlike existing datasets that include a raw sequence of gaze alongside a report, with significant misalignment between gaze location and report content, our FG-CXR dataset offers a more grained alignment between gaze attention and diagnosis transcript.
no code implementations • 22 Nov 2024 • Yuhang Song, Mario Gianni, Chenguang Yang, Kunyang Lin, Te-Chuan Chiu, Anh Nguyen, Chun-Yi Lee
To validate the proposed methodology, we conduct a series of experiments to assess the effectiveness of the enriched embeddings on fine-grained vision negatives.
no code implementations • 9 Nov 2024 • Zhaorui Tan, Xi Yang, Tan Pan, Tianyi Liu, Chen Jiang, Xin Guo, Qiufeng Wang, Anh Nguyen, Yuan Qi, Kaizhu Huang, Yuan Cheng
We validate the feasibility and benefits of learning a personalized ${X}_h$, showing that this representation is highly generalizable and transferable across various multi-modal medical tasks.
no code implementations • 3 Nov 2024 • Lu Qian, Yuqi Wang, Zimu Wang, Haiyang Zhang, Wei Wang, Ting Yu, Anh Nguyen
In domain-specific contexts, particularly mental health, abstractive summarization requires advanced techniques adept at handling specialized content to generate domain-relevant and faithful summaries.
no code implementations • 29 Oct 2024 • Tudor Jianu, Baoru Huang, Hoan Nguyen, Binod Bhattarai, Tuong Do, Erman Tjiputra, Quang Tran, Pierre Berthet-Rayne, Ngan Le, Sebastiano Fichera, Anh Nguyen
Endovascular surgical tool reconstruction represents an important factor in advancing endovascular tool navigation, which is an important step in endovascular surgery.
1 code implementation • 15 Oct 2024 • Bishal Thapaliya, Anh Nguyen, Yao Lu, Tian Xie, Igor Grudetskyi, Fudong Lin, Antonios Valkanas, Jingyu Liu, Deepayan Chakraborty, Bilel Fehri
We propose the Enhanced Cluster-aware Graph Network (ECGN), a novel method that addresses these issues by integrating cluster-specific training with synthetic node generation.
no code implementations • 15 Oct 2024 • Konstantinos Panagiotis Alexandridis, Ismail Elezi, Jiankang Deng, Anh Nguyen, Shan Luo
FRACAL devises a logit adjustment method that utilises the fractal dimension to estimate how uniformly classes are distributed in image space.
1 code implementation • 6 Oct 2024 • Zhaorui Tan, Xi Yang, Qiufeng Wang, Anh Nguyen, Kaizhu Huang
Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories.
no code implementations • 26 Sep 2024 • Nghia Nguyen, Minh Nhat Vu, Tung D. Ta, Baoru Huang, Thieu Vo, Ngan Le, Anh Nguyen
Vision language models have played a key role in extracting meaningful features for various robotic applications.
no code implementations • 22 Sep 2024 • Huy Hoang Nguyen, An Vuong, Anh Nguyen, Ian Reid, Minh Nhat Vu
Grasp detection is a fundamental robotic task critical to the success of many industrial applications.
no code implementations • 23 Aug 2024 • Baoru Huang, Tuan Vo, Chayun Kongtongvattana, Giulio Dagnino, Dennis Kundrat, Wenqiang Chi, Mohamed Abdelaziz, Trevor Kwok, Tudor Jianu, Tuong Do, Hieu Le, Minh Nguyen, Hoan Nguyen, Erman Tjiputra, Quang Tran, Jianyang Xie, Yanda Meng, Binod Bhattarai, Zhaorui Tan, Hongbin Liu, Hong Seng Gan, Wei Wang, Xi Yang, Qiufeng Wang, Jionglong Su, Kaizhu Huang, Angelos Stefanidis, Min Guo, Bo Du, Rong Tao, Minh Vu, Guoyan Zheng, Yalin Zheng, Francisco Vasconcelos, Danail Stoyanov, Daniel Elson, Ferdinando Rodriguez y Baena, Anh Nguyen
Real-time visual feedback from catheterization analysis is crucial for enhancing surgical safety and efficiency during endovascular interventions.
no code implementations • 29 Jul 2024 • Tuan Van Vo, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen
Our work introduces a new framework for language-driven grasp detection, paving the way for language-driven robotic applications.
no code implementations • 26 Jul 2024 • Nhat Le, Khoa Do, Xuan Bui, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen
Our method achieves high-fidelity group dance motion and enables the generation with an unlimited number of dancers while consuming only a minimal and constant amount of memory.
Ranked #1 on Motion Synthesis on AIOZ-GDANCE
no code implementations • 25 Jul 2024 • Nghia Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen
Language-driven grasp detection is a fundamental yet challenging task in robotics with various industrial applications.
no code implementations • 18 Jul 2024 • Toan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen
In this paper, we present a new approach for language-driven 6-DoF grasp detection in cluttered point clouds.
1 code implementation • 11 Jul 2024 • Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo
In this work, we delve deeper in this phenomenon by performing a comprehensive statistical analysis in the classification and intermediate layers of both balanced and imbalanced networks and we empirically show that aligning the activation function with the data distribution, enhances the performance in both balanced and imbalanced tasks.
Ranked #11 on Long-tail Learning on Places-LT
1 code implementation • 4 Jul 2024 • Qinkai Yu, Jianyang Xie, Anh Nguyen, He Zhao, Jiong Zhang, Huazhu Fu, Yitian Zhao, Yalin Zheng, Yanda Meng
Diabetic retinopathy (DR) is a complication of diabetes and usually takes decades to reach sight-threatening levels.
no code implementations • CVPR 2024 • An Dinh Vuong, Minh Nhat Vu, Baoru Huang, Nghia Nguyen, Hieu Le, Thieu Vo, Anh Nguyen
We approach the language-driven grasp detection task as a conditional generation problem.
no code implementations • 22 Apr 2024 • Marah Abdin, Jyoti Aneja, Hany Awadalla, Ahmed Awadallah, Ammar Ahmad Awan, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Qin Cai, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Weizhu Chen, Yen-Chun Chen, Yi-Ling Chen, Hao Cheng, Parul Chopra, Xiyang Dai, Matthew Dixon, Ronen Eldan, Victor Fragoso, Jianfeng Gao, Mei Gao, Min Gao, Amit Garg, Allie Del Giorno, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Wenxiang Hu, Jamie Huynh, Dan Iter, Sam Ade Jacobs, Mojan Javaheripi, Xin Jin, Nikos Karampatziakis, Piero Kauffmann, Mahoud Khademi, Dongwoo Kim, Young Jin Kim, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Yunsheng Li, Chen Liang, Lars Liden, Xihui Lin, Zeqi Lin, Ce Liu, Liyuan Liu, Mengchen Liu, Weishung Liu, Xiaodong Liu, Chong Luo, Piyush Madan, Ali Mahmoudzadeh, David Majercak, Matt Mazzola, Caio César Teodoro Mendes, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Liliang Ren, Gustavo de Rosa, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Yelong Shen, Swadheen Shukla, Xia Song, Masahiro Tanaka, Andrea Tupini, Praneetha Vaddamanu, Chunyu Wang, Guanhua Wang, Lijuan Wang, Shuohang Wang, Xin Wang, Yu Wang, Rachel Ward, Wen Wen, Philipp Witte, Haiping Wu, Xiaoxia Wu, Michael Wyatt, Bin Xiao, Can Xu, Jiahang Xu, Weijian Xu, Jilong Xue, Sonali Yadav, Fan Yang, Jianwei Yang, Yifan Yang, ZiYi Yang, Donghan Yu, Lu Yuan, Chenruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou
We introduce phi-3-mini, a 3. 8 billion parameter language model trained on 3. 3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3. 5 (e. g., phi-3-mini achieves 69% on MMLU and 8. 38 on MT-bench), despite being small enough to be deployed on a phone.
Ranked #5 on MMR total on MRR-Benchmark (using extra training data)
no code implementations • 20 Apr 2024 • Baoru Huang, Yida Wang, Anh Nguyen, Daniel Elson, Francisco Vasconcelos, Danail Stoyanov
In surgical oncology, screening colonoscopy plays a pivotal role in providing diagnostic assistance, such as biopsy, and facilitating surgical navigation, particularly in polyp detection.
no code implementations • 14 Apr 2024 • Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De
Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models.
no code implementations • 11 Apr 2024 • Olatunji Mumini Omisore, Toluwanimi Akinyemi, Anh Nguyen, Lei Wang
Thus, we offer a less expensive method for real-time tool segmentation and tracking during robot-assisted cardiac catheterization.
1 code implementation • 8 Apr 2024 • Giang Nguyen, Mohammad Reza Taesiri, Sunnie S. Y. Kim, Anh Nguyen
We build CHM-Corr++, an interactive interface for CHM-Corr, enabling users to edit the feature importance map provided by CHM-Corr and observe updated model decisions.
1 code implementation • 2 Apr 2024 • Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen
The results reveal that Samba achieved unparalleled performance on commonly used remote sensing datasets for semantic segmentation.
no code implementations • 19 Mar 2024 • Simon Klüttermann, Jérôme Rutinowski, Anh Nguyen, Britta Grimme, Moritz Roidl, Emmanuel Müller
In this contribution, we introduce a novel ensemble method for the re-identification of industrial entities, using images of chipwood pallets and galvanized metal plates as dataset examples.
1 code implementation • 18 Mar 2024 • Minh Tran, Winston Bounsavy, Khoa Vo, Anh Nguyen, Tri Nguyen, Ngan Le
Consequently, this compromised quality of visible features during the subsequent visible-to-amodal transition.
1 code implementation • 17 Jan 2024 • Tudor Jianu, Baoru Huang, Tuan Vo, Minh Nhat Vu, Jingxuan Kang, Hoan Nguyen, Olatunji Omisore, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyen
Endovascular robots have been actively developed in both academia and industry.
no code implementations • 22 Dec 2023 • Tin Nguyen, Anh Nguyen
Training CNNs and ViTs with habitat-augmented data results in an improvement of up to +0. 83 and +0. 23 points on NABirds and CUB-200, respectively.
1 code implementation • 15 Dec 2023 • Huy Le, Tung Kieu, Anh Nguyen, Ngan Le
Text-video retrieval, a prominent sub-field within the domain of multimodal information retrieval, has witnessed remarkable growth in recent years.
no code implementations • 14 Dec 2023 • Bingbin Liu, Sebastien Bubeck, Ronen Eldan, Janardhan Kulkarni, Yuanzhi Li, Anh Nguyen, Rachel Ward, Yi Zhang
Specifically for solving grade school math, the smallest model size so far required to break the 80\% barrier on the GSM8K benchmark remains to be 34B.
Ranked #62 on Arithmetic Reasoning on GSM8K
no code implementations • CVPR 2024 • Mohammad Reza Taesiri, Tianjun Feng, Anh Nguyen, Cor-Paul Bezemer
To address this gap, we introduce GlitchBench, a novel benchmark derived from video game quality assurance tasks, to test and evaluate the reasoning capabilities of LMMs.
no code implementations • 20 Nov 2023 • Zimu Wang, Wei Wang, Qi Chen, Qiufeng Wang, Anh Nguyen
Deep learning-based natural language processing (NLP) models, particularly pre-trained language models (PLMs), have been revealed to be vulnerable to adversarial attacks.
no code implementations • 19 Nov 2023 • Chayun Kongtongvattana, Baoru Huang, Jingxuan Kang, Hoan Nguyen, Olajide Olufemi, Anh Nguyen
By computing the cosine similarity between these feature vectors, we gain a nuanced understanding of image similarity that goes beyond the limitations of traditional overlap-based measures.
no code implementations • 19 Nov 2023 • Tudor Jianu, Baoru Huang, Pierre Berthet-Rayne, Sebastiano Fichera, Anh Nguyen
Endovascular navigation, essential for diagnosing and treating endovascular diseases, predominantly hinges on fluoroscopic images due to the constraints in sensory feedback.
no code implementations • 31 Oct 2023 • Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De
In the era of the Internet of Things (IoT), the retrieval of relevant medical information has become essential for efficient clinical decision-making.
1 code implementation • 29 Oct 2023 • Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen
Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications.
Ranked #2 on Motion Synthesis on AIOZ-GDANCE
1 code implementation • NeurIPS 2023 • An Vuong, Minh Nhat Vu, Toan Tien Nguyen, Baoru Huang, Dzung Nguyen, Thieu Vo, Anh Nguyen
In this paper, we propose a language-driven scene synthesis task, which is a new task that integrates text prompts, human motion, and existing objects for scene synthesis.
Ranked #1 on Indoor Scene Synthesis on PRO-teXt
1 code implementation • 5 Oct 2023 • Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le
Open-Fusion harnesses the power of a pre-trained vision-language foundation model (VLFM) for open-set semantic comprehension and employs the Truncated Signed Distance Function (TSDF) for swift 3D scene reconstruction.
1 code implementation • 28 Sep 2023 • Yuhang Song, Anh Nguyen, Chun-Yi Lee
This paper tackles the critical challenge of object navigation in autonomous navigation systems, particularly focusing on the problem of target approach and episode termination in environments with long optimal episode length in Deep Reinforcement Learning (DRL) based methods.
1 code implementation • 24 Sep 2023 • Trong Thang Pham, Jacob Brecheisen, Anh Nguyen, Hien Nguyen, Ngan Le
In the field of chest X-ray (CXR) diagnosis, existing works often focus solely on determining where a radiologist looks, typically through tasks such as detection, segmentation, or classification.
1 code implementation • 18 Sep 2023 • An Dinh Vuong, Minh Nhat Vu, Hieu Le, Baoru Huang, Binh Huynh, Thieu Vo, Andreas Kugi, Anh Nguyen
Foundation models such as ChatGPT have made significant strides in robotic tasks due to their universal representation of real-world domains.
1 code implementation • 7 Jul 2023 • Baoru Huang, Yicheng Hu, Anh Nguyen, Stamatia Giannarou, Daniel S. Elson
In surgical oncology, it is challenging for surgeons to identify lymph nodes and completely resect cancer even with pre-operative imaging systems like PET and CT, because of the lack of reliable intraoperative visualization tools.
1 code implementation • 20 Jun 2023 • An Dinh Vuong, Toan Tien Nguyen, Minh Nhat Vu, Baoru Huang, Dzung Nguyen, Huynh Thi Thanh Binh, Thieu Vo, Anh Nguyen
Visual navigation, a foundational aspect of Embodied AI (E-AI), has been significantly studied in the past few years.
no code implementations • 14 Jun 2023 • Ronast Subedi, Rebati Raman Gaire, Sharib Ali, Anh Nguyen, Danail Stoyanov, Binod Bhattarai
This paper presents a solution to the cross-domain adaptation problem for 2D surgical image segmentation, explicitly considering the privacy protection of distributed datasets belonging to different centers.
no code implementations • 9 May 2023 • Changyu Zeng, Wei Wang, Anh Nguyen, Yutao Yue
We first present an innovative taxonomy, categorizing the existing SSL methods into four broad categories based on the pretexts' characteristics.
no code implementations • 16 Apr 2023 • Jingxuan Kang, Tudor Jianu, Baoru Huang, Binod Bhattarai, Ngan Le, Frans Coenen, Anh Nguyen
In this paper, we propose a new method to translate simulation images from an endovascular simulator to X-ray images.
1 code implementation • CVPR 2023 • Nhat Le, Thang Pham, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen
The proposed dataset consists of 16. 7 hours of paired music and 3D motion from in-the-wild videos, covering 7 dance styles and 16 music genres.
Ranked #4 on Motion Synthesis on AIOZ-GDANCE
1 code implementation • 17 Mar 2023 • Trong-Thang Pham, Nhat Le, Tuong Do, Hung Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen
In this paper, we present a new method to generate talking head animation with learnable style references.
1 code implementation • 4 Mar 2023 • Toan Nguyen, Minh Nhat Vu, An Vuong, Dzung Nguyen, Thieu Vo, Ngan Le, Anh Nguyen
In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds.
no code implementations • 25 Feb 2023 • Lam Pham, Cam Le, Dat Ngo, Anh Nguyen, Jasmin Lampert, Alexander Schindler, Ian McLoughlin
In this paper, we present a high-performance and light-weight deep learning model for Remote Sensing Image Classification (RSIC), the task of identifying the aerial scene of a remote sensing image.
1 code implementation • 25 Jan 2023 • Cong Dao Tran, Nhut Huy Pham, Anh Nguyen, Truong Son Hy, Tu Vu
This paper presents ViDeBERTa, a new pre-trained monolingual language model for Vietnamese, with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large, which are pre-trained on a large-scale corpus of high-quality and diverse Vietnamese texts using DeBERTa architecture.
no code implementations • 4 Nov 2022 • Anh Nguyen, Galen Pogoncheff, Ban Xuan Dong, Nam Bui, Hoang Truong, Nhat Pham, Linh Nguyen, Hoang Huu Nguyen, Sy Duong-Quy, Sangtae Ha, Tam Vu
Various intervention therapies ranging from pharmaceutical to hi-tech tailored solutions have been available to treat difficulty in falling asleep commonly caused by insomnia in modern life.
1 code implementation • 27 Oct 2022 • Zhaorui Tan, Xi Yang, Zihan Ye, Qiufeng Wang, Yuyao Yan, Anh Nguyen, Kaizhu Huang
Generating consistent and high-quality images from given texts is essential for visual-language understanding.
1 code implementation • 21 Sep 2022 • Nhat Le, Khanh Nguyen, Quang Tran, Erman Tjiputra, Bac Le, Anh Nguyen
In this paper, we propose a new uncertainty-aware label distribution learning method to improve the robustness of deep models against uncertainty and ambiguity.
Facial Expression Recognition Facial Expression Recognition (FER)
1 code implementation • 11 Sep 2022 • Konstantinos Panagiotis Alexandridis, Shan Luo, Anh Nguyen, Jiankang Deng, Stefanos Zafeiriou
The long-tailed distribution is a common phenomenon in the real world.
3 code implementations • 17 Aug 2022 • Baoru Huang, Jian-Qing Zheng, Anh Nguyen, Chi Xu, Ioannis Gkouzionis, Kunal Vyas, David Tuch, Stamatia Giannarou, Daniel S. Elson
Depth estimation is a crucial step for image-guided intervention in robotic surgery and laparoscopic imaging system.
1 code implementation • 26 Jul 2022 • Giang Nguyen, Mohammad Reza Taesiri, Anh Nguyen
Via a large-scale, human study on ImageNet and CUB, our correspondence-based explanations are found to be more useful to users than kNN explanations.
Ranked #68 on Fine-Grained Image Classification on CUB-200-2011
1 code implementation • 22 Jul 2022 • Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo
Major advancements have been made in the field of object detection and segmentation recently.
Ranked #14 on Instance Segmentation on LVIS v1.0 val
1 code implementation • 21 Jul 2022 • Bei Chen, Fengji Zhang, Anh Nguyen, Daoguang Zan, Zeqi Lin, Jian-Guang Lou, Weizhu Chen
A natural way to evaluate the quality and correctness of a code solution is to run it against a set of test cases, but the manual creation of such test cases is often costly and time-consuming.
Ranked #1 on Code Generation on APPS
1 code implementation • ICCV 2023 • Tuong Do, Binh X. Nguyen, Vuong Pham, Toan Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen
In this paper, we present a new multigraph topology for cross-silo federated learning.
1 code implementation • 19 Jul 2022 • Thang M. Pham, Seunghyun Yoon, Trung Bui, Anh Nguyen
While contextualized word embeddings have been a de-facto standard, learning contextualized phrase embeddings is less explored and being hindered by the lack of a human-annotated benchmark that tests machine understanding of phrase semantics given a context sentence or paragraph (instead of phrases alone).
no code implementations • 25 May 2022 • Mehdi Nourelahi, Lars Kotthoff, Peijie Chen, Anh Nguyen
Here, we perform the first, large-scale evaluation of the relations of the three criteria using 9 feature-importance methods and 12 ImageNet-trained CNNs that are of 3 training algorithms and 5 CNN architectures.
1 code implementation • 21 May 2022 • Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen
We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets.
Ranked #6 on Fine-Grained Image Classification on Stanford Dogs
2 code implementations • 1 Jan 2022 • Thuy T. Do, Du Nguyen, Anh Le, Anh Nguyen, Dong Nguyen, Nga Hoang, Uyen Le, Tuan Tran
This paper studies the reactions of social network users on the recommendation of using HCQ for COVID-19 treatment by analyzing the reaction patterns and sentiment of the tweets.
1 code implementation • CVPR 2022 • Hai Phan, Anh Nguyen
Face identification (FI) is ubiquitous and drives many high-stake decisions made by law enforcement.
1 code implementation • 7 Nov 2021 • Nhat Le, Khanh Nguyen, Anh Nguyen, Bac Le
Our network is designed to extract features from both facial and context regions independently, then learn them together using the attention module.
1 code implementation • 22 Oct 2021 • Thang M. Pham, Trung Bui, Long Mai, Anh Nguyen
We find two reasons why IM is not better than LOO: (1) deleting a single word from the input only marginally reduces a classifier's accuracy; and (2) a highly predictable word is always given near-zero attribution, regardless of its true importance to the classifier.
1 code implementation • 12 Oct 2021 • Anh Nguyen, Tuong Do, Minh Tran, Binh X. Nguyen, Chien Duong, Tu Phan, Erman Tjiputra, Quang D. Tran
We design a new Federated Autonomous Driving network (FADNet) that can improve the model stability, ensure convergence, and handle imbalanced data distribution problems while is being trained with federated learning methods.
no code implementations • 7 Oct 2021 • Mohamed Ghafoor, Anh Nguyen
Introduction: The extracellular matrix (ECM) is a networkof proteins and carbohydrates that has a structural and bio-chemical function.
2 code implementations • 6 Oct 2021 • Binh X. Nguyen, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen
Bridging the semantic gap between image and question is an important step to improve the accuracy of the Visual Question Answering (VQA) task.
Ranked #1 on Visual Question Answering (VQA) on GQA test-dev
1 code implementation • 4 Oct 2021 • Minh Q. Tran, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen
We design the student network such as it is light-weight and well suitable for deployment on a typical CPU.
no code implementations • 27 Sep 2021 • Anh Lam, Anh Nguyen, Bac Le
An attention gates filter features from the contraction path before combining with features on the expansion path, it enables our model to reduce the effect of non-traffic region features and focus more on crucial region features.
1 code implementation • 27 Sep 2021 • Thai-Vu Nguyen, Anh Nguyen, Nghia Le, Bac Le
Domain adaptation is a potential method to train a powerful deep neural network, which can handle the absence of labeled data.
no code implementations • 24 Aug 2021 • Xue Hu, Anh Nguyen, Ferdinando Rodriguez y Baena
In practice, by using a high-quality commercial RGB-D camera, our proposed visual tracking method achieves an accuracy of 1-2 degress and 2-4 mm on a model knee, which meets the standard for clinical applications.
no code implementations • 9 Jul 2021 • Baoru Huang, Jianqing Zheng, Anh Nguyen, David Tuch, Kunal Vyas, Stamatia Giannarou, Daniel S. Elson
Dense depth estimation and 3D reconstruction of a surgical scene are crucial steps in computer assisted surgery.
1 code implementation • 13 Jun 2021 • Renan A. Rojas-Gomez, Raymond A. Yeh, Minh N. Do, Anh Nguyen
Despite unconditional feature inversion being the foundation of many image synthesis applications, training an inverter demands a high computational budget, large decoding capacity and imposing conditions such as autoregressive priors.
1 code implementation • ICML Workshop INNF 2021 • Michael A. Alcorn, Anh Nguyen
In this paper, we propose an alternative approach for encoding feature identities, where each feature's identity is included alongside its value in the input.
1 code implementation • NeurIPS 2021 • Giang Nguyen, Daeyoung Kim, Anh Nguyen
Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications.
2 code implementations • 19 May 2021 • Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran, Anh Nguyen
However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized.
Ranked #5 on Medical Visual Question Answering on PathVQA
1 code implementation • NeurIPS 2021 • Michael A. Alcorn, Anh Nguyen
In many multi-agent spatiotemporal systems, agents operate under the influence of shared, unobserved variables (e. g., the play a team is executing in a game of basketball).
Ranked #1 on Trajectory Modeling on NBA SportVU
1 code implementation • 14 Apr 2021 • Binh X. Nguyen, Binh D. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen
In this paper, we propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph, namely Graph-based Person Signature, and utilize Graph Convolutional Networks to learn the topological structure of the visual signature of a person.
Ranked #52 on Person Re-Identification on DukeMTMC-reID
no code implementations • 5 Apr 2021 • Anh Nguyen, Khoa Pham, Dat Ngo, Thanh Ngo, Lam Pham
This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network.
1 code implementation • 4 Mar 2021 • Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller
In this paper, we propose a novel framework that can capture both the semantic and the paralinguistic information in the signal.
Speech Emotion Recognition Sound Audio and Speech Processing
1 code implementation • 20 Feb 2021 • Anh Nguyen, Anh Tran
With the thriving of deep learning and the widespread practice of using pre-trained networks, backdoor attacks have become an increasing security threat drawing many research interests in recent years.
1 code implementation • NeurIPS 2021 • Michael A. Alcorn, Anh Nguyen
Multi-agent spatiotemporal modeling is a challenging task from both an algorithmic design and computational complexity perspective.
no code implementations • Findings (ACL) 2021 • Thang M. Pham, Trung Bui, Long Mai, Anh Nguyen
Encouraging classifiers to capture word order information improves the performance on most GLUE tasks, SQuAD 2. 0 and out-of-samples.
Natural Language Inference Natural Language Understanding +2
no code implementations • 26 Dec 2020 • Dat Ngo, Lam Pham, Anh Nguyen, Ben Phan, Khoa Tran, Truong Nguyen
This paper proposes a robust deep learning framework used for classifying anomaly of respiratory cycles.
no code implementations • COLING 2020 • Kiet Nguyen, Vu Nguyen, Anh Nguyen, Ngan Nguyen
Due to the lack of benchmark datasets for Vietnamese, we present the Vietnamese Question Answering Dataset (UIT-ViQuAD), a new dataset for the low-resource language as Vietnamese to evaluate MRC models.
1 code implementation • NeurIPS 2020 • Anh Nguyen, Anh Tran
In recent years, neural backdoor attack has been considered to be a potential security threat to deep learning systems.
no code implementations • 31 Jul 2020 • Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran
In this work, we propose a multimodal fusion approach to address the problem of autonomous navigation in complex environments such as collapsed cites, or natural caves.
Robotics
1 code implementation • 2 Jul 2020 • Ella M. Gale, Nicholas Martin, Ryan Blything, Anh Nguyen, Jeffrey S. Bowers
We find that the different measures provide different estimates of object selectivity, with precision and CCMAS measures providing misleadingly high estimates.
1 code implementation • 17 Jun 2020 • Hai-Long Trieu, Thy Thy Tran, Khoa N A Duong, Anh Nguyen, Makoto Miwa, Sophia Ananiadou
Motivation Recent neural approaches on event extraction from text mainly focus on flat events in general domain, while there are less attempts to detect nested and overlapping events.
Ranked #1 on Event Extraction on GENIA 2013
no code implementations • 16 Jun 2020 • Anh Nguyen, Dennis Kundrat, Giulio Dagnino, Wenqiang Chi, Mohamed E. M. K. Abdelaziz, Yao Guo, YingLiang Ma, Trevor M. Y. Kwok, Celia Riga, Guang-Zhong Yang
In this paper, we present FW-Net, an end-to-end and real-time deep learning framework for endovascular intervention.
1 code implementation • 16 Jun 2020 • Peijie Chen, Chirag Agarwal, Anh Nguyen
Increasingly more similarities between human vision and convolutional neural networks (CNNs) have been revealed in the past few years.
1 code implementation • CVPR 2020 • Naman Bansal, Chirag Agarwal, Anh Nguyen
Attribution methods can provide powerful insights into the reasons for a classifier's decision.
1 code implementation • 10 Oct 2019 • Qi Li, Long Mai, Michael A. Alcorn, Anh Nguyen
Large, pre-trained generative models have been increasingly popular and useful to both the research and wider communities.
1 code implementation • 9 Oct 2019 • Chirag Agarwal, Anh Nguyen
Perturbation-based explanation methods often measure the contribution of an input feature to an image classifier's outputs by heuristically removing it via e. g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples.
no code implementations • 25 Sep 2019 • Chirag Agarwal, Dan Schonfeld, Anh Nguyen
Interpretability methods often measure the contribution of an input feature to an image classifier's decisions by heuristically removing it via e. g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples.
1 code implementation • 18 Apr 2019 • Anh Nguyen, Jason Yosinski, Jeff Clune
A neuroscience method to understanding the brain is to find and study the preferred stimuli that highly activate an individual cell or groups of cells.
no code implementations • 23 Mar 2019 • Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis
We propose V2CNet, a new deep learning framework to automatically translate the demonstration videos to commands that can be directly used in robotic applications.
no code implementations • 23 Mar 2019 • Anh Nguyen
In this study, our long-term goal is to bridge the gap between computer vision and robotics by developing visual methods that can be used in real robots.
1 code implementation • CVPR 2019 • Michael A. Alcorn, Qi Li, Zhitao Gong, Chengfei Wang, Long Mai, Wei-Shinn Ku, Anh Nguyen
Using our framework and a self-assembled dataset of 3D objects, we investigate the vulnerability of DNNs to OoD poses of well-known objects in ImageNet.
no code implementations • 1 Nov 2018 • Chirag Agarwal, Anh Nguyen, Dan Schonfeld
Intuitively, the center loss encourages DNNs to simultaneously learns a center for the deep features of each class, and minimize the distances between the intra-class deep features and their corresponding class centers.
no code implementations • 27 Sep 2018 • Ella M. Gale, Anh Nguyen, Ryan Blything, Nicholas Martin and Jeffrey S. Bowers
These findings highlight the problem with current selectivity measures and show that new measures are required in order to provide a better assessment of learned representations in NNs.
1 code implementation • 23 Apr 2018 • Vishaal Munusamy Kabilan, Brandon Morris, Anh Nguyen
Training deep neural networks on images represented as grids of pixels has brought to light an interesting phenomenon known as adversarial examples.
1 code implementation • 16 Mar 2018 • Anh Nguyen, Thanh-Toan Do, Ian Reid, Darwin G. Caldwell, Nikos G. Tsagarakis
The key idea of our approach is the use of object descriptions to provide the detailed understanding of an object.
no code implementations • 9 Mar 2018 • Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter, Laurent Keller, Carole Knibbe, Peter Krcah, Richard E. Lenski, Hod Lipson, Robert MacCurdy, Carlos Maestre, Risto Miikkulainen, Sara Mitri, David E. Moriarty, Jean-Baptiste Mouret, Anh Nguyen, Charles Ofria, Marc Parizeau, David Parsons, Robert T. Pennock, William F. Punch, Thomas S. Ray, Marc Schoenauer, Eric Shulte, Karl Sims, Kenneth O. Stanley, François Taddei, Danesh Tarapore, Simon Thibault, Westley Weimer, Richard Watson, Jason Yosinski
Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them.
no code implementations • 3 Dec 2017 • Nader Akoury, Anh Nguyen
In this paper we propose Spatial PixelCNN, a conditional autoregressive model that generates images from small patches.
no code implementations • 1 Oct 2017 • Anh Nguyen, Dimitrios Kanoulas, Luca Muratore, Darwin G. Caldwell, Nikos G. Tsagarakis
We present a new method to translate videos to commands for robotic manipulation using Deep Recurrent Neural Networks (RNN).
2 code implementations • 21 Sep 2017 • Thanh-Toan Do, Anh Nguyen, Ian Reid
We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images.
1 code implementation • 22 Aug 2017 • Anh Nguyen, Thanh-Toan Do, Darwin G. Caldwell, Nikos G. Tsagarakis
Our method first creates the event image from a list of events that occurs in a very short time interval, then a Stacked Spatial LSTM Network (SP-LSTM) is used to learn the camera pose.
no code implementations • 16 Mar 2017 • Mohammed Sadegh Norouzzadeh, Anh Nguyen, Margaret Kosmala, Ali Swanson, Meredith Palmer, Craig Packer, Jeff Clune
Having accurate, detailed, and up-to-date information about the location and behavior of animals in the wild would revolutionize our ability to study and conserve ecosystems.
1 code implementation • CVPR 2017 • Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski
PPGNs are composed of 1) a generator network G that is capable of drawing a wide range of image types and 2) a replaceable "condition" network C that tells the generator what to draw.
5 code implementations • NeurIPS 2016 • Anh Nguyen, Alexey Dosovitskiy, Jason Yosinski, Thomas Brox, Jeff Clune
Understanding the inner workings of such computational brains is both fascinating basic science that is interesting in its own right - similar to why we study the human brain - and will enable researchers to further improve DNNs.
no code implementations • 11 Feb 2016 • Anh Nguyen, Jason Yosinski, Jeff Clune
Here, we introduce an algorithm that explicitly uncovers the multiple facets of each neuron by producing a synthetic visualization of each of the types of images that activate a neuron.
7 code implementations • 22 Jun 2015 • Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, Hod Lipson
The first is a tool that visualizes the activations produced on each layer of a trained convnet as it processes an image or video (e. g. a live webcam stream).
2 code implementations • CVPR 2015 • Anh Nguyen, Jason Yosinski, Jeff Clune
Here we show a related result: it is easy to produce images that are completely unrecognizable to humans, but that state-of-the-art DNNs believe to be recognizable objects with 99. 99% confidence (e. g. labeling with certainty that white noise static is a lion).