no code implementations • 28 Dec 2024 • Shuo Wang, Wanting Li, Yongcai Wang, Zhaoxin Fan, Zhe Huang, Xudong Cai, Jian Zhao, Deying Li
To address this challenge, this paper proposes MambaVO, which conducts robust initialization, Mamba-based sequential matching refinement, and smoothed training to enhance the matching quality and improve the pose estimation in deep visual odometry.
2 code implementations • 10 Nov 2024 • Xiaowei Yu, Zhe Huang, Zao Zhang
In this study, we propose a novel Feature Fusion Transferability Aware Transformer (FFTAT) to enhance ViT performance in UDA tasks.
Ranked #1 on Domain Adaptation on VisDA2017
no code implementations • 16 Sep 2024 • Neeloy Chakraborty, Yixiao Fang, Andre Schreiber, Tianchen Ji, Zhe Huang, Aganze Mihigo, Cassidy Wall, Abdulrahman Almana, Katherine Driggs-Campbell
Teleoperation is an important technology to enable supervisors to control agricultural robots remotely.
no code implementations • 8 Sep 2024 • Shuo Liu, Zhe Huang, Jun Zeng, Koushil Sreenath, Calin A. Belta
Most of this research limits discussions to CBFs with relative degree one with respect to the system dynamics.
no code implementations • 17 Aug 2024 • Shuo Wang, Yongcai Wang, Zhimin Xu, Yongyu Guo, Wanting Li, Zhe Huang, Xuewei Bai, Deying Li
GSLAMOT utilizes camera and LiDAR multimodal information as inputs and divides the representation of the dynamic scene into a semantic map for representing the static environment, a trajectory of the ego-agent, and an online maintained Tracklet Graph (TG) for tracking and predicting the 3D poses of the detected mobile objects.
1 code implementation • 1 Aug 2024 • Zhe Huang, Shuo Wang, Yongcai Wang, Wanting Li, Deying Li, Lei Wang
However, in collaborative perception, the quality of object detection based on a modality is highly sensitive to the relative pose errors among the agents.
no code implementations • 19 Jun 2024 • Zhe Huang, John Pohovey, Ananya Yammanuru, Katherine Driggs-Campbell
Large Language Models (LLM) and Vision Language Models (VLM) enable robots to ground natural language prompts into control actions to achieve tasks in an open world.
no code implementations • 17 May 2024 • Zhe Huang, Yizhe Zhao, Hao Xiao, Chenyan Wu, Lingting Ge
Recent advances in multi-view camera-only 3D object detection either rely on an accurate reconstruction of bird's-eye-view (BEV) 3D features or on traditional 2D perspective view (PV) image features.
1 code implementation • 10 Apr 2024 • Wenqian Li, Haozhi Wang, Zhe Huang, Yan Pang
Wasserstein distance is a principle measure of data divergence from a distributional standpoint.
2 code implementations • 15 Mar 2024 • Zhe Huang, Xiaowei Yu, Dajiang Zhu, Michael C. Hughes
In this paper, we introduce InterLUDE, a new approach to enhance SSL made of two parts that each benefit from labeled-unlabeled interaction.
no code implementations • 9 Mar 2024 • Zhe Huang, Xiaowei Yu, Benjamin S. Wessler, Michael C. Hughes
However, existing deep learning pipelines for assessing AS from echocardiograms have two key limitations.
1 code implementation • 25 Feb 2024 • Xudong Cai, Yongcai Wang, Zhe Huang, Yu Shao, Deying Li
Then the QPC is compressed by the same GPC, and is aggregated into a global descriptor by an attention-based aggregation module, to query the compressed Lidar map in the vector space.
no code implementations • 5 Dec 2023 • Yiqian Gan, Hao Xiao, Yizhe Zhao, Ethan Zhang, Zhe Huang, Xin Ye, Lingting Ge
Motion prediction has been an essential component of autonomous driving systems since it handles highly uncertain and complex scenarios involving moving agents of different types.
1 code implementation • 13 Oct 2023 • Peihua Mai, Ran Yan, Zhe Huang, Youjia Yang, Yan Pang
Large Language Models (LLMs) excel in natural language understanding by capturing hidden semantics in vector space.
3 code implementations • 19 Sep 2023 • Xiaowei Yu, Zhe Huang, Minheng Chen, Yao Xue, Tianming Liu, Dajiang Zhu
We theoretically prove the enhancement gained from positive noise by reducing the task complexity defined by information entropy and experimentally show the significant performance gain in large image datasets, such as the ImageNet.
no code implementations • 31 Jul 2023 • Soyeon Caren Han, Yihao Ding, Siwen Luo, Josiah Poon, HeeGuen Yoon, Zhe Huang, Paul Duuring, Eun Jung Holden
Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically.
1 code implementation • CVPR 2024 • Zhe Huang, Ruijie Jiang, Shuchin Aeron, Michael C. Hughes
Yet past benchmarks do not focus on medical tasks and rarely compare self- and semi- methods together on an equal footing.
no code implementations • 26 May 2023 • Zhe Huang, Yudian Li
Most generic object detectors are mainly built for standard object detection tasks such as COCO and PASCAL VOC.
1 code implementation • 25 May 2023 • Zhe Huang, Benjamin S. Wessler, Michael C. Hughes
To automate screening for AS, deep networks must learn to mimic a human expert's ability to identify views of the aortic valve then aggregate across these relevant images to produce a study-level diagnosis.
no code implementations • 23 Apr 2023 • Hongyu Sun, Yongcai Wang, Xudong Cai, Peng Wang, Zhe Huang, Deying Li, Yu Shao, Shuo Wang
To advance the research and practical solutions for bird strike prevention, in this paper, we present a large-scale challenging dataset AirBirds that consists of 118, 312 time-series images, where a total of 409, 967 bounding boxes of flying birds are manually, carefully annotated.
1 code implementation • 25 Aug 2022 • Zhe Huang, Mary-Joy Sidhom, Benjamin S. Wessler, Michael C. Hughes
Semi-supervised learning (SSL) promises improved accuracy compared to training classifiers on small labeled datasets by also training on many unlabeled images.
no code implementations • 21 Jul 2022 • Adam Villaflor, Zhe Huang, Swapnil Pande, John Dolan, Jeff Schneider
Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a generic sequence modeling problem.
no code implementations • 3 Jun 2022 • Fangfang Zhang, Zhe Huang, Lei Kou, Yang Li, Maoyong Cao, Fengying Ma
In this paper, a new 9D complex chaotic system with quaternion is proposed for the encryption of smart grid data.
no code implementations • 27 May 2022 • Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han
We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks.
2 code implementations • 3 Mar 2022 • Shuijing Liu, Peixin Chang, Zhe Huang, Neeloy Chakraborty, Kaiwen Hong, Weihang Liang, D. Livingston McPherson, Junyi Geng, Katherine Driggs-Campbell
We study the problem of safe and intention-aware robot navigation in dense and interactive crowds.
no code implementations • CVPR 2022 • Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han
We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks.
1 code implementation • 30 Jul 2021 • Zhe Huang, Gary Long, Benjamin Wessler, Michael C. Hughes
Semi-supervised image classification has shown substantial progress in learning from limited labeled data, but recent advances remain largely untested for clinical applications.
1 code implementation • 15 Jul 2021 • Zhe Huang, Ruohua Li, Kazuki Shin, Katherine Driggs-Campbell
Multi-pedestrian trajectory prediction is an indispensable element of autonomous systems that safely interact with crowds in unstructured environments.
no code implementations • 19 Sep 2020 • Nan Wu, Zhe Huang, Yiqiu Shen, Jungkyu Park, Jason Phang, Taro Makino, S. Gene Kim, Kyunghyun Cho, Laura Heacock, Linda Moy, Krzysztof J. Geras
Breast cancer is the most common cancer in women, and hundreds of thousands of unnecessary biopsies are done around the world at a tremendous cost.
1 code implementation • 30 Jun 2020 • Zhe Huang, Aamir Hasan, Kazuki Shin, Ruohua Li, Katherine Driggs-Campbell
Trajectory prediction is one of the key capabilities for robots to safely navigate and interact with pedestrians.
no code implementations • 12 Oct 2019 • Peter Du, Zhe Huang, Tianqi Liu, Ke Xu, Qichao Gao, Hussein Sibai, Katherine Driggs-Campbell, Sayan Mitra
As autonomous systems begin to operate amongst humans, methods for safe interaction must be investigated.
Robotics Multiagent Systems Signal Processing
no code implementations • 20 Sep 2019 • Zhe Huang, Weijiang Yu, Wayne Zhang, Litong Feng, Nong Xiao
Taking the residual result (the coarse de-rained result) between the rainy image sample (i. e. the input data) and the output of coarse stage (i. e. the learnt rain mask) as input, the fine stage continues to de-rain by removing the fine-grained rain streaks (e. g. light rain streaks and water mist) to get a rain-free and well-reconstructed output image via a unified contextual merging sub-network with dense blocks and a merging block.
2 code implementations • 20 Mar 2019 • Nan Wu, Jason Phang, Jungkyu Park, Yiqiu Shen, Zhe Huang, Masha Zorin, Stanisław Jastrzębski, Thibault Févry, Joe Katsnelson, Eric Kim, Stacey Wolfson, Ujas Parikh, Sushma Gaddam, Leng Leng Young Lin, Kara Ho, Joshua D. Weinstein, Beatriu Reig, Yiming Gao, Hildegard Toth, Kristine Pysarenko, Alana Lewin, Jiyon Lee, Krystal Airola, Eralda Mema, Stephanie Chung, Esther Hwang, Naziya Samreen, S. Gene Kim, Laura Heacock, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras
We present a deep convolutional neural network for breast cancer screening exam classification, trained and evaluated on over 200, 000 exams (over 1, 000, 000 images).