no code implementations • 13 Oct 2023 • Peihua Mai, Ran Yan, Zhe Huang, Youjia Yang, Yan Pang
Large Language Models (LLMs) shows powerful capability in natural language understanding by capturing hidden semantics in vector space.
no code implementations • 31 Jul 2023 • Soyeon Caren Han, Yihao Ding, Siwen Luo, Josiah Poon, HeeGuen Yoon, Zhe Huang, Paul Duuring, Eun Jung Holden
Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically.
1 code implementation • 18 Jul 2023 • Zhe Huang, Ruijie Jiang, Shuchin Aeron, Michael C. Hughes
This study contributes a carefully-designed benchmark to help answer a practitioner's key question: given a small labeled dataset and a limited budget of hours to spend on training, what gains from additional unlabeled images are possible and which methods best achieve them?
no code implementations • 26 May 2023 • Zhe Huang, Yudian Li
Most generic object detectors are mainly built for standard object detection tasks such as COCO and PASCAL VOC.
1 code implementation • 25 May 2023 • Zhe Huang, Benjamin S. Wessler, Michael C. Hughes
To automate screening for AS, deep networks must learn to mimic a human expert's ability to identify views of the aortic valve then aggregate across these relevant images to produce a study-level diagnosis.
no code implementations • 23 Apr 2023 • Hongyu Sun, Yongcai Wang, Xudong Cai, Peng Wang, Zhe Huang, Deying Li, Yu Shao, Shuo Wang
To advance the research and practical solutions for bird strike prevention, in this paper, we present a large-scale challenging dataset AirBirds that consists of 118, 312 time-series images, where a total of 409, 967 bounding boxes of flying birds are manually, carefully annotated.
1 code implementation • 25 Aug 2022 • Zhe Huang, Mary-Joy Sidhom, Benjamin S. Wessler, Michael C. Hughes
Semi-supervised learning (SSL) promises improved accuracy compared to training classifiers on small labeled datasets by also training on many unlabeled images.
no code implementations • 21 Jul 2022 • Adam Villaflor, Zhe Huang, Swapnil Pande, John Dolan, Jeff Schneider
Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a generic sequence modeling problem.
no code implementations • 3 Jun 2022 • Fangfang Zhang, Zhe Huang, Lei Kou, Yang Li, Maoyong Cao, Fengying Ma
In this paper, a new 9D complex chaotic system with quaternion is proposed for the encryption of smart grid data.
no code implementations • 27 May 2022 • Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han
We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks.
2 code implementations • 3 Mar 2022 • Shuijing Liu, Peixin Chang, Zhe Huang, Neeloy Chakraborty, Kaiwen Hong, Weihang Liang, D. Livingston McPherson, Junyi Geng, Katherine Driggs-Campbell
We study the problem of safe and intention-aware robot navigation in dense and interactive crowds.
no code implementations • CVPR 2022 • Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han
We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks.
1 code implementation • 30 Jul 2021 • Zhe Huang, Gary Long, Benjamin Wessler, Michael C. Hughes
Semi-supervised image classification has shown substantial progress in learning from limited labeled data, but recent advances remain largely untested for clinical applications.
1 code implementation • 15 Jul 2021 • Zhe Huang, Ruohua Li, Kazuki Shin, Katherine Driggs-Campbell
Multi-pedestrian trajectory prediction is an indispensable element of autonomous systems that safely interact with crowds in unstructured environments.
no code implementations • 19 Sep 2020 • Nan Wu, Zhe Huang, Yiqiu Shen, Jungkyu Park, Jason Phang, Taro Makino, S. Gene Kim, Kyunghyun Cho, Laura Heacock, Linda Moy, Krzysztof J. Geras
Breast cancer is the most common cancer in women, and hundreds of thousands of unnecessary biopsies are done around the world at a tremendous cost.
1 code implementation • 30 Jun 2020 • Zhe Huang, Aamir Hasan, Kazuki Shin, Ruohua Li, Katherine Driggs-Campbell
Trajectory prediction is one of the key capabilities for robots to safely navigate and interact with pedestrians.
no code implementations • 12 Oct 2019 • Peter Du, Zhe Huang, Tianqi Liu, Ke Xu, Qichao Gao, Hussein Sibai, Katherine Driggs-Campbell, Sayan Mitra
As autonomous systems begin to operate amongst humans, methods for safe interaction must be investigated.
Test
Robotics
Multiagent Systems
Signal Processing
no code implementations • 20 Sep 2019 • Zhe Huang, Weijiang Yu, Wayne Zhang, Litong Feng, Nong Xiao
Taking the residual result (the coarse de-rained result) between the rainy image sample (i. e. the input data) and the output of coarse stage (i. e. the learnt rain mask) as input, the fine stage continues to de-rain by removing the fine-grained rain streaks (e. g. light rain streaks and water mist) to get a rain-free and well-reconstructed output image via a unified contextual merging sub-network with dense blocks and a merging block.
2 code implementations • 20 Mar 2019 • Nan Wu, Jason Phang, Jungkyu Park, Yiqiu Shen, Zhe Huang, Masha Zorin, Stanisław Jastrzębski, Thibault Févry, Joe Katsnelson, Eric Kim, Stacey Wolfson, Ujas Parikh, Sushma Gaddam, Leng Leng Young Lin, Kara Ho, Joshua D. Weinstein, Beatriu Reig, Yiming Gao, Hildegard Toth, Kristine Pysarenko, Alana Lewin, Jiyon Lee, Krystal Airola, Eralda Mema, Stephanie Chung, Esther Hwang, Naziya Samreen, S. Gene Kim, Laura Heacock, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras
We present a deep convolutional neural network for breast cancer screening exam classification, trained and evaluated on over 200, 000 exams (over 1, 000, 000 images).