1 code implementation • ECCV 2020 • Jiwei Chen, Yubao Sun, Qingshan Liu, Rui Huang
The IDR module is designed to reconstruct the remaining details from the residual measurement vector, and MRU is employed to update the residual measurement vector and feed it into the next IDR module.
no code implementations • 23 Sep 2024 • Yan Xing, Qi'ao Xu, Jingcheng Zeng, Rui Huang, Sihua Gao, Weifeng Xu, Yuxiang Zhang, Wei Fan
Semi-supervised change detection (SSCD) utilizes partially labeled data and a large amount of unlabeled data to detect changes.
no code implementations • 6 Sep 2024 • Jiangxia Cao, Shen Wang, Gaode Chen, Rui Huang, Shuang Yang, Zhaojie Liu, Guorui Zhou
In addressing the persistent challenges of data-sparsity and cold-start issues in domain-expert recommender systems, Cross-Domain Recommendation (CDR) emerges as a promising methodology.
no code implementations • 22 Aug 2024 • Wuchao Li, Rui Huang, Haijun Zhao, Chi Liu, Kai Zheng, Qi Liu, Na Mou, Guorui Zhou, Defu Lian, Yang song, Wentian Bao, Enyun Yu, Wenwu Ou
Nevertheless, a straightforward combination of SR and DM leads to sub-optimal performance due to discrepancies in learning objectives (recommendation vs. noise reconstruction) and the respective learning spaces (non-stationary vs. stationary).
no code implementations • 21 Aug 2024 • Yuzhou Huang, Yiran Qin, Shunlin Lu, Xintao Wang, Rui Huang, Ying Shan, Ruimao Zhang
Traditional visual storytelling is complex, requiring specialized knowledge and substantial resources, yet often constrained by human creativity and creation precision.
1 code implementation • 13 Aug 2024 • Xianming Li, Julius Lipp, Aamir Shakir, Rui Huang, Jing Li
BM25, a widely-used lexical search algorithm, remains crucial in information retrieval despite the rise of pre-trained and large language models (PLMs/LLMs).
no code implementations • 13 Aug 2024 • Qi Song, Qingyong Hu, Chi Zhang, Yongquan Chen, Rui Huang
Additionally, the input-independent nature of initial queries also limits the learning capacity of Transformer-based models.
1 code implementation • 12 Aug 2024 • Long Xu, Shanghong Li, Yongquan Chen, Junkang Chen, Rui Huang, Feng Wu
To address this issue, we propose a click attention algorithm that expands the influence range of positive clicks based on the similarity between positively-clicked regions and the whole input.
no code implementations • 4 Jul 2024 • Laiyan Ding, Hualie Jiang, Jie Li, Yongquan Chen, Rui Huang
The first loss function is the dense depth consistency loss, which penalizes the difference between predicted depths in overlapping regions.
no code implementations • 8 May 2024 • Kai Zheng, Haijun Zhao, Rui Huang, Beichuan Zhang, Na Mou, Yanan Niu, Yang song, Hongning Wang, Kun Gai
To address this issue, we propose an improved ranking principle for multi-stage systems, namely the Generalized Probability Ranking Principle (GPRP), to emphasize both the selection bias in each stage of the system pipeline as well as the underlying interest of users.
1 code implementation • 7 May 2024 • Long Xu, Yongquan Chen, Rui Huang, Feng Wu, Shiwu Lai
Then the graph nodes will be aggregated to obtain structured interaction features.
no code implementations • 17 Apr 2024 • Chi Zhang, Qi Song, Feifei Li, Yongquan Chen, Rui Huang
Constructing vectorized high-definition maps from surround-view cameras has garnered significant attention in recent years.
no code implementations • 25 Mar 2024 • Laiyan Ding, Panwen Hu, Jie Li, Rui Huang
To address this RGB-TSDF distribution difference, we propose a two-stage network with a 3D RGB feature completion module that completes RGB features with meaningful values for occluded areas.
no code implementations • 29 Feb 2024 • Rui Huang, Sikun Yang, Heinz Koeppl
Modeling count-valued time series has been receiving increasing attention since count time series naturally arise in physical and social domains.
no code implementations • 22 Jan 2024 • Rui Huang, Qingyi Zhao, Yan Xing, Sihua Gao, Weifeng Xu, Yuxiang Zhang, Wei Fan
SEFF utilizes saliency maps of the neighboring scales to enhance the necessary features for fusing, resulting in more representative fused features.
1 code implementation • 4 Jan 2024 • Ruofei Wang, Renjie Wan, Zongyu Guo, Qing Guo, Rui Huang
Backdoor attack aims to deceive a victim model when facing backdoor instances while maintaining its performance on benign data.
no code implementations • 28 Dec 2023 • Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc Pollefeys, Shiji Song, Gao Huang, Francis Engelmann
Therefore, we explore the use of image segmentation foundation models to automatically generate training labels for 3D segmentation.
1 code implementation • CVPR 2024 • Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan
Both quantitative and qualitative results on this evaluation dataset indicate that our SmartEdit surpasses previous methods, paving the way for the practical application of complex instruction-based image editing.
1 code implementation • 5 Dec 2023 • Rui Huang, Binbin Jiang, Qingyi Zhao, William Wang, Yuxiang Zhang, Qing Guo
Our approach surpasses state-of-the-art 2D change detection and NeRF-based methods by a significant margin.
1 code implementation • 13 Jun 2023 • Yin Fang, Xiaozhuan Liang, Ningyu Zhang, Kangwei Liu, Rui Huang, Zhuo Chen, Xiaohui Fan, Huajun Chen
Large Language Models (LLMs), with their remarkable task-handling capabilities and innovative outputs, have catalyzed significant advancements across a spectrum of fields.
Catalytic activity prediction Chemical-Disease Interaction Extraction +14
no code implementations • 29 Jan 2023 • Abhijit Mahabal, Jiyun Luo, Rui Huang, Michael Ellsworth, Rui Li
Creating a taxonomy of interests is expensive and human-effort intensive: not only do we need to identify nodes and interconnect them, in order to use the taxonomy, we must also connect the nodes to relevant entities such as users, pins, and queries.
no code implementations • 18 Jan 2023 • Rui Huang, Xuran Pan, Henry Zheng, Haojun Jiang, Zhifeng Xie, Shiji Song, Gao Huang
During the pre-training stage, we establish the correspondence of images and point clouds based on the readily available RGB-D data and use contrastive learning to align the image and point cloud representations.
1 code implementation • CVPR 2023 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
Specifically, cheap scene graph supervision data can be easily obtained by parsing image language descriptions into semantic graphs.
no code implementations • 27 Nov 2022 • Rui Huang, Ze Huang, Songzhi Su
We designed RepVGG-lite as the backbone network in our architecture, it is more discriminative than other general networks in the Place Recognition task.
1 code implementation • 21 Nov 2022 • Rui Huang, Ruofei Wang, Qing Guo, Jieda Wei, Yuxiang Zhang, Wei Fan, Yang Liu
Change detection (CD) is to decouple object changes (i. e., object missing or appearing) from background changes (i. e., environment variations) like light and season variations in two images captured in the same scene over a long time span, presenting critical applications in disaster management, urban development, etc.
1 code implementation • 17 Nov 2022 • Haojun Jiang, Jianke Zhang, Rui Huang, Chunjiang Ge, Zanlin Ni, Jiwen Lu, Jie zhou, Shiji Song, Gao Huang
However, as pre-trained models are scaling up, fully fine-tuning them on text-video retrieval datasets has a high risk of overfitting.
1 code implementation • 21 Oct 2022 • Rui Huang, Vincent W. S. Wong, Robert Schober
In the proposed system, RS facilitates the exploitation of the shared interests of the users in VR streaming, and IRS creates additional propagation channels to support the transmission of high-resolution 360-degree videos.
1 code implementation • 12 Oct 2022 • Chaofei Wang, Qisen Yang, Rui Huang, Shiji Song, Gao Huang
Knowledge distillation is an effective approach to learn compact models (students) with the supervision of large and strong models (teachers).
no code implementations • 26 Jul 2022 • Jiahui Zhang, Shitao Tang, Kejie Qiu, Rui Huang, Chuan Fang, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan
Visual relocalization has been a widely discussed problem in 3D vision: given a pre-constructed 3D visual map, the 6 DoF (Degrees-of-Freedom) pose of a query image is estimated.
1 code implementation • 19 Jul 2022 • Kangfu Mei, Vishal M. Patel, Rui Huang
The ultimate aim of image restoration like denoising is to find an exact correlation between the noisy and clear image domains.
1 code implementation • 15 Jul 2022 • Qing Guo, Ruofei Wang, Rui Huang, Shuifa Sun, Yuxiang Zhang
Change detection (CD) aims to detect change regions within an image pair captured at different times, playing a significant role in diverse real-world applications.
1 code implementation • CVPR 2022 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
Such design decomposes the process of HOI set prediction into two subsequent phases, i. e., an interaction proposal generation is first performed, and then followed by transforming the non-parametric interaction proposals into HOI predictions via a structure-aware Transformer.
Ranked #3 on Human-Object Interaction Detection on V-COCO
no code implementations • 23 May 2022 • Jincheng Huang, Ping Li, Rui Huang, Chen Na, Acong Zhang
Alternatively, it is possible to exploit the information about the presence of heterophilous neighbors for feature learning, so a hybrid message passing approach is devised to aggregate homophilious neighbors and diversify heterophilous neighbors based on edge classification.
no code implementations • 17 Mar 2022 • Jianwei Zhao, Qiang Zhai, Pengbo Zhao, Rui Huang, Hong Cheng
Geolocation is a fundamental component of route planning and navigation for unmanned vehicles, but GNSS-based geolocation fails under denial-of-service conditions.
1 code implementation • 14 Feb 2022 • Chunjiang Ge, Rui Huang, Mixue Xie, Zihang Lai, Shiji Song, Shuang Li, Gao Huang
Unsupervised domain adaption (UDA) aims to adapt models learned from a well-annotated source domain to a target domain, where only unlabeled samples are given.
1 code implementation • CVPR 2022 • Shutao Bai, Bingpeng Ma, Hong Chang, Rui Huang, Xilin Chen
To further improve SBM, an Integration-and-Distribution Module (IDM) is introduced to enhance frame-level representations.
1 code implementation • 8 Dec 2021 • Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang
Recent non-local self-attention methods have proven to be effective in capturing long-range dependencies for semantic segmentation.
no code implementations • NeurIPS 2021 • Zhiding Yu, Rui Huang, Wonmin Byeon, Sifei Liu, Guilin Liu, Thomas Breuel, Anima Anandkumar, Jan Kautz
It is therefore interesting to study how these two tasks can be coupled to benefit each other.
no code implementations • 27 Oct 2021 • Qi Song, Jie Li, Hao Guo, Rui Huang
Without any external training data, our proposed Denoised NL can achieve the state-of-the-art performance of 83. 5\% and 46. 69\% mIoU on Cityscapes and ADE20K, respectively.
1 code implementation • 12 Oct 2021 • Hualie Jiang, Laiyan Ding, Junjie Hu, Rui Huang
Unsupervised learning of depth from indoor monocular videos is challenging as the artificial environment contains many textureless regions.
1 code implementation • NeurIPS 2021 • Rui Huang, Andrew Geng, Yixuan Li
Detecting out-of-distribution (OOD) data has become a critical component in ensuring the safe deployment of machine learning models in the real world.
Ranked #12 on Out-of-Distribution Detection on ImageNet-1k vs SUN
1 code implementation • 18 Sep 2021 • Ran Gu, Jingyang Zhang, Rui Huang, Wenhui Lei, Guotai Wang, Shaoting Zhang
First, we present a domain composition method that represents one certain domain by a linear combination of a set of basis representations (i. e., a representation bank).
1 code implementation • 30 Aug 2021 • Hualie Jiang, Laiyan Ding, Zhenglong Sun, Rui Huang
We first propose an outlier masking technique that considers the occluded or dynamic pixels as statistical outliers in the photometric error map.
no code implementations • 29 Jun 2021 • Jie Li, Laiyan Ding, Rui Huang
3D semantic scene completion and 2D semantic segmentation are two tightly correlated tasks that are both essential for indoor scene understanding, because they predict the same semantic classes, using positively correlated high-level features.
no code implementations • 26 Jun 2021 • Jingxuan Li, Rui Huang, Wei Li, Kai Yao, Weiguo Tan
We integrate this ranking scheme with two frequency models and a GPT-2 styled language model, along with the acceptance model to yield 27. 80% and 37. 64% increase in TOP1 and TOP5 accuracy, respectively.
2 code implementations • NeurIPS 2021 • Yulin Wang, Rui Huang, Shiji Song, Zeyi Huang, Gao Huang
Inspired by this phenomenon, we propose a Dynamic Transformer to automatically configure a proper number of tokens for each input image.
Ranked #29 on Image Classification on CIFAR-100 (using extra training data)
4 code implementations • CVPR 2021 • Rui Huang, Yixuan Li
Detecting out-of-distribution (OOD) inputs is a central challenge for safely deploying machine learning models in the real world.
1 code implementation • CVPR 2021 • Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan
Detail Branch processes frames at original resolution to preserve the detailed visual clues, and Context Branch with a down-sampling strategy is employed to capture long-range contexts.
1 code implementation • 5 Apr 2021 • Yunhe Gao, Rui Huang, Yiwei Yang, Jie Zhang, Kainan Shao, Changjuan Tao, YuanYuan Chen, Dimitris N. Metaxas, Hongsheng Li, Ming Chen
Radiotherapy is a treatment where radiation is used to eliminate cancer cells.
1 code implementation • 2 Apr 2021 • Kangfu Mei, Shenglong Ye, Rui Huang
Deep Neural Network (DNN) based super-resolution algorithms have greatly improved the quality of the generated images.
1 code implementation • CVPR 2021 • Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu, Ping Tan
We present a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene.
no code implementations • 27 Mar 2021 • Rui Huang, Chuan Fang, Kejie Qiu, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan
Secondly, we propose an AR mapping pipeline which takes the input from the scanning device and produces accurate AR Maps.
1 code implementation • 10 Mar 2021 • Qi Song, Kangfu Mei, Rui Huang
In this paper, we propose a new model, called Attention-Augmented Network (AttaNet), to capture both global context and multilevel semantics while keeping the efficiency high.
1 code implementation • 6 Feb 2021 • Hualie Jiang, Zhe Sheng, Siyu Zhu, Zilong Dong, Rui Huang
Besides, we also designed a more effective fusion module for our fusion scheme.
Ranked #1 on Depth Estimation on Matterport3D
1 code implementation • 3 Feb 2021 • Wenhui Lei, Haochen Mei, Zhengwentai Sun, Shan Ye, Ran Gu, Huan Wang, Rui Huang, Shichuan Zhang, Shaoting Zhang, Guotai Wang
Despite the stateof-the-art performance achieved by Convolutional Neural Networks (CNNs) for automatic segmentation of OARs, existing methods do not provide uncertainty estimation of the segmentation results for treatment planning, and their accuracy is still limited by several factors, including the low contrast of soft tissues in CT, highly imbalanced sizes of OARs and large inter-slice spacing.
no code implementations • IEEE Transactions on Wireless Communications 2021 • Rui Huang, Vincent W.S. Wong, Robert Schober
Grant-free multiple access (GFMA) is a promising paradigm to efficiently support uplink access of Internet of Things (IoT) devices.
1 code implementation • ICCV 2021 • Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan
Spotting objects that are visually adapted to their surroundings is challenging for both humans and AI.
2 code implementations • 7 Dec 2020 • Xu Yan, Jiantao Gao, Jie Li, Ruimao Zhang, Zhen Li, Rui Huang, Shuguang Cui
In practice, an initial semantic segmentation (SS) of a single sweep point cloud can be achieved by any appealing network and then flows into the semantic scene completion (SSC) module as the input.
Ranked #3 on 3D Semantic Scene Completion on SemanticKITTI
3D Semantic Scene Completion from a single RGB image 3D Semantic Segmentation +3
1 code implementation • NeurIPS 2020 • Yulin Wang, Kangchen Lv, Rui Huang, Shiji Song, Le Yang, Gao Huang
The accuracy of deep convolutional neural networks (CNNs) generally improves when fueled with high resolution images.
no code implementations • 28 Sep 2020 • Panwen Hu, Jiazhen Liu, Rui Huang
The attention mechanism has been proved to be helpful in solving the occlusion problem by a large number of existing methods.
3 code implementations • 22 Sep 2020 • Ran Gu, Guotai Wang, Tao Song, Rui Huang, Michael Aertsen, Jan Deprest, Sébastien Ourselin, Tom Vercauteren, Shaoting Zhang
Also, we propose a scale attention module implicitly emphasizing the most salient feature maps among multiple scales so that the CNN is adaptive to the size of an object.
no code implementations • 17 Aug 2020 • Rui Huang, Yuanjie Zheng, Zhiqiang Hu, Shaoting Zhang, Hongsheng Li
In most scenarios, one might obtain annotations of a single or a few organs from one training set, and obtain annotations of the the other organs from another set of training images.
no code implementations • 9 Aug 2020 • Lanston Hau Man Chu, Tejas Bhojraj, Rui Huang
This paper aims to solve machine learning optimization problem by using quantum circuit.
no code implementations • ECCV 2020 • Rui Huang, Wanyue Zhang, Abhijit Kundu, Caroline Pantofaru, David A. Ross, Thomas Funkhouser, Alireza Fathi
We use a U-Net style 3D sparse convolution network to extract features for each frame's LiDAR point-cloud.
no code implementations • 24 Jun 2020 • Kangfu Mei, Yao Lu, Qiaosi Yi, Hao-Yu Wu, Juncheng Li, Rui Huang
Perceptual learning approaches like perceptual loss are empirically powerful for such tasks but they usually rely on the pre-trained classification network to provide features, which are not necessarily optimal in terms of visual perception of image transformation.
1 code implementation • 3 Mar 2020 • Hualie Jiang, Laiyan Ding, Zhenglong Sun, Rui Huang
Unsupervised learning of depth and ego-motion from unlabelled monocular videos has recently drawn great attention, which avoids the use of expensive ground truth in the supervised one.
Ranked #58 on Monocular Depth Estimation on KITTI Eigen split
1 code implementation • 19 Nov 2019 • Kangfu Mei, Juncheng Li, Jiajie Zhang, Hao-Yu Wu, Jie Li, Rui Huang
However, plenty of studies have shown that global information is crucial for image restoration tasks like image demosaicing and enhancing.
1 code implementation • IJCNLP 2019 • Zuyi Bao, Rui Huang, Chen Li, Kenny Q. Zhu
Previous work on cross-lingual sequence labeling tasks either requires parallel data or bridges the two languages through word-byword matching.
no code implementations • 28 Jul 2019 • Yunhe Gao, Rui Huang, Ming Chen, Zhe Wang, Jincheng Deng, YuanYuan Chen, Yiwei Yang, Jie Zhang, Chanjuan Tao, Hongsheng Li
In this paper, we propose an end-to-end deep neural network for solving the problem of imbalanced large and small organ segmentation in head and neck (HaN) CT images.
no code implementations • 9 Mar 2019 • Panwen Hu, Zizheng Yan, Rui Huang, Feng Yin
Visual tracking is fragile in some difficult scenarios, for instance, appearance ambiguity and variation, occlusion can easily degrade most of visual trackers to some extent.
no code implementations • 27 Mar 2018 • Ervin Teng, Rui Huang, Bob Iannucci
Modern deep convolutional neural networks (CNNs) for image classification and object detection are often trained offline on large static datasets.
no code implementations • 9 Feb 2018 • Jun Xiang, Guoshuai Zhang, Jianhua Hou, Nong Sang, Rui Huang
Designing a robust affinity model is the key issue in multiple target tracking (MTT).
no code implementations • ICCV 2017 • Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang
How to effectively learn temporal variation of target appearance, to exclude the interference of cluttered background, while maintaining real-time response, is an essential problem of visual object tracking.
Ranked #5 on Visual Object Tracking on OTB-2013
no code implementations • 2 May 2017 • Rui Huang, Danping Zou, Richard Vaughan, Ping Tan
Image-based modeling techniques can now generate photo-realistic 3D models from images.
3 code implementations • ICCV 2017 • Rui Huang, Shu Zhang, Tianyu Li, Ran He
This paper proposes a Two-Pathway Generative Adversarial Network (TP-GAN) for photorealistic frontal view synthesis by simultaneously perceiving global structures and local details.
no code implementations • ICCV 2015 • Kan Liu, Bingpeng Ma, Wei zhang, Rui Huang
Pedestrian re-identification is a difficult problem due to the large variations in a person's appearance caused by different poses and viewpoints, illumination changes, and occlusions.
1 code implementation • 3 Feb 2015 • Xiaodan Liang, Qingxing Cao, Rui Huang, Liang Lin
The aim of this study is to provide an automatic computational framework to assist clinicians in diagnosing Focal Liver Lesions (FLLs) in Contrast-Enhancement Ultrasound (CEUS).
no code implementations • 2 Feb 2015 • Zhujin Liang, Xiaolong Wang, Rui Huang, Liang Lin
This paper aims at one newly raising task in vision and multimedia research: recognizing human actions from still images.
no code implementations • 24 Feb 2014 • Changxin Gao, Feifei Chen, Jin-Gang Yu, Rui Huang, Nong Sang
However, the task in tracking is to search for a specific object, rather than an object category as in detection.