Search Results for author: Haokui Zhang

Found 22 papers, 15 papers with code

ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer

3 code implementations8 Mar 2022 Haokui Zhang, Wenze Hu, Xiaoyu Wang

Experiment results show that the proposed ParC-Net achieves better performance than popular light-weight ConvNets and vision transformer based models in common vision tasks and datasets, while having fewer parameters and faster inference speed.

Image Classification object-detection +3

Grafting Transformer on Automatically Designed Convolutional Neural Network for Hyperspectral Image Classification

1 code implementation21 Oct 2021 Xizhe Xue, Haokui Zhang, Bei Fang, Zongwen Bai, Ying Li

Compared with search spaces proposed in previous works, the proposed hybrid search space is more aligned with the characteristic of HSI data, that is, HSIs have a relatively low spatial resolution and an extremely high spectral resolution.

Classification Hyperspectral Image Classification +1

Fcaformer: Forward Cross Attention in Hybrid Vision Transformer

2 code implementations ICCV 2023 Haokui Zhang, Wenze Hu, Xiaoyu Wang

Currently, one main research line in designing a more efficient vision transformer is reducing the computational cost of self attention modules by adopting sparse attention or using local attention windows.

Image Classification Knowledge Distillation

Bridging Sensor Gaps via Single-Direction Tuning for Hyperspectral Image Classification

1 code implementation22 Sep 2023 Xizhe Xue, Haokui Zhang, Ying Li, Liuwei Wan, Zongwen Bai, Mike Zheng Shou

In this paper, aiming to solve this problem, we propose the single-direction tuning (SDT) strategy, which serves as a bridge, allowing us to leverage existing labeled HSI datasets even RGB datasets to enhance the performance on new HSI datasets with limited samples.

Hyperspectral Image Classification Representation Learning

Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning

2 code implementations7 Dec 2020 Haokui Zhang, Ying Li, Yenan Jiang, Peng Wang, Qiang Shen, Chunhua Shen

In contrast to previous approaches, we do not impose restrictions over the source data sets, in which they do not have to be collected by the same sensors as the target data sets.

Classification General Classification +1

Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration

1 code implementation24 Dec 2020 Haokui Zhang, Ying Li, Hao Chen, Chengrong Gong, Zongwen Bai, Chunhua Shen

For the inner search space, we propose a layer-wise architecture sharing strategy (LWAS), resulting in more flexible architectures and better performance.

Image Denoising Image Restoration +2

3D-ANAS: 3D Asymmetric Neural Architecture Search for Fast Hyperspectral Image Classification

1 code implementation12 Jan 2021 Haokui Zhang, Chengrong Gong, Yunpeng Bai, Zongwen Bai, Ying Li

Correspondingly, different models need to be designed for different datasets, which further increases the workload of designing architectures; 2) the mainstream framework is a patch-to-pixel framework.

Classification General Classification +3

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

1 code implementation CVPR 2023 Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

In this paper, we propose a neural architecture representation model that can be used to estimate these attributes holistically.

Representation Learning

ParCNetV2: Oversized Kernel with Enhanced Attention

1 code implementation ICCV 2023 Ruihan Xu, Haokui Zhang, Wenze Hu, Shiliang Zhang, Xiaoyu Wang

Specifically, we propose a new convolutional neural network, ParCNetV2, that extends position-aware circular convolution (ParCNet) with oversized convolutions and bifurcate gate units to enhance attention.

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks

no code implementations24 Nov 2018 Haokui Zhang, Ying Li, Peng Wang, Yu Liu, Chunhua Shen

Different from RGB videos, depth data in RGB-D videos provide key complementary information for tristimulus visual data which potentially could achieve accuracy improvement for action recognition.

Action Recognition Temporal Action Localization

Gradient Information Guided Deraining with A Novel Network and Adversarial Training

no code implementations9 Oct 2019 Yinglong Wang, Haokui Zhang, Yu Liu, Qinfeng Shi, Bing Zeng

However, the existing methods usually do not have good generalization ability, which leads to the fact that almost all of existing methods have a satisfied performance on removing a specific type of rain streaks, but may have a relatively poor performance on other types of rain streaks.

Rain Removal

Hyperspectral Image Classification with Spatial Consistence Using Fully Convolutional Spatial Propagation Network

no code implementations4 Aug 2020 Yenan Jiang, Ying Li, Shanrong Zou, Haokui Zhang, Yunpeng Bai

However, the existing CNN-based models operate at the patch-level, in which pixel is separately classified into classes using a patch of images around it.

Classification General Classification +1

Pseudo-LiDAR Based Road Detection

no code implementations28 Jul 2021 Libo Sun, Haokui Zhang, Wei Yin

Specifically, we exploit pseudo-LiDAR using depth estimation, and propose a feature fusion network where RGB and learned depth information are fused for improved road detection.

Depth Estimation Self-Driving Cars

Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search

no code implementations30 Jul 2021 Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang

Specifically, based on transformer, we propose a new network structure to compress the feature into a low dimensional space, and an inhomogeneous neighborhood relationship preserving (INRP) loss that aims to maintain high search accuracy.

Feature Compression Information Retrieval +2

Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning

1 code implementation1 Jun 2023 Shengqin Jiang, Yaoyu Fang, Haokui Zhang, Qingshan Liu, Yuankai Qi, Yang Yang, Peng Wang

Rehearsal-based video incremental learning often employs knowledge distillation to mitigate catastrophic forgetting of previously learned data.

Incremental Learning Knowledge Distillation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.