Search Results for author: Li Zhu

Found 28 papers, 8 papers with code

Noise Learning for Text Classification: A Benchmark

no code implementations • COLING 2022 • Bo Liu, Wandi Xu, Yuejia Xiang, XiaoJun Wu, Lejian He, BoWen Zhang, Li Zhu

However, we find that noise learning in text classification is relatively underdeveloped: 1. many methods that have been proven effective in the image domain are not explored in text classification, 2. it is difficult to conduct a fair comparison between previous studies as they do experiments in different noise settings.

text-classification Text Classification

Paper
Add Code

Multi-view Distillation based on Multi-modal Fusion for Few-shot Action Recognition(CLIP-$\mathrm{M^2}$DF)

no code implementations • 16 Jan 2024 • Fei Guo, Yikang Wang, Han Qi, Wenping Jin, Li Zhu

In each view, we fuse the prompt embedding as consistent information with visual and the global or local temporal context to overcome the overlapping distribution of classes and outliers.

Few-Shot action recognition Few Shot Action Recognition +1

Paper
Add Code

Forced Exploration in Bandit Problems

1 code implementation • 12 Dec 2023 • Han Qi, Fei Guo, Li Zhu

This paper aims to design a multi-armed bandit algorithm that can be implemented without using information about the reward distribution while still achieving substantial regret upper bounds.

Paper
Code

Consistency Prototype Module and Motion Compensation for Few-Shot Action Recognition (CLIP-CP$\mathbf{M^2}$C)

no code implementations • 2 Dec 2023 • Fei Guo, Li Zhu, Yikang Wang, Han Qi

Although some multi-modal works use labels as supplementary to construct prototypes of support videos, they can not use this information for query videos.

Domain Adaptation Few-Shot action recognition +2

Paper
Add Code

CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation

1 code implementation • 14 Nov 2023 • Weixiang Yan, Haitian Liu, Yunkun Wang, Yunzhe Li, Qian Chen, Wen Wang, Tingyu Lin, Weishan Zhao, Li Zhu, Shuiguang Deng, Hari Sundaram

To bridge these gaps between existing benchmarks and expectations from practical applications, we introduce CodeScope, an execution-based, multilingual, multi-task, multi-dimensional evaluation benchmark for comprehensively gauging LLM capabilities on coding tasks.

Code Generation

Paper
Code

Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action Recognition

1 code implementation • 5 Jul 2023 • Fei Guo, Li Zhu, YiWang Wang, Jing Sun

The second module (MLT) focuses on the Multiple-level feature of the support prototype and query sample to mine more information for the alignment, which operates on different level features.

Few-Shot action recognition Few Shot Action Recognition +1

Paper
Code

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark

1 code implementation • 5 Jun 2023 • Shuyu Yang, Yinan Zhou, Yaxiong Wang, Yujiao Wu, Li Zhu, Zhedong Zheng

To verify the feasibility of learning from the generated data, we develop a new joint Attribute Prompt Learning and Text Matching Learning (APTM) framework, considering the shared knowledge between attribute and text.

Ranked #1 on Text based Person Retrieval on CUHK-PEDES

Attribute Image-text matching +7

114

Paper
Code

Discounted Thompson Sampling for Non-Stationary Bandit Problems

no code implementations • 18 May 2023 • Han Qi, Yue Wang, Li Zhu

Under mild assumptions, we show that DS-TS with Gaussian priors can achieve nearly optimal regret bound on the order of $\tilde{O}(\sqrt{TB_T})$ for abruptly changing and $\tilde{O}(T^{\beta})$ for smoothly changing, where $T$ is the number of time steps, $B_T$ is the number of breakpoints, $\beta$ is associated with the smoothly changing environment and $\tilde{O}$ hides the parameters independent of $T$ as well as logarithmic terms.

Thompson Sampling

Paper
Add Code

Two-stage MR Image Segmentation Method for Brain Tumors based on Attention Mechanism

no code implementations • 17 Apr 2023 • Li Zhu, Jiawei Jiang, Lin Lu, Jin Li

In response to this problem, we introduce the Coordinate Attention (CA) module to replace the Res Block to reduce the number of parameters, and cooperate with the spatial information extraction network above to strengthen the information extraction ability.

Brain Tumor Segmentation Generative Adversarial Network +3

Paper
Add Code

FSDNet-An efficient fire detection network for complex scenarios based on YOLOv3 and DenseNet

no code implementations • 15 Apr 2023 • Li Zhu, Jiahui Xiong, Wenxian Wu, Hongyu Yu

Fire is one of the common disasters in daily life.

Data Augmentation Fire Detection

Paper
Add Code

YOLO-Drone:Airborne real-time detection of dense small objects from high-altitude perspective

no code implementations • 14 Apr 2023 • Li Zhu, Jiahui Xiong, Feng Xiong, Hanzheng Hu, Zhengnan Jiang

With regards to UAVDT, the YOLO-Drone exhibits both high real-time inference speed of 53 FPS and a maximum mAP of 34. 04%.

Ranked #1 on Object Detection on India Driving Dataset

Object object-detection +1

Paper
Add Code

Landslide Susceptibility Prediction Modeling Based on Self-Screening Deep Learning Model

no code implementations • 12 Apr 2023 • Li Zhu, Lekai Liu, Changshi Yu

In conclusion, compared with some existing traditional machine learning, the SGCN-LSTM model proposed in this paper has higher landslide prediction accuracy and better robustness, and has a good application prospect in the LSP field.

Paper
Add Code

ISSTAD: Incremental Self-Supervised Learning Based on Transformer for Anomaly Detection and Localization

1 code implementation • 30 Mar 2023 • Wenping Jin, Fei Guo, Li Zhu

In the subsequent stage, we apply pixel-level data augmentation techniques to generate corrupted normal images and their corresponding pixel labels.

Anomaly Detection Data Augmentation +3

Paper
Code

Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond

1 code implementation • CVPR 2023 • Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang, Xiaoming Wei, Xiaolin Wei

In this paper, we introduce a novel Generative Adversarial Networks alike framework, referred to as GAN-MAE, where a generator is used to generate the masked patches according to the remaining visible patches, and a discriminator is employed to predict whether the patch is synthesized by the generator.

Representation Learning

Paper
Code

Uncertainty-Aware Image Captioning

no code implementations • 30 Nov 2022 • Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang, Xiaoming Wei, Xiaolin Wei

It is well believed that the higher uncertainty in a word of the caption, the more inter-correlated context information is required to determine it.

Caption Generation Image Captioning +1

Paper
Add Code

Progressive Text-to-Image Generation

no code implementations • 5 Oct 2022 • Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang

Recently, Vector Quantized AutoRegressive (VQ-AR) models have shown remarkable results in text-to-image synthesis by equally predicting discrete image tokens from the top left to bottom right in the latent space.

Denoising Text-to-Image Generation

Paper
Add Code

Progressive Glass Segmentation

no code implementations • 6 Sep 2022 • Letian Yu, Haiyang Mei, Wen Dong, Ziqi Wei, Li Zhu, Yuxin Wang, Xin Yang

First, we attempt to bridge the characteristic gap between different levels of features by developing a Discriminability Enhancement (DE) module which enables level-specific features to be a more discriminative representation, alleviating the features incompatibility for fusion.

Segmentation

Paper
Add Code

Detecting Small Objects in Thermal Images Using Single-Shot Detector

no code implementations • 25 Aug 2021 • Hao Zhang, Xianggong Hong, Li Zhu

In this paper, we proposed DDSSD (Dilation and Deconvolution Single Shot Multibox Detector), an enhanced SSD with a novel feature fusion module which can improve the performance over SSD for small object detection.

Object object-detection +1

Paper
Add Code

Generating Superpixels for High-resolution Images with Decoupled Patch Calibration

no code implementations • 19 Aug 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

Superpixel segmentation has recently seen important progress benefiting from the advances in differentiable deep learning.

Segmentation Superpixels +1

Paper
Add Code

ReGO: Reference-Guided Outpainting for Scenery Image

1 code implementation • 20 Jun 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

We aim to tackle the challenging yet practical scenery image outpainting task in this work.

Image Outpainting

Paper
Code

1st Place Solution to ICDAR 2021 RRC-ICTEXT End-to-end Text Spotting and Aesthetic Assessment on Integrated Circuit

no code implementations • 8 Apr 2021 • Qiyao Wang, Pengfei Li, Li Zhu, Yi Niu

For the text spotting task, we detect the characters on integrated circuit and classify them based on yolov5 detection model.

Text Spotting

Paper
Add Code

AINet: Association Implantation for Superpixel Segmentation

no code implementations • ICCV 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

However, simply applying a series of convolution operations with limited receptive fields can only implicitly perceive the relations between the pixel and its surrounding grids.

Segmentation

Paper
Add Code

DONet: Dual Objective Networks for Skin Lesion Segmentation

no code implementations • 19 Aug 2020 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

Skin lesion segmentation is a crucial step in the computer-aided diagnosis of dermoscopic images.

Lesion Segmentation Segmentation +2

Paper
Add Code

Sketch-Guided Scenery Image Outpainting

no code implementations • 17 Jun 2020 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

In this work, we take the image outpainting one step forward by allowing users to harvest personal custom outpainting results using sketches as the guidance.

Image Outpainting

Paper
Add Code

IoU-uniform R-CNN: Breaking Through the Limitations of RPN

1 code implementation • 11 Dec 2019 • Li Zhu, Zihao Xie, Liman Liu, Bo Tao, Wenbing Tao

Region Proposal Network (RPN) is the cornerstone of two-stage object detectors, it generates a sparse set of object proposals and alleviates the extrem foregroundbackground class imbalance problem during training.

Object object-detection +2

Paper
Code

Localization-aware Channel Pruning for Object Detection

no code implementations • 6 Nov 2019 • Zihao Xie, Wenbing Tao, Li Zhu, Lin Zhao

In this paper, based on discrimination-aware channel pruning (DCP) which is state-of-the-art pruning method for classification, we propose a localization-aware auxiliary network to find out the channels with key information for classification and regression so that we can conduct channel pruning directly for object detection, which saves lots of time and computing resources.

Classification General Classification +5

Paper
Add Code

VrR-VG: Refocusing Visually-Relevant Relationships

no code implementations • ICCV 2019 • Yuanzhi Liang, Yalong Bai, Wei zhang, Xueming Qian, Li Zhu, Tao Mei

Relationships encode the interactions among individual instances, and play a critical role in deep visual scene understanding.

Image Captioning Question Answering +3

Paper
Add Code

Multi-view Point Cloud Registration with Adaptive Convergence Threshold and its Application on 3D Model Retrieval

no code implementations • 25 Nov 2018 • Yaochen Li, Ying Liu, Rui Sun, Rui Guo, Li Zhu, Yong Qi

In this paper, we propose a framework to reconstruct the 3D models by the multi-view point cloud registration algorithm with adaptive convergence threshold, and subsequently apply it to 3D model retrieval.

Point Cloud Registration Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.