no code implementations • COLING 2022 • Bo Liu, Wandi Xu, Yuejia Xiang, XiaoJun Wu, Lejian He, BoWen Zhang, Li Zhu
However, we find that noise learning in text classification is relatively underdeveloped: 1. many methods that have been proven effective in the image domain are not explored in text classification, 2. it is difficult to conduct a fair comparison between previous studies as they do experiments in different noise settings.
1 code implementation • 13 Jan 2025 • Wenping Jin, Li Zhu, Jing Sun
Weakly supervised violence detection refers to the technique of training models to identify violent segments in videos using only video-level labels.
Anomaly Detection In Surveillance Videos Multiple Instance Learning +1
no code implementations • 27 Dec 2024 • Jingchun Lian, Lingyu Liu, Yaxiong Wang, Yujiao Wu, Li Zhu, Zhedong Zheng
Leveraging the MMTT dataset, we develop ForgeryTalker, an architecture designed for concurrent forgery localization and interpretation.
no code implementations • 26 Nov 2024 • Shuyu Yang, Yaxiong Wang, Li Zhu, Zhedong Zheng
To enable the training and evaluation of this new task, we construct a large-scale image-text Pedestrian Anomaly Behavior (PAB) benchmark, featuring a broad spectrum of actions, e. g., running, performing, playing soccer, and the corresponding anomalies, e. g., lying, being hit, and falling of the same identity.
no code implementations • 26 Nov 2024 • Yinan Zhou, Yuxin Chen, Haokun Lin, Shuyu Yang, Li Zhu, Zhongang Qi, Chen Ma, Ying Shan
In recent years, Multimodal Large Language Models (MLLMs) have increasingly emphasized grounding and referring capabilities to achieve detailed understanding and flexible user interaction.
1 code implementation • 26 Oct 2024 • Yimin Deng, Yuxia Wu, Guoshuai Zhao, Li Zhu, Xueming Qian
To enable better knowledge transfer, we design a prototype learning method integrating the supervised and pseudo signals from IND and OOD samples.
no code implementations • 8 Jul 2024 • Fei Guo, Yikang Wang, Han Qi, Li Zhu, Jing Sun
In the first branch, a Domain Temporal Encoder is employed to capture temporal features for both the source and target domains.
1 code implementation • 19 Jun 2024 • Weixiang Yan, Haitian Liu, Tengxiao Wu, Qian Chen, Wen Wang, Haoyuan Chai, Jiayi Wang, Weishan Zhao, Yixin Zhang, Renjun Zhang, Li Zhu, Xuandong Zhao
Existing clinical diagnostic evaluation benchmarks for evaluating medical agents powered by LLMs have severe limitations.
1 code implementation • 18 May 2024 • Han Qi, Guo Fei, Li Zhu
When the means of arms are independently generated from some distribution, we provide regret upper bounds for both algorithms and discuss the sub-linearity of bounds in relation to the distribution of means.
no code implementations • 25 Apr 2024 • Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, HaoNing Wu, Yixuan Gao, Yuqin Cao, ZiCheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Fengbin Guan, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao
A total of 196 participants have registered in the video track.
no code implementations • 16 Jan 2024 • Fei Guo, Yikang Wang, Han Qi, Wenping Jin, Li Zhu
In each view, we fuse the prompt embedding as consistent information with visual and the global or local temporal context to overcome the overlapping distribution of classes and outliers.
1 code implementation • 12 Dec 2023 • Han Qi, Fei Guo, Li Zhu
This paper aims to design a multi-armed bandit algorithm that can be implemented without using information about the reward distribution while still achieving substantial regret upper bounds.
no code implementations • 2 Dec 2023 • Fei Guo, Li Zhu, Yikang Wang, Han Qi
Although some multi-modal works use labels as supplementary to construct prototypes of support videos, they can not use this information for query videos.
1 code implementation • 14 Nov 2023 • Weixiang Yan, Haitian Liu, Yunkun Wang, Yunzhe Li, Qian Chen, Wen Wang, Tingyu Lin, Weishan Zhao, Li Zhu, Hari Sundaram, Shuiguang Deng
Finally, we systematically evaluate and analyze eight mainstream LLMs and demonstrate the superior breadth and challenges of CodeScope for evaluating LLMs on code understanding and generation tasks compared to other benchmarks.
1 code implementation • 5 Jul 2023 • Fei Guo, Li Zhu, YiWang Wang, Jing Sun
The second module (MLT) focuses on the Multiple-level feature of the support prototype and query sample to mine more information for the alignment, which operates on different level features.
1 code implementation • 5 Jun 2023 • Shuyu Yang, Yinan Zhou, Yaxiong Wang, Yujiao Wu, Li Zhu, Zhedong Zheng
To verify the feasibility of learning from the generated data, we develop a new joint Attribute Prompt Learning and Text Matching Learning (APTM) framework, considering the shared knowledge between attribute and text.
Ranked #1 on Text based Person Retrieval on ICFG-PEDES (using extra training data)
no code implementations • 18 May 2023 • Han Qi, Yue Wang, Li Zhu
Under mild assumptions, we show that DS-TS with Gaussian priors can achieve nearly optimal regret bound on the order of $\tilde{O}(\sqrt{TB_T})$ for abruptly changing and $\tilde{O}(T^{\beta})$ for smoothly changing, where $T$ is the number of time steps, $B_T$ is the number of breakpoints, $\beta$ is associated with the smoothly changing environment and $\tilde{O}$ hides the parameters independent of $T$ as well as logarithmic terms.
no code implementations • 17 Apr 2023 • Li Zhu, Jiawei Jiang, Lin Lu, Jin Li
In response to this problem, we introduce the Coordinate Attention (CA) module to replace the Res Block to reduce the number of parameters, and cooperate with the spatial information extraction network above to strengthen the information extraction ability.
no code implementations • 15 Apr 2023 • Li Zhu, Jiahui Xiong, Wenxian Wu, Hongyu Yu
Fire is one of the common disasters in daily life.
no code implementations • 14 Apr 2023 • Li Zhu, Jiahui Xiong, Feng Xiong, Hanzheng Hu, Zhengnan Jiang
With regards to UAVDT, the YOLO-Drone exhibits both high real-time inference speed of 53 FPS and a maximum mAP of 34. 04%.
Ranked #1 on Object Detection on India Driving Dataset
no code implementations • 12 Apr 2023 • Li Zhu, Lekai Liu, Changshi Yu
In conclusion, compared with some existing traditional machine learning, the SGCN-LSTM model proposed in this paper has higher landslide prediction accuracy and better robustness, and has a good application prospect in the LSP field.
1 code implementation • 30 Mar 2023 • Wenping Jin, Fei Guo, Li Zhu
In the subsequent stage, we apply pixel-level data augmentation techniques to generate corrupted normal images and their corresponding pixel labels.
Ranked #72 on Anomaly Detection on MVTec AD (using extra training data)
1 code implementation • CVPR 2023 • Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang, Xiaoming Wei, Xiaolin Wei
In this paper, we introduce a novel Generative Adversarial Networks alike framework, referred to as GAN-MAE, where a generator is used to generate the masked patches according to the remaining visible patches, and a discriminator is employed to predict whether the patch is synthesized by the generator.
no code implementations • 30 Nov 2022 • Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang, Xiaoming Wei, Xiaolin Wei
It is well believed that the higher uncertainty in a word of the caption, the more inter-correlated context information is required to determine it.
no code implementations • 5 Oct 2022 • Zhengcong Fei, Mingyuan Fan, Li Zhu, Junshi Huang
Recently, Vector Quantized AutoRegressive (VQ-AR) models have shown remarkable results in text-to-image synthesis by equally predicting discrete image tokens from the top left to bottom right in the latent space.
no code implementations • 6 Sep 2022 • Letian Yu, Haiyang Mei, Wen Dong, Ziqi Wei, Li Zhu, Yuxin Wang, Xin Yang
First, we attempt to bridge the characteristic gap between different levels of features by developing a Discriminability Enhancement (DE) module which enables level-specific features to be a more discriminative representation, alleviating the features incompatibility for fusion.
no code implementations • 25 Aug 2021 • Hao Zhang, Xianggong Hong, Li Zhu
In this paper, we proposed DDSSD (Dilation and Deconvolution Single Shot Multibox Detector), an enhanced SSD with a novel feature fusion module which can improve the performance over SSD for small object detection.
no code implementations • 19 Aug 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang
Superpixel segmentation has recently seen important progress benefiting from the advances in differentiable deep learning.
1 code implementation • 20 Jun 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang
We aim to tackle the challenging yet practical scenery image outpainting task in this work.
no code implementations • 8 Apr 2021 • Qiyao Wang, Pengfei Li, Li Zhu, Yi Niu
For the text spotting task, we detect the characters on integrated circuit and classify them based on yolov5 detection model.
no code implementations • ICCV 2021 • Yaxiong Wang, Yunchao Wei, Yujiao Wu, Xueming Qian, Li Zhu, Yi Yang
Superpixel segmentation has seen significant progress benefiting from the deep convolutional networks.
no code implementations • 19 Aug 2020 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang
Skin lesion segmentation is a crucial step in the computer-aided diagnosis of dermoscopic images.
no code implementations • 17 Jun 2020 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang
In this work, we take the image outpainting one step forward by allowing users to harvest personal custom outpainting results using sketches as the guidance.
1 code implementation • 11 Dec 2019 • Li Zhu, Zihao Xie, Liman Liu, Bo Tao, Wenbing Tao
Region Proposal Network (RPN) is the cornerstone of two-stage object detectors, it generates a sparse set of object proposals and alleviates the extrem foregroundbackground class imbalance problem during training.
no code implementations • 6 Nov 2019 • Zihao Xie, Wenbing Tao, Li Zhu, Lin Zhao
In this paper, based on discrimination-aware channel pruning (DCP) which is state-of-the-art pruning method for classification, we propose a localization-aware auxiliary network to find out the channels with key information for classification and regression so that we can conduct channel pruning directly for object detection, which saves lots of time and computing resources.
no code implementations • ICCV 2019 • Yuanzhi Liang, Yalong Bai, Wei zhang, Xueming Qian, Li Zhu, Tao Mei
Relationships encode the interactions among individual instances, and play a critical role in deep visual scene understanding.
no code implementations • 25 Nov 2018 • Yaochen Li, Ying Liu, Rui Sun, Rui Guo, Li Zhu, Yong Qi
In this paper, we propose a framework to reconstruct the 3D models by the multi-view point cloud registration algorithm with adaptive convergence threshold, and subsequently apply it to 3D model retrieval.