Search Results for author: Yanye Lu

Found 12 papers, 9 papers with code

Beyond Text: Frozen Large Language Models in Visual Signal Comprehension

1 code implementation12 Mar 2024 Lei Zhu, Fangyun Wei, Yanye Lu

To achieve this, we present the Vision-to-Language Tokenizer, abbreviated as V2T Tokenizer, which transforms an image into a ``foreign language'' with the combined aid of an encoder-decoder, the LLM vocabulary, and a CLIP model.

Deblurring Image Captioning +5

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label

1 code implementation27 Feb 2024 Xinliang Zhang, Lei Zhu, Hangzhou He, Lujia Jin, Yanye Lu

In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.

Segmentation Weakly supervised Semantic Segmentation +1

Multi-level Asymmetric Contrastive Learning for Medical Image Segmentation Pre-training

no code implementations21 Sep 2023 Shuang Zeng, Lei Zhu, Xinliang Zhang, Zifeng Tian, Qian Chen, Lujia Jin, Jiayi Wang, Yanye Lu

In this work, we propose a novel asymmetric contrastive learning framework named JCL for medical image segmentation with self-supervised pre-training.

Contrastive Learning Image Segmentation +3

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

no code implementations9 Aug 2023 Lei Zhu, Hangzhou He, Xinliang Zhang, Qian Chen, Shuang Zeng, Qiushi Ren, Yanye Lu

Existing methods adopt an online-trained classification branch to provide pseudo annotations for supervising the segmentation branch.

Classification Segmentation +3

One-Pot Multi-Frame Denoising

no code implementations18 Feb 2023 Lujia Jin, Shi Zhao, Lei Zhu, Qian Chen, Yanye Lu

Therefore, it is necessary to avoid the restriction of clean labels and make full use of noisy data for model training.

Denoising

Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization

1 code implementation16 Jul 2022 Lei Zhu, Qian Chen, Lujia Jin, Yunfei You, Yanye Lu

Classification activation map (CAM), utilizing the classification structure to generate pixel-wise localization maps, is a crucial mechanism for weakly supervised object localization (WSOL).

Object Weakly-Supervised Object Localization

Weakly Supervised Object Localization as Domain Adaption

1 code implementation CVPR 2022 Lei Zhu, Qi She, Qian Chen, Yunfei You, Boyu Wang, Yanye Lu

To avoid this problem, this work provides a novel perspective that models WSOL as a domain adaption (DA) task, where the score estimator trained on the source/image domain is tested on the target/pixel domain to locate objects.

Classification Domain Adaptation +2

Content-Noise Complementary Learning for Medical Image Denoising

2 code implementations IEEE Transactions on Medical Imaging 2022 Mufeng Geng, Xiangxi Meng, Jiangyuan Yu, Lei Zhu, Lujia Jin, Zhe Jiang, Bin Qiu, Hui Li, Hanjing Kong, Jianmin Yuan, Kun Yang, Hongming Shan, Hongbin Han, Zhi Yang, Qiushi Ren, Yanye Lu

In this study, we propose a simple yet effective strategy, the content-noise complementary learning (CNCL) strategy, in which two deep learning predictors are used to learn the respective content and noise of the image dataset complementarily.

Generative Adversarial Network Image Denoising +1

Background-aware Classification Activation Map for Weakly Supervised Object Localization

1 code implementation29 Dec 2021 Lei Zhu, Qi She, Qian Chen, Xiangxi Meng, Mufeng Geng, Lujia Jin, Zhe Jiang, Bin Qiu, Yunfei You, Yibao Zhang, Qiushi Ren, Yanye Lu

In our B-CAM, two image-level features, aggregated by pixel-level features of potential background and object locations, are used to purify the object feature from the object-related background and to represent the feature of the pure-background sample, respectively.

Classification Object +1

Unifying Nonlocal Blocks for Neural Networks

1 code implementation ICCV 2021 Lei Zhu, Qi She, Duo Li, Yanye Lu, Xuejing Kang, Jie Hu, Changhu Wang

The nonlocal-based blocks are designed for capturing long-range spatial-temporal dependencies in computer vision tasks.

Action Recognition Image Classification +2

Bayesian Statistics Guided Label Refurbishment Mechanism: Mitigating Label Noise in Medical Image Classification

1 code implementation23 Jun 2021 Mengdi Gao, Ximeng Feng, Mufeng Geng, Zhe Jiang, Lei Zhu, Xiangxi Meng, Chuanqing Zhou, Qiushi Ren, Yanye Lu

BLRM utilizes maximum a posteriori probability (MAP) in the Bayesian statistics and the exponentially time-weighted technique to selectively correct the labels of noisy images.

Image Classification Medical Image Classification

Learning the Superpixel in a Non-iterative and Lifelong Manner

1 code implementation CVPR 2021 Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu

Superpixel is generated by automatically clustering pixels in an image into hundreds of compact partitions, which is widely used to perceive the object contours for its excellent contour adherence.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.