Search Results for author: Lechao Cheng

Found 40 papers, 18 papers with code

LoopGaussian: Creating 3D Cinemagraph with Multi-view Images via Eulerian Motion Field

no code implementations13 Apr 2024 Jiyang Li, Lechao Cheng, Zhangye Wang, Tingting Mu, Jingxuan He

In this paper, inspired by significant progress in the field of novel view synthesis (NVS) achieved by 3D Gaussian Splatting (3D-GS), we propose LoopGaussian to elevate cinemagraph from 2D image space to 3D space using 3D Gaussian modeling.

Novel View Synthesis Scene Generation

Revisiting the Power of Prompt for Visual Tuning

1 code implementation4 Feb 2024 Yuzhu Wang, Lechao Cheng, Chaowei Fang, Dingwen Zhang, Manni Duan, Meng Wang

Inspired by the observation that the prompt tokens tend to share high mutual information with patch tokens, we propose initializing prompts with downstream token prototypes.

Visual Prompt Tuning

Open-Vocabulary Video Relation Extraction

1 code implementation25 Dec 2023 Wentao Tian, Zheng Wang, Yuqian Fu, Jingjing Chen, Lechao Cheng

A comprehensive understanding of videos is inseparable from describing the action with its contextual action-object interactions.

Action Classification Action Understanding +3

Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation

1 code implementation14 Dec 2023 Jingxuan He, Lechao Cheng, Chaowei Fang, Zunlei Feng, Tingting Mu, Mingli Song

Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Integrating UMLS Knowledge into Large Language Models for Medical Question Answering

no code implementations4 Oct 2023 Rui Yang, Edison Marrese-Taylor, Yuhe Ke, Lechao Cheng, Qingyu Chen, Irene Li

Our research demonstrates the effectiveness of using UMLS-augmented LLMs and highlights the potential application value of LLMs in in medical question-answering.

Question Answering Text Generation

NLPBench: Evaluating Large Language Models on Solving NLP Problems

1 code implementation27 Sep 2023 Linxin Song, Jieyu Zhang, Lechao Cheng, Pengyuan Zhou, Tianyi Zhou, Irene Li

Recent developments in large language models (LLMs) have shown promise in enhancing the capabilities of natural language processing (NLP).

Benchmarking Math

ScrollTimes: Tracing the Provenance of Paintings as a Window into History

no code implementations15 Jun 2023 Wei zhang, Wong Kam-Kwai, Yitian Chen, Ailing Jia, Luwei Wang, Jian-Wei Zhang, Lechao Cheng, Huamin Qu, Wei Chen

The study of cultural artifact provenance, tracing ownership and preservation, holds significant importance in archaeology and art history.

Improving Knowledge Distillation via Regularizing Feature Norm and Direction

1 code implementation26 May 2023 Yuzhu Wang, Lechao Cheng, Manni Duan, Yongheng Wang, Zunlei Feng, Shu Kong

Finally, we propose a rather simple loss term (dubbed ND loss) to simultaneously (1) encourage student to produce large-\emph{norm} features, and (2) align the \emph{direction} of student features and teacher class-means.

Domain Adaptation Knowledge Distillation

Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation

1 code implementation15 May 2023 Fangwen Wu, Jingxuan He, Yufei Yin, Yanbin Hao, Gang Huang, Lechao Cheng

This study introduces an efficacious approach, Masked Collaborative Contrast (MCC), to highlight semantic regions in weakly supervised semantic segmentation.

Contrastive Learning Weakly supervised Semantic Segmentation +1

Life Regression based Patch Slimming for Vision Transformers

no code implementations11 Apr 2023 Jiawei Chen, Lin Chen, Jiang Yang, Tianqi Shi, Lechao Cheng, Zunlei Feng, Mingli Song

In this study, we tackle the patch slimming problem from a different perspective by proposing a life regression module that determines the lifespan of each image patch in one go.

regression

ViT-Calibrator: Decision Stream Calibration for Vision Transformer

no code implementations10 Apr 2023 Lin Chen, Zhijie Jia, Tian Qiu, Lechao Cheng, Jie Lei, Zunlei Feng, Mingli Song

In this work, we propose a new paradigm dubbed Decision Stream Calibration that boosts the performance of general Vision Transformers.

Propheter: Prophetic Teacher Guided Long-Tailed Distribution Learning

1 code implementation9 Apr 2023 Wenxiang Xu, Yongcheng Jing, Linyun Zhou, Wenqi Huang, Lechao Cheng, Zunlei Feng, Mingli Song

This is specifically achieved by devising an elaborated ``prophetic'' teacher, termed as ``Propheter'', that aims to learn the potential class distributions.

Data Augmentation

Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation

1 code implementation CVPR 2023 Tianli Zhang, Mengqi Xue, Jiangtao Zhang, Haofei Zhang, Yu Wang, Lechao Cheng, Jie Song, Mingli Song

Most existing online knowledge distillation(OKD) techniques typically require sophisticated modules to produce diverse knowledge for improving students' generalization ability.

Knowledge Distillation

Model Doctor for Diagnosing and Treating Segmentation Error

1 code implementation17 Feb 2023 Zhijie Jia, Lin Chen, Kaiwen Hu, Lechao Cheng, Zunlei Feng, Mingli Song

Despite the remarkable progress in semantic segmentation tasks with the advancement of deep neural networks, existing U-shaped hierarchical typical segmentation networks still suffer from local misclassification of categories and inaccurate target boundaries.

Segmentation Semantic Segmentation

Team DETR: Guide Queries as a Professional Team in Detection Transformers

1 code implementation14 Feb 2023 Tian Qiu, Linyun Zhou, Wenxiang Xu, Lechao Cheng, Zunlei Feng, Mingli Song

Recent proposed DETR variants have made tremendous progress in various scenarios due to their streamlined processes and remarkable performance.

Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt

no code implementations CVPR 2023 Hao Li, Dingwen Zhang, Nian Liu, Lechao Cheng, Yalun Dai, Chao Zhang, Xinggang Wang, Junwei Han

Inspired by the recent success of the Prompting technique, we introduce a new pre-training method that boosts QEIS models by giving Saliency Prompt for queries/kernels.

Instance Segmentation Semantic Segmentation +1

Text-Guided Mask-free Local Image Retouching

no code implementations15 Dec 2022 Zerun Liu, Fan Zhang, Jingxuan He, Jin Wang, Zhangye Wang, Lechao Cheng

In the realm of multi-modality, text-guided image retouching techniques emerged with the advent of deep learning.

Image Retouching

SASFormer: Transformers for Sparsely Annotated Semantic Segmentation

1 code implementation5 Dec 2022 Hui Su, Yue Ye, Wei Hua, Lechao Cheng, Mingli Song

In this work, we propose a simple yet effective sparse annotated semantic segmentation framework based on segformer, dubbed SASFormer, that achieves remarkable performance.

Segmentation Semantic Segmentation +2

Transferability Estimation Based On Principal Gradient Expectation

no code implementations29 Nov 2022 Huiyan Qi, Lechao Cheng, Jingjing Chen, Yue Yu, Xue Song, Zunlei Feng, Yu-Gang Jiang

Transfer learning aims to improve the performance of target tasks by transferring knowledge acquired in source tasks.

Transfer Learning

A Survey of Neural Trees

1 code implementation7 Sep 2022 Haoling Li, Jie Song, Mengqi Xue, Haofei Zhang, Jingwen Ye, Lechao Cheng, Mingli Song

This survey aims to present a comprehensive review of NTs and attempts to identify how they enhance the model interpretability.

Combating Noisy Labels in Long-Tailed Image Classification

no code implementations1 Sep 2022 Chaowei Fang, Lechao Cheng, Huiyan Qi, Dingwen Zhang

Most existing methods that cope with noisy labels usually assume that the class distributions are well balanced, which has insufficient capacity to deal with the practical scenarios where training samples have imbalanced distributions.

Classification Image Classification

ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition

1 code implementation22 Aug 2022 Mengqi Xue, Qihan Huang, Haofei Zhang, Lechao Cheng, Jie Song, Minghui Wu, Mingli Song

The global prototypes are adopted to provide the global view of objects to guide local prototypes to concentrate on the foreground while eliminating the influence of the background.

Decision Making Explainable artificial intelligence +1

Re-Attention Transformer for Weakly Supervised Object Localization

1 code implementation3 Aug 2022 Hui Su, Yue Ye, Zhiwei Chen, Mingli Song, Lechao Cheng

Weakly supervised object localization is a challenging task which aims to localize objects with coarse annotations such as image categories.

Object Weakly-Supervised Object Localization

Long-term Leap Attention, Short-term Periodic Shift for Video Classification

1 code implementation12 Jul 2022 Hao Zhang, Lechao Cheng, Yanbin Hao, Chong-Wah Ngo

By replacing a vanilla 2D attention with the LAPS, we could adapt a static transformer into a video one, with zero extra parameters and neglectable computation overhead ($\sim$2. 6\%).

Video Classification

KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing

1 code implementation21 Jun 2022 Xuanhan Wang, Jingkuan Song, Xiaojia Chen, Lechao Cheng, Lianli Gao, Heng Tao Shen

In this article, we propose a Knowledge Embedded RCNN (KE-RCNN) to identify attributes by leveraging rich knowledges, including implicit knowledge (e. g., the attribute ``above-the-hip'' for a shirt requires visual/geometry relations of shirt-hip) and explicit knowledge (e. g., the part of ``shorts'' cannot have the attribute of ``hoodie'' or ``lining'').

Attribute

Cross-Modality High-Frequency Transformer for MR Image Super-Resolution

no code implementations29 Mar 2022 Chaowei Fang, Dingwen Zhang, Liang Wang, Yulun Zhang, Lechao Cheng, Junwei Han

Improving the resolution of magnetic resonance (MR) image data is critical to computer-aided diagnosis and brain function analysis.

Image Super-Resolution Vocal Bursts Intensity Prediction

Weakly Supervised Semantic Segmentation via Alternative Self-Dual Teaching

no code implementations17 Dec 2021 Dingwen Zhang, Wenyuan Zeng, Guangyu Guo, Chaowei Fang, Lechao Cheng, Ming-Ming Cheng, Junwei Han

Current weakly supervised semantic segmentation (WSSS) frameworks usually contain the separated mask-refinement model and the main semantic region mining model.

Knowledge Distillation Weakly supervised Semantic Segmentation +1

Boundary Knowledge Translation based Reference Semantic Segmentation

no code implementations1 Aug 2021 Lechao Cheng, Zunlei Feng, Xinchao Wang, Ya Jie Liu, Jie Lei, Mingli Song

In this paper, we introduce a novel Reference semantic segmentation Network (Ref-Net) to conduct visual boundary knowledge translation.

Segmentation Semantic Segmentation +1

Edge-competing Pathological Liver Vessel Segmentation with Limited Labels

1 code implementation1 Aug 2021 Zunlei Feng, Zhonghua Wang, Xinchao Wang, Xiuming Zhang, Lechao Cheng, Jie Lei, Yuexuan Wang, Mingli Song

The diagnosis of MVI needs discovering the vessels that contain hepatocellular carcinoma cells and counting their number in each vessel, which depends heavily on experiences of the doctor, is largely subjective and time-consuming.

Segmentation whole slide images

Visual Boundary Knowledge Translation for Foreground Segmentation

1 code implementation1 Aug 2021 Zunlei Feng, Lechao Cheng, Xinchao Wang, Xiang Wang, Yajie Liu, Xiangtong Du, Mingli Song

To this end, we propose a Translation Segmentation Network (Trans-Net), which comprises a segmentation network and two boundary discriminators.

Foreground Segmentation Image Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.