Search Results for author: Yanwei Fu

Found 195 papers, 84 papers with code

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

no code implementations ICML 2020 Yanwei Fu, Chen Liu, Donghao Li, Xinwei Sun, Jinshan Zeng, Yuan YAO

Over-parameterization is ubiquitous nowadays in training neural networks to benefit both optimization in seeking global optima and generalization in reducing prediction error.

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

no code implementations11 Feb 2025 Sixiao Zheng, Zimian Peng, Yanpeng Zhou, Yi Zhu, Hang Xu, Xiangru Huang, Yanwei Fu

In this paper, we introduce VidCRAFT3, a novel framework for precise image-to-video generation that enables control over camera motion, object motion, and lighting direction simultaneously.

Image to Video Generation

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

1 code implementation9 Feb 2025 Kaizhen Zhu, Mokai Pan, Yuexin Ma, Yanwei Fu, Jingyi Yu, Jingya Wang, Ye Shi

We demonstrate that existing diffusion bridges employing Doob's $h$-transform constitute a special case of our framework, emerging when the terminal penalty coefficient in the SOC cost function tends to infinity.

Image Restoration

A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs

1 code implementation20 Jan 2025 Chang Wan, Ke Fan, Xinwei Sun, Yanwei Fu, MingLu Li, Yunliang Jiang, ZhongLong Zheng

To address these challenges, we propose a novel Lipschitz-constrained Functional Gradient GANs learning (Li-CFG) method to stabilize the training of GAN and provide a theoretical foundation for effectively increasing the diversity of synthetic samples by reducing the neighborhood size of the latent vector.

Diversity Image Generation

Making Your Dreams A Reality: Decoding the Dreams into a Coherent Video Story from fMRI Signals

no code implementations16 Jan 2025 Yanwei Fu, Jianxiong Gao, Baofeng Yang, Jianfeng Feng

By combining subjective dream experiences with objective neurophysiological data, we aim to understand the visual aspects of dreams and create complete video narratives.

Language Modeling Language Modelling

Adaptive Pruning of Pretrained Transformer via Differential Inclusions

no code implementations6 Jan 2025 Yizhuo Ding, Ke Fan, Yikai Wang, Xinwei Sun, Yanwei Fu

Therefore, the solution path identifies a Transformer weight family with various sparsity levels, offering greater flexibility and customization.

Low-rank compression

SparseGrasp: Robotic Grasping via 3D Semantic Gaussian Splatting from Sparse Multi-View RGB Images

no code implementations3 Dec 2024 Junqiu Yu, Xinlin Ren, Yongchong Gu, Haitao Lin, Tianyu Wang, Yi Zhu, Hang Xu, Yu-Gang Jiang, xiangyang xue, Yanwei Fu

Language-guided robotic grasping is a rapidly advancing field where robots are instructed using human language to grasp specific objects.

3DGS Robotic Grasping

Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization

no code implementations2 Dec 2024 Lingyun Zhang, Yu Xie, Yanwei Fu, Ping Chen

As large-scale diffusion models continue to advance, they excel at producing high-quality images but often generate unwanted content, such as sexually explicit or violent content.

Denoising Few-Shot Learning +1

ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos

no code implementations28 Nov 2024 Yuqian Fu, Runze Wang, Yanwei Fu, Danda Pani Paudel, Xuanjing Huang, Luc van Gool

In this paper, we focus on the Ego-Exo Object Correspondence task, an emerging challenge in the field of computer vision that aims to map objects across ego-centric and exo-centric views.

Object Object Localization +1

MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

1 code implementation25 Nov 2024 Chenjie Cao, Chaohui Yu, Shang Liu, Fan Wang, xiangyang xue, Yanwei Fu

We introduce MVGenMaster, a multi-view diffusion model enhanced with 3D priors to address versatile Novel View Synthesis (NVS) tasks.

Novel View Synthesis

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

1 code implementation15 Nov 2024 Boyuan Jiang, Xiaobin Hu, Donghao Luo, Qingdong He, Chengming Xu, Jinlong Peng, Jiangning Zhang, Chengjie Wang, Yunsheng Wu, Yanwei Fu

Although image-based virtual try-on has made considerable progress, emerging approaches still encounter challenges in producing high-fidelity and robust fitting images across diverse scenarios.

Virtual Try-on

Robust Network Learning via Inverse Scale Variational Sparsification

no code implementations27 Sep 2024 Zhiling Zhou, Zirui Liu, Chengming Xu, Yanwei Fu, Xinwei Sun

While neural networks have made significant strides in many AI tasks, they remain vulnerable to a range of noise types, including natural corruptions, adversarial noise, and low-resolution artifacts.

SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model

no code implementations5 Sep 2024 Weipeng Tan, Chuming Lin, Chengming Xu, Xiaozhong Ji, Junwei Zhu, Chengjie Wang, Yunsheng Wu, Yanwei Fu

Specifically, we first introduce the novel probabilistic style prior learning to model the intrinsic style as a Gaussian distribution using facial expressions and audio embedding.

Diversity Talking Head Generation

Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image

1 code implementation4 Aug 2024 Xinlin Ren, Chenjie Cao, Yanwei Fu, xiangyang xue

Additionally, we examine the impact of varying feature resolutions and evaluate both pixel-wise and patch-wise consistent losses, providing insights into effective strategies for improving NSR performance.

Surface Reconstruction

Unified Lexical Representation for Interpretable Visual-Language Alignment

1 code implementation25 Jul 2024 YiFan Li, Yikai Wang, Yanwei Fu, Dongyu Ru, Zheng Zhang, Tong He

On the other hand, lexical representation, a vector whose element represents the similarity between the sample and a word from the vocabulary, is a natural sparse representation and interpretable, providing exact matches for individual words.

Cross-Modal Retrieval Language Modelling

ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline Context

no code implementations13 Jul 2024 Sixiao Zheng, Yanwei Fu

To address these issues, we propose ContextualStory, a novel framework designed to generate coherent story frames and extend frames for story continuation.

Story Continuation Story Visualization +2

EFCNet: Every Feature Counts for Small Medical Object Segmentation

no code implementations26 Jun 2024 Lingjie Kong, Qiaoling Wei, Chengming Xu, Han Chen, Yanwei Fu

In response to this challenge, we propose a novel model named EFCNet for small object segmentation in medical images.

Decoder Image Segmentation +2

CustAny: Customizing Anything from A Single Example

2 code implementations17 Jun 2024 Lingjie Kong, Kai Wu, Xiaobin Hu, Wenhui Han, Jinlong Peng, Chengming Xu, Donghao Luo, Mengtian Li, Jiangning Zhang, Chengjie Wang, Yanwei Fu

The primary issue of promoting zero-shot object customization from specific domains to the general domain is to establish a large-scale general ID dataset for model pre-training, which is time-consuming and labor-intensive.

Object Virtual Try-on

Adaptive Slot Attention: Object Discovery with Dynamic Slot Number

1 code implementation CVPR 2024 Ke Fan, Zechen Bai, Tianjun Xiao, Tong He, Max Horn, Yanwei Fu, Francesco Locatello, Zheng Zhang

Moreover, our analysis substantiates that our method exhibits the capability to dynamically adapt the slot number according to each instance's complexity, offering the potential for further exploration in slot attention research.

Decoder Object +1

Hyper-Transformer for Amodal Completion

no code implementations30 May 2024 Jianxiong Gao, Xuelin Qian, Longfei Liang, Junwei Han, Yanwei Fu

The multi-scale features from the image branch guide the hyper transformer in learning shape priors and in generating the weights for dynamic convolution tailored to each instance.

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation

no code implementations28 May 2024 Qilin Wang, Zhengkai Jiang, Chengming Xu, Jiangning Zhang, Yabiao Wang, Xinyi Zhang, Yun Cao, Weijian Cao, Chengjie Wang, Yanwei Fu

This enables accurate alignment of pose and shape in the generated videos, providing a robust framework capable of handling a wide range of body shapes and dynamic hand movements.

Image Animation

3D StreetUnveiler with Semantic-Aware 2DGS

no code implementations28 May 2024 Jingwei Xu, Yikai Wang, Yiqun Zhao, Yanwei Fu, Shenghua Gao

The mesh representation of the empty street can be extracted for further applications.

3D Inpainting Autonomous Driving

Image-Text-Image Knowledge Transferring for Lifelong Person Re-Identification with Hybrid Clothing States

no code implementations26 May 2024 Qizao Wang, Xuelin Qian, Bin Li, Yanwei Fu, xiangyang xue

To tackle the challenges of knowledge granularity mismatch and knowledge presentation mismatch that occurred in LReID-Hybrid, we take advantage of the consistency and generalization of the text space, and propose a novel framework, dubbed $Teata$, to effectively align, transfer and accumulate knowledge in an "image-text-image" closed loop.

Person Re-Identification Transfer Learning

Content and Salient Semantics Collaboration for Cloth-Changing Person Re-Identification

1 code implementation26 May 2024 Qizao Wang, Xuelin Qian, Bin Li, Lifeng Chen, Yanwei Fu, xiangyang xue

Specifically, we propose the Content and Salient Semantics Collaboration (CSSC) framework, facilitating cross-parallel semantics interaction and refinement.

Cloth-Changing Person Re-Identification

Towards Global Optimal Visual In-Context Learning Prompt Selection

no code implementations24 May 2024 Chengming Xu, Chen Liu, Yikai Wang, Yuan YAO, Yanwei Fu

Visual In-Context Learning (VICL) is a prevailing way to transfer visual foundation models to new tasks by leveraging contextual information contained in in-context examples to enhance learning and prediction of query sample.

Colorization Foreground Segmentation +4

ArtWeaver: Advanced Dynamic Style Integration via Diffusion Model

no code implementations24 May 2024 Chengming Xu, Kai Hu, Qilin Wang, Donghao Luo, Jiangning Zhang, Xiaobin Hu, Yanwei Fu, Chengjie Wang

Stylized Text-to-Image Generation (STIG) aims to generate images from text prompts and style reference images.

Denoising model +1

A Generalization Theory of Cross-Modality Distillation with Contrastive Learning

no code implementations6 May 2024 Hangyu Lin, Chen Liu, Chengming Xu, Zhengqi Gao, Yanwei Fu, Yuan YAO

For instance, one typically aims to minimize the L2 distance or contrastive loss between the learned features of pairs of samples in the source (e. g. image) and the target (e. g. sketch) modalities.

Contrastive Learning

NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation

no code implementations27 Mar 2024 Jingyang Huo, Yikai Wang, Xuelin Qian, Yun Wang, Chong Li, Jianfeng Feng, Yanwei Fu

Recent fMRI-to-image approaches mainly focused on associating fMRI signals with specific conditions of pre-trained diffusion models.

Image Reconstruction

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation

no code implementations26 Mar 2024 Qilin Wang, Jiangning Zhang, Chengming Xu, Weijian Cao, Ying Tai, Yue Han, Yanhao Ge, Hong Gu, Chengjie Wang, Yanwei Fu

Facial Appearance Editing (FAE) aims to modify physical attributes, such as pose, expression and lighting, of human facial images while preserving attributes like identity and background, showing great importance in photograph.

Attribute Semantic Composition

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT

no code implementations24 Feb 2024 Sixiao Zheng, Jingyang Huo, Yu Wang, Yanwei Fu

We propose an Intelligent Director framework, utilizing LENS to generate descriptions for images and video frames and combining ChatGPT to generate coherent captions while recommending appropriate music names.

Retrieval Style Transfer

Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability

no code implementations19 Feb 2024 Xuelin Qian, Yu Wang, Simian Luo, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue, Bo Zhao, Tiejun Huang, Yunsheng Wu, Yanwei Fu

In this paper, we extend auto-regressive models to 3D domains, and seek a stronger ability of 3D shape generation by improving auto-regressive models at capacity and scalability simultaneously.

3D Generation 3D Shape Generation +1

Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector

1 code implementation5 Feb 2024 Yuqian Fu, Yu Wang, Yixuan Pan, Lian Huai, Xingyu Qiu, Zeyu Shangguan, Tong Liu, Yanwei Fu, Luc van Gool, Xingqun Jiang

This paper studies the challenging cross-domain few-shot object detection (CD-FSOD), aiming to develop an accurate object detector for novel domains with minimal labeled examples.

Cross-Domain Few-Shot Cross-Domain Few-Shot Object Detection +3

Repositioning the Subject within Image

1 code implementation30 Jan 2024 Yikai Wang, Chenjie Cao, Ke Fan, Qiaole Dong, YiFan Li, xiangyang xue, Yanwei Fu

To assess SEELE's effectiveness in subject repositioning, we assemble a real-world subject repositioning dataset called ReS.

Image Inpainting Image Manipulation

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo

1 code implementation22 Jan 2024 Chenjie Cao, Xinlin Ren, Yanwei Fu

Recent advancements in learning-based Multi-View Stereo (MVS) methods have prominently featured transformer-based models with attention mechanisms.

3D Reconstruction Depth Estimation +1

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations

1 code implementation30 Dec 2023 Yilan Dong, Chunlin Yu, Ruiyang Ha, Ye Shi, Yuexin Ma, Lan Xu, Yanwei Fu, Jingya Wang

Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space.

Gait Recognition

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

no code implementations12 Dec 2023 Jianxiong Gao, Yuqian Fu, Yun Wang, Xuelin Qian, Jianfeng Feng, Yanwei Fu

In this paper, we introduce Recon3DMind, an innovative task aimed at reconstructing 3D visuals from Functional Magnetic Resonance Imaging (fMRI) signals, marking a significant advancement in the fields of cognitive neuroscience and computer vision.

Brain Decoding Decoder +1

Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation

1 code implementation4 Dec 2023 Qiaole Dong, Bo Zhao, Yanwei Fu

Recently, Google proposes DDVM which for the first time demonstrates that a general diffusion model for image-to-image translation task works impressively well on optical flow estimation task without any specific designs like RAFT.

Image-to-Image Translation Optical Flow Estimation +1

fMRI-PTE: A Large-scale fMRI Pretrained Transformer Encoder for Multi-Subject Brain Activity Decoding

no code implementations1 Nov 2023 Xuelin Qian, Yun Wang, Jingyang Huo, Jianfeng Feng, Yanwei Fu

The exploration of brain activity and its decoding from fMRI data has been a longstanding pursuit, driven by its potential applications in brain-computer interfaces, medical diagnostics, and virtual reality.

Rethinking Amodal Video Segmentation from Learning Supervised Signals with Object-centric Representation

1 code implementation ICCV 2023 Ke Fan, Jingshi Lei, Xuelin Qian, Miaopeng Yu, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu

Furthermore, we propose a multi-view fusion layer based temporal module which is equipped with a set of object slots and interacts with features from different views by attention mechanism to fulfill sufficient object representation completion.

Object Video Segmentation +1

Doubly Robust Proximal Causal Learning for Continuous Treatments

1 code implementation22 Sep 2023 Yong Wu, Yanwei Fu, Shouyan Wang, Xinwei Sun

To address these challenges, we propose a kernel-based DR estimator that can well handle continuous treatments.

Unsupervised Open-Vocabulary Object Localization in Videos

1 code implementation ICCV 2023 Ke Fan, Zechen Bai, Tianjun Xiao, Dominik Zietlow, Max Horn, Zixu Zhao, Carl-Johann Simon-Gabriel, Mike Zheng Shou, Francesco Locatello, Bernt Schiele, Thomas Brox, Zheng Zhang, Yanwei Fu, Tong He

In this paper, we show that recent advances in video representation learning and pre-trained vision-language models allow for substantial improvements in self-supervised video object localization.

Object Object Localization +1

Object-Centric Multiple Object Tracking

1 code implementation ICCV 2023 Zixu Zhao, Jiaze Wang, Max Horn, Yizhuo Ding, Tong He, Zechen Bai, Dominik Zietlow, Carl-Johann Simon-Gabriel, Bing Shuai, Zhuowen Tu, Thomas Brox, Bernt Schiele, Yanwei Fu, Francesco Locatello, Zheng Zhang, Tianjun Xiao

Unsupervised object-centric learning methods allow the partitioning of scenes into entities without additional localization information and are excellent candidates for reducing the annotation burden of multiple-object tracking (MOT) pipelines.

Multiple Object Tracking Object +3

Coarse-to-Fine Amodal Segmentation with Shape Prior

1 code implementation ICCV 2023 Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu

To address this issue, we propose a convolution refine module to inject fine-grained information and provide a more precise amodal object segmentation based on visual features and coarse-predicted segmentation.

Object Segmentation +1

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

no code implementations30 Aug 2023 Tianyu Wang, YiFan Li, Haitao Lin, xiangyang xue, Yanwei Fu

The target instruction is then forwarded to a visual grounding system for object pose and size estimation, following which the robot grasps the object accordingly.

Language Modeling Language Modelling +4

Rethinking Person Re-identification from a Projection-on-Prototypes Perspective

no code implementations21 Aug 2023 Qizao Wang, Xuelin Qian, Bin Li, Yanwei Fu, xiangyang xue

In this paper, we rethink the role of the classifier in person Re-ID, and advocate a new perspective to conceive the classifier as a projection from image features to class prototypes.

Person Re-Identification Person Retrieval +3

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification

1 code implementation21 Aug 2023 Qizao Wang, Xuelin Qian, Bin Li, xiangyang xue, Yanwei Fu

Cloth-changing person Re-IDentification (Re-ID) is a particularly challenging task, suffering from two limitations of inferior discriminative features and limited training samples.

Attribute Cloth-Changing Person Re-Identification +4

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning

1 code implementation6 Aug 2023 Linbo Wang, Jing Wu, Xianyong Fang, Zhengyi Liu, Chenjie Cao, Yanwei Fu

First, we propose a Local Feature Consensus (LFC) plugin block to augment the features of existing models.

Pushing the Limits of 3D Shape Generation at Scale

no code implementations20 Jun 2023 Yu Wang, Xuelin Qian, Jingyang Huo, Tiejun Huang, Bo Zhao, Yanwei Fu

Through the adaptation of the Auto-Regressive model and the utilization of large language models, we have developed a remarkable model with an astounding 3. 6 billion trainable parameters, establishing it as the largest 3D shape generation model to date, named Argus-3D.

3D Generation 3D Shape Generation +2

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation

1 code implementation CVPR 2023 Jingyang Huo, Qiang Sun, Boyan Jiang, Haitao Lin, Yanwei Fu

Technically, we introduce a two-stage module that combine local slot attention and CLIP model to produce geometry-enhanced representation from such input.

Vision and Language Navigation

LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model

3 code implementations CVPR 2024 Chenjie Cao, Yunuo Cai, Qiaole Dong, Yikai Wang, Yanwei Fu

As an exemplar, we leverage LeftRefill to address two different challenges: reference-guided inpainting and novel view synthesis, based on the pre-trained StableDiffusion.

Image Inpainting Image Manipulation +2

Faster OreFSDet : A Lightweight and Effective Few-shot Object Detector for Ore Images

1 code implementation2 May 2023 Yang Zhang, Le Cheng, Yuting Peng, Chengming Xu, Yanwei Fu, Bo Wu, Guodong Sun

For the ore particle size detection, obtaining a sizable amount of high-quality ore labeled data is time-consuming and expensive.

Object object-detection +1

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

1 code implementation CVPR 2023 Yun He, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

Most existing point cloud upsampling methods have roughly three steps: feature extraction, feature expansion and 3D coordinate prediction.

point cloud upsampling

Joint fMRI Decoding and Encoding with Latent Embedding Alignment

no code implementations26 Mar 2023 Xuelin Qian, Yikai Wang, Yanwei Fu, Xinwei Sun, xiangyang xue, Jianfeng Feng

Our Latent Embedding Alignment (LEA) model concurrently recovers visual stimuli from fMRI signals and predicts brain activity from images within a unified framework.

Image Generation

Learning Versatile 3D Shape Generation with Improved AR Models

no code implementations26 Mar 2023 Simian Luo, Xuelin Qian, Yanwei Fu, yinda zhang, Ying Tai, Zhenyu Zhang, Chengjie Wang, xiangyang xue

Auto-Regressive (AR) models have achieved impressive results in 2D image generation by modeling joint distributions in the grid space.

3D Shape Generation Image Generation +1

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation

no code implementations11 Mar 2023 Chenjie Cao, Xinlin Ren, xiangyang xue, Yanwei Fu

To address these problems, we first apply one of the state-of-the-art learning-based MVS methods, --MVSFormer, to overcome intractable scenarios such as textureless and reflections regions suffered by traditional PatchMatch methods, but it fails in a few large scenes' reconstructions.

Co-Attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-Identification

1 code implementation Asian Conference on Computer Vision (ACCV) 2023 Qizao Wang, Xuelin Qian, Yanwei Fu, xiangyang xue

In this paper, we first design a novel Shape Semantics Embedding (SSE) module to encode body shape semantic information, which is one of the essential clues to distinguish pedestrians when their clothes change.

Cloth-Changing Person Re-Identification Person Retrieval +1

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints

1 code implementation ICCV 2023 Chenjie Cao, Yanwei Fu

Learning robust local image feature matching is a fundamental low-level vision task, which has been widely explored in the past few years.

Pose Estimation Visual Localization

Entity-Level Text-Guided Image Manipulation

1 code implementation22 Feb 2023 Yikai Wang, Jianan Wang, Guansong Lu, Hang Xu, Zhenguo Li, Wei zhang, Yanwei Fu

In the image manipulation phase, SeMani adopts a generative model to synthesize new images conditioned on the entity-irrelevant regions and target text descriptions.

Denoising Image Manipulation

StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning

2 code implementations CVPR 2023 Yuqian Fu, Yu Xie, Yanwei Fu, Yu-Gang Jiang

Thus, inspired by vanilla adversarial learning, a novel model-agnostic meta Style Adversarial training (StyleAdv) method together with a novel style adversarial attack method is proposed for CD-FSL.

Adversarial Attack cross-domain few-shot learning

Exploring Efficient Few-shot Adaptation for Vision Transformers

1 code implementation6 Jan 2023 Chengming Xu, Siqian Yang, Yabiao Wang, Zhanxiong Wang, Yanwei Fu, xiangyang xue

Essentially, despite ViTs have been shown to enjoy comparable or even better performance on other vision tasks, it is still very nontrivial to efficiently finetune the ViTs in real-world FSL scenarios.

Few-Shot Learning

Vocabulary-informed Zero-shot and Open-set Learning

1 code implementation3 Jan 2023 Yanwei Fu, Xiaomei Wang, Hanze Dong, Yu-Gang Jiang, Meng Wang, xiangyang xue, Leonid Sigal

Despite significant progress in object categorization, in recent years, a number of important challenges remain; mainly, the ability to learn from limited labeled data and to recognize object classes within large, potentially open, set of labels.

Object Categorization Open Set Learning +1

Knockoffs-SPR: Clean Sample Selection in Learning with Noisy Labels

1 code implementation2 Jan 2023 Yikai Wang, Yanwei Fu, Xinwei Sun

While Knockoffs-SPR can be regarded as a sample selection module for a standard supervised training pipeline, we further combine it with a semi-supervised algorithm to exploit the support of noisy data as unlabeled data.

Learning with noisy labels regression

Split-PU: Hardness-aware Training Strategy for Positive-Unlabeled Learning

1 code implementation30 Nov 2022 Chengming Xu, Chen Liu, Siqian Yang, Yabiao Wang, Shijie Zhang, Lijie Jia, Yanwei Fu

Since only part of the most confident positive samples are available and evidence is not enough to categorize the rest samples, many of these unlabeled data may also be the positive samples.

Binary Classification

PatchMix Augmentation to Identify Causal Features in Few-shot Learning

no code implementations29 Nov 2022 Chengming Xu, Chen Liu, Xinwei Sun, Siqian Yang, Yabiao Wang, Chengjie Wang, Yanwei Fu

We theoretically show that such an augmentation mechanism, different from existing ones, is able to identify the causal features.

Data Augmentation Few-Shot Learning +1

RankDNN: Learning to Rank for Few-shot Learning

1 code implementation28 Nov 2022 Qianyu Guo, Hongtong Gong, Xujun Wei, Yanwei Fu, Weifeng Ge, Yizhou Yu, Wenqiang Zhang

This paper introduces a new few-shot learning pipeline that casts relevance ranking for image retrieval as binary ranking relation classification.

Few-Shot Learning Image Classification +5

Self-supervised Amodal Video Object Segmentation

1 code implementation23 Oct 2022 Jian Yao, Yuxin Hong, Chiyu Wang, Tianjun Xiao, Tong He, Francesco Locatello, David Wipf, Yanwei Fu, Zheng Zhang

The key intuition is that the occluded part of an object can be explained away if that part is visible in other frames, possibly deformed as long as the deformation can be reasonably learned.

Object Segmentation +6

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors

2 code implementations12 Oct 2022 Chenjie Cao, Qiaole Dong, Yanwei Fu

Specifically, given one corrupt image, we present the Transformer Structure Restorer (TSR) module to restore holistic structural priors at low image resolution, which are further upsampled by Simple Structure Upsampler (SSU) module to higher image resolution.

Image Inpainting

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning

1 code implementation11 Oct 2022 Yuqian Fu, Yu Xie, Yanwei Fu, Jingjing Chen, Yu-Gang Jiang

Concretely, to solve the data imbalance problem between the source data with sufficient examples and the auxiliary target data with limited examples, we build our model under the umbrella of multi-expert learning.

cross-domain few-shot learning Knowledge Distillation

Specialized Re-Ranking: A Novel Retrieval-Verification Framework for Cloth Changing Person Re-Identification

no code implementations7 Oct 2022 Renjie Zhang, Yu Fang, Huaxin Song, Fangbin Wan, Yanwei Fu, Hirokazu Kato, Yang Wu

Cloth changing person re-identification(Re-ID) can work under more complicated scenarios with higher security than normal Re-ID and biometric techniques and is therefore extremely valuable in applications.

Cloth-Changing Person Re-Identification Re-Ranking +1

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling

1 code implementation18 Aug 2022 Boyan Jiang, Xinlin Ren, Mingsong Dou, xiangyang xue, Yanwei Fu, yinda zhang

Recent progress in 4D implicit representation focuses on globally controlling the shape and motion with low dimensional latent vectors, which is prone to missing surface details and accumulating tracking error.

3D Shape Modeling 4D reconstruction +1

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth

1 code implementation4 Aug 2022 Chenjie Cao, Xinlin Ren, Yanwei Fu

In this paper, we propose a pre-trained ViT enhanced MVS network called MVSFormer, which can learn more reliable feature representations benefited by informative priors from ViT.

3D Reconstruction Point Clouds +1

Learning Prior Feature and Attention Enhanced Image Inpainting

1 code implementation3 Aug 2022 Chenjie Cao, Qiaole Dong, Yanwei Fu

To this end, this paper incorporates the pre-training based Masked AutoEncoder (MAE) into the inpainting model, which enjoys richer informative priors to enhance the inpainting process.

Image Inpainting Image Restoration +2

Vision Transformers: From Semantic Segmentation to Dense Prediction

3 code implementations19 Jul 2022 Li Zhang, Jiachen Lu, Sixiao Zheng, Xinxuan Zhao, Xiatian Zhu, Yanwei Fu, Tao Xiang, Jianfeng Feng, Philip H. S. Torr

Extensive experiments show that our methods achieve appealing performance on a variety of dense prediction tasks (e. g., object detection and instance segmentation and semantic segmentation) as well as image classification.

Image Classification Instance Segmentation +6

RCLane: Relay Chain Prediction for Lane Detection

no code implementations19 Jul 2022 Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, xiangyang xue

This is because most of the existing lane detection methods either treat the lane detection as a dense prediction or a detection task, few of them consider the unique topologies (Y-shape, Fork-shape, nearly horizontal lane) of the lane markers, which leads to sub-optimal solution.

Lane Detection Prediction

Local Slot Attention for Vision-and-Language Navigation

1 code implementation17 Jun 2022 Yifeng Zhuang, Qiang Sun, Yanwei Fu, Lifeng Chen, xiangyang xue

Since the attention mechanism in the transformer architecture can better integrate inter- and intra-modal information of vision and language.

Navigate Vision and Language Navigation

Wavelet Prior Attention Learning in Axial Inpainting Network

no code implementations7 Jun 2022 Chenjie Cao, Chengrong Wang, Yuntao Zhang, Yanwei Fu

Image inpainting is the task of filling masked or unknown regions of an image with visually realistic contents, which has been remarkably improved by Deep Neural Networks (DNNs) recently.

Decoder Image Inpainting +1

Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions

no code implementations9 May 2022 Chilam Cheang, Haitao Lin, Yanwei Fu, xiangyang xue

This paper studies the task of any objects grasping from the known categories by free-form language instructions.

Object Object Localization +1

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches

no code implementations9 May 2022 Haitao Lin, Chilam Cheang, Yanwei Fu, xiangyang xue

The physical robot experiments confirm the utility of our method in object-cluttered scenes.

Density-preserving Deep Point Cloud Compression

no code implementations CVPR 2022 Yun He, Xinlin Ren, Danhang Tang, yinda zhang, xiangyang xue, Yanwei Fu

To address this, we propose a novel deep point cloud compression method that preserves local density information.

Decoder

Reinforcing Generated Images via Meta-learning for One-Shot Fine-Grained Visual Recognition

no code implementations22 Apr 2022 Satoshi Tsutsui, Yanwei Fu, David Crandall

One-shot fine-grained visual recognition often suffers from the problem of having few training examples for new fine-grained classes.

Diversity Fine-Grained Image Classification +4

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

no code implementations21 Apr 2022 Chao Wen, yinda zhang, Chenjie Cao, Zhuwen Li, xiangyang xue, Yanwei Fu

We study the problem of shape generation in 3D mesh representation from a small number of color images with or without camera poses.

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation

1 code implementation CVPR 2022 Jianan Wang, Guansong Lu, Hang Xu, Zhenguo Li, Chunjing Xu, Yanwei Fu

Existing text-guided image manipulation methods aim to modify the appearance of the image or to edit a few objects in a virtual or simple scenario, which is far from practical application.

Image Manipulation

DST: Dynamic Substitute Training for Data-free Black-box Attack

no code implementations CVPR 2022 Wenxuan Wang, Xuelin Qian, Yanwei Fu, xiangyang xue

With the wide applications of deep neural network models in various computer vision tasks, more and more works study the model vulnerability to adversarial examples.

Knowledge Distillation

ImpDet: Exploring Implicit Fields for 3D Object Detection

no code implementations31 Mar 2022 Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, xiangyang xue

Conventional 3D object detection approaches concentrate on bounding boxes representation learning with several parameters, i. e., localization, dimension, and orientation.

3D Object Detection Object +2

A Framework of Meta Functional Learning for Regularising Knowledge Transfer

no code implementations28 Mar 2022 Pan Li, Yanwei Fu, Shaogang Gong

The MFL computes meta-knowledge on functional regularisation generalisable to different learning tasks by which functional training on limited labelled data promotes more discriminative functions to be learned.

cross-domain few-shot learning Transfer Learning

Recent Few-Shot Object Detection Algorithms: A Survey with Performance Comparison

no code implementations27 Mar 2022 Tianying Liu, Lu Zhang, Yang Wang, Jihong Guan, Yanwei Fu, Jiajia Zhao, Shuigeng Zhou

To this end, the Few-Shot Object Detection (FSOD) has been topical recently, as it mimics the humans' ability of learning to learn, and intelligently transfers the learned generic object knowledge from the common heavy-tailed, to the novel long-tailed object classes.

Few-Shot Object Detection Meta-Learning +3

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation

no code implementations22 Mar 2022 Yuxin Hong, Xuelin Qian, Simian Luo, xiangyang xue, Yanwei Fu

To this end, this paper proposes a novel model of learning to Quantize, Scrabble, and Craft (QS-Craft) for conditional human motion animation.

Generative Adversarial Network

Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning

1 code implementation15 Mar 2022 Yuqian Fu, Yu Xie, Yanwei Fu, Jingjing Chen, Yu-Gang Jiang

The key challenge of CD-FSL lies in the huge data shift between source and target domains, which is typically in the form of totally different visual styles.

cross-domain few-shot learning Self-Supervised Learning

H4D: Human 4D Modeling by Learning Neural Compositional Representation

no code implementations CVPR 2022 Boyan Jiang, yinda zhang, Xingkui Wei, xiangyang xue, Yanwei Fu

A simple yet effective linear motion model is proposed to provide a rough and regularized motion estimation, followed by per-frame compensation for pose and geometry details with the residual encoded in the auxiliary code.

3D Reconstruction Future prediction +2

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding

2 code implementations CVPR 2022 Qiaole Dong, Chenjie Cao, Yanwei Fu

The proposed model restores holistic image structures with a powerful attention-based transformer model in a fixed low-resolution sketch space.

Image Inpainting

Clustering by the Probability Distributions from Extreme Value Theory

1 code implementation20 Feb 2022 Sixiao Zheng, Ke Fan, Yanxi Hou, Jianfeng Feng, Yanwei Fu

In contrast, the GPD fits the distribution of distance to the centroid exceeding a sufficiently large threshold, leading to a more stable performance of GPD k-means.

Clustering

Learning To Memorize Feature Hallucination for One-Shot Image Generation

no code implementations CVPR 2022 Yu Xie, Yanwei Fu, Ying Tai, Yun Cao, Junwei Zhu, Chengjie Wang

In this paper, we propose a novel model to explicitly learn and memorize reusable features that can help hallucinate novel category images.

Hallucination Image Generation

FEDERATED LEARNING FRAMEWORK BASED ON TRIMMED MEAN AGGREGATION RULES

no code implementations29 Sep 2021 Wang Tian Xiang, Meiyue Shao, Yanwei Fu, Riheng Jia, Feilong Lin, ZhongLong Zheng

Typically, aggregation rules are utilized to protect the model from the attacks in federated learning.

Federated Learning

Relative Instance Credibility Inference for Learning with Noisy Labels

no code implementations29 Sep 2021 Yikai Wang, Xinwei Sun, Yanwei Fu

Specifically, we re-purpose a sparse linear model with incidental parameters as a unified Relative Instance Credibility Inference (RICI) framework, which will detect and remove outliers in the forward pass of each mini-batch and use the remaining instances to train the network.

Learning with noisy labels

An Improved Composite Functional Gradient Learning by Wasserstein Regularization for Generative adversarial networks

no code implementations29 Sep 2021 Chang Wan, Yanwei Fu, Ke Fan, Jinshan Zeng, Ming Zhong, Riheng Jia, MingLu Li, ZhongLong Zheng

However, the discriminator using logistic regression from the CFG framework is gradually hard to discriminate between real and fake images while the training steps go on.

Image Generation regression

The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk Screening by Eye-region Manifestations

no code implementations18 Sep 2021 Yanwei Fu, Feng Li, Paula boned Fustel, Lei Zhao, Lijie Jia, Haojie Zheng, Qiang Sun, Shisong Rong, Haicheng Tang, xiangyang xue, Li Yang, Hong Li, Jiao Xie Wenxuan Wang, Yuan Li, Wei Wang, Yantao Pei, Jianmin Wang, Xiuqi Wu, Yanhua Zheng, Hongxia Tian, Mengwei Gu

The image-level performance of COVID-19 prescreening model in the China-Spain multicenter study achieved an AUC of 0. 913 (95% CI, 0. 898-0. 927), with a sensitivity of 0. 695 (95% CI, 0. 643-0. 748), a specificity of 0. 904 (95% CI, 0. 891 -0. 919), an accuracy of 0. 875(0. 861-0. 889), and a F1 of 0. 611(0. 568-0. 655).

Binary Classification Specificity

Deep Hybrid Self-Prior for Full 3D Mesh Generation

1 code implementation ICCV 2021 Xingkui Wei, Zhengqing Chen, Yanwei Fu, Zhaopeng Cui, yinda zhang

We present a deep learning pipeline that leverages network self-prior to recover a full 3D model consisting of both a triangular mesh and a texture map from the colored 3D point cloud.

Surface Reconstruction

A Unified Efficient Pyramid Transformer for Semantic Segmentation

no code implementations29 Jul 2021 Fangrui Zhu, Yi Zhu, Li Zhang, Chongruo wu, Yanwei Fu, Mu Li

Semantic segmentation is a challenging problem due to difficulties in modeling context in complex scenes and class confusions along boundaries.

Segmentation Semantic Segmentation

Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data

1 code implementation26 Jul 2021 Yuqian Fu, Yanwei Fu, Yu-Gang Jiang

Secondly, a novel disentangle module together with a domain classifier is proposed to extract the disentangled domain-irrelevant and domain-specific features.

cross-domain few-shot learning

Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos

no code implementations25 Jul 2021 Yuqian Fu, Yanwei Fu, Yu-Gang Jiang

To achieve this, a novel Mesh-based Video Action Imitation (M-VAI) method is proposed by us.

Human Dynamics

SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation

no code implementations CVPR 2022 Haitao Lin, Zichang Liu, Chilam Cheang, Yanwei Fu, Guodong Guo, xiangyang xue

The concatenation of the observed point cloud and symmetric one reconstructs a coarse object shape, thus facilitating object center (3D translation) and 3D size estimation.

Object Optical Character Recognition (OCR)

Rapid COVID-19 Risk Screening by Eye-region Manifestations

no code implementations12 Jun 2021 Yanwei Fu, Lei Zhao, Haojie Zheng, Qiang Sun, Li Yang, Hong Li, Jiao Xie, xiangyang xue, Feng Li, Yuan Li, Wei Wang, Yantao Pei, Jianmin Wang, Xiuqi Wu, Yanhua Zheng, Hongxia Tian Mengwei Gu1

It is still nontrivial to develop a new fast COVID-19 screening method with the easier access and lower cost, due to the technical and cost limitations of the current testing methods in the medical resource-poor districts.

Ethics

The Image Local Autoregressive Transformer

1 code implementation NeurIPS 2021 Chenjie Cao, Yuxin Hong, Xiang Li, Chengrong Wang, Chengming Xu, xiangyang xue, Yanwei Fu

To address these limitations, we propose a novel model -- image Local Autoregressive Transformer (iLAT), to better facilitate the locally guided image synthesis.

Image Generation

NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection

1 code implementation4 Jun 2021 Zekun Luo, Zheng Fang, Sixiao Zheng, Yabiao Wang, Yanwei Fu

Non-Maximum Suppression (NMS) is essential for object detection and affects the evaluation results by incorporating False Positives (FP) and False Negatives (FN), especially in crowd occlusion scenes.

object-detection Object Detection +1

Delving into Data: Effectively Substitute Training for Black-box Attack

no code implementations CVPR 2021 Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, xiangyang xue

Previous substitute training approaches focus on stealing the knowledge of the target model based on real training data or synthetic data, without exploring what kind of data can further improve the transferability between the substitute and target models.

Adversarial Attack

Learning a Sketch Tensor Space for Image Inpainting of Man-made Scenes

1 code implementation ICCV 2021 Chenjie Cao, Yanwei Fu

To this end, this paper proposes learning a Sketch Tensor (ST) space for inpainting man-made scenes.

Decoder Image Inpainting

Learning Dynamic Alignment via Meta-filter for Few-shot Learning

1 code implementation CVPR 2021 Chengming Xu, Chen Liu, Li Zhang, Chengjie Wang, Jilin Li, Feiyue Huang, xiangyang xue, Yanwei Fu

Our insight is that these methods would lead to poor adaptation with redundant matching, and leveraging channel-wise adjustment is the key to well adapting the learned knowledge to new classes.

Few-Shot Learning Position

Incrementally Zero-Shot Detection by an Extreme Value Analyzer

no code implementations23 Mar 2021 Sixiao Zheng, Yanwei Fu, Yanxi Hou

However, zero-shot learning models assume that all seen classes should be known beforehand, while incremental learning models cannot recognize unseen classes.

class-incremental learning Class Incremental Learning +4

Learning Compositional Representation for 4D Captures with Neural ODE

no code implementations CVPR 2021 Boyan Jiang, yinda zhang, Xingkui Wei, xiangyang xue, Yanwei Fu

To model the motion, a neural Ordinary Differential Equation (ODE) is trained to update the initial state conditioned on the learned motion code, and a decoder takes the shape code and the updated state code to reconstruct the 3D model at each time stamp.

4D reconstruction Decoder

A Simple Feature Augmentation for Domain Generalization

no code implementations ICCV 2021 Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, Timothy M. Hospedales

The topical domain generalization (DG) problem asks trained models to perform well on an unseen target domain with different data statistics from the source training domains.

Data Augmentation Domain Generalization

Whose hand is this? Person Identification from Egocentric Hand Gestures

no code implementations17 Nov 2020 Satoshi Tsutsui, Yanwei Fu, David Crandall

But while one's own face is not frequently visible, their hands are: in fact, hands are among the most common objects in one's own field of view.

Gesture Recognition Person Identification

Data-efficient Alignment of Multimodal Sequences by Aligning Gradient Updates and Internal Feature Distributions

1 code implementation15 Nov 2020 Jianan Wang, Boyang Li, Xiangyu Fan, Jing Lin, Yanwei Fu

The task of video and text sequence alignment is a prerequisite step toward joint understanding of movie videos and screenplays.

Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition

1 code implementation20 Oct 2020 Yuqian Fu, Li Zhang, Junke Wang, Yanwei Fu, Yu-Gang Jiang

Humans can easily recognize actions with only a few examples given, while the existing video recognition models still heavily rely on the large-scale labeled data inputs.

Few Shot Action Recognition Meta-Learning +2

M3Lung-Sys: A Deep Learning System for Multi-Class Lung Pneumonia Screening from CT Imaging

no code implementations7 Oct 2020 Xuelin Qian, Huazhu Fu, Weiya Shi, Tao Chen, Yanwei Fu, Fei Shan, xiangyang xue

To counter the outbreak of COVID-19, the accurate diagnosis of suspected cases plays a crucial role in timely quarantine, medical treatment, and preventing the spread of the pandemic.

A New Screening Method for COVID-19 based on Ocular Feature Recognition by Machine Learning Tools

no code implementations4 Sep 2020 Yanwei Fu, Feng Li, Wenxuan Wang, Haicheng Tang, Xuelin Qian, Mengwei Gu, xiangyang xue

After more than four months study, we found that the confirmed cases of COVID-19 present the consistent ocular pathological symbols; and we propose a new screening method of analyzing the eye-region images, captured by common CCD and CMOS cameras, could reliably make a rapid risk screening of COVID-19 with very high accuracy.

BIG-bench Machine Learning Ethics +2

Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking

1 code implementation ECCV 2020 Jinlong Peng, Changan Wang, Fangbin Wan, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu

Existing Multiple-Object Tracking (MOT) methods either follow the tracking-by-detection paradigm to conduct object detection, feature extraction and data association separately, or have two of the three subtasks integrated to form a partially end-to-end solution.

Multiple Object Tracking Object +3

How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning

2 code implementations15 Jul 2020 Yikai Wang, Li Zhang, Yuan YAO, Yanwei Fu

We rank the credibility of pseudo-labeled instances along the regularization path of their corresponding incidental parameters, and the most trustworthy pseudo-labeled examples are preserved as the augmented labeled instances.

Data Augmentation Few-Shot Learning

DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths

1 code implementation4 Jul 2020 Yanwei Fu, Chen Liu, Donghao Li, Xinwei Sun, Jinshan Zeng, Yuan YAO

Over-parameterization is ubiquitous nowadays in training neural networks to benefit both optimization in seeking global optima and generalization in reducing prediction error.

Self-supervised Video Object Segmentation

no code implementations22 Jun 2020 Fangrui Zhu, Li Zhang, Yanwei Fu, Guodong Guo, Weidi Xie

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a. k. a.

Object One-shot visual object segmentation +4

Long-Term Cloth-Changing Person Re-identification

no code implementations26 May 2020 Xuelin Qian, Wenxuan Wang, Li Zhang, Fangrui Zhu, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, xiangyang xue

Specifically, we consider that under cloth-changes, soft-biometrics such as body shape would be more reliable.

Cloth-Changing Person Re-Identification

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt

1 code implementation CVPR 2020 Hangyu Lin, Yanwei Fu, Yu-Gang Jiang, xiangyang xue

Unfortunately, the representation learned by SketchRNN is primarily for the generation tasks, rather than the other tasks of recognition and retrieval of sketches.

Retrieval Self-Supervised Learning +1

Instance Credibility Inference for Few-Shot Learning

1 code implementation CVPR 2020 Yikai Wang, Chengming Xu, Chen Liu, Li Zhang, Yanwei Fu

To measure the credibility of each pseudo-labeled instance, we then propose to solve another linear regression hypothesis by increasing the sparsity of the incidental parameters and rank the pseudo-labeled instances with their sparsity degree.

Data Augmentation Few-Shot Image Classification +2

When Person Re-identification Meets Changing Clothes

no code implementations9 Mar 2020 Fangbin Wan, Yang Wu, Xuelin Qian, Yixiong Chen, Yanwei Fu

We find that changing clothes makes ReID a much harder problem in the sense of bringing difficulties to learning effective representations and also challenges the generalization ability of previous ReID models to identify persons with unseen (new) clothes.

Person Re-Identification Person Search