Search Results for author: Tieniu Tan

Found 114 papers, 45 papers with code

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

1 code implementation • CVPR 2023 • Zhengxiong Luo, Dayou Chen, Yingya Zhang, Yan Huang, Liang Wang, Yujun Shen, Deli Zhao, Jingren Zhou, Tieniu Tan

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data distribution.

Ranked #7 on Video Generation on UCF-101

Code Generation Denoising +4

6,005

Paper
Code

Session-based Recommendation with Graph Neural Networks

7 code implementations • 1 Nov 2018 • Shu Wu, Yuyuan Tang, Yanqiao Zhu, Liang Wang, Xing Xie, Tieniu Tan

To obtain accurate item embedding and take complex transitions of items into account, we propose a novel method, i. e. Session-based Recommendation with Graph Neural Networks, SR-GNN for brevity.

Ranked #1 on Session-Based Recommendations on Gowalla

Session-Based Recommendations

4,094

Paper
Code

A Light CNN for Deep Face Representation with Noisy Labels

17 code implementations • 9 Nov 2015 • Xiang Wu, Ran He, Zhenan Sun, Tieniu Tan

This paper presents a Light CNN framework to learn a compact embedding on the large-scale face data with massive noisy labels.

Ranked #2 on Age-Invariant Face Recognition on CAFR

Face Identification Face Recognition +2

834

Paper
Code

Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

2 code implementations • CVPR 2019 • Xuecai Hu, Haoyuan Mu, Xiangyu Zhang, Zilei Wang, Tieniu Tan, Jian Sun

In this work, we propose a novel method called Meta-SR to firstly solve super-resolution of arbitrary scale factor (including non-integer scale factors) with a single model.

Image Super-Resolution

544

Paper
Code

A Comprehensive Survey on Test-Time Adaptation under Distribution Shifts

1 code implementation • 27 Mar 2023 • Jian Liang, Ran He, Tieniu Tan

Test-time adaptation (TTA), an emerging paradigm, has the potential to adapt a pre-trained model to unlabeled data during testing, before making predictions.

Source-Free Domain Adaptation Test-time Adaptation

487

Paper
Code

Unfolding the Alternating Optimization for Blind Super Resolution

1 code implementation • NeurIPS 2020 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

More importantly, \textit{Restorer} is trained with the kernel estimated by \textit{Estimator}, instead of ground-truth kernel, thus \textit{Restorer} could be more tolerant to the estimation error of \textit{Estimator}.

Ranked #2 on Blind Super-Resolution on Set5 - 2x upscaling

Blind Super-Resolution Burst Image Super-Resolution +1

229

Paper
Code

End-to-end Alternating Optimization for Blind Super Resolution

1 code implementation • 14 May 2021 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

More importantly, \textit{Restorer} is trained with the kernel estimated by \textit{Estimator}, instead of the ground-truth kernel, thus \textit{Restorer} could be more tolerant to the estimation error of \textit{Estimator}.

Ranked #2 on Blind Super-Resolution on DIV2KRK - 4x upscaling

Blind Super-Resolution Super-Resolution

229

Paper
Code

End-to-end Alternating Optimization for Real-World Blind Super Resolution

2 code implementations • 17 Aug 2023 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

To address this issue, instead of considering these two problems independently, we adopt an alternating optimization algorithm, which can estimate the degradation and restore the SR image in a single model.

Blind Super-Resolution Super-Resolution

229

Paper
Code

GAIA: A Transfer Learning System of Object Detection that Fits Your Needs

1 code implementation • CVPR 2021 • Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang

Transfer learning with pre-training on large-scale datasets has played an increasingly significant role in computer vision and natural language processing recently.

object-detection Object Detection +1

184

Paper
Code

BEVBert: Multimodal Map Pre-training for Language-guided Navigation

1 code implementation • ICCV 2023 • Dong An, Yuankai Qi, Yangguang Li, Yan Huang, Liang Wang, Tieniu Tan, Jing Shao

Concretely, we build a local metric map to explicitly aggregate incomplete observations and remove duplicates, while modeling navigation dependency in a global topological map.

Ranked #2 on Visual Navigation on R2R

Vision and Language Navigation Visual Navigation

160

Paper
Code

Learning the Degradation Distribution for Blind Image Super-Resolution

1 code implementation • CVPR 2022 • Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

Compared with previous deterministic degradation models, PDM could model more diverse degradations and generate HR-LR pairs that may better cover the various degradations of test images, and thus prevent the SR model from over-fitting to specific ones.

Image Super-Resolution

157

Paper
Code

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation

1 code implementation • CVPR 2021 • Zhengxiong Luo, Zhicheng Wang, Yan Huang, Tieniu Tan, Erjin Zhou

However, for bottom-up methods, which need to handle a large variance of human scales and labeling ambiguities, the current practice seems unreasonable.

Pose Estimation regression

121

Paper
Code

Hierarchical Graph Convolutional Networks for Semi-supervised Node Classification

1 code implementation • 13 Feb 2019 • Fenyu Hu, Yanqiao Zhu, Shu Wu, Liang Wang, Tieniu Tan

Graph convolutional networks (GCNs) have been successfully applied in node classification tasks of network mining.

Ranked #6 on Node Classification on Cora with Public Split: fixed 20 nodes per class

Classification General Classification +1

114

Paper
Code

Vision Transformer with Super Token Sampling

1 code implementation • CVPR 2023 • Huaibo Huang, Xiaoqiang Zhou, Jie Cao, Ran He, Tieniu Tan

STA decomposes vanilla global attention into multiplications of a sparse association map and a low-dimensional attention, leading to high efficiency in capturing global dependencies.

Semantic Segmentation Superpixels

109

Paper
Code

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation

1 code implementation • CVPR 2023 • Yueming Lyu, Tianwei Lin, Fu Li, Dongliang He, Jing Dong, Tieniu Tan

Our key idea is to investigate and identify a space, namely delta image and text space that has well-aligned distribution between CLIP visual feature differences of two images and CLIP textual embedding differences of source and target texts.

Image Manipulation

Paper
Code

Improving Zero-Shot Generalization for CLIP with Synthesized Prompts

1 code implementation • ICCV 2023 • Zhengbo Wang, Jian Liang, Ran He, Nan Xu, Zilei Wang, Tieniu Tan

Thereafter, we fine-tune CLIP with off-the-shelf methods by combining labeled and synthesized features.

Generalized Zero-Shot Learning Transfer Learning +1

Paper
Code

Neighbor-view Enhanced Model for Vision and Language Navigation

1 code implementation • 15 Jul 2021 • Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan

Specifically, our NvEM utilizes a subject module and a reference module to collect contexts from neighbor views.

Ranked #82 on Vision and Language Navigation on VLN Challenge

Navigate Vision and Language Navigation

Paper
Code

OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling

1 code implementation • NeurIPS 2023 • Yi-Fan Zhang, Qingsong Wen, Xue Wang, Weiqi Chen, Liang Sun, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

Online updating of time series forecasting models aims to address the concept drifting problem by efficiently updating forecasting models based on streaming data.

Time Series Time Series Forecasting

Paper
Code

AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation

1 code implementation • 25 Apr 2023 • Yi-Fan Zhang, Xue Wang, Kexin Jin, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

In particular, when the adaptation target is a series of domains, the adaptation accuracy of AdaNPC is 50% higher than advanced TTA methods.

Domain Generalization Test-time Adaptation

Paper
Code

TAGNN: Target Attentive Graph Neural Networks for Session-based Recommendation

1 code implementation • 6 May 2020 • Feng Yu, Yanqiao Zhu, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

However, these methods compress a session into one fixed representation vector without considering the target items to be predicted.

Ranked #3 on Session-Based Recommendations on yoochoose1

Session-Based Recommendations

Paper
Code

Free Lunch for Domain Adversarial Training: Environment Label Smoothing

1 code implementation • The Eleventh International Conference on Learning Representations (ICLR 2023) 2023 • Yifan Zhang, Xue Wang, Jian Liang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

A fundamental challenge for machine learning models is how to generalize learned models for out-of-distribution (OOD) data.

Ranked #4 on Domain Adaptation on Office-Home

Domain Generalization

Paper
Code

Debiasing Multimodal Large Language Models

1 code implementation • 8 Mar 2024 • Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

In the realms of computer vision and natural language processing, Large Vision-Language Models (LVLMs) have become indispensable tools, proficient in generating textual descriptions based on visual inputs.

Fairness Question Answering

Paper
Code

Dynamic Graph Representation for Partially Occluded Biometrics

2 code implementations • 1 Dec 2019 • Min Ren, Yunlong Wang, Zhenan Sun, Tieniu Tan

During dynamic graph matching, we propose a novel strategy to measure the distances of both nodes and adjacent matrixes.

Graph Matching

Paper
Code

Fully Sparse Fusion for 3D Object Detection

1 code implementation • 24 Apr 2023 • Yingyan Li, Lue Fan, Yang Liu, Zehao Huang, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang, Tieniu Tan

In this paper, we study how to effectively leverage image modality in the emerging fully sparse architecture.

3D Instance Segmentation 3D Object Detection +3

Paper
Code

A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation

1 code implementation • 6 Feb 2024 • Zhengbo Wang, Jian Liang, Lijun Sheng, Ran He, Zilei Wang, Tieniu Tan

Extensive results on 17 datasets validate that our method surpasses or achieves comparable results with state-of-the-art methods on few-shot classification, imbalanced learning, and out-of-distribution generalization.

Out-of-Distribution Generalization

Paper
Code

Semantic 3D-aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field

1 code implementation • 3 Feb 2023 • Tianxiang Ma, Bingchuan Li, Qian He, Jing Dong, Tieniu Tan

CNeRF divides the image by semantic regions and learns an independent neural radiance field for each region, and finally fuses them and renders the complete image.

Paper
Code

RiDDLE: Reversible and Diversified De-identification with Latent Encryptor

1 code implementation • CVPR 2023 • Dongze Li, Wei Wang, Kang Zhao, Jing Dong, Tieniu Tan

This work presents RiDDLE, short for Reversible and Diversified De-identification with Latent Encryptor, to protect the identity information of people from being misused.

De-identification

Paper
Code

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation

1 code implementation • 27 Nov 2018 • Junsong Fan, Zhao-Xiang Zhang, Tieniu Tan, Chunfeng Song, Jun Xiao

Weakly supervised semantic segmentation with only image-level labels saves large human effort to annotate pixel-level labels.

Segmentation Weakly supervised segmentation +2

Paper
Code

Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

1 code implementation • CVPR 2022 • Zengjie Song, Yuxi Wang, Junsong Fan, Tieniu Tan, Zhaoxiang Zhang

Sound source localization in visual scenes aims to localize objects emitting the sound in a given image.

Contrastive Learning

Paper
Code

Transferable Sparse Adversarial Attack

2 code implementations • CVPR 2022 • Ziwen He, Wei Wang, Jing Dong, Tieniu Tan

The experiment shows that our method has improved the transferability by a large margin under a similar sparsity setting compared with state-of-the-art methods.

Adversarial Attack Quantization

Paper
Code

Semantic Prompt for Few-Shot Image Recognition

1 code implementation • CVPR 2023 • Wentao Chen, Chenyang Si, Zhang Zhang, Liang Wang, Zilei Wang, Tieniu Tan

Instead of the naive exploitation of semantic information for remedying classifiers, we explore leveraging semantic information as prompts to tune the visual feature extraction network adaptively.

Few-Shot Learning

Paper
Code

Pointly-Supervised Panoptic Segmentation

1 code implementation • 25 Oct 2022 • Junsong Fan, Zhaoxiang Zhang, Tieniu Tan

In this paper, we propose a new approach to applying point-level annotations for weakly-supervised panoptic segmentation.

Panoptic Segmentation Segmentation +3

Paper
Code

3D Shape Temporal Aggregation for Video-Based Clothing-Change Person Re-Identication

1 code implementation • Asian Conference on Computer Vision 2023 • Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan

However, existing Re-ID methods usually generate 3D body shapes without considering identity modeling, which severely weakens the discriminability of 3D human shapes.

3D Shape Generation Person Re-Identification

Paper
Code

IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis

3 code implementations • NeurIPS 2018 • Huaibo Huang, Zhihang Li, Ran He, Zhenan Sun, Tieniu Tan

On the other hand, the inference model is encouraged to classify between the generated and real samples while the generator tries to fool it as GANs.

Image Generation

Paper
Code

Learning Instance-level Spatial-Temporal Patterns for Person Re-identification

1 code implementation • ICCV 2021 • Min Ren, Lingxiao He, Xingyu Liao, Wu Liu, Yunlong Wang, Tieniu Tan

In this paper, we propose a novel Instance-level and Spatial-Temporal Disentangled Re-ID method (InSTD), to improve Re-ID accuracy.

Ranked #14 on Person Re-Identification on DukeMTMC-reID

Image Retrieval Person Re-Identification +1

Paper
Code

GraphAIR: Graph Representation Learning with Neighborhood Aggregation and Interaction

1 code implementation • 5 Nov 2019 • Fenyu Hu, Yanqiao Zhu, Shu Wu, Weiran Huang, Liang Wang, Tieniu Tan

Then, in order to better capture the complicated non-linearity of graph data, we present a novel GraphAIR framework which models the neighborhood interaction in addition to neighborhood aggregation.

Ranked #5 on Node Classification on Cora with Public Split: fixed 20 nodes per class

Community Detection General Classification +3

Paper
Code

AdaptGuard: Defending Against Universal Attacks for Model Adaptation

1 code implementation • ICCV 2023 • Lijun Sheng, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

To address this issue, we propose a model preprocessing framework, named AdaptGuard, to improve the security of model adaptation algorithms.

Knowledge Distillation Transfer Learning

Paper
Code

DeepFirearm: Learning Discriminative Feature Representation for Fine-grained Firearm Retrieval

1 code implementation • 8 Jun 2018 • Jiedong Hao, Jing Dong, Wei Wang, Tieniu Tan

There are great demands for automatically regulating inappropriate appearance of shocking firearm images in social media or identifying firearm types in forensics.

Image Retrieval Retrieval

Paper
Code

Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models

1 code implementation • 18 Feb 2024 • Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan

In this work, we adopt the intuition that the LVLM tends to respond logically consistently for existent objects but inconsistently for hallucinated objects.

Hallucination Object

Paper
Code

Learning Pose-invariant 3D Object Reconstruction from Single-view Images

1 code implementation • 3 Apr 2020 • Bo Peng, Wei Wang, Jing Dong, Tieniu Tan

Learning to reconstruct 3D shapes using 2D images is an active research topic, with benefits of not requiring expensive 3D data.

3D Object Reconstruction Domain Adaptation

Paper
Code

Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations

1 code implementation • 16 Jul 2022 • Wentao Chen, Zhang Zhang, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan

Different from previous cross-domain FSL work (CD-FSL) that considers the domain shift between base and novel classes, the new problem, termed cross-domain cross-set FSL (CDSC-FSL), requires few-shot learners not only to adapt to the new domain, but also to be consistent between different domains within each novel class.

Few-Shot Learning

Paper
Code

Exploiting Semantic Attributes for Transductive Zero-Shot Learning

1 code implementation • 17 Mar 2023 • Zhengbo Wang, Jian Liang, Zilei Wang, Tieniu Tan

To address this issue, we present a novel transductive ZSL method that produces semantic attributes of the unseen data and imposes them on the generative process.

Attribute Generative Adversarial Network +1

Paper
Code

What Is the Best Practice for CNNs Applied to Visual Instance Retrieval?

1 code implementation • 5 Nov 2016 • Jiedong Hao, Jing Dong, Wei Wang, Tieniu Tan

Based on the evaluation results, we also identify the best choices for different factors and propose a new multi-scale image feature representation method to encode the image effectively.

Image Retrieval Retrieval

Paper
Code

Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning

no code implementations • ECCV 2018 • Chenyang Si, Ya Jing, Wei Wang, Liang Wang, Tieniu Tan

Skeleton-based action recognition has made great progress recently, but many problems still remain unsolved.

Ranked #81 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Human-Object Interaction Detection +2

Paper
Add Code

Geometry Guided Adversarial Facial Expression Synthesis

no code implementations • 10 Dec 2017 • Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan

An expression invariant face recognition experiment is also performed to further show the advantages of our proposed method.

Face Recognition Face Transfer +2

Paper
Add Code

Deep Supervised Discrete Hashing

no code implementations • NeurIPS 2017 • Qi Li, Zhenan Sun, Ran He, Tieniu Tan

Benefit from recent advances in deep learning, deep hashing methods have achieved promising results for image retrieval.

Deep Hashing General Classification +1

Paper
Add Code

Anti-Makeup: Learning A Bi-Level Adversarial Network for Makeup-Invariant Face Verification

no code implementations • 12 Sep 2017 • Yi Li, Lingxiao Song, Xiang Wu, Ran He, Tieniu Tan

This paper proposes a learning from generation approach for makeup-invariant face verification by introducing a bi-level adversarial network (BLAN).

Face Verification

Paper
Add Code

Coupled Deep Learning for Heterogeneous Face Recognition

no code implementations • 8 Apr 2017 • Xiang Wu, Lingxiao Song, Ran He, Tieniu Tan

CDL seeks a shared feature space in which the heterogeneous face matching problem can be approximately treated as a homogeneous face matching problem.

Face Recognition Heterogeneous Face Recognition

Paper
Add Code

Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition

no code implementations • 8 Aug 2017 • Ran He, Xiang Wu, Zhenan Sun, Tieniu Tan

To avoid the over-fitting problem on small-scale heterogeneous face data, a correlation prior is introduced on the fully-connected layers of WCNN network to reduce parameter space.

Ranked #3 on Face Verification on BUAA-VisNir

Face Recognition Heterogeneous Face Recognition

Paper
Add Code

Multimodal Memory Modelling for Video Captioning

no code implementations • 17 Nov 2016 • Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

In this paper, we propose a Multimodal Memory Model (M3) to describe videos, which builds a visual and textual shared memory to model the long-term visual-textual dependency and further guide global visual attention on described targets.

Sentence Video Captioning

Paper
Add Code

DeMeshNet: Blind Face Inpainting for Deep MeshFace Verification

no code implementations • 16 Nov 2016 • Shu Zhang, Ran He, Tieniu Tan

The occlusions incurred by random meshes severely degenerate the performance of face verification systems, which raises the MeshFace verification problem between MeshFace and daily photos.

Face Alignment Face Verification +1

Paper
Add Code

ICE: Information Credibility Evaluation on Social Media via Representation Learning

no code implementations • 29 Sep 2016 • Qiang Liu, Shu Wu, Feng Yu, Liang Wang, Tieniu Tan

In this paper, we propose a novel representation learning method, Information Credibility Evaluation (ICE), to learn representations of information credibility on social media.

Feature Engineering Representation Learning

Paper
Add Code

Learning Structured Ordinal Measures for Video based Face Recognition

no code implementations • 9 Jul 2015 • Ran He, Tieniu Tan, Larry Davis, Zhenan Sun

This paper presents a structured ordinal measure method for video-based face recognition that simultaneously learns ordinal filters and structured ordinal features.

Face Recognition

Paper
Add Code

Deep Semantic Ranking Based Hashing for Multi-Label Image Retrieval

no code implementations • CVPR 2015 • Fang Zhao, Yongzhen Huang, Liang Wang, Tieniu Tan

Research efforts have been devoted to learning compact binary codes that preserve semantic similarity based on labels.

Multi-Label Image Retrieval Retrieval +2

Paper
Add Code

Deep Steganalysis: End-to-End Learning with Supervisory Information beyond Class Labels

no code implementations • 27 Jun 2018 • Wei Wang, Jing Dong, Yinlong Qian, Tieniu Tan

Recently, deep learning has shown its power in steganalysis.

Steganalysis

Paper
Add Code

Variational Capsules for Image Analysis and Synthesis

no code implementations • 11 Jul 2018 • Huaibo Huang, Lingxiao Song, Ran He, Zhenan Sun, Tieniu Tan

Variational capsules model an image as a composition of entities in a probabilistic model.

Attribute General Classification +2

Paper
Add Code

Accelerating Deep Neural Networks with Spatial Bottleneck Modules

no code implementations • 7 Sep 2018 • Junran Peng, Lingxi Xie, Zhao-Xiang Zhang, Tieniu Tan, Jingdong Wang

This paper presents an efficient module named spatial bottleneck for accelerating the convolutional layers in deep neural networks.

Paper
Add Code

Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search

no code implementations • 22 Sep 2018 • Ya Jing, Chenyang Si, Jun-Bo Wang, Wei Wang, Liang Wang, Tieniu Tan

To exploit the multilevel corresponding visual contents, we propose a pose-guided multi-granularity attention network (PMA).

Person Search Sentence +1

Paper
Add Code

Relevance Topic Model for Unstructured Social Group Activity Recognition

no code implementations • NeurIPS 2013 • Fang Zhao, Yongzhen Huang, Liang Wang, Tieniu Tan

Unstructured social group activity recognition in web videos is a challenging task due to 1) the semantic gap between class labels and low-level visual features and 2) the lack of labeled training data.

Attribute Group Activity Recognition +1

Paper
Add Code

Multistage Adversarial Losses for Pose-Based Human Image Synthesis

no code implementations • CVPR 2018 • Chenyang Si, Wei Wang, Liang Wang, Tieniu Tan

Human image synthesis has extensive practical applications e. g. person re-identification and data augmentation for human pose estimation.

Data Augmentation Image Generation +2

Paper
Add Code

M3: Multimodal Memory Modelling for Video Captioning

no code implementations • CVPR 2018 • Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

Inspired by the facts that memory modelling poses potential advantages to long-term sequential problems [35] and working memory is the key factor of visual attention [33], we propose a Multimodal Memory Model (M3) to describe videos, which builds a visual and textual shared memory to model the long-term visual-textual dependency and further guide visual attention on described visual targets to solve visual-textual alignments.

Sentence Video Captioning

Paper
Add Code

End-to-end View Synthesis for Light Field Imaging with Pseudo 4DCNN

no code implementations • ECCV 2018 • Yunlong Wang, Fei Liu, Zilei Wang, Guangqi Hou, Zhenan Sun, Tieniu Tan

Limited angular resolution has become the main bottleneck of microlens-based plenoptic cameras towards practical vision applications.

Computational Efficiency Depth Estimation

Paper
Add Code

Deformable Object Matching via Deformation Decomposition based 2D Label MRF

no code implementations • CVPR 2014 • Kangwei Liu, Junge Zhang, Kaiqi Huang, Tieniu Tan

The MRF energy function is derived from the deformation decomposition model.

Paper
Add Code

ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering

no code implementations • CVPR 2016 • Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li

For spectral embedding/clustering, it is still an open problem on how to construct an relation graph to reflect the intrinsic structures in data.

Clustering graph construction +5

Paper
Add Code

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-Identification

no code implementations • CVPR 2017 • Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, Tieniu Tan

Accordingly, a demanding need is to recognize a person under different cameras, which is called person re-identification.

Metric Learning Video-Based Person Re-Identification

Paper
Add Code

Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution

no code implementations • ICCV 2017 • Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan

Most modern face super-resolution methods resort to convolutional neural networks (CNN) to infer high-resolution (HR) face images.

Ranked #3 on Face Hallucination on FFHQ 512 x 512 - 16x upscaling

Face Hallucination Image Super-Resolution

Paper
Add Code

Cross-spectral Face Completion for NIR-VIS Heterogeneous Face Recognition

no code implementations • 10 Feb 2019 • Ran He, Jie Cao, Lingxiao Song, Zhenan Sun, Tieniu Tan

This paper models high resolution heterogeneous face synthesis as a complementary combination of two components, a texture inpainting component and pose correction component.

Face Generation Face Recognition +3

Paper
Add Code

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

no code implementations • CVPR 2019 • Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, Tieniu Tan

Nevertheless, how to effectively extract discriminative spatial and temporal features is still a challenging problem.

Ranked #51 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

Fast Supervised Discrete Hashing

no code implementations • 7 Apr 2019 • Jie Gui, Tongliang Liu, Zhenan Sun, DaCheng Tao, Tieniu Tan

Rather than adopting this method, FSDH uses a very simple yet effective regression of the class labels of training examples to the corresponding hash code to accelerate the algorithm.

regression

Paper
Add Code

Supervised Discrete Hashing with Relaxation

no code implementations • 7 Apr 2019 • Jie Gui, Tongliang Liu, Zhenan Sun, DaCheng Tao, Tieniu Tan

In SDHR, the regression target is instead optimized.

General Classification regression +1

Paper
Add Code

Progressive Cluster Purification for Transductive Few-shot Learning

no code implementations • 10 Jun 2019 • Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, Tieniu Tan

Furthermore, the inter-class classification and the intra-class transduction are extremely flexible to be repeated several times to progressively purify the clusters.

Few-Shot Learning General Classification

Paper
Add Code

Efficient Neural Architecture Transformation Searchin Channel-Level for Object Detection

no code implementations • 5 Sep 2019 • Junran Peng, Ming Sun, Zhao-Xiang Zhang, Tieniu Tan, Junjie Yan

With the combination of these two designs, an architecture transformation scheme could be discovered to adapt a network designed for image classification to task of object detection.

Image Classification Neural Architecture Search +3

Paper
Add Code

POD: Practical Object Detection with Scale-Sensitive Network

no code implementations • ICCV 2019 • Junran Peng, Ming Sun, Zhao-Xiang Zhang, Tieniu Tan, Junjie Yan

Scale-sensitive object detection remains a challenging task, where most of the existing methods could not learn it explicitly and are not robust to scale variance.

Object object-detection +1

Paper
Add Code

Efficient Neural Architecture Transformation Search in Channel-Level for Object Detection

no code implementations • NeurIPS 2019 • Junran Peng, Ming Sun, Zhao-Xiang Zhang, Tieniu Tan, Junjie Yan

Instead of searching and constructing an entire network, NATS explores the architecture space on the base of existing network and reusing its weights.

Image Classification Neural Architecture Search +3

Paper
Add Code

A3GAN: An Attribute-aware Attentive Generative Adversarial Network for Face Aging

no code implementations • 15 Nov 2019 • Yunfan Liu, Qi Li, Zhenan Sun, Tieniu Tan

Face aging, which aims at aesthetically rendering a given face to predict its future appearance, has received significant research attention in recent years.

Attribute Generative Adversarial Network

Paper
Add Code

Alignment Free and Distortion Robust Iris Recognition

no code implementations • 1 Dec 2019 • Min Ren, Caiyong Wang, Yunlong Wang, Zhenan Sun, Tieniu Tan

And illumination variations may cause irregular distortion of iris texture.

Iris Recognition

Paper
Add Code

A New Ensemble Method for Concessively Targeted Multi-model Attack

no code implementations • 19 Dec 2019 • Ziwen He, Wei Wang, Xinsheng Xuan, Jing Dong, Tieniu Tan

Thus, in this paper, we propose a new attack mechanism which performs the non-targeted attack when the targeted attack fails.

Image Classification

Paper
Add Code

Temporal Sparse Adversarial Attack on Sequence-based Gait Recognition

no code implementations • 22 Feb 2020 • Ziwen He, Wei Wang, Jing Dong, Tieniu Tan

In this paper, we demonstrate that the state-of-the-art gait recognition model is vulnerable to such attacks.

Adversarial Attack Gait Recognition +1

Paper
Add Code

Cosmetic-Aware Makeup Cleanser

no code implementations • 20 Apr 2020 • Yi Li, Huaibo Huang, Junchi Yu, Ran He, Tieniu Tan

Face verification aims at determining whether a pair of face images belongs to the same identity.

Face Parsing Face Verification +1

Paper
Add Code

Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels

no code implementations • CVPR 2020 • Junran Peng, Xingyuan Bu, Ming Sun, Zhao-Xiang Zhang, Tieniu Tan, Junjie Yan

Training with more data has always been the most stable and effective way of improving performance in deep learning era.

Long-tail Learning Object +2

Paper
Add Code

TFNet: Multi-Semantic Feature Interaction for CTR Prediction

no code implementations • 29 Jun 2020 • Shu Wu, Feng Yu, Xueli Yu, Qiang Liu, Liang Wang, Tieniu Tan, Jie Shao, Fan Huang

The CTR (Click-Through Rate) prediction plays a central role in the domain of computational advertising and recommender systems.

Ranked #31 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Recommendation Systems

Paper
Add Code

Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition

no code implementations • ECCV 2020 • Chenyang Si, Xuecheng Nie, Wei Wang, Liang Wang, Tieniu Tan, Jiashi Feng

Self-supervised learning (SSL) has been proved very effective at learning representations from unlabeled data in the image domain.

3D Action Recognition Self-Supervised Learning

Paper
Add Code

Employing Multi-Estimations for Weakly-Supervised Semantic Segmentation

no code implementations • ECCV 2020 • Junsong Fan, Zhao-Xiang Zhang, Tieniu Tan

Instead of struggling to refine a single seed, we propose a novel approach to alleviate the inaccurate seed problem by leveraging the segmentation model's robustness to learn from multiple seeds.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification

no code implementations • ECCV 2020 • Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan

In this paper, we propose a novel Prediction, Recovery and Identification (PRI) model for LR re-id, which adaptively recovers missing details by predicting a preferable scale factor based on the image content.

Person Re-Identification Super-Resolution

Paper
Add Code

Style Intervention: How to Achieve Spatial Disentanglement with Style-based Generators?

no code implementations • 19 Nov 2020 • Yunfan Liu, Qi Li, Zhenan Sun, Tieniu Tan

Generative Adversarial Networks (GANs) with style-based generators (e. g. StyleGAN) successfully enable semantic control over image synthesis, and recent studies have also revealed that interpretable image translations could be obtained by modifying the latent code.

Attribute Disentanglement +2

Paper
Add Code

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations

no code implementations • 13 Dec 2020 • Zhengxiong Luo, Zhicheng Wang, Yuanhao Cai, GuanAn Wang, Yan Huang, Liang Wang, Erjin Zhou, Tieniu Tan, Jian Sun

Instead, we focus on exploiting multi-scale information from layers with different receptive-field sizes and then making full of use this information by improving the fusion method.

Pose Estimation

Paper
Add Code

Focal and Efficient IOU Loss for Accurate Bounding Box Regression

no code implementations • 20 Jan 2021 • Yi-Fan Zhang, Weiqiang Ren, Zhang Zhang, Zhen Jia, Liang Wang, Tieniu Tan

(ii) Most of the loss functions ignore the imbalance problem in BBR that the large number of anchor boxes which have small overlaps with the target boxes contribute most to the optimization of BBR.

object-detection Object Detection +2

Paper
Add Code

Graph Classification by Mixture of Diverse Experts

no code implementations • 29 Mar 2021 • Fenyu Hu, Liping Wang, Shu Wu, Liang Wang, Tieniu Tan

Graph classification is a challenging research problem in many applications across a broad range of domains.

General Classification Graph Classification

Paper
Add Code

Locate then Segment: A Strong Pipeline for Referring Image Segmentation

no code implementations • CVPR 2021 • Ya Jing, Tao Kong, Wei Wang, Liang Wang, Lei LI, Tieniu Tan

Referring image segmentation aims to segment the objects referred by a natural language expression.

Ranked #5 on Generalized Referring Expression Segmentation on gRefCOCO

Generalized Referring Expression Segmentation Image Segmentation +2

Paper
Add Code

Learning Domain Invariant Representations for Generalizable Person Re-Identification

no code implementations • 29 Mar 2021 • Yi-Fan Zhang, Zhang Zhang, Da Li, Zhen Jia, Liang Wang, Tieniu Tan

Generalizable person Re-Identification (ReID) has attracted growing attention in recent computer vision community.

Data Augmentation Domain Generalization +2

Paper
Add Code

SOGAN: 3D-Aware Shadow and Occlusion Robust GAN for Makeup Transfer

no code implementations • 21 Apr 2021 • Yueming Lyu, Jing Dong, Bo Peng, Wei Wang, Tieniu Tan

Since human faces are symmetrical in the UV space, we can conveniently remove the undesired shadow and occlusion from the reference image by carefully designing a Flip Attention Module (FAM).

Face Model Facial Makeup Transfer

Paper
Add Code

Robust Face-Swap Detection Based on 3D Facial Shape Information

no code implementations • 28 Apr 2021 • Weinan Guan, Wei Wang, Jing Dong, Bo Peng, Tieniu Tan

Maliciously-manipulated images or videos - so-called deep fakes - especially face-swap images and videos have attracted more and more malicious attackers to discredit some key figures.

Paper
Add Code

Few-Shot Learning with Part Discovery and Augmentation from Unlabeled Images

no code implementations • 25 May 2021 • Wentao Chen, Chenyang Si, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan

Few-shot learning is a challenging task since only few instances are given for recognizing an unseen class.

Ranked #3 on Unsupervised Few-Shot Image Classification on Tiered ImageNet 5-way (1-shot)

Few-Shot Learning Inductive Bias +2

Paper
Add Code

Adaptive Dilated Convolution For Human Pose Estimation

no code implementations • 22 Jul 2021 • Zhengxiong Luo, Zhicheng Wang, Yan Huang, Liang Wang, Tieniu Tan, Erjin Zhou

It can generate and fuse multi-scale features of the same spatial sizes by setting different dilation rates for different channels.

Pose Estimation

Paper
Add Code

Generalizable Person Re-identification Without Demographics

no code implementations • 29 Sep 2021 • Yifan Zhang, Feng Li, Zhang Zhang, Liang Wang, DaCheng Tao, Tieniu Tan

However, the convex condition of KL DRO may not hold for overparameterized neural networks, such that applying KL DRO often fails to generalize under distribution shifts in real scenarios.

Generalizable Person Re-identification

Paper
Add Code

Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

no code implementations • 1 Mar 2022 • Ke Han, Chenyang Si, Yan Huang, Liang Wang, Tieniu Tan

In this paper, we investigate the generalization problem of person re-identification (re-id), whose major challenge is the distribution shift on an unseen domain.

Generalizable Person Re-identification

Paper
Add Code

Disentangled Federated Learning for Tackling Attributes Skew via Invariant Aggregation and Diversity Transferring

no code implementations • 14 Jun 2022 • Zhengquan Luo, Yunlong Wang, Zilei Wang, Zhenan Sun, Tieniu Tan

Attributes skew hinders the current federated learning (FL) frameworks from consistent optimization directions among the clients, which inevitably leads to performance reduction and unstable convergence.

Federated Learning valid

Paper
Add Code

HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation

no code implementations • 11 Nov 2022 • Kaiduo Zhang, Muyi Sun, Jianxin Sun, Binghao Zhao, Kunbo Zhang, Zhenan Sun, Tieniu Tan

In this paper, we propose HumanDiffusion, a coarse-to-fine alignment diffusion framework, for text-driven person image generation.

Image Generation Retrieval +2

Paper
Add Code

Semantic-aware One-shot Face Re-enactment with Dense Correspondence Estimation

no code implementations • 23 Nov 2022 • Yunfan Liu, Qi Li, Zhenan Sun, Tieniu Tan

One-shot face re-enactment is a challenging task due to the identity mismatch between source and driving faces.

Disentanglement Generative Adversarial Network

Paper
Add Code

Human Image Generation: A Comprehensive Survey

no code implementations • 17 Dec 2022 • Zhen Jia, Zhang Zhang, Liang Wang, Tieniu Tan

Image and video synthesis has become a blooming topic in computer vision and machine learning communities along with the developments of deep generative models, due to its great academic and application value.

Data Augmentation Image Generation +2

Paper
Add Code

CFFT-GAN: Cross-domain Feature Fusion Transformer for Exemplar-based Image Translation

no code implementations • 3 Feb 2023 • Tianxiang Ma, Bingchuan Li, Wei Liu, Miao Hua, Jing Dong, Tieniu Tan

In this paper, we propose a more general learning approach by considering two domain features as a whole and learning both inter-domain correspondence and intra-domain potential information interactions.

Translation

Paper
Add Code

Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

no code implementations • 17 Apr 2023 • Weinan Guan, Wei Wang, Jing Dong, Bo Peng, Tieniu Tan

An important topic in manipulation detection is the localization of the fake regions.

Segmentation

Paper
Add Code

Clothing-Change Feature Augmentation for Person Re-Identification

no code implementations • CVPR 2023 • Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan

Specifically, to formulate meaningful clothing variations in the feature space, our method first estimates a clothing-change normal distribution with intra-ID cross-clothing variances.

Person Re-Identification

Paper
Add Code

GaFET: Learning Geometry-aware Facial Expression Translation from In-The-Wild Images

no code implementations • ICCV 2023 • Tianxiang Ma, Bingchuan Li, Qian He, Jing Dong, Tieniu Tan

In this paper, we introduce a novel Geometry-aware Facial Expression Translation (GaFET) framework, which is based on parametric 3D facial representations and can stably decoupled expression.

Facial Expression Translation

Paper
Add Code

Towards Realistic Unsupervised Fine-tuning with CLIP

no code implementations • 24 Aug 2023 • Jian Liang, Lijun Sheng, Zhengbo Wang, Ran He, Tieniu Tan

The emergence of vision-language models (VLMs), such as CLIP, has spurred a significant research effort towards their application for downstream supervised learning tasks.

Out-of-Distribution Detection

Paper
Add Code

MSRA-SR: Image Super-resolution Transformer with Multi-scale Shared Representation Acquisition

no code implementations • ICCV 2023 • Xiaoqiang Zhou, Huaibo Huang, Ran He, Zilei Wang, Jie Hu, Tieniu Tan

In particular, self-attention with cross-scale matching and convolution filters with different kernel sizes are designed to exploit the multi-scale features in images.

Image Super-Resolution

Paper
Add Code

Model-free Test Time Adaptation for Out-Of-Distribution Detection

no code implementations • 28 Nov 2023 • Yifan Zhang, Xue Wang, Tian Zhou, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

We demonstrate the effectiveness of \abbr through comprehensive experiments on multiple OOD detection benchmarks, extensive empirical studies show that \abbr significantly improves the performance of OOD detection over state-of-the-art methods.

Decision Making Out-of-Distribution Detection +2

Paper
Add Code

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

no code implementations • 18 Dec 2023 • Dongze Li, Kang Zhao, Wei Wang, Bo Peng, Yingya Zhang, Jing Dong, Tieniu Tan

Audio-driven talking head synthesis is a promising topic with wide applications in digital human, film making and virtual reality.

Talking Head Generation

Paper
Add Code

Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors

1 code implementation • 20 Dec 2023 • Yi-Fan Zhang, Zhang Zhang, Liang Wang, Tieniu Tan, Rong Jin

In an effort to address these issues, we delve into the realm of zero-shot machine-generated text detection.

Binary Classification Text Detection +1

Paper
Code

GraphDIVE: Graph Classification by Mixture of Diverse Experts

1 code implementation • journal 2021 • Fenyu Hu, Liping Wang, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

Graph classification is a challenging research problem in many applications across a broad range of domains.

Graph Classification

Paper
Code

Not all Minorities are Equal: Empty-Class-Aware Distillation for Heterogeneous Federated Learning

no code implementations • 4 Jan 2024 • Kuangpu Guo, Yuhe Ding, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

Data heterogeneity, characterized by disparities in local data distribution across clients, poses a significant challenge in federated learning.

Federated Learning Knowledge Distillation

Paper
Add Code

Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models

no code implementations • 6 Feb 2024 • Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

This paper proposes a \textbf{C}ollabo\textbf{ra}tive \textbf{F}ine-\textbf{T}uning (\textbf{CraFT}) approach for fine-tuning black-box VLMs to downstream tasks, where one only has access to the input prompts and the output predictions of the model.

Paper
Add Code

KEBench: A Benchmark on Knowledge Editing for Large Vision-Language Models

no code implementations • 12 Mar 2024 • Han Huang, Haitian Zhong, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

We conducted experiments of different editing methods on five LVLMs, and thoroughly analyze how these methods impact the models.

knowledge editing

Paper
Add Code

Artifact Feature Purification for Cross-domain Detection of AI-generated Images

no code implementations • 17 Mar 2024 • Zheling Meng, Bo Peng, Jing Dong, Tieniu Tan

We also find that the artifact features APN focuses on across generators and scenes are global and diverse.

Mutual Information Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.