Search Results for author: Tieniu Tan

Found 114 papers, 45 papers with code

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

1 code implementation CVPR 2023 Zhengxiong Luo, Dayou Chen, Yingya Zhang, Yan Huang, Liang Wang, Yujun Shen, Deli Zhao, Jingren Zhou, Tieniu Tan

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data distribution.

Code Generation Denoising +4

Session-based Recommendation with Graph Neural Networks

7 code implementations1 Nov 2018 Shu Wu, Yuyuan Tang, Yanqiao Zhu, Liang Wang, Xing Xie, Tieniu Tan

To obtain accurate item embedding and take complex transitions of items into account, we propose a novel method, i. e. Session-based Recommendation with Graph Neural Networks, SR-GNN for brevity.

Session-Based Recommendations

A Light CNN for Deep Face Representation with Noisy Labels

17 code implementations9 Nov 2015 Xiang Wu, Ran He, Zhenan Sun, Tieniu Tan

This paper presents a Light CNN framework to learn a compact embedding on the large-scale face data with massive noisy labels.

Face Identification Face Recognition +2

Meta-SR: A Magnification-Arbitrary Network for Super-Resolution

2 code implementations CVPR 2019 Xuecai Hu, Haoyuan Mu, Xiangyu Zhang, Zilei Wang, Tieniu Tan, Jian Sun

In this work, we propose a novel method called Meta-SR to firstly solve super-resolution of arbitrary scale factor (including non-integer scale factors) with a single model.

Image Super-Resolution

A Comprehensive Survey on Test-Time Adaptation under Distribution Shifts

1 code implementation27 Mar 2023 Jian Liang, Ran He, Tieniu Tan

Test-time adaptation (TTA), an emerging paradigm, has the potential to adapt a pre-trained model to unlabeled data during testing, before making predictions.

Source-Free Domain Adaptation Test-time Adaptation

Unfolding the Alternating Optimization for Blind Super Resolution

1 code implementation NeurIPS 2020 Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

More importantly, \textit{Restorer} is trained with the kernel estimated by \textit{Estimator}, instead of ground-truth kernel, thus \textit{Restorer} could be more tolerant to the estimation error of \textit{Estimator}.

Blind Super-Resolution Burst Image Super-Resolution +1

End-to-end Alternating Optimization for Blind Super Resolution

1 code implementation14 May 2021 Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

More importantly, \textit{Restorer} is trained with the kernel estimated by \textit{Estimator}, instead of the ground-truth kernel, thus \textit{Restorer} could be more tolerant to the estimation error of \textit{Estimator}.

Blind Super-Resolution Super-Resolution

End-to-end Alternating Optimization for Real-World Blind Super Resolution

2 code implementations17 Aug 2023 Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

To address this issue, instead of considering these two problems independently, we adopt an alternating optimization algorithm, which can estimate the degradation and restore the SR image in a single model.

Blind Super-Resolution Super-Resolution

GAIA: A Transfer Learning System of Object Detection that Fits Your Needs

1 code implementation CVPR 2021 Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang

Transfer learning with pre-training on large-scale datasets has played an increasingly significant role in computer vision and natural language processing recently.

object-detection Object Detection +1

BEVBert: Multimodal Map Pre-training for Language-guided Navigation

1 code implementation ICCV 2023 Dong An, Yuankai Qi, Yangguang Li, Yan Huang, Liang Wang, Tieniu Tan, Jing Shao

Concretely, we build a local metric map to explicitly aggregate incomplete observations and remove duplicates, while modeling navigation dependency in a global topological map.

Vision and Language Navigation Visual Navigation

Learning the Degradation Distribution for Blind Image Super-Resolution

1 code implementation CVPR 2022 Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, Tieniu Tan

Compared with previous deterministic degradation models, PDM could model more diverse degradations and generate HR-LR pairs that may better cover the various degradations of test images, and thus prevent the SR model from over-fitting to specific ones.

Image Super-Resolution

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation

1 code implementation CVPR 2021 Zhengxiong Luo, Zhicheng Wang, Yan Huang, Tieniu Tan, Erjin Zhou

However, for bottom-up methods, which need to handle a large variance of human scales and labeling ambiguities, the current practice seems unreasonable.

Pose Estimation regression

Vision Transformer with Super Token Sampling

1 code implementation CVPR 2023 Huaibo Huang, Xiaoqiang Zhou, Jie Cao, Ran He, Tieniu Tan

STA decomposes vanilla global attention into multiplications of a sparse association map and a low-dimensional attention, leading to high efficiency in capturing global dependencies.

Semantic Segmentation Superpixels

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation

1 code implementation CVPR 2023 Yueming Lyu, Tianwei Lin, Fu Li, Dongliang He, Jing Dong, Tieniu Tan

Our key idea is to investigate and identify a space, namely delta image and text space that has well-aligned distribution between CLIP visual feature differences of two images and CLIP textual embedding differences of source and target texts.

Image Manipulation

OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling

1 code implementation NeurIPS 2023 Yi-Fan Zhang, Qingsong Wen, Xue Wang, Weiqi Chen, Liang Sun, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

Online updating of time series forecasting models aims to address the concept drifting problem by efficiently updating forecasting models based on streaming data.

Time Series Time Series Forecasting

AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation

1 code implementation25 Apr 2023 Yi-Fan Zhang, Xue Wang, Kexin Jin, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

In particular, when the adaptation target is a series of domains, the adaptation accuracy of AdaNPC is 50% higher than advanced TTA methods.

Domain Generalization Test-time Adaptation

TAGNN: Target Attentive Graph Neural Networks for Session-based Recommendation

1 code implementation6 May 2020 Feng Yu, Yanqiao Zhu, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

However, these methods compress a session into one fixed representation vector without considering the target items to be predicted.

Session-Based Recommendations

Debiasing Multimodal Large Language Models

1 code implementation8 Mar 2024 Yi-Fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

In the realms of computer vision and natural language processing, Large Vision-Language Models (LVLMs) have become indispensable tools, proficient in generating textual descriptions based on visual inputs.

Fairness Question Answering

Dynamic Graph Representation for Partially Occluded Biometrics

2 code implementations1 Dec 2019 Min Ren, Yunlong Wang, Zhenan Sun, Tieniu Tan

During dynamic graph matching, we propose a novel strategy to measure the distances of both nodes and adjacent matrixes.

Graph Matching

Fully Sparse Fusion for 3D Object Detection

1 code implementation24 Apr 2023 Yingyan Li, Lue Fan, Yang Liu, Zehao Huang, Yuntao Chen, Naiyan Wang, Zhaoxiang Zhang, Tieniu Tan

In this paper, we study how to effectively leverage image modality in the emerging fully sparse architecture.

3D Instance Segmentation 3D Object Detection +3

A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation

1 code implementation6 Feb 2024 Zhengbo Wang, Jian Liang, Lijun Sheng, Ran He, Zilei Wang, Tieniu Tan

Extensive results on 17 datasets validate that our method surpasses or achieves comparable results with state-of-the-art methods on few-shot classification, imbalanced learning, and out-of-distribution generalization.

Out-of-Distribution Generalization

Semantic 3D-aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field

1 code implementation3 Feb 2023 Tianxiang Ma, Bingchuan Li, Qian He, Jing Dong, Tieniu Tan

CNeRF divides the image by semantic regions and learns an independent neural radiance field for each region, and finally fuses them and renders the complete image.

RiDDLE: Reversible and Diversified De-identification with Latent Encryptor

1 code implementation CVPR 2023 Dongze Li, Wei Wang, Kang Zhao, Jing Dong, Tieniu Tan

This work presents RiDDLE, short for Reversible and Diversified De-identification with Latent Encryptor, to protect the identity information of people from being misused.

De-identification

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation

1 code implementation27 Nov 2018 Junsong Fan, Zhao-Xiang Zhang, Tieniu Tan, Chunfeng Song, Jun Xiao

Weakly supervised semantic segmentation with only image-level labels saves large human effort to annotate pixel-level labels.

Segmentation Weakly supervised segmentation +2

Transferable Sparse Adversarial Attack

2 code implementations CVPR 2022 Ziwen He, Wei Wang, Jing Dong, Tieniu Tan

The experiment shows that our method has improved the transferability by a large margin under a similar sparsity setting compared with state-of-the-art methods.

Adversarial Attack Quantization

Semantic Prompt for Few-Shot Image Recognition

1 code implementation CVPR 2023 Wentao Chen, Chenyang Si, Zhang Zhang, Liang Wang, Zilei Wang, Tieniu Tan

Instead of the naive exploitation of semantic information for remedying classifiers, we explore leveraging semantic information as prompts to tune the visual feature extraction network adaptively.

Few-Shot Learning

Pointly-Supervised Panoptic Segmentation

1 code implementation25 Oct 2022 Junsong Fan, Zhaoxiang Zhang, Tieniu Tan

In this paper, we propose a new approach to applying point-level annotations for weakly-supervised panoptic segmentation.

Panoptic Segmentation Segmentation +3

3D Shape Temporal Aggregation for Video-Based Clothing-Change Person Re-Identication

1 code implementation Asian Conference on Computer Vision 2023 Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan

However, existing Re-ID methods usually generate 3D body shapes without considering identity modeling, which severely weakens the discriminability of 3D human shapes.

3D Shape Generation Person Re-Identification

IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis

3 code implementations NeurIPS 2018 Huaibo Huang, Zhihang Li, Ran He, Zhenan Sun, Tieniu Tan

On the other hand, the inference model is encouraged to classify between the generated and real samples while the generator tries to fool it as GANs.

Image Generation

GraphAIR: Graph Representation Learning with Neighborhood Aggregation and Interaction

1 code implementation5 Nov 2019 Fenyu Hu, Yanqiao Zhu, Shu Wu, Weiran Huang, Liang Wang, Tieniu Tan

Then, in order to better capture the complicated non-linearity of graph data, we present a novel GraphAIR framework which models the neighborhood interaction in addition to neighborhood aggregation.

Community Detection General Classification +3

AdaptGuard: Defending Against Universal Attacks for Model Adaptation

1 code implementation ICCV 2023 Lijun Sheng, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

To address this issue, we propose a model preprocessing framework, named AdaptGuard, to improve the security of model adaptation algorithms.

Knowledge Distillation Transfer Learning

DeepFirearm: Learning Discriminative Feature Representation for Fine-grained Firearm Retrieval

1 code implementation8 Jun 2018 Jiedong Hao, Jing Dong, Wei Wang, Tieniu Tan

There are great demands for automatically regulating inappropriate appearance of shocking firearm images in social media or identifying firearm types in forensics.

Image Retrieval Retrieval

Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models

1 code implementation18 Feb 2024 Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan

In this work, we adopt the intuition that the LVLM tends to respond logically consistently for existent objects but inconsistently for hallucinated objects.

Hallucination Object

Learning Pose-invariant 3D Object Reconstruction from Single-view Images

1 code implementation3 Apr 2020 Bo Peng, Wei Wang, Jing Dong, Tieniu Tan

Learning to reconstruct 3D shapes using 2D images is an active research topic, with benefits of not requiring expensive 3D data.

3D Object Reconstruction Domain Adaptation

Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations

1 code implementation16 Jul 2022 Wentao Chen, Zhang Zhang, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan

Different from previous cross-domain FSL work (CD-FSL) that considers the domain shift between base and novel classes, the new problem, termed cross-domain cross-set FSL (CDSC-FSL), requires few-shot learners not only to adapt to the new domain, but also to be consistent between different domains within each novel class.

Few-Shot Learning

Exploiting Semantic Attributes for Transductive Zero-Shot Learning

1 code implementation17 Mar 2023 Zhengbo Wang, Jian Liang, Zilei Wang, Tieniu Tan

To address this issue, we present a novel transductive ZSL method that produces semantic attributes of the unseen data and imposes them on the generative process.

Attribute Generative Adversarial Network +1

What Is the Best Practice for CNNs Applied to Visual Instance Retrieval?

1 code implementation5 Nov 2016 Jiedong Hao, Jing Dong, Wei Wang, Tieniu Tan

Based on the evaluation results, we also identify the best choices for different factors and propose a new multi-scale image feature representation method to encode the image effectively.

Image Retrieval Retrieval

Geometry Guided Adversarial Facial Expression Synthesis

no code implementations10 Dec 2017 Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan

An expression invariant face recognition experiment is also performed to further show the advantages of our proposed method.

Face Recognition Face Transfer +2

Deep Supervised Discrete Hashing

no code implementations NeurIPS 2017 Qi Li, Zhenan Sun, Ran He, Tieniu Tan

Benefit from recent advances in deep learning, deep hashing methods have achieved promising results for image retrieval.

Deep Hashing General Classification +1

Anti-Makeup: Learning A Bi-Level Adversarial Network for Makeup-Invariant Face Verification

no code implementations12 Sep 2017 Yi Li, Lingxiao Song, Xiang Wu, Ran He, Tieniu Tan

This paper proposes a learning from generation approach for makeup-invariant face verification by introducing a bi-level adversarial network (BLAN).

Face Verification

Coupled Deep Learning for Heterogeneous Face Recognition

no code implementations8 Apr 2017 Xiang Wu, Lingxiao Song, Ran He, Tieniu Tan

CDL seeks a shared feature space in which the heterogeneous face matching problem can be approximately treated as a homogeneous face matching problem.

Face Recognition Heterogeneous Face Recognition

Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition

no code implementations8 Aug 2017 Ran He, Xiang Wu, Zhenan Sun, Tieniu Tan

To avoid the over-fitting problem on small-scale heterogeneous face data, a correlation prior is introduced on the fully-connected layers of WCNN network to reduce parameter space.

Face Recognition Heterogeneous Face Recognition

Multimodal Memory Modelling for Video Captioning

no code implementations17 Nov 2016 Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

In this paper, we propose a Multimodal Memory Model (M3) to describe videos, which builds a visual and textual shared memory to model the long-term visual-textual dependency and further guide global visual attention on described targets.

Sentence Video Captioning

DeMeshNet: Blind Face Inpainting for Deep MeshFace Verification

no code implementations16 Nov 2016 Shu Zhang, Ran He, Tieniu Tan

The occlusions incurred by random meshes severely degenerate the performance of face verification systems, which raises the MeshFace verification problem between MeshFace and daily photos.

Face Alignment Face Verification +1

ICE: Information Credibility Evaluation on Social Media via Representation Learning

no code implementations29 Sep 2016 Qiang Liu, Shu Wu, Feng Yu, Liang Wang, Tieniu Tan

In this paper, we propose a novel representation learning method, Information Credibility Evaluation (ICE), to learn representations of information credibility on social media.

Feature Engineering Representation Learning

Learning Structured Ordinal Measures for Video based Face Recognition

no code implementations9 Jul 2015 Ran He, Tieniu Tan, Larry Davis, Zhenan Sun

This paper presents a structured ordinal measure method for video-based face recognition that simultaneously learns ordinal filters and structured ordinal features.

Face Recognition

Accelerating Deep Neural Networks with Spatial Bottleneck Modules

no code implementations7 Sep 2018 Junran Peng, Lingxi Xie, Zhao-Xiang Zhang, Tieniu Tan, Jingdong Wang

This paper presents an efficient module named spatial bottleneck for accelerating the convolutional layers in deep neural networks.

Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search

no code implementations22 Sep 2018 Ya Jing, Chenyang Si, Jun-Bo Wang, Wei Wang, Liang Wang, Tieniu Tan

To exploit the multilevel corresponding visual contents, we propose a pose-guided multi-granularity attention network (PMA).

Person Search Sentence +1

Relevance Topic Model for Unstructured Social Group Activity Recognition

no code implementations NeurIPS 2013 Fang Zhao, Yongzhen Huang, Liang Wang, Tieniu Tan

Unstructured social group activity recognition in web videos is a challenging task due to 1) the semantic gap between class labels and low-level visual features and 2) the lack of labeled training data.

Attribute Group Activity Recognition +1

Multistage Adversarial Losses for Pose-Based Human Image Synthesis

no code implementations CVPR 2018 Chenyang Si, Wei Wang, Liang Wang, Tieniu Tan

Human image synthesis has extensive practical applications e. g. person re-identification and data augmentation for human pose estimation.

Data Augmentation Image Generation +2

M3: Multimodal Memory Modelling for Video Captioning

no code implementations CVPR 2018 Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan

Inspired by the facts that memory modelling poses potential advantages to long-term sequential problems [35] and working memory is the key factor of visual attention [33], we propose a Multimodal Memory Model (M3) to describe videos, which builds a visual and textual shared memory to model the long-term visual-textual dependency and further guide visual attention on described visual targets to solve visual-textual alignments.

Sentence Video Captioning

End-to-end View Synthesis for Light Field Imaging with Pseudo 4DCNN

no code implementations ECCV 2018 Yunlong Wang, Fei Liu, Zilei Wang, Guangqi Hou, Zhenan Sun, Tieniu Tan

Limited angular resolution has become the main bottleneck of microlens-based plenoptic cameras towards practical vision applications.

Computational Efficiency Depth Estimation

ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering

no code implementations CVPR 2016 Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li

For spectral embedding/clustering, it is still an open problem on how to construct an relation graph to reflect the intrinsic structures in data.

Clustering graph construction +5

Cross-spectral Face Completion for NIR-VIS Heterogeneous Face Recognition

no code implementations10 Feb 2019 Ran He, Jie Cao, Lingxiao Song, Zhenan Sun, Tieniu Tan

This paper models high resolution heterogeneous face synthesis as a complementary combination of two components, a texture inpainting component and pose correction component.

Face Generation Face Recognition +3

Fast Supervised Discrete Hashing

no code implementations7 Apr 2019 Jie Gui, Tongliang Liu, Zhenan Sun, DaCheng Tao, Tieniu Tan

Rather than adopting this method, FSDH uses a very simple yet effective regression of the class labels of training examples to the corresponding hash code to accelerate the algorithm.

regression

Progressive Cluster Purification for Transductive Few-shot Learning

no code implementations10 Jun 2019 Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, Tieniu Tan

Furthermore, the inter-class classification and the intra-class transduction are extremely flexible to be repeated several times to progressively purify the clusters.

Few-Shot Learning General Classification

Efficient Neural Architecture Transformation Searchin Channel-Level for Object Detection

no code implementations5 Sep 2019 Junran Peng, Ming Sun, Zhao-Xiang Zhang, Tieniu Tan, Junjie Yan

With the combination of these two designs, an architecture transformation scheme could be discovered to adapt a network designed for image classification to task of object detection.

Image Classification Neural Architecture Search +3

POD: Practical Object Detection with Scale-Sensitive Network

no code implementations ICCV 2019 Junran Peng, Ming Sun, Zhao-Xiang Zhang, Tieniu Tan, Junjie Yan

Scale-sensitive object detection remains a challenging task, where most of the existing methods could not learn it explicitly and are not robust to scale variance.

Object object-detection +1

A3GAN: An Attribute-aware Attentive Generative Adversarial Network for Face Aging

no code implementations15 Nov 2019 Yunfan Liu, Qi Li, Zhenan Sun, Tieniu Tan

Face aging, which aims at aesthetically rendering a given face to predict its future appearance, has received significant research attention in recent years.

Attribute Generative Adversarial Network

A New Ensemble Method for Concessively Targeted Multi-model Attack

no code implementations19 Dec 2019 Ziwen He, Wei Wang, Xinsheng Xuan, Jing Dong, Tieniu Tan

Thus, in this paper, we propose a new attack mechanism which performs the non-targeted attack when the targeted attack fails.

Image Classification

Temporal Sparse Adversarial Attack on Sequence-based Gait Recognition

no code implementations22 Feb 2020 Ziwen He, Wei Wang, Jing Dong, Tieniu Tan

In this paper, we demonstrate that the state-of-the-art gait recognition model is vulnerable to such attacks.

Adversarial Attack Gait Recognition +1

Cosmetic-Aware Makeup Cleanser

no code implementations20 Apr 2020 Yi Li, Huaibo Huang, Junchi Yu, Ran He, Tieniu Tan

Face verification aims at determining whether a pair of face images belongs to the same identity.

Face Parsing Face Verification +1

TFNet: Multi-Semantic Feature Interaction for CTR Prediction

no code implementations29 Jun 2020 Shu Wu, Feng Yu, Xueli Yu, Qiang Liu, Liang Wang, Tieniu Tan, Jie Shao, Fan Huang

The CTR (Click-Through Rate) prediction plays a central role in the domain of computational advertising and recommender systems.

Click-Through Rate Prediction Recommendation Systems

Employing Multi-Estimations for Weakly-Supervised Semantic Segmentation

no code implementations ECCV 2020 Junsong Fan, Zhao-Xiang Zhang, Tieniu Tan

Instead of struggling to refine a single seed, we propose a novel approach to alleviate the inaccurate seed problem by leveraging the segmentation model's robustness to learn from multiple seeds.

Segmentation Weakly supervised Semantic Segmentation +1

Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification

no code implementations ECCV 2020 Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan

In this paper, we propose a novel Prediction, Recovery and Identification (PRI) model for LR re-id, which adaptively recovers missing details by predicting a preferable scale factor based on the image content.

Person Re-Identification Super-Resolution

Style Intervention: How to Achieve Spatial Disentanglement with Style-based Generators?

no code implementations19 Nov 2020 Yunfan Liu, Qi Li, Zhenan Sun, Tieniu Tan

Generative Adversarial Networks (GANs) with style-based generators (e. g. StyleGAN) successfully enable semantic control over image synthesis, and recent studies have also revealed that interpretable image translations could be obtained by modifying the latent code.

Attribute Disentanglement +2

Efficient Human Pose Estimation by Learning Deeply Aggregated Representations

no code implementations13 Dec 2020 Zhengxiong Luo, Zhicheng Wang, Yuanhao Cai, GuanAn Wang, Yan Huang, Liang Wang, Erjin Zhou, Tieniu Tan, Jian Sun

Instead, we focus on exploiting multi-scale information from layers with different receptive-field sizes and then making full of use this information by improving the fusion method.

Pose Estimation

Focal and Efficient IOU Loss for Accurate Bounding Box Regression

no code implementations20 Jan 2021 Yi-Fan Zhang, Weiqiang Ren, Zhang Zhang, Zhen Jia, Liang Wang, Tieniu Tan

(ii) Most of the loss functions ignore the imbalance problem in BBR that the large number of anchor boxes which have small overlaps with the target boxes contribute most to the optimization of BBR.

object-detection Object Detection +2

Graph Classification by Mixture of Diverse Experts

no code implementations29 Mar 2021 Fenyu Hu, Liping Wang, Shu Wu, Liang Wang, Tieniu Tan

Graph classification is a challenging research problem in many applications across a broad range of domains.

General Classification Graph Classification

SOGAN: 3D-Aware Shadow and Occlusion Robust GAN for Makeup Transfer

no code implementations21 Apr 2021 Yueming Lyu, Jing Dong, Bo Peng, Wei Wang, Tieniu Tan

Since human faces are symmetrical in the UV space, we can conveniently remove the undesired shadow and occlusion from the reference image by carefully designing a Flip Attention Module (FAM).

Face Model Facial Makeup Transfer

Robust Face-Swap Detection Based on 3D Facial Shape Information

no code implementations28 Apr 2021 Weinan Guan, Wei Wang, Jing Dong, Bo Peng, Tieniu Tan

Maliciously-manipulated images or videos - so-called deep fakes - especially face-swap images and videos have attracted more and more malicious attackers to discredit some key figures.

Adaptive Dilated Convolution For Human Pose Estimation

no code implementations22 Jul 2021 Zhengxiong Luo, Zhicheng Wang, Yan Huang, Liang Wang, Tieniu Tan, Erjin Zhou

It can generate and fuse multi-scale features of the same spatial sizes by setting different dilation rates for different channels.

Pose Estimation

Generalizable Person Re-identification Without Demographics

no code implementations29 Sep 2021 Yifan Zhang, Feng Li, Zhang Zhang, Liang Wang, DaCheng Tao, Tieniu Tan

However, the convex condition of KL DRO may not hold for overparameterized neural networks, such that applying KL DRO often fails to generalize under distribution shifts in real scenarios.

Generalizable Person Re-identification

Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption

no code implementations1 Mar 2022 Ke Han, Chenyang Si, Yan Huang, Liang Wang, Tieniu Tan

In this paper, we investigate the generalization problem of person re-identification (re-id), whose major challenge is the distribution shift on an unseen domain.

Generalizable Person Re-identification

Disentangled Federated Learning for Tackling Attributes Skew via Invariant Aggregation and Diversity Transferring

no code implementations14 Jun 2022 Zhengquan Luo, Yunlong Wang, Zilei Wang, Zhenan Sun, Tieniu Tan

Attributes skew hinders the current federated learning (FL) frameworks from consistent optimization directions among the clients, which inevitably leads to performance reduction and unstable convergence.

Federated Learning valid

Semantic-aware One-shot Face Re-enactment with Dense Correspondence Estimation

no code implementations23 Nov 2022 Yunfan Liu, Qi Li, Zhenan Sun, Tieniu Tan

One-shot face re-enactment is a challenging task due to the identity mismatch between source and driving faces.

Disentanglement Generative Adversarial Network

Human Image Generation: A Comprehensive Survey

no code implementations17 Dec 2022 Zhen Jia, Zhang Zhang, Liang Wang, Tieniu Tan

Image and video synthesis has become a blooming topic in computer vision and machine learning communities along with the developments of deep generative models, due to its great academic and application value.

Data Augmentation Image Generation +2

CFFT-GAN: Cross-domain Feature Fusion Transformer for Exemplar-based Image Translation

no code implementations3 Feb 2023 Tianxiang Ma, Bingchuan Li, Wei Liu, Miao Hua, Jing Dong, Tieniu Tan

In this paper, we propose a more general learning approach by considering two domain features as a whole and learning both inter-domain correspondence and intra-domain potential information interactions.

Translation

Clothing-Change Feature Augmentation for Person Re-Identification

no code implementations CVPR 2023 Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan

Specifically, to formulate meaningful clothing variations in the feature space, our method first estimates a clothing-change normal distribution with intra-ID cross-clothing variances.

Person Re-Identification

GaFET: Learning Geometry-aware Facial Expression Translation from In-The-Wild Images

no code implementations ICCV 2023 Tianxiang Ma, Bingchuan Li, Qian He, Jing Dong, Tieniu Tan

In this paper, we introduce a novel Geometry-aware Facial Expression Translation (GaFET) framework, which is based on parametric 3D facial representations and can stably decoupled expression.

Facial Expression Translation

Towards Realistic Unsupervised Fine-tuning with CLIP

no code implementations24 Aug 2023 Jian Liang, Lijun Sheng, Zhengbo Wang, Ran He, Tieniu Tan

The emergence of vision-language models (VLMs), such as CLIP, has spurred a significant research effort towards their application for downstream supervised learning tasks.

Out-of-Distribution Detection

MSRA-SR: Image Super-resolution Transformer with Multi-scale Shared Representation Acquisition

no code implementations ICCV 2023 Xiaoqiang Zhou, Huaibo Huang, Ran He, Zilei Wang, Jie Hu, Tieniu Tan

In particular, self-attention with cross-scale matching and convolution filters with different kernel sizes are designed to exploit the multi-scale features in images.

Image Super-Resolution

Model-free Test Time Adaptation for Out-Of-Distribution Detection

no code implementations28 Nov 2023 Yifan Zhang, Xue Wang, Tian Zhou, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

We demonstrate the effectiveness of \abbr through comprehensive experiments on multiple OOD detection benchmarks, extensive empirical studies show that \abbr significantly improves the performance of OOD detection over state-of-the-art methods.

Decision Making Out-of-Distribution Detection +2

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

no code implementations18 Dec 2023 Dongze Li, Kang Zhao, Wei Wang, Bo Peng, Yingya Zhang, Jing Dong, Tieniu Tan

Audio-driven talking head synthesis is a promising topic with wide applications in digital human, film making and virtual reality.

Talking Head Generation

Assaying on the Robustness of Zero-Shot Machine-Generated Text Detectors

1 code implementation20 Dec 2023 Yi-Fan Zhang, Zhang Zhang, Liang Wang, Tieniu Tan, Rong Jin

In an effort to address these issues, we delve into the realm of zero-shot machine-generated text detection.

Binary Classification Text Detection +1

GraphDIVE: Graph Classification by Mixture of Diverse Experts

1 code implementation journal 2021 Fenyu Hu, Liping Wang, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

Graph classification is a challenging research problem in many applications across a broad range of domains.

Graph Classification

Not all Minorities are Equal: Empty-Class-Aware Distillation for Heterogeneous Federated Learning

no code implementations4 Jan 2024 Kuangpu Guo, Yuhe Ding, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

Data heterogeneity, characterized by disparities in local data distribution across clients, poses a significant challenge in federated learning.

Federated Learning Knowledge Distillation

Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models

no code implementations6 Feb 2024 Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

This paper proposes a \textbf{C}ollabo\textbf{ra}tive \textbf{F}ine-\textbf{T}uning (\textbf{CraFT}) approach for fine-tuning black-box VLMs to downstream tasks, where one only has access to the input prompts and the output predictions of the model.

KEBench: A Benchmark on Knowledge Editing for Large Vision-Language Models

no code implementations12 Mar 2024 Han Huang, Haitian Zhong, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan

We conducted experiments of different editing methods on five LVLMs, and thoroughly analyze how these methods impact the models.

knowledge editing

Artifact Feature Purification for Cross-domain Detection of AI-generated Images

no code implementations17 Mar 2024 Zheling Meng, Bo Peng, Jing Dong, Tieniu Tan

We also find that the artifact features APN focuses on across generators and scenes are global and diverse.

Mutual Information Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.