Unleashing Text-to-Image Diffusion Models for Visual Perception

3D Object Detection Object +2

Paper
Code

3D Small Object Detection with Dynamic Spatial Pruning

1 code implementation • 5 May 2023 • Xiuwei Xu, Zhihao Sun, Ziwei Wang, Hongmin Liu, Jie zhou, Jiwen Lu

Specifically, we theoretically derive a dynamic spatial pruning (DSP) strategy to prune the redundant spatial representation of 3D scene in a cascade manner according to the distribution of objects.

Paper
Code

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

1 code implementation • 19 Mar 2024 • Zuyan Liu, Yuhao Dong, Yongming Rao, Jie zhou, Jiwen Lu

In the realm of vision-language understanding, the proficiency of models in interpreting and reasoning over visual content has become a cornerstone for numerous applications.

Ranked #42 on Visual Question Answering on MM-Vet

visual instruction following Visual Question Answering

Paper
Code

PCANet: A Simple Deep Learning Baseline for Image Classification?

2 code implementations • 14 Apr 2014 • Tsung-Han Chan, Kui Jia, Shenghua Gao, Jiwen Lu, Zinan Zeng, Yi Ma

In this work, we propose a very simple deep learning network for image classification which comprises only the very basic data processing components: cascaded principal component analysis (PCA), binary hashing, and block-wise histograms.

Ranked #46 on Image Classification on MNIST

Classification Face Recognition +5

Paper
Code

DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion

1 code implementation • CVPR 2023 • Wenliang Zhao, Yongming Rao, Weikang Shi, Zuyan Liu, Jie zhou, Jiwen Lu

Unlike previous work that relies on carefully designed network architectures and loss functions to fuse the information from the source and target faces, we reformulate the face swapping as a conditional inpainting task, performed by a powerful diffusion model guided by the desired face attributes (e. g., identity and landmarks).

Face Swapping

Paper
Code

Human Trajectory Prediction via Counterfactual Analysis

1 code implementation • ICCV 2021 • Guangyi Chen, Junlong Li, Jiwen Lu, Jie zhou

Most existing methods learn to predict future trajectories by behavior clues from history trajectories and interaction clues from environments.

Autonomous Vehicles counterfactual +1

Paper
Code

LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images

1 code implementation • 11 Apr 2023 • Hui Li, Tianyang Xu, Xiao-Jun Wu, Jiwen Lu, Josef Kittler

In particular we adopt a learnable representation approach to the fusion task, in which the construction of the fusion network architecture is guided by the optimisation algorithm producing the learnable model.

Representation Learning

Paper
Code

PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds

1 code implementation • CVPR 2021 • Yi Wei, Ziyi Wang, Yongming Rao, Jiwen Lu, Jie zhou

In this paper, we propose a Point-Voxel Recurrent All-Pairs Field Transforms (PV-RAFT) method to estimate scene flow from point clouds.

Scene Flow Estimation

Paper
Code

Group-aware Contrastive Regression for Action Quality Assessment

1 code implementation • ICCV 2021 • Xumin Yu, Yongming Rao, Wenliang Zhao, Jiwen Lu, Jie zhou

Assessing action quality is challenging due to the subtle differences between videos and large variations in scores.

Ranked #2 on Action Quality Assessment on MTL-AQA

Action Quality Assessment regression

Paper
Code

MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

1 code implementation • NeurIPS 2023 • Yinan Liang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie zhou, Jiwen Lu

Due to the high price and heavy energy consumption of GPUs, deploying deep models on IoT devices such as microcontrollers makes significant contributions for ecological AI.

Image Classification

Paper
Code

Cross-Modal Adapter for Text-Video Retrieval

1 code implementation • 17 Nov 2022 • Haojun Jiang, Jianke Zhang, Rui Huang, Chunjiang Ge, Zanlin Ni, Jiwen Lu, Jie zhou, Shiji Song, Gao Huang

However, as pre-trained models are scaling up, fully fine-tuning them on text-video retrieval datasets has a high risk of overfitting.

Retrieval Video Retrieval

Paper
Code

Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution

1 code implementation • 26 Jul 2022 • Cheng Ma, Jingyi Zhang, Jie zhou, Jiwen Lu

On the other hand, we propose a parallel network which includes two branches of cascaded lookup tables which process different components of the input low-resolution images.

Image Super-Resolution

Paper
Code

Person Re-identification via Attention Pyramid

1 code implementation • 11 Aug 2021 • Guangyi Chen, Tianpei Gu, Jiwen Lu, Jin-An Bao, Jie zhou

Experimental results demonstrate the superiority of our method, which outperforms the state-of-the-art methods by a large margin with limited computational cost.

Ranked #21 on Person Re-Identification on MSMT17

3D Instance Segmentation 3D Semantic Segmentation +1

Paper
Code

SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation

1 code implementation • 18 Dec 2020 • An Tao, Yueqi Duan, Yi Wei, Jiwen Lu, Jie zhou

Most existing point cloud instance and semantic segmentation methods rely heavily on strong supervision signals, which require point-level labels for every point in the scene.

Paper
Code

A Simple Baseline for Multi-Camera 3D Object Detection

1 code implementation • 22 Aug 2022 • Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Jie zhou, Jiwen Lu

First, we extract multi-scale features and generate the perspective object proposals on each monocular image.

Autonomous Driving Monocular 3D Object Detection +2

Paper
Code

SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation

1 code implementation • CVPR 2022 • Ziyi Wang, Yongming Rao, Xumin Yu, Jie zhou, Jiwen Lu

Conventional point cloud semantic segmentation methods usually employ an encoder-decoder architecture, where mid-level features are locally aggregated to extract geometric information.

Image Segmentation Point Cloud Segmentation +2

Paper
Code

Introspective Deep Metric Learning for Image Retrieval

2 code implementations • 9 May 2022 • Wenzhao Zheng, Chengkun Wang, Jie zhou, Jiwen Lu

This paper proposes an introspective deep metric learning (IDML) framework for uncertainty-aware comparisons of images.

Image Classification Image Retrieval +2

Paper
Code

Introspective Deep Metric Learning

2 code implementations • 11 Sep 2023 • Chengkun Wang, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

This paper proposes an introspective deep metric learning (IDML) framework for uncertainty-aware comparisons of images.

Image Retrieval Metric Learning

Paper
Code

Deep Relational Metric Learning

1 code implementation • ICCV 2021 • Wenzhao Zheng, Borui Zhang, Jiwen Lu, Jie zhou

This paper presents a deep relational metric learning (DRML) framework for image clustering and retrieval.

Image Clustering Metric Learning +1

Paper
Code

FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection

1 code implementation • 17 May 2021 • Yi Wei, Shang Su, Jiwen Lu, Jie zhou

To tackle this problem, we propose frustum-aware geometric reasoning (FGR) to detect vehicles in point clouds without any 3D annotations.

3D Object Detection object-detection

Paper
Code

Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation

1 code implementation • 17 Oct 2021 • Yinghuan Shi, Jian Zhang, Tong Ling, Jiwen Lu, Yefeng Zheng, Qian Yu, Lei Qi, Yang Gao

In semi-supervised medical image segmentation, most previous works draw on the common assumption that higher entropy means higher uncertainty.

Image Segmentation Segmentation +2

Paper
Code

Uncertainty-aware Score Distribution Learning for Action Quality Assessment

1 code implementation • CVPR 2020 • Yansong Tang, Zanlin Ni, Jiahuan Zhou, Danyang Zhang, Jiwen Lu, Ying Wu, Jie zhou

Assessing action quality from videos has attracted growing attention in recent years.

Ranked #4 on Action Quality Assessment on AQA-7

Action Quality Assessment

Paper
Code

RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection

2 code implementations • ICCV 2021 • Yongming Rao, Benlin Liu, Yi Wei, Jiwen Lu, Cho-Jui Hsieh, Jie zhou

In particular, we propose to generate random layouts of a scene by making use of the objects in the synthetic CAD dataset and learn the 3D scene representation by applying object-level contrastive learning on two random scenes generated from the same set of synthetic objects.

3D Object Detection Contrastive Learning +3

Paper
Code

Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement

2 code implementations • CVPR 2022 • Xiuwei Xu, Yifan Wang, Yu Zheng, Yongming Rao, Jie zhou, Jiwen Lu

In this paper, we propose a weakly-supervised approach for 3D object detection, which makes it possible to train a strong 3D detector with position-level annotations (i. e. annotations of object centers).

3D Object Detection Domain Adaptation +3

Paper
Code

Attributable Visual Similarity Learning

1 code implementation • CVPR 2022 • Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

This paper proposes an attributable visual similarity learning (AVSL) framework for a more accurate and explainable similarity measure between images.

Ranked #3 on Metric Learning on CARS196 (using extra training data)

Metric Learning Semantic Similarity +1

Paper
Code

Learning Probabilistic Ordinal Embeddings for Uncertainty-Aware Regression

1 code implementation • CVPR 2021 • Wanhua Li, Xiaoke Huang, Jiwen Lu, Jianjiang Feng, Jie zhou

An ordinal distribution constraint is proposed to exploit the ordinal nature of regression.

Ranked #2 on Age Estimation on Adience

Aesthetics Quality Assessment Age And Gender Classification +3

Paper
Code

Personalized Trajectory Prediction via Distribution Discrimination

1 code implementation • ICCV 2021 • Guangyi Chen, Junlong Li, Nuoxing Zhou, Liangliang Ren, Jiwen Lu

In this paper, we present a distribution discrimination (DisDis) method to predict personalized motion patterns by distinguishing the potential distributions.

Trajectory Prediction

Paper
Code

OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression

1 code implementation • 6 Jun 2022 • Wanhua Li, Xiaoke Huang, Zheng Zhu, Yansong Tang, Xiu Li, Jie zhou, Jiwen Lu

In this paper, we propose to learn the rank concepts from the rich semantic CLIP latent space.

Ranked #1 on Few-shot Age Estimation on MORPH Album2

Aesthetics Quality Assessment Few-shot Age Estimation +4

Paper
Code

OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions

1 code implementation • ICCV 2023 • Chengkun Wang, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

The pretrain-finetune paradigm in modern computer vision facilitates the success of self-supervised learning, which tends to achieve better transferability than supervised learning.

Image Classification object-detection +3

Paper
Code

Self-Supervised Video Hashing via Bidirectional Transformers

1 code implementation • CVPR 2021 • Shuyan Li, Xiu Li, Jiwen Lu, Jie zhou

Most existing unsupervised video hashing methods are built on unidirectional models with less reliable training objectives, which underuse the correlations among frames and the similarity structure between videos.

Retrieval Video Retrieval

Paper
Code

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

1 code implementation • ICCV 2023 • Ziyi Wang, Xumin Yu, Yongming Rao, Jie zhou, Jiwen Lu

In this paper, we propose a novel 3D-to-2D generative pre-training method that is adaptable to any point cloud model.

Ranked #6 on 3D Part Segmentation on ShapeNet-Part

3D Part Segmentation 3D Point Cloud Classification

Paper
Code

Label2Label: A Language Modeling Framework for Multi-Attribute Learning

1 code implementation • 18 Jul 2022 • Wanhua Li, Zhexuan Cao, Jianjiang Feng, Jie zhou, Jiwen Lu

As each sample is annotated with multiple attribute labels, these "words" will naturally form an unordered but meaningful "sentence", which depicts the semantic information of the corresponding sample.

Ranked #1 on Clothing Attribute Recognition on Clothing Attributes Dataset

Attribute Clothing Attribute Recognition +4

Paper
Code

Generalizable Mixed-Precision Quantization via Attribution Rank Preservation

1 code implementation • ICCV 2021 • Ziwei Wang, Han Xiao, Jiwen Lu, Jie zhou

On the contrary, our GMPQ searches the mixed-quantization policy that can be generalized to largescale datasets with only a small amount of data, so that the search cost is significantly reduced without performance degradation.

Quantization

Paper
Code

Efficient Meshy Neural Fields for Animatable Human Avatars

1 code implementation • 23 Mar 2023 • Xiaoke Huang, Yiji Cheng, Yansong Tang, Xiu Li, Jie zhou, Jiwen Lu

Moreover, only minutes of optimization is enough for plausible reconstruction results.

Disentanglement Inverse Rendering

Paper
Code

Similarity-Aware Fusion Network for 3D Semantic Segmentation

1 code implementation • 4 Jul 2021 • Linqing Zhao, Jiwen Lu, Jie zhou

To address this, we employ a late fusion strategy where we first learn the geometric and contextual similarities between the input and back-projected (from 2D pixels) point clouds and utilize them to guide the fusion of two modalities to further exploit complementary information.

Ranked #21 on Semantic Segmentation on ScanNet

3D Semantic Segmentation

Paper
Code

Instance Similarity Learning for Unsupervised Feature Representation

1 code implementation • ICCV 2021 • Ziwei Wang, Yunsong Wang, Ziyi Wu, Jiwen Lu, Jie zhou

In this paper, we propose an instance similarity learning (ISL) method for unsupervised feature representation.

Image Classification Semantic Similarity +1

Paper
Code

Shapley-NAS: Discovering Operation Contribution for Neural Architecture Search

1 code implementation • CVPR 2022 • Han Xiao, Ziwei Wang, Zheng Zhu, Jie zhou, Jiwen Lu

Differentiable architecture search (DARTS) acquires the optimal architectures by optimizing the architecture parameters with gradient descent, which significantly reduces the search cost.

Ranked #1 on Neural Architecture Search on NAS-Bench-201, CIFAR-100

Neural Architecture Search

Paper
Code

Token-Label Alignment for Vision Transformers

1 code implementation • ICCV 2023 • Han Xiao, Wenzhao Zheng, Zheng Zhu, Jie zhou, Jiwen Lu

Data mixing strategies (e. g., CutMix) have shown the ability to greatly improve the performance of convolutional neural networks (CNNs).

Image Classification Semantic Segmentation +1

Paper
Code

Deep Compositional Metric Learning

1 code implementation • CVPR 2021 • Wenzhao Zheng, Chengkun Wang, Jiwen Lu, Jie zhou

In this paper, we propose a deep compositional metric learning (DCML) framework for effective and generalizable similarity measurement between images.

Relation Relational Reasoning +1

Paper
Code

Graph-Based Social Relation Reasoning

1 code implementation • ECCV 2020 • Wanhua Li, Yueqi Duan, Jiwen Lu, Jianjiang Feng, Jie zhou

Human beings are fundamentally sociable -- that we generally organize our social lives in terms of relations with other people.

Ranked #1 on Visual Social Relationship Recognition on PIPA

Paper
Code

Deep Factorized Metric Learning

1 code implementation • CVPR 2023 • Chengkun Wang, Wenzhao Zheng, Junlong Li, Jie zhou, Jiwen Lu

Learning a generalizable and comprehensive similarity metric to depict the semantic discrepancies between images is the foundation of many computer vision tasks.

Image Classification Metric Learning

Paper
Code

An Improved Evaluation Framework for Generative Adversarial Networks

1 code implementation • 20 Mar 2018 • Shaohui Liu, Yi Wei, Jiwen Lu, Jie zhou

Unlike most existing evaluation frameworks which transfer the representation of ImageNet inception model to map images onto the feature space, our framework uses a specialized encoder to acquire fine-grained domain-specific representation.

Paper
Code

Diverse Sample Generation: Pushing the Limit of Generative Data-free Quantization

1 code implementation • 1 Sep 2021 • Haotong Qin, Yifu Ding, Xiangguo Zhang, Jiakai Wang, Xianglong Liu, Jiwen Lu

We first give a theoretical analysis that the diversity of synthetic samples is crucial for the data-free quantization, while in existing approaches, the synthetic data completely constrained by BN statistics experimentally exhibit severe homogenization at distribution and sample levels.

Data Free Quantization Image Classification

Paper
Code

Narrative Action Evaluation with Prompt-Guided Multimodal Interaction

1 code implementation • 22 Apr 2024 • Shiyi Zhang, Sule Bai, Guangyi Chen, Lei Chen, Jiwen Lu, Junle Wang, Yansong Tang

NAE is a more challenging task because it requires both narrative flexibility and evaluation rigor.

Action Quality Assessment Multi-Task Learning +2

Paper
Code

MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer

1 code implementation • 5 Mar 2024 • JianJian Cao, Peng Ye, Shengze Li, Chong Yu, Yansong Tang, Jiwen Lu, Tao Chen

To this end, we propose a novel framework named Multimodal Alignment-Guided Dynamic Token Pruning (MADTP) for accelerating various VLTs.

Paper
Code

Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint

1 code implementation • 18 Dec 2022 • Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Deep learning has revolutionized human society, yet the black-box nature of deep neural networks hinders further application to reliability-demanded industries.

Paper
Code

Separable Structure Modeling for Semi-supervised Video Object Segmentation

1 code implementation • 18 Feb 2021 • Wencheng Zhu, Jiahao Li, Jiwen Lu, Jie zhou

Specifically, we first compute a pixel-wise similarity matrix by using representations of reference and target pixels and then select top-rank reference pixels for target pixel classification.

Ranked #45 on Semi-Supervised Video Object Segmentation on DAVIS 2017 (test-dev)

Object One-shot visual object segmentation +1

Paper
Code

Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition

1 code implementation • 17 Jul 2022 • Yansong Tang, Xingyu Liu, Xumin Yu, Danyang Zhang, Jiwen Lu, Jie zhou

Different from the conventional adversarial learning-based approaches for UDA, we utilize a self-supervision scheme to reduce the domain shift between two skeleton-based action datasets.

Action Recognition Self-Supervised Learning +2

Paper
Code

TCOVIS: Temporally Consistent Online Video Instance Segmentation

1 code implementation • ICCV 2023 • Junlong Li, Bingyao Yu, Yongming Rao, Jie zhou, Jiwen Lu

The core of our method consists of a global instance assignment strategy and a spatio-temporal enhancement module, which improve the temporal consistency of the features from two aspects.

Instance Segmentation Semantic Segmentation +1

Paper
Code

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

1 code implementation • 1 Apr 2024 • Yixuan Zhu, Ao Li, Yansong Tang, Wenliang Zhao, Jie zhou, Jiwen Lu

The recovery of occluded human meshes presents challenges for current methods due to the difficulty in extracting effective image features under severe occlusion.

Denoising Human Mesh Recovery

Paper
Code

MetaAge: Meta-Learning Personalized Age Estimators

1 code implementation • 12 Jul 2022 • Wanhua Li, Jiwen Lu, Abudukelimu Wuerkaixi, Jianjiang Feng, Jie zhou

Unlike most existing personalized methods that learn the parameters of a personalized estimator for each person in the training set, our method learns the mapping from identity information to age estimator parameters.

Ranked #1 on Age Estimation on ChaLearn 2015

Age Estimation Meta-Learning +1

Paper
Code

Dense Hybrid Proposal Modulation for Lane Detection

1 code implementation • 28 Apr 2023 • Yuejian Wu, Linqing Zhao, Jiwen Lu, Haibin Yan

In addition to the shape and location constraints, we design a quality-aware classification loss to adaptively supervise each positive proposal so that the discriminative power can be further boosted.

Lane Detection

Paper
Code

Automatic Data Augmentation by Learning the Deterministic Policy

1 code implementation • 18 Oct 2019 • Yinghuan Shi, Tiexin Qin, Yong liu, Jiwen Lu, Yang Gao, Dinggang Shen

By introducing an unified optimization goal, DeepAugNet intends to combine the data augmentation and the deep model training in an end-to-end training manner which is realized by simultaneously training a hybrid architecture of dueling deep Q-learning algorithm and a surrogate deep model.

Data Augmentation Q-Learning

Paper
Code

Content-aware Warping for View Synthesis

1 code implementation • 22 Jan 2022 • Mantang Guo, Junhui Hou, Jing Jin, Hui Liu, Huanqiang Zeng, Jiwen Lu

To this end, we propose content-aware warping, which adaptively learns the interpolation weights for pixels of a relatively large neighborhood from their contextual information via a lightweight neural network.

Novel View Synthesis

Paper
Code

Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

1 code implementation • 13 Apr 2023 • Ziwei Wang, Jiwen Lu, Han Xiao, Shengyu Liu, Jie zhou

On the contrary, we obtain the optimal efficient networks by directly optimizing the compression policy with an accurate performance predictor, where the ultrafast automated model compression for various computational cost constraint is achieved without complex compression policy search and evaluation.

Image Classification Model Compression +3

Paper
Code

Probabilistic Deep Metric Learning for Hyperspectral Image Classification

1 code implementation • 15 Nov 2022 • Chengkun Wang, Wenzhao Zheng, Xian Sun, Jiwen Lu, Jie zhou

We propose to learn a global probabilistic distribution for each pixel in the patch and a probabilistic metric to model the distance between distributions.

Classification Hyperspectral Image Classification +1

Paper
Code

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

1 code implementation • 23 Mar 2024 • Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhang

Recent Vision Transformer Compression (VTC) works mainly follow a two-stage scheme, where the importance score of each model unit is first evaluated or preset in each submodule, followed by the sparsity score evaluation according to the target sparsity constraint.

Dimensionality Reduction

Paper
Code

Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network

1 code implementation • 17 Dec 2021 • An Tao, Yueqi Duan, He Wang, Ziyi Wu, Pengliang Ji, Haowen Sun, Jie zhou, Jiwen Lu

It results in a serious issue of lagged gradient, making the learned attack at the current step ineffective due to the architecture changes afterward.

3D Classification 3D Semantic Segmentation +2

Paper
Code

Dynamics-aware Adversarial Attack of Adaptive Neural Networks

1 code implementation • 15 Oct 2022 • An Tao, Yueqi Duan, Yingqi Wang, Jiwen Lu, Jie zhou

To address this issue, we propose a Leaded Gradient Method (LGM) and show the significant effects of the lagged gradient.

Adversarial Attack Computational Efficiency

Paper
Code

Exploring Unified Perspective For Fast Shapley Value Estimation

1 code implementation • 2 Nov 2023 • Borui Zhang, Baotong Tian, Wenzhao Zheng, Jie zhou, Jiwen Lu

Shapley values have emerged as a widely accepted and trustworthy tool, grounded in theoretical axioms, for addressing challenges posed by black-box models like deep neural networks.

Paper
Code

Path Choice Matters for Clear Attribution in Path Methods

1 code implementation • 19 Jan 2024 • Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Rigorousness and clarity are both essential for interpretations of DNNs to engender human trust.

Paper
Code

X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

1 code implementation • 23 Apr 2024 • Shuofeng Sun, Yongming Rao, Jiwen Lu, Haibin Yan

However, we contend that such implicit high-dimensional structure modeling approch inadequately represents the local geometric structure of point clouds due to the absence of explicit structural information.

Segmentation

Paper
Code

Deep Sparse Subspace Clustering

no code implementations • 25 Sep 2017 • Xi Peng, Jiashi Feng, Shijie Xiao, Jiwen Lu, Zhang Yi, Shuicheng Yan

In this paper, we present a deep extension of Sparse Subspace Clustering, termed Deep Sparse Subspace Clustering (DSSC).

Clustering valid

Paper
Add Code

3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds

no code implementations • ICCV 2017 • Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu

Our method provides an automatic process that maps the raw data to the classification results.

Classification General Classification +4

Paper
Add Code

Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition

no code implementations • 6 Apr 2016 • Ziyan Wang, Jiwen Lu, Ruogu Lin, Jianjiang Feng, Jie zhou

Specifically, we construct a pair of deep convolutional neural networks (CNNs) for the RGB and depth data, and concatenate them at the top layer of the network with a loss function which learns a new feature space where both correlated part and the individual part of the RGB-D information are well modelled.

Object Object Recognition

Paper
Add Code

Automatic Subspace Learning via Principal Coefficients Embedding

no code implementations • 17 Nov 2014 • Xi Peng, Jiwen Lu, Zhang Yi, Rui Yan

In this paper, we address two challenging problems in unsupervised subspace learning: 1) how to automatically identify the feature dimension of the learned subspace (i. e., automatic subspace learning), and 2) how to learn the underlying subspace in the presence of Gaussian noise (i. e., robust subspace learning).

Paper
Add Code

A Siamese Long Short-Term Memory Architecture for Human Re-Identification

no code implementations • European Conference on Computer Vision 2016 • Rahul Rama Varior, Bing Shuai, Jiwen Lu, Dong Xu, Gang Wang

Matching pedestrians across multiple camera views known as human re-identification (re-identification) is a challenging problem in visual surveillance.

Attribute Clothing Attribute Recognition +1

Paper
Add Code

Multi-task CNN Model for Attribute Prediction

no code implementations • 4 Jan 2016 • Abrar H. Abdulnabi, Gang Wang, Jiwen Lu, Kui Jia

Each CNN will generate attribute-specific feature representations, and then we apply multi-task learning on the features to predict their attributes.

Ranked #2 on Clothing Attribute Recognition on Clothing Attributes Dataset

Paper
Add Code

Nonlinear Local Metric Learning for Person Re-identification

no code implementations • 16 Nov 2015 • Siyuan Huang, Jiwen Lu, Jie zhou, Anil K. Jain

In this paper, we propose a nonlinear local metric learning (NLML) method to improve the state-of-the-art performance of person re-identification on public datasets.

Metric Learning Person Re-Identification

Paper
Add Code

Learning Invariant Color Features for Person Re-Identification

no code implementations • 4 Oct 2014 • Rahul Rama Varior, Gang Wang, Jiwen Lu

We model color feature generation as a learning problem by jointly learning a linear transformation and a dictionary to encode pixel values.

Paper
Add Code

Face Recognition via Globality-Locality Preserving Projections

no code implementations • 6 Nov 2013 • Sheng Huang, Dan Yang, Fei Yang, Yongxin Ge, Xiaohong Zhang, Jiwen Lu

We present an improved Locality Preserving Projections (LPP) method, named Gloablity-Locality Preserving Projections (GLPP), to preserve both the global and local geometric structures of data.

Paper
Add Code

Runtime Neural Pruning

no code implementations • NeurIPS 2017 • Ji Lin, Yongming Rao, Jiwen Lu, Jie zhou

In this paper, we propose a Runtime Neural Pruning (RNP) framework which prunes the deep neural network dynamically at the runtime.

Paper
Add Code

Deep Adversarial Metric Learning

no code implementations • CVPR 2018 • Yueqi Duan, Wenzhao Zheng, Xudong Lin, Jiwen Lu, Jie zhou

Learning an effective distance metric between image pairs plays an important role in visual analysis, where the training procedure largely relies on hard negative samples.

Action Recognition reinforcement-learning +3

Paper
Add Code

Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition

no code implementations • CVPR 2018 • Yansong Tang, Yi Tian, Jiwen Lu, Peiyang Li, Jie zhou

In this paper, we propose a deep progressive reinforcement learning (DPRL) method for action recognition in skeleton-based videos, which aims to distil the most informative frames and discard ambiguous frames in sequences for recognizing actions.

Ranked #3 on Skeleton Based Action Recognition on UT-Kinect

Paper
Add Code

Learning Globally Optimized Object Detector via Policy Gradient

no code implementations • CVPR 2018 • Yongming Rao, Dahua Lin, Jiwen Lu, Jie zhou

In this paper, we propose a simple yet effective method to learn globally optimized detector for object detection, which is a simple modification to the standard cross-entropy gradient inspired by the REINFORCE algorithm.

Object object-detection +1

Paper
Add Code

Deep Hashing via Discrepancy Minimization

no code implementations • CVPR 2018 • Zhixiang Chen, Xin Yuan, Jiwen Lu, Qi Tian, Jie zhou

This paper presents a discrepancy minimizing model to address the discrete optimization problem in hashing learning.

Deep Hashing

Paper
Add Code

GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning

no code implementations • CVPR 2018 • Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie zhou

Specifically, we design a deep reinforcement learning model to learn the structure of the graph for bitwise interaction mining, reducing the uncertainty of binary codes by maximizing the mutual information with inputs and related bits, so that the ambiguous bits receive additional instruction from the graph for confident binarization.

Binarization reinforcement-learning +2

Paper
Add Code

Part-Activated Deep Reinforcement Learning for Action Prediction

no code implementations • ECCV 2018 • Lei Chen, Jiwen Lu, Zhanjie Song, Jie zhou

In this paper, we propose a part-activated deep reinforcement learning (PA-DRL) for action prediction.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking

no code implementations • ECCV 2018 • Minghao Guo, Jiwen Lu, Jie zhou

In this paper, we propose a dual-agent deep reinforcement learning (DADRL) method for deformable face tracking, which generates bounding boxes and detects facial landmarks interactively from face videos.

Facial Landmark Detection reinforcement-learning +1

Paper
Add Code

Deep Reinforcement Learning with Iterative Shift for Visual Tracking

no code implementations • ECCV 2018 • Liangliang Ren, Xin Yuan, Jiwen Lu, Ming Yang, Jie Zhou

Visual tracking is confronted by the dilemma to locate a target both}accurately and efficiently, and make decisions online whether and how to adapt the appearance model or even restart tracking.

Motion Estimation Object +4

Paper
Add Code

Collaborative Deep Reinforcement Learning for Multi-Object Tracking

no code implementations • ECCV 2018 • Liangliang Ren, Jiwen Lu, Zifeng Wang, Qi Tian, Jie zhou

To address this, we develop a deep prediction-decision network in our C-DRL, which simultaneously detects and predicts objects under a unified network via deep reinforcement learning.

Multi-Object Tracking Object +2

Paper
Add Code

Deep Variational Metric Learning

no code implementations • ECCV 2018 • Xudong Lin, Yueqi Duan, Qiyuan Dong, Jiwen Lu, Jie zhou

Deep metric learning has been extensively explored recently, which trains a deep neural network to produce discriminative embedding features.

Deep Hashing Image Retrieval

Paper
Add Code

Graininess-Aware Deep Feature Learning for Pedestrian Detection

no code implementations • ECCV 2018 • Chunze Lin, Jiwen Lu, Gang Wang, Jie zhou

In this paper, we propose a graininess-aware deep feature learning method for pedestrian detection.

Pedestrian Detection

Paper
Add Code

Relaxation-Free Deep Hashing via Policy Gradient

no code implementations • ECCV 2018 • Xin Yuan, Liangliang Ren, Jiwen Lu, Jie zhou

In this paper, we propose a simple yet effective relaxation-free method to learn more effective binary codes via policy gradient for scalable image search.

Paper
Add Code

Improving Sample-based Evaluation for Generative Adversarial Networks

no code implementations • ICLR 2019 • Shaohui Liu*, Yi Wei*, Jiwen Lu, Jie zhou

Paper
Add Code

Discriminative Deep Metric Learning for Face Verification in the Wild

no code implementations • CVPR 2014 • Junlin Hu, Jiwen Lu, Yap-Peng Tan

This paper presents a new discriminative deep metric learning (DDML) method for face verification in the wild.

Face Verification Metric Learning

Paper
Add Code

Deep Transfer Metric Learning

no code implementations • CVPR 2015 • Junlin Hu, Jiwen Lu, Yap-Peng Tan

Conventional metric learning methods usually assume that the training and test samples are captured in similar scenarios so that their distributions are assumed to be the same.

Face Verification Metric Learning +1

Paper
Add Code

Multi-Manifold Deep Metric Learning for Image Set Classification

no code implementations • CVPR 2015 • Jiwen Lu, Gang Wang, Weihong Deng, Pierre Moulin, Jie zhou

In this paper, we propose a multi-manifold deep metric learning (MMDML) method for image set classification, which aims to recognize an object of interest from a set of image instances captured from varying viewpoints or under varying illuminations.

Classification General Classification +1

Paper
Add Code

Deep Hashing for Compact Binary Codes Learning

no code implementations • CVPR 2015 • Venice Erin Liong, Jiwen Lu, Gang Wang, Pierre Moulin, Jie zhou

In this paper, we propose a new deep hashing (DH) approach to learn compact binary codes for large scale visual search.

Deep Hashing

Paper
Add Code

Learning Compact Binary Descriptors With Unsupervised Deep Neural Networks

no code implementations • CVPR 2016 • Kevin Lin, Jiwen Lu, Chu-Song Chen, Jie zhou

In this paper, we propose a new unsupervised deep learning approach called DeepBit to learn compact binary descriptor for efficient visual object matching.

Image Retrieval Object +3

Paper
Add Code

Modality and Component Aware Feature Fusion For RGB-D Scene Classification

no code implementations • CVPR 2016 • Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham

While convolutional neural networks (CNN) have been excellent for object recognition, the greater spatial variability in scene images typically meant that the standard full-image CNN features are suboptimal for scene classification.

General Classification Object Recognition +1

Paper
Add Code

Learning Deep Binary Descriptor With Multi-Quantization

no code implementations • CVPR 2017 • Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie zhou

In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching.

Binarization Image Retrieval +2

Paper
Add Code

Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network

no code implementations • CVPR 2017 • Ji Lin, Liangliang Ren, Jiwen Lu, Jianjiang Feng, Jie zhou

In this paper, we propose a consistent-aware deep learning (CADL) framework for person re-identification in a camera network.

object-detection Object Detection

Paper
Add Code

Multi-View Complementary Hash Tables for Nearest Neighbor Search

no code implementations • ICCV 2015 • Xianglong Liu, Lei Huang, Cheng Deng, Jiwen Lu, Bo Lang

have enjoyed the benefits of complementary hash tables and information fusion over multiple views.

Paper
Add Code

MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition

no code implementations • ICCV 2015 • Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham

We first construct deep CNN layers for color and depth separately, and then connect them with our carefully designed multi-modal layers, which fuse color and depth information by enforcing a common part to be shared by features of different modalities.

Object Object Recognition

Paper
Add Code

Multiple Feature Fusion via Weighted Entropy for Visual Tracking

no code implementations • ICCV 2015 • Lin Ma, Jiwen Lu, Jianjiang Feng, Jie zhou

It is desirable to combine multiple feature descriptors to improve the visual tracking performance because different features can provide complementary information to describe objects of interest.

Object Visual Object Tracking +1

Paper
Add Code

Simultaneous Local Binary Feature Learning and Encoding for Face Recognition

no code implementations • ICCV 2015 • Jiwen Lu, Venice Erin Liong, Jie zhou

In this paper, we propose a simultaneous local binary feature learning and encoding (SLBFLE) method for face recognition.

Object Object Tracking +1

Paper
Add Code

Local Subspace Collaborative Tracking

no code implementations • ICCV 2015 • Lin Ma, Xiaoqin Zhang, Weiming Hu, Junliang Xing, Jiwen Lu, Jie zhou

To address this, this paper presents a local subspace collaborative tracking method for robust visual tracking, where multiple linear and nonlinear subspaces are learned to better model the nonlinear relationship of object appearances.

Paper
Add Code

Learning Discriminative Aggregation Network for Video-Based Face Recognition

no code implementations • ICCV 2017 • Yongming Rao, Ji Lin, Jiwen Lu, Jie zhou

In this paper, we propose a discriminative aggregation network (DAN) for video face recognition, which aims to integrate information from video frames effectively and efficiently.

Face Recognition Metric Learning

Paper
Add Code

Attention-Aware Deep Reinforcement Learning for Video Face Recognition

no code implementations • ICCV 2017 • Yongming Rao, Jiwen Lu, Jie zhou

In this paper, we propose an attention-aware deep reinforcement learning (ADRL) method for video face recognition, which aims to discard the misleading and confounding frames and find the focuses of attention in face videos for person recognition.

Face Recognition Person Recognition +2

Paper
Add Code

Cross-Modal Deep Variational Hashing

no code implementations • ICCV 2017 • Venice Erin Liong, Jiwen Lu, Yap-Peng Tan, Jie zhou

In this paper, we propose a cross-modal deep variational hashing (CMDVH) method to learn compact binary codes for cross-modality multimedia retrieval.

Retrieval

Paper
Add Code

COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis

no code implementations • CVPR 2019 • Yansong Tang, Dajun Ding, Yongming Rao, Yu Zheng, Danyang Zhang, Lili Zhao, Jiwen Lu, Jie zhou

There are substantial instructional videos on the Internet, which enables us to acquire knowledge for completing various tasks.

Action Detection

Paper
Add Code

BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation

no code implementations • CVPR 2019 • Wanhua Li, Jiwen Lu, Jianjiang Feng, Chunjing Xu, Jie zhou, Qi Tian

Existing methods for age estimation usually apply a divide-and-conquer strategy to deal with heterogeneous data caused by the non-stationary aging process.

Ranked #2 on Age Estimation on FGNET

Age Estimation MORPH

Paper
Add Code

Conditional Single-view Shape Generation for Multi-view Stereo Reconstruction

no code implementations • CVPR 2019 • Yi Wei, Shaohui Liu, Wang Zhao, Jiwen Lu, Jie zhou

In this paper, we present a new perspective towards image-based shape generation.

3D Reconstruction

Paper
Add Code

Deep Fitting Degree Scoring Network for Monocular 3D Object Detection

no code implementations • CVPR 2019 • Lijie Liu, Jiwen Lu, Chunjing Xu, Qi Tian, Jie zhou

In this paper, we propose to learn a deep fitting degree scoring network for monocular 3D object detection, which aims to score fitting degree between proposals and object conclusively.

Ranked #7 on Vehicle Pose Estimation on KITTI Cars Hard

Monocular 3D Object Detection Object +2

Paper
Add Code

P$^2$GNet: Pose-Guided Point Cloud Generating Networks for 6-DoF Object Pose Estimation

no code implementations • 19 Dec 2019 • Peiyu Yu, Yongming Rao, Jiwen Lu, Jie zhou

Humans are able to perform fast and accurate object pose estimation even under severe occlusion by exploiting learned object model priors from everyday life.

6D Pose Estimation 6D Pose Estimation using RGB +1

Paper
Add Code

DotFAN: A Domain-transferred Face Augmentation Network for Pose and Illumination Invariant Face Recognition

no code implementations • 23 Feb 2020 • Hao-Chiang Shao, Kang-Yu Liu, Chia-Wen Lin, Jiwen Lu

With their aid, DotFAN can learn a disentangled face representation and effectively generate face images of various facial attributes while preserving the identity of augmented faces.

Kinship Verification Relational Reasoning

Paper
Add Code

Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation

no code implementations • 20 Mar 2020 • Yansong Tang, Jiwen Lu, Jie zhou

We believe the introduction of the COIN dataset will promote the future in-depth research on instructional video analysis for the community.

Action Detection

Paper
Add Code

Graph-based Kinship Reasoning Network

no code implementations • 22 Apr 2020 • Wanhua Li, Yingqiang Zhang, Kangchen Lv, Jiwen Lu, Jianjiang Feng, Jie zhou

In this paper, we propose a graph-based kinship reasoning (GKR) network for kinship verification, which aims to effectively perform relational reasoning on the extracted features of an image pair.

Ranked #3 on Kinship Verification on KinFaceW-II

Paper
Add Code

Latent Fingerprint Registration via Matching Densely Sampled Points

no code implementations • 12 May 2020 • Shan Gu, Jianjiang Feng, Jiwen Lu, Jie zhou

Given a pair of fingerprints to match, we bypass the minutiae extraction step and take uniformly sampled points as key points.

Clustering

Paper
Add Code

MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation

no code implementations • ECCV 2020 • Benlin Liu, Yongming Rao, Jiwen Lu, Jie zhou, Cho-Jui Hsieh

Knowledge Distillation (KD) has been one of the most popu-lar methods to learn a compact model.

Knowledge Distillation Meta-Learning

Paper
Add Code

Reinforced Axial Refinement Network for Monocular 3D Object Detection

no code implementations • ECCV 2020 • Lijie Liu, Chufan Wu, Jiwen Lu, Lingxi Xie, Jie zhou, Qi Tian

Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.

Ranked #16 on Vehicle Pose Estimation on KITTI Cars Hard

Monocular 3D Object Detection Object +2

Paper
Add Code

Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification

no code implementations • ECCV 2020 • Guangyi Chen, Yuhao Lu, Jiwen Lu, Jie Zhou

Experimental results demonstrate that our DCML method explores credible and valuable training data and improves the performance of unsupervised domain adaptation.

Metric Learning Person Re-Identification +2

Paper
Add Code

Temporal Coherence or Temporal Motion: Which is More Critical for Video-based Person Re-identification?

no code implementations • ECCV 2020 • Guangyi Chen, Yongming Rao, Jiwen Lu, Jie zhou

Specifically, we disentangle the video representation into the temporal coherence and motion parts and randomly change the scale of the temporal motion features as the adversarial noise.

Video-Based Person Re-Identification

Paper
Add Code

Structural Deep Metric Learning for Room Layout Estimation

no code implementations • ECCV 2020 • Wenzhao Zheng, Jiwen Lu, Jie zhou

We employ a metric model and a layout encoder to map the RGB images and the ground-truth layouts to the embedding space, respectively, and a layout decoder to map the embeddings to the corresponding layouts, where the whole framework is trained in an end-to-end manner.

Metric Learning Room Layout Estimation

Paper
Add Code

Deep Hashing with Active Pairwise Supervision

no code implementations • ECCV 2020 • Ziwei Wang, Quan Zheng, Jiwen Lu, Jie zhou

n this paper, we propose a Deep Hashing method with Active Pairwise Supervision(DH-APS).

Deep Hashing

Paper
Add Code

Rotation-robust Intersection over Union for 3D Object Detection

no code implementations • ECCV 2020 • Yu Zheng, Danyang Zhang, Sinan Xie, Jiwen Lu, Jie zhou

In this paper, we propose a Rotation-robust Intersection over Union ($ extit{RIoU}$) for 3D object detection, which aims to jointly learn the overlap of rotated bounding boxes.

3D Object Detection Object +1

Paper
Add Code

Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning

no code implementations • ECCV 2020 • Liangliang Ren, Yangyang Song, Jiwen Lu, Jie zhou

Unlike most existing works that define room layout on a 2D image, we model the layout in 3D as a configuration of the camera and the room.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images

no code implementations • 19 Jan 2021 • Lei He, Jiwen Lu, Guanghui Wang, Shiyu Song, Jie zhou

In this paper, we first introduce the concept of semantic objectness to exploit the geometric relationship of these two tasks through an analysis of the imaging process, then propose a Semantic Object Segmentation and Depth Estimation Network (SOSD-Net) based on the objectness assumption.

Ranked #81 on Semantic Segmentation on NYU Depth v2

Monocular Depth Estimation Multi-Task Learning +3

Paper
Add Code

Rank-Consistency Deep Hashing for Scalable Multi-Label Image Search

no code implementations • 2 Feb 2021 • Cheng Ma, Jiwen Lu, Jie zhou

As hashing becomes an increasingly appealing technique for large-scale image retrieval, multi-label hashing is also attracting more attention for the ability to exploit multi-level semantic contents.

Clustering Deep Hashing +3

Paper
Add Code

WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

no code implementations • CVPR 2021 • Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie zhou

In this paper, we contribute a new million-scale face benchmark containing noisy 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation protocol.

Ranked #1 on Face Verification on IJB-C (training dataset metric)

Attribute Face Recognition +1

Paper
Add Code

Meta-Mining Discriminative Samples for Kinship Verification

no code implementations • CVPR 2021 • Wanhua Li, Shiwei Wang, Jiwen Lu, Jianjiang Feng, Jie zhou

In the end, the samples in the unbalanced train batch are re-weighted by the learned meta-miner to optimize the kinship models.

Ranked #1 on Kinship Verification on KinFaceW-II

Kinship Verification

Paper
Add Code

SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation

no code implementations • 6 Apr 2021 • Jiabin Zhang, Zheng Zhu, Jiwen Lu, JunJie Huang, Guan Huang, Jie zhou

To make a better trade-off between accuracy and efficiency, we propose a novel multi-person pose estimation framework, SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation (SIMPLE).

Human Detection Multi-Person Pose Estimation

Paper
Add Code

Pseudo Facial Generation With Extreme Poses for Face Recognition

no code implementations • CVPR 2021 • Guoli Wang, Jiaqi Ma, Qian Zhang, Jiwen Lu, Jie zhou

Many of them settle it by generating fake frontal faces from extreme ones, whereas they are tough to maintain the identity information with high computational consumption and uncontrolled disturbances.

Paper
Add Code

Masked Face Recognition Challenge: The WebFace260M Track Report

no code implementations • 16 Aug 2021 • Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jia Guo, Jiwen Lu, Dalong Du, Jie zhou

There are second phase of the challenge till October 1, 2021 and on-going leaderboard.

Kinship Verification Relational Reasoning

Paper
Add Code

Reasoning Graph Networks for Kinship Verification: from Star-shaped to Hierarchical

no code implementations • 6 Sep 2021 • Wanhua Li, Jiwen Lu, Abudukelimu Wuerkaixi, Jianjiang Feng, Jie zhou

To address this, we propose a Star-shaped Reasoning Graph Network (S-RGN).

Ranked #1 on Kinship Verification on KinFaceW-I

Paper
Add Code

Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection

no code implementations • ICCV 2021 • Bingyao Yu, Wanhua Li, Xiu Li, Jiwen Lu, Jie zhou

In this paper, we propose a frequency-aware spatiotemporal transformers for deep In this paper, we propose a Frequency-Aware Spatiotemporal Transformer (FAST) for video inpainting detection, which aims to simultaneously mine the traces of video inpainting from spatial, temporal, and frequency domains.

Video Inpainting

Paper
Add Code

Adaptive neighborhood Metric learning

no code implementations • 20 Jan 2022 • Kun Song, Junwei Han, Gong Cheng, Jiwen Lu, Feiping Nie

In this paper, we reveal that metric learning would suffer from serious inseparable problem if without informative sample mining.

Language Modelling Machine Translation +1

Paper
Add Code

A Roadmap for Big Model

no code implementations • 26 Mar 2022 • Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, HuaWei Shen, HUI ZHANG, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan YAO, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, LiWei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang

With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.

Paper
Add Code

HyperDet3D: Learning a Scene-conditioned 3D Object Detector

no code implementations • CVPR 2022 • Yu Zheng, Yueqi Duan, Jiwen Lu, Jie zhou, Qi Tian

A bathtub in a library, a sink in an office, a bed in a laundry room -- the counter-intuition suggests that scene provides important prior knowledge for 3D object detection, which instructs to eliminate the ambiguous detection of similar objects.

3D Object Detection Object +1

Paper
Add Code

WebFace260M: A Benchmark for Million-Scale Deep Face Recognition

no code implementations • 21 Apr 2022 • Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, JunJie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Dalong Du, Jiwen Lu, Jie zhou

For a comprehensive evaluation of face matchers, three recognition tasks are performed under standard, masked and unbiased settings, respectively.