Search Results for author: Yu Liu

Found 266 papers, 100 papers with code

CIST System Report for ACL MultiLing 2013 -- Track 1: Multilingual Multi-document Summarization

no code implementations • WS 2013 • Lei Li, Wei Heng, JIA YU, Yu Liu, Shuhong Wan

Document Summarization Multi-Document Summarization

Paper
Add Code

Delineating Intra-Urban Spatial Connectivity Patterns by Travel-Activities: A Case Study of Beijing, China

no code implementations • 16 Jul 2014 • Chaogui Kang, Yu Liu, Lun Wu

Travel activities have been widely applied to quantify spatial interactions between places, regions and nations.

Physics and Society Social and Information Networks

Paper
Add Code

Modulation Classification via Gibbs Sampling Based on a Latent Dirichlet Bayesian Network

no code implementations • 4 Aug 2014 • Yu Liu, Osvaldo Simeone, Alexander M. Haimovich, Wei Su

A novel Bayesian modulation classification scheme is proposed for a single-antenna system over frequency-selective fading channels.

Classification General Classification

Paper
Add Code

Learning Relaxed Deep Supervision for Better Edge Detection

no code implementations • CVPR 2016 • Yu Liu, Michael S. Lew

We consider these false positives in the supervision, and are able to achieve high performance for better edge detection.

Edge Detection

Paper
Add Code

Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

no code implementations • 2 Jun 2016 • Yu Liu, Jianlong Fu, Tao Mei, Chang Wen Chen

Second, by using sGRU as basic units, the BMRNN is trained to align the local storylines into the global sequential timeline.

Video Captioning Visual Storytelling

Paper
Add Code

Crafting GBD-Net for Object Detection

1 code implementation • 8 Oct 2016 • Xingyu Zeng, Wanli Ouyang, Junjie Yan, Hongsheng Li, Tong Xiao, Kun Wang, Yu Liu, Yucong Zhou, Bin Yang, Zhe Wang, Hui Zhou, Xiaogang Wang

The effectiveness of GBD-Net is shown through experiments on three object detection datasets, ImageNet, Pascal VOC2007 and Microsoft COCO.

Object object-detection +1

182

Paper
Code

POI: Multiple Object Tracking with High Performance Detection and Appearance Feature

no code implementations • 19 Oct 2016 • Fengwei Yu, Wenbo Li, Quanquan Li, Yu Liu, Xiaohua Shi, Junjie Yan

In this paper, we explore the high-performance detection and deep learning based appearance feature, and show that they lead to significantly better MOT results in both online and offline setting.

Multiple Object Tracking Vocal Bursts Intensity Prediction

Paper
Add Code

Combinatorial Multi-Armed Bandit with General Reward Functions

no code implementations • NeurIPS 2016 • Wei Chen, Wei Hu, Fu Li, Jian Li, Yu Liu, Pinyan Lu

Our framework enables a much larger class of reward functions such as the $\max()$ function and nonlinear utility functions.

Paper
Add Code

On the Exploration of Convolutional Fusion Networks for Visual Recognition

no code implementations • 16 Nov 2016 • Yu Liu, Yanming Guo, Michael S. Lew

Despite recent advances in multi-scale deep representations, their limitations are attributed to expensive parameters and weak fusion modules.

Image Retrieval Retrieval +1

Paper
Add Code

Zoom Out-and-In Network with Recursive Training for Object Proposal

1 code implementation • 19 Feb 2017 • Hongyang Li, Yu Liu, Wanli Ouyang, Xiaogang Wang

In this paper, we propose a zoom-out-and-in network for generating object proposals.

Paper
Code

Learning Deep Features via Congenerous Cosine Loss for Person Recognition

1 code implementation • 22 Feb 2017 • Yu Liu, Hongyang Li, Xiaogang Wang

Person recognition aims at recognizing the same identity across time and space with complicated scenes and similar appearance.

Person Recognition

175

Paper
Code

Unsupervised Sequence Classification using Sequential Output Statistics

no code implementations • NeurIPS 2017 • Yu Liu, Jianshu Chen, Li Deng

Although it is harder to optimize in its functional form, a stochastic primal-dual gradient method is developed to effectively solve the problem.

Classification General Classification

Paper
Add Code

Quality Aware Network for Set to Set Recognition

1 code implementation • CVPR 2017 • Yu Liu, Junjie Yan, Wanli Ouyang

In this paper, the quality aware network (QAN) is proposed to confront this problem, where the quality of each sample can be automatically learned although such information is not explicitly provided in the training stage.

Ranked #6 on Face Verification on YouTube Faces DB

Face Verification Person Re-Identification

Paper
Code

Scale-Aware Face Detection

no code implementations • CVPR 2017 • Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu

Then the scale histogram guides the zoom-in and zoom-out of the image.

Face Detection

Paper
Add Code

MLBench: How Good Are Machine Learning Clouds for Binary Classification Tasks on Structured Data?

no code implementations • 29 Jul 2017 • Yu Liu, Hantian Zhang, Luyuan Zeng, Wentao Wu, Ce Zhang

We then compare the performance of the top winning code available from Kaggle with that of running machine learning clouds from both Azure and Amazon on mlbench.

BIG-bench Machine Learning Binary Classification +1

Paper
Add Code

Recurrent Scale Approximation for Object Detection in CNN

1 code implementation • ICCV 2017 • Yu Liu, Hongyang Li, Junjie Yan, Fangyin Wei, Xiaogang Wang, Xiaoou Tang

To further increase efficiency and accuracy, we (a): design a scale-forecast network to globally predict potential scales in the image since there is no need to compute maps on all levels of the pyramid.

Ranked #3 on Face Detection on Annotated Faces in the Wild

Face Detection Object +2

238

Paper
Code

Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection

1 code implementation • 13 Sep 2017 • Hongyang Li, Yu Liu, Wanli Ouyang, Xiaogang Wang

A key observation is that it is difficult to classify anchors of different sizes with the same set of features.

Ranked #2 on Region Proposal on COCO test-dev

object-detection Object Detection +1

Paper
Code

Learning a Recurrent Residual Fusion Network for Multimodal Matching

no code implementations • ICCV 2017 • Yu Liu, Yanming Guo, Erwin M. Bakker, Michael S. Lew

A major challenge in matching between vision and language is that they typically have completely different features and representations.

Paper
Add Code

Rethinking Feature Discrimination and Polymerization for Large-scale Recognition

1 code implementation • 2 Oct 2017 • Yu Liu, Hongyang Li, Xiaogang Wang

Feature matters.

Clustering Metric Learning

175

Paper
Code

Transferable Semi-supervised Semantic Segmentation

no code implementations • 18 Nov 2017 • Huaxin Xiao, Yunchao Wei, Yu Liu, Maojun Zhang, Jiashi Feng

The performance of deep learning based semantic segmentation models heavily depends on sufficient data with careful annotations.

Segmentation Semi-Supervised Semantic Segmentation

Paper
Add Code

Region-based Quality Estimation Network for Large-scale Person Re-identification

no code implementations • 23 Nov 2017 • Guanglu Song, Biao Leng, Yu Liu, Congrui Hetang, Shaofan Cai

One of the major restrictions on the performance of video-based person re-id is partial noise caused by occlusion, blur and illumination.

Large-Scale Person Re-Identification

Paper
Add Code

The artificial ecosystem: number soup (part II)

no code implementations • 12 Jan 2018 • Yu Liu

This paper is a follow-up work about the artificial ecosystem model: number soup (Liu and Sumpter, J. Royal Soc.

Paper
Add Code

Exploring Disentangled Feature Representation Beyond Face Identification

no code implementations • CVPR 2018 • Yu Liu, Fangyin Wei, Jing Shao, Lu Sheng, Junjie Yan, Xiaogang Wang

This paper proposes learning disentangled but complementary face features with minimal supervision by face identification.

Attribute Face Generation +1

Paper
Add Code

Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy

no code implementations • CVPR 2018 • Guanglu Song, Yu Liu, Ming Jiang, Yujie Wang, Junjie Yan, Biao Leng

Fully convolutional neural network (FCN) has been dominating the game of face detection task for a few years with its congenital capability of sliding-window-searching with shared kernels, which boiled down all the redundant calculation, and most recent state-of-the-art methods such as Faster-RCNN, SSD, YOLO and FPN use FCN as their backbone.

Face Detection Philosophy +1

Paper
Add Code

PAD-Net: A Perception-Aided Single Image Dehazing Network

1 code implementation • 8 May 2018 • Yu Liu, Guanlong Zhao

In this work, we investigate the possibility of replacing the $\ell_2$ loss with perceptually derived loss functions (SSIM, MS-SSIM, etc.)

Image Dehazing MS-SSIM +2

Paper
Code

MoNet: Deep Motion Exploitation for Video Object Segmentation

no code implementations • CVPR 2018 • Huaxin Xiao, Jiashi Feng, Guosheng Lin, Yu Liu, Maojun Zhang

In this paper, we propose a novel MoNet model to deeply exploit motion cues for boosting video object segmentation performance from two aspects, i. e., frame representation learning and segmentation refinement.

Object Optical Flow Estimation +5

Paper
Add Code

Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning

no code implementations • CVPR 2018 • Xingping Dong, Jianbing Shen, Wenguan Wang, Yu Liu, Ling Shao, Fatih Porikli

Hyperparameters are numerical presets whose values are assigned prior to the commencement of the learning process.

Hyperparameter Optimization Object Tracking +1

Paper
Add Code

Improved Techniques for Learning to Dehaze and Beyond: A Collective Study

1 code implementation • 30 Jun 2018 • Yu Liu, Guanlong Zhao, Boyuan Gong, Yang Li, Ritu Raj, Niraj Goel, Satya Kesav, Sandeep Gottimukkala, Zhangyang Wang, Wenqi Ren, DaCheng Tao

Here we explore two related but important tasks based on the recently released REalistic Single Image DEhazing (RESIDE) benchmark dataset: (i) single image dehazing as a low-level image restoration problem; and (ii) high-level visual understanding (e. g., object detection) of hazy images.

Image Dehazing Image Restoration +4

Paper
Code

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation

1 code implementation • 20 Jul 2018 • Hang Zhou, Yu Liu, Ziwei Liu, Ping Luo, Xiaogang Wang

Talking face generation aims to synthesize a sequence of face images that correspond to a clip of speech.

Lip Reading Retrieval +2

812

Paper
Code

Knowledge Graph Embedding with Entity Neighbors and Deep Memory Network

no code implementations • 11 Aug 2018 • Kai Wang, Yu Liu, Xiujuan Xu, Dan Lin

Knowledge Graph Embedding (KGE) aims to represent entities and relations of knowledge graph in a low-dimensional continuous vector space.

Knowledge Graph Embedding

Paper
Add Code

Transductive Centroid Projection for Semi-supervised Large-scale Recognition

no code implementations • ECCV 2018 • Yu Liu, Guanglu Song, Jing Shao, Xiao Jin, Xiaogang Wang

It is inspired by the observation of the weights in classification layer (called extit{anchors}) converge to the central direction of each class in hyperspace.

Clustering General Classification

Paper
Add Code

Learning to Measure Change: Fully Convolutional Siamese Metric Networks for Scene Change Detection

2 code implementations • 22 Oct 2018 • Enqiang Guo, Xinsha Fu, Jiawei Zhu, Min Deng, Yu Liu, Qing Zhu, Haifeng Li

A critical challenge problem of scene change detection is that noisy changes generated by varying illumination, shadows and camera viewpoint make variances of a scene difficult to define and measure since the noisy changes and semantic ones are entangled.

Change Detection Scene Change Detection

229

Paper
Code

T-GCN: A Temporal Graph ConvolutionalNetwork for Traffic Prediction

10 code implementations • 12 Nov 2018 • Ling Zhao, Yujiao Song, Chao Zhang, Yu Liu, Pu Wang, Tao Lin, Min Deng, Haifeng Li

However, traffic forecasting has always been considered an open scientific issue, owing to the constraints of urban road network topological structure and the law of dynamic change with time, namely, spatial dependence and temporal dependence.

Ranked #4 on Traffic Prediction on SZ-Taxi

Management Traffic Prediction

2,484

Paper
Code

Gradient Harmonized Single-stage Detector

9 code implementations • 13 Nov 2018 • Buyu Li, Yu Liu, Xiaogang Wang

Despite the great success of two-stage detectors, single-stage detector is still a more elegant and efficient way, yet suffers from the two well-known disharmonies during training, i. e. the huge difference in quantity between positive and negative examples as well as between easy and hard examples.

Ranked #165 on Object Detection on COCO test-dev

General Classification Object Detection +1

27,765

Paper
Code

How far from automatically interpreting deep learning

no code implementations • 19 Nov 2018 • Jinwei Zhao, Qizhou Wang, YuFei Wang, Xinhong Hei, Yu Liu

In other words, there is a gap between the deep learning model and the cognitive mode.

Paper
Add Code

Visual Localization Under Appearance Change: Filtering Approaches

3 code implementations • 20 Nov 2018 • Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Shin-Fang Ch'ng, Thanh-Toan Do, Ian Reid

Our approaches rely on local features with an encoding technique to represent an image as a single vector.

Visual Localization Visual Place Recognition

Paper
Code

How to improve the interpretability of kernel learning

no code implementations • 21 Nov 2018 • Jinwei Zhao, Qizhou Wang, YuFei Wang, Yu Liu, Zhenghao Shi, Xinhong Hei

In this paper, a quantitative index of the interpretability is proposed and its rationality is proved, and equilibrium problem between the interpretability and the generalization performance is analyzed.

BIG-bench Machine Learning

Paper
Add Code

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks

no code implementations • 24 Nov 2018 • Haokui Zhang, Ying Li, Peng Wang, Yu Liu, Chunhua Shen

Different from RGB videos, depth data in RGB-D videos provide key complementary information for tristimulus visual data which potentially could achieve accuracy improvement for action recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

Derivative Estimation in Random Design

no code implementations • NeurIPS 2018 • Yu Liu, Kris De Brabanter

We propose a nonparametric derivative estimation method for random design without having to estimate the regression function.

regression

Paper
Add Code

Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes

no code implementations • 12 Jan 2019 • Yu Liu, Lingqiao Liu, Hamid Rezatofighi, Thanh-Toan Do, Qinfeng Shi, Ian Reid

As the post-processing step for object detection, non-maximum suppression (GreedyNMS) is widely used in most of the detectors for many years.

object-detection Object Detection

Paper
Add Code

RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion

no code implementations • CVPR 2019 • Jie Li, Yu Liu, Dong Gong, Qinfeng Shi, Xia Yuan, Chunxia Zhao, Ian Reid

RGB images differentiate from depth images as they carry more details about the color and texture information, which can be utilized as a vital complementary to depth for boosting the performance of 3D semantic scene completion (SSC).

Ranked #19 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion Scene Labeling

Paper
Add Code

Correlation Congruence for Knowledge Distillation

2 code implementations • ICCV 2019 • Baoyun Peng, Xiao Jin, Jiaheng Liu, Shunfeng Zhou, Yi-Chao Wu, Yu Liu, Dongsheng Li, Zhaoning Zhang

Most teacher-student frameworks based on knowledge distillation (KD) depend on a strong congruent constraint on instance level.

Face Recognition Image Classification +3

1,266

Paper
Code

Conditional Adversarial Generative Flow for Controllable Image Synthesis

no code implementations • CVPR 2019 • Rui Liu, Yu Liu, Xinyu Gong, Xiaogang Wang, Hongsheng Li

Flow-based generative models show great potential in image synthesis due to its reversible pipeline and exact log-likelihood target, yet it suffers from weak ability for conditional image synthesis, especially for multi-label or unaware conditions.

Image Generation

Paper
Add Code

Knowledge Distillation via Route Constrained Optimization

1 code implementation • ICCV 2019 • Xiao Jin, Baoyun Peng, Yi-Chao Wu, Yu Liu, Jiaheng Liu, Ding Liang, Xiaolin Hu

However, we find that the representation of a converged heavy model is still a strong constraint for training a small student model, which leads to a high lower bound of congruence loss.

Face Recognition Knowledge Distillation

570

Paper
Code

Extracting human emotions at different places based on facial expressions and spatial clustering analysis

no code implementations • 6 May 2019 • Yuhao Kang, Qingyuan Jia, Song Gao, Xiaohuan Zeng, Yueyao Wang, Stephan Angsuesser, Yu Liu, Xinyue Ye, Teng Fei

In this study, a novel framework for extracting human emotions from large-scale georeferenced photos at different places is proposed.

Clustering

Paper
Add Code

From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation

no code implementations • 6 Jun 2019 • Yu Liu, Li Deng, Jianshu Chen, Chang Wen Chen

To remove the need for the parallel training corpora has practical significance for real-world applications, and it is one of the main goals of unsupervised learning.

Binary Classification General Classification +4

Paper
Add Code

Use of OWL and Semantic Web Technologies at Pinterest

no code implementations • 3 Jul 2019 • Rafael S. Gonçalves, Matthew Horridge, Rui Li, Yu Liu, Mark A. Musen, Csongor I. Nyulas, Evelyn Obamos, Dhananjay Shrouty, David Temple

In this paper, we present the engineering of an OWL ontology---the Pinterest Taxonomy---that forms the core of Pinterest's knowledge graph, the Pinterest Taste Graph.

Paper
Add Code

Scalable Place Recognition Under Appearance Change for Autonomous Driving

no code implementations • ICCV 2019 • Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Thanh-Toan Do, Ian Reid

Our experiments show that, compared to state-of-the-art techniques, our method has much greater potential for large-scale place recognition for autonomous driving.

Autonomous Driving Visual Place Recognition

Paper
Add Code

Modeling Event Propagation via Graph Biased Temporal Point Process

no code implementations • 5 Aug 2019 • Weichang Wu, Huanxi Liu, Xiaohu Zhang, Yu Liu, Hongyuan Zha

Temporal point process is widely used for sequential data modeling.

Graph Representation Learning

Paper
Add Code

Exploiting temporal consistency for real-time video depth estimation

2 code implementations • ICCV 2019 • Haokui Zhang, Chunhua Shen, Ying Li, Yuanzhouhan Cao, Yu Liu, Youliang Yan

The temporal consistency loss is combined with the spatial loss to update the model in an end-to-end fashion.

Ranked #5 on Monocular Depth Estimation on Mid-Air Dataset

Monocular Depth Estimation

Paper
Code

In defense of OSVOS

no code implementations • 19 Aug 2019 • Yu Liu, Yutong Dai, Anh-Dzung Doan, Lingqiao Liu, Ian Reid

Through adding a common module, video loss, which we formulate with various forms of constraints (including weighted BCE loss, high-dimensional triplet loss, as well as a novel mixed instance-aware video loss), to train the parent network in the step (2), the network is then better prepared for the step (3), i. e. online fine-tuning on the target instance.

Depth Estimation Object +6

Paper
Add Code

Towards Flops-constrained Face Recognition

1 code implementation • 2 Sep 2019 • Yu Liu, Guanglu Song, Manyuan Zhang, Jihao Liu, Yucong Zhou, Junjie Yan

Large scale face recognition is challenging especially when the computational budget is limited.

Lightweight Face Recognition

127

Paper
Code

Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators using Time Compression Supporting Multiple Spike Codes

no code implementations • 10 Sep 2019 • Changqing Xu, Wenrui Zhang, Yu Liu, Peng Li

Using spiking speech and image recognition datasets, we demonstrate the feasibility of supporting large time compression ratios of up to 16x, delivering up to 15. 93x, 13. 88x, and 86. 21x improvements in throughput, energy dissipation, the tradeoffs between hardware area, runtime, energy, and classification accuracy, respectively based on different spike codes on a Xilinx Zynq-7000 FPGA.

Paper
Add Code

Meta Learning with Differentiable Closed-form Solver for Fast Video Object Segmentation

no code implementations • 28 Sep 2019 • Yu Liu, Lingqiao Liu, Haokui Zhang, Hamid Rezatofighi, Ian Reid

This paper tackles the problem of video object segmentation.

Meta-Learning Object +4

Paper
Add Code

Differentiable Kernel Evolution

no code implementations • ICCV 2019 • Yu Liu, Jihao Liu, Ailing Zeng, Xiaogang Wang

This paper proposes a differentiable kernel evolution (DKE) algorithm to find a better layer-operator for the convolutional neural network.

object-detection Object Detection

Paper
Add Code

Gradient Information Guided Deraining with A Novel Network and Adversarial Training

no code implementations • 9 Oct 2019 • Yinglong Wang, Haokui Zhang, Yu Liu, Qinfeng Shi, Bing Zeng

However, the existing methods usually do not have good generalization ability, which leads to the fact that almost all of existing methods have a satisfied performance on removing a specific type of rain streaks, but may have a relatively poor performance on other types of rain streaks.

Rain Removal

Paper
Add Code

A game method for improving the interpretability of convolution neural network

no code implementations • 21 Oct 2019 • Jinwei Zhao, Qizhou Wang, Fuqiang Zhang, Wanli Qiu, YuFei Wang, Yu Liu, Guo Xie, Weigang Ma, Bin Wang, Xinhong Hei

The reason is, we believe that: the network is essentially a perceptual model.

Paper
Add Code

Learning Where to Focus for Efficient Video Object Detection

1 code implementation • ECCV 2020 • Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan

Transferring existing image-based detectors to the video is non-trivial since the quality of frames is always deteriorated by part occlusion, rare pose, and motion blur.

Ranked #22 on Video Object Detection on ImageNet VID

Object object-detection +1

Paper
Code

Search to Distill: Pearls are Everywhere but not the Eyes

no code implementations • CVPR 2020 • Yu Liu, Xuhui Jia, Mingxing Tan, Raviteja Vemulapalli, Yukun Zhu, Bradley Green, Xiaogang Wang

Standard Knowledge Distillation (KD) approaches distill the knowledge of a cumbersome teacher model into the parameters of a student model with a pre-defined architecture.

Ensemble Learning Face Recognition +3

Paper
Add Code

Understanding the mesoscopic scaling patterns within cities

1 code implementation • 2 Jan 2020 • Lei Dong, Zhou Huang, Jiang Zhang, Yu Liu

Understanding quantitative relationships between urban elements is crucial for a wide range of applications.

Physics and Society

Paper
Code

Temporal Interlacing Network

4 code implementations • 17 Jan 2020 • Hao Shao, Shengju Qian, Yu Liu

In this way, a heavy temporal model is replaced by a simple interlacing operator.

Optical Flow Estimation Video Understanding

3,887

Paper
Code

Depth Based Semantic Scene Completion with Position Importance Aware Loss

1 code implementation • 29 Jan 2020 • Yu Liu, Jie Li, Xia Yuan, Chunxia Zhao, Roland Siegwart, Ian Reid, Cesar Cadena

We propose PALNet, a novel hybrid network for SSC based on single depth.

3D Semantic Segmentation Position

Paper
Code

Hyperspectral Classification Based on 3D Asymmetric Inception Network with Data Fusion Transfer Learning

1 code implementation • 11 Feb 2020 • Haokui Zhang, Yu Liu, Bei Fang, Ying Li, Lingqiao Liu, Ian Reid

Hyperspectral image(HSI) classification has been improved with convolutional neural network(CNN) in very recent years.

General Classification Transfer Learning

Paper
Code

3D Gated Recurrent Fusion for Semantic Scene Completion

no code implementations • 17 Feb 2020 • Yu Liu, Jie Li, Qingsen Yan, Xia Yuan, Chunxia Zhao, Ian Reid, Cesar Cadena

This paper tackles the problem of data fusion in the semantic scene completion (SSC) task, which can simultaneously deal with semantic labeling and scene completion.

Ranked #14 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion Scene Understanding

Paper
Add Code

DASNet: Dual attentive fully convolutional siamese networks for change detection of high resolution satellite images

2 code implementations • 7 Mar 2020 • Jie Chen, Ziyang Yuan, Jian Peng, Li Chen, Haozhe Huang, Jiawei Zhu, Yu Liu, Haifeng Li

However, the available methods focus mainly on the difference information between multitemporal remote sensing images and lack robustness to pseudo-change information.

Ranked #22 on Change detection for remote sensing images on CDD Dataset (season-varying)

Change Detection Change detection for remote sensing images

132

Paper
Code

Top-1 Solution of Multi-Moments in Time Challenge 2019

1 code implementation • 12 Mar 2020 • Manyuan Zhang, Hao Shao, Guanglu Song, Yu Liu, Junjie Yan

In this technical report, we briefly introduce the solutions of our team 'Efficient' for the Multi-Moments in Time challenge in ICCV 2019.

Action Recognition Video Understanding

468

Paper
Code

Revisiting the Sibling Head in Object Detector

2 code implementations • CVPR 2020 • Guanglu Song, Yu Liu, Xiaogang Wang

The ``shared head for classification and localization'' (sibling head), firstly denominated in Fast RCNN~\cite{girshick2015fast}, has been leading the fashion of the object detection community in the past five years.

Ranked #67 on Object Detection on COCO test-dev

Disentanglement General Classification +4

453

Paper
Code

1st Place Solutions for OpenImage2019 -- Object Detection and Instance Segmentation

2 code implementations • 17 Mar 2020 • Yu Liu, Guanglu Song, Yuhang Zang, Yan Gao, Enze Xie, Junjie Yan, Chen Change Loy, Xiaogang Wang

Given such good instance bounding box, we further design a simple instance-level semantic segmentation pipeline and achieve the 1st place on the segmentation challenge.

General Classification Instance Segmentation +6

453

Paper
Code

KPNet: Towards Minimal Face Detector

no code implementations • 17 Mar 2020 • Guanglu Song, Yu Liu, Yuhang Zang, Xiaogang Wang, Biao Leng, Qingsheng Yuan

The small receptive field and capacity of minimal neural networks limit their performance when using them to be the backbone of detectors.

Face Detection

Paper
Add Code

Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

1 code implementation • CVPR 2020 • Hang Zhou, Jihao Liu, Ziwei Liu, Yu Liu, Xiaogang Wang

Though face rotation has achieved rapid progress in recent years, the lack of high-quality paired training data remains a great hurdle for existing methods.

3D Face Modelling Data Augmentation +1

482

Paper
Code

DPGN: Distribution Propagation Graph Network for Few-shot Learning

1 code implementation • CVPR 2020 • Ling Yang, Liangliang Li, Zilun Zhang, Xinyu Zhou, Erjin Zhou, Yu Liu

To combine the distribution-level relations and instance-level relations for all examples, we construct a dual complete graph network which consists of a point graph and a distribution graph with each node standing for an example.

Ranked #2 on Few-Shot Learning on Mini-ImageNet - 1-Shot Learning

Few-Shot Learning Relation

175

Paper
Code

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

1 code implementation • CVPR 2020 • Jie Li, Kai Han, Peng Wang, Yu Liu, Xia Yuan

In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely.

Ranked #4 on 3D Semantic Scene Completion from a single RGB image on NYUv2

3D Semantic Scene Completion from a single RGB image

Paper
Code

Change Detection in Heterogeneous Optical and SAR Remote Sensing Images via Deep Homogeneous Feature Fusion

no code implementations • 8 Apr 2020 • Xiao Jiang, Gang Li, Yu Liu, Xiao-Ping Zhang, You He

To solve this problem, this paper presents a new homogeneous transformation model termed deep homogeneous feature fusion (DHFF) based on image style transfer (IST).

Change Detection Style Transfer

Paper
Add Code

A novel multimodal approach for hybrid brain-computer interface

1 code implementation • 25 Apr 2020 • Zhe Sun, Zihao Huang, Feng Duan, Yu Liu

It has been already shown in literature that the hybrid of EEG and NIRS has better results than their respective individual signals.

Human-Computer Interaction Signal Processing

Paper
Code

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization

3 code implementations • CVPR 2021 • Junting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li

We propose to explicitly model the Actor-Context-Actor Relation, which is the relation between two actors based on their interactions with the context.

Ranked #2 on Action Recognition on AVA v2.1

Action Detection Action Recognition +5

2,986

Paper
Code

1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020

2 code implementations • 16 Jun 2020 • Siyu Chen, Junting Pan, Guanglu Song, Manyuan Zhang, Hao Shao, Ziyi Lin, Jing Shao, Hongsheng Li, Yu Liu

This technical report introduces our winning solution to the spatio-temporal action localization track, AVA-Kinetics Crossover, in ActivityNet Challenge 2020.

Relation Network Spatio-Temporal Action Localization +1

198

Paper
Code

Generalizing Tensor Decomposition for N-ary Relational Knowledge Bases

1 code implementation • 8 Jul 2020 • Yu Liu, Quanming Yao, Yong Li

With the rapid development of knowledge bases (KBs), link prediction task, which completes KBs with missing facts, has been broadly studied in especially binary relational KBs (a. k. a knowledge graph) with powerful tensor decomposition related methods.

Link Prediction Tensor Decomposition

Paper
Code

Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020

no code implementations • 20 Jul 2020 • Haisheng Su, Jinyuan Feng, Hao Shao, Zhenyu Jiang, Manyuan Zhang, Wei Wu, Yu Liu, Hongsheng Li, Junjie Yan

Specifically, in order to generate high-quality proposals, we consider several factors including the video feature encoder, the proposal generator, the proposal-proposal relations, the scale imbalance, and ensemble strategy.

Temporal Action Localization

Paper
Add Code

Deep Reinforcement Learning for Dynamic Spectrum Sensing and Aggregation in Multi-Channel Wireless Networks

no code implementations • 28 Jul 2020 • Yunzeng Li, Wensheng Zhang, Cheng-Xiang Wang, Jian Sun, Yu Liu

Then, the vacant channels in the selected segment will be aggregated for satisfying the user requirement.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

1 code implementation • 6 Aug 2020 • Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

To solve the problem, we present a carefully designed dual Gaussian-based variational auto-encoder (DG-VAE), which disentangles an identity-discriminable and an identity-ambiguous cross-modality feature subspace, following a mixture-of-Gaussians (MoG) prior and a standard Gaussian distribution prior, respectively.

Disentanglement Person Re-Identification +2

Paper
Code

Single Cell Transcriptome Research in Human Placenta

no code implementations • 7 Aug 2020 • Hui Li, Qianhui Huang, Yu Liu, Lana X Garmire

Human placenta is a complex and heterogeneous organ interfacing between the mother and the fetus that supports fetal development.

Paper
Add Code

Discriminability Distillation in Group Representation Learning

no code implementations • ECCV 2020 • Manyuan Zhang, Guanglu Song, Hang Zhou, Yu Liu

We show the discrimiability knowledge has good properties that can be distilled by a light-weight distillation network and can be generalized on the unseen target set.

Representation Learning

Paper
Add Code

Image Retrieval for Structure-from-Motion via Graph Convolutional Network

no code implementations • 17 Sep 2020 • Shen Yan, Yang Pen, Shiming Lai, Yu Liu, Maojun Zhang

Conventional image retrieval techniques for Structure-from-Motion (SfM) suffer from the limit of effectively recognizing repetitive patterns and cannot guarantee to create just enough match pairs with high precision and high recall.

Binary Classification Image Retrieval +1

Paper
Add Code

Commands 4 Autonomous Vehicles (C4AV) Workshop Summary

no code implementations • 18 Sep 2020 • Thierry Deruyttere, Simon Vandenhende, Dusan Grujicic, Yu Liu, Luc van Gool, Matthew Blaschko, Tinne Tuytelaars, Marie-Francine Moens

In this work, we deviate from recent, popular task settings and consider the problem under an autonomous vehicle scenario.

Ranked #3 on Referring Expression Comprehension on Talk2Car

Autonomous Vehicles Referring Expression Comprehension +1

Paper
Add Code

MulDE: Multi-teacher Knowledge Distillation for Low-dimensional Knowledge Graph Embeddings

no code implementations • 14 Oct 2020 • Kai Wang, Yu Liu, Qian Ma, Quan Z. Sheng

Link prediction based on knowledge graph embeddings (KGE) aims to predict new triples to automatically construct knowledge graphs (KGs).

Knowledge Distillation Knowledge Graph Embedding +2

Paper
Add Code

On the Exploration of Incremental Learning for Fine-grained Image Retrieval

1 code implementation • 15 Oct 2020 • Wei Chen, Yu Liu, Weiping Wang, Tinne Tuytelaars, Erwin M. Bakker, Michael Lew

On the other hand, fine-tuning the learned representation only with the new classes leads to catastrophic forgetting.

Image Retrieval Incremental Learning +1

Paper
Code

Electronic properties of InAs/EuS/Al hybrid nanowires

no code implementations • 12 Nov 2020 • Chun-Xiao Liu, Sergej Schuwalow, Yu Liu, Kostas Vilkelis, A. L. R. Manesco, P. Krogstrup, Michael Wimmer

We study the electronic properties of InAs/EuS/Al heterostructures as explored in a recent experiment [S. Vaitiekenas \emph{et al.}, Nat.

Mesoscale and Nanoscale Physics

Paper
Add Code

High Fermi velocities and small cyclotron masses in LaAlGe

no code implementations • 9 Dec 2020 • Zhixiang Hu, Qianheng Du, Yu Liu, D. Graf, C. Petrovic

We report quantum oscillation measurements of LaAlGe, a Lorentz-violating type-II Weyl semimetal with tilted Weyl cones.

Mesoscale and Nanoscale Physics Materials Science

Paper
Add Code

Precision test of statistical dynamics with state-to-state ultracold chemistry

no code implementations • 31 Dec 2020 • Yu Liu, Ming-Guang Hu, Matthew A. Nichols, Dongzheng Yang, Daiqian Xie, Hua Guo, Kang-Kuen Ni

Chemical reactions represent a class of quantum problems that challenge both the current theoretical understanding and computational capabilities.

Chemical Physics Atomic Physics Quantum Physics

Paper
Add Code

Self-supervised Temporal Learning

no code implementations • 1 Jan 2021 • Hao Shao, Yu Liu, Hongsheng Li

Inspired by spatial-based contrastive SSL, we show that significant improvement can be achieved by a proposed temporal-based contrastive learning approach, which includes three novel and efficient modules: temporal augmentations, temporal memory bank and SSTL loss.

Contrastive Learning Retrieval +3

Paper
Add Code

Fast MNAS: Uncertainty-aware Neural Architecture Search with Lifelong Learning

no code implementations • 1 Jan 2021 • Jihao Liu, Yangting Sun, Ming Zhang, Boxiao Liu, Yu Liu

Further, a life-long knowledge pool together with a block similarity function is proposed to utilize the lifelong parameter knowledge and reduces the searching time by 2 times.

Fairness Neural Architecture Search

Paper
Add Code

Switchable K-Class Hyperplanes for Noise-Robust Representation Learning

no code implementations • ICCV 2021 • Boxiao Liu, Guanglu Song, Manyuan Zhang, Haihang You, Yu Liu

When collaborated with the popular ArcFace on million-level data representation learning, we found that the switchable manner in SKH can effectively eliminate the gradient conflict generated by real-world label noise on a single K-class hyperplane.

Model Optimization Representation Learning +1

Paper
Add Code

Deep Learning for Instance Retrieval: A Survey

no code implementations • 27 Jan 2021 • Wei Chen, Yu Liu, Weiping Wang, Erwin Bakker, Theodoros Georgiou, Paul Fieguth, Li Liu, Michael S. Lew

In recent years a vast amount of visual content has been generated and shared from many fields, such as social media platforms, medical imaging, and robotics.

Content-Based Image Retrieval Instance Search +1

Paper
Add Code

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

no code implementations • 9 Feb 2021 • Yu Liu, Lianghua Huang, Pan Pan, Bin Wang, Yinghui Xu, Rong Jin

However, scaling up the classification task from thousands of semantic labels to millions of instance labels brings specific challenges including 1) the large-scale softmax computation; 2) the slow convergence due to the infrequent visiting of instance samples; and 3) the massive number of negative classes that can be noisy.

Classification General Classification +1

Paper
Add Code

ACDnet: An action detection network for real-time edge computing based on flow-guided feature approximation and memory aggregation

1 code implementation • 26 Feb 2021 • Yu Liu, Fan Yang, Dominique Ginhac

Interpreting human actions requires understanding the spatial and temporal context of the scenes.

Action Detection Edge-computing

Paper
Code

Lifelong Person Re-Identification via Adaptive Knowledge Accumulation

1 code implementation • CVPR 2021 • Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

In this work we explore a new and challenging ReID task, namely lifelong person re-identification (LReID), which enables to learn continuously across multiple domains and even generalise on new and unseen domains.

Incremental Learning Person Re-Identification

Paper
Code

Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph Embeddings

no code implementations • Findings (EMNLP) 2021 • Kai Wang, Yu Liu, Dan Lin, Quan Z. Sheng

Recent knowledge graph embedding (KGE) models based on hyperbolic geometry have shown great potential in a low-dimensional embedding space.

Knowledge Graph Embedding Knowledge Graph Embeddings +1

Paper
Add Code

Self-supervised Video Representation Learning by Context and Motion Decoupling

1 code implementation • CVPR 2021 • Lianghua Huang, Yu Liu, Bin Wang, Pan Pan, Yinghui Xu, Rong Jin

A key challenge in self-supervised video representation learning is how to effectively capture motion information besides context bias.

Action Recognition motion prediction +3

Paper
Code

Integrating Information Theory and Adversarial Learning for Cross-modal Retrieval

no code implementations • 11 Apr 2021 • Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew

Moreover, feature encoders (as a generator) project uni-modal features into a commonly shared space and attempt to fool the discriminator by maximizing its output information entropy.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Role-Aware Modeling for N-ary Relational Knowledge Bases

1 code implementation • 20 Apr 2021 • Yu Liu, Quanming Yao, Yong Li

N-ary relational knowledge bases (KBs) represent knowledge with binary and beyond-binary relational facts.

Knowledge Graphs

Paper
Code

A Novel Binocular Eye-Tracking SystemWith Stereo Stimuli for 3D Gaze Estimation

no code implementations • 25 Apr 2021 • Jinglin Sun, Zhipeng Wu, Han Wang, Peiguang Jing, Yu Liu

However, most current eye trackers focus on 2D point of gaze (PoG) estimation and cannot provide accurate gaze depth. Concerning future applications such as HCI with 3D displays, we propose a novel binocular eye tracking device with stereo stimuli to provide highly accurate 3D PoG estimation.

Depth Estimation Gaze Estimation

Paper
Add Code

FNAS: Uncertainty-Aware Fast Neural Architecture Search

no code implementations • 25 May 2021 • Jihao Liu, Ming Zhang, Yangting Sun, Boxiao Liu, Guanglu Song, Yu Liu, Hongsheng Li

Further, an architecture knowledge pool together with a block similarity function is proposed to utilize parameter knowledge and reduces the searching time by 2 times.

Fairness Neural Architecture Search +1

Paper
Add Code

Communication Efficient SGD via Gradient Sampling With Bayes Prior

no code implementations • CVPR 2021 • Liuyihan Song, Kang Zhao, Pan Pan, Yu Liu, Yingya Zhang, Yinghui Xu, Rong Jin

Different from all of them, we regard large and small gradients selection as the exploitation and exploration of gradient information, respectively.

Image Classification object-detection +2

Paper
Add Code

Curvature Graph Neural Network

no code implementations • 30 Jun 2021 • Haifeng Li, Jun Cao, Jiawei Zhu, Yu Liu, Qing Zhu, Guohua Wu

And we propose Curvature Graph Neural Network (CGNN), which effectively improves the adaptive locality ability of GNNs by leveraging the structural property of graph curvature.

Node Classification

Paper
Add Code

Real-time Keypoints Detection for Autonomous Recovery of the Unmanned Ground Vehicle

no code implementations • 27 Jul 2021 • Jie Li, Sheng Zhang, Kai Han, Xia Yuan, Chunxia Zhao, Yu Liu

UGV-KPNet is computationally efficient with a small number of parameters and provides pixel-level accurate keypoints detection results in real-time.

Keypoint Detection

Paper
Add Code

Unified Regularity Measures for Sample-wise Learning and Generalization

no code implementations • 9 Aug 2021 • Chi Zhang, Xiaoning Ma, Yu Liu, Le Wang, Yuanqi SU, Yuehu Liu

Fundamental machine learning theory shows that different samples contribute unequally both in learning and testing processes.

Learning Theory Memorization

Paper
Add Code

Iterative Self-consistent Parallel Magnetic Resonance Imaging Reconstruction based on Nonlocal Low-Rank Regularization

no code implementations • 10 Aug 2021 • Ting Pan, Jizhong Duan, Junfeng Wang, Yu Liu

Recent methods have exploited the nonlocal self-similarity (NSS) of images by imposing nonlocal low-rankness of similar patches to achieve a superior performance.

Paper
Add Code

Spatio-temporal Parking Behaviour Forecasting and Analysis Before and During COVID-19

no code implementations • 15 Aug 2021 • Shuhui Gong, Xiaopeng Mo, Rui Cao, Yu Liu, Wei Tu, Ruibin Bai

Parking demand forecasting and behaviour analysis have received increasing attention in recent years because of their critical role in mitigating traffic congestion and understanding travel behaviours.

Epidemiology graph construction +1

Paper
Add Code

LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders

no code implementations • 22 Sep 2021 • Xiaoyu Chen, Chen Gong, Qiang He, Xinwen Hou, Yu Liu

Variational autoencoders (VAEs), as an important aspect of generative models, have received a lot of research interests and reached many successful applications.

Image Generation

Paper
Add Code

The $f$-Divergence Reinforcement Learning Framework

no code implementations • 24 Sep 2021 • Chen Gong, Qiang He, Yunpeng Bai, Zhou Yang, Xiaoyu Chen, Xinwen Hou, Xianjie Zhang, Yu Liu, Guoliang Fan

In FRL, the policy evaluation and policy improvement phases are simultaneously performed by minimizing the $f$-divergence between the learning policy and sampling policy, which is distinct from conventional DRL algorithms that aim to maximize the expected cumulative rewards.

Decision Making Mathematical Proofs +2

Paper
Add Code

UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning

3 code implementations • ICLR 2022 • Kunchang Li, Yali Wang, Gao Peng, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao

For Something-Something V1 and V2, our UniFormer achieves new state-of-the-art performances of 60. 8% and 71. 4% top-1 accuracy respectively.

Ranked #8 on Action Recognition on Something-Something V1

Action Classification Action Recognition +1

2,986

Paper
Code

Self-Slimming Vision Transformer

no code implementations • 29 Sep 2021 • Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu

Specifically, we first design a novel Token Slimming Module (TSM), which can boost the inference efficiency of ViTs by dynamic token aggregation.

Knowledge Distillation

Paper
Add Code

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

no code implementations • 8 Oct 2021 • Jihao Liu, Hongsheng Li, Guanglu Song, Xin Huang, Yu Liu

Recently, transformer and multi-layer perceptron (MLP) architectures have achieved impressive results on various vision tasks.

Ranked #236 on Image Classification on ImageNet

Image Classification object-detection +2

Paper
Add Code

Rectifying the Data Bias in Knowledge Distillation

no code implementations • ICCV 2021 • Boxiao Liu, Shenghan Zhang, Guanglu Song, Haihang You, Yu Liu

In this paper, we first quantitatively define the uniformity of the sampled data for training, providing a unified view for methods that learn from biased data.

Ranked #1 on Face Verification on IJB-C (training dataset metric)

Face Recognition Face Verification +3

Paper
Add Code

RelationRS: Relationship Representation Network for Object Detection in Aerial Images

no code implementations • 13 Oct 2021 • Zhiming Liu, Xuefei Zhang, Chongyang Liu, Hao Wang, Chao Sun, Bin Li, Weifeng Sun, Pu Huang, Qingjun Li, Yu Liu, Haipeng Kuang, Jihong Xiu

To address these issues, we propose a relationship representation network for object detection in aerial images (RelationRS): 1) Firstly, multi-scale features are fused and enhanced by a dual relationship module (DRM) with conditional convolution.

Object object-detection +1

Paper
Add Code

Spatio-Temporal Urban Knowledge Graph Enabled Mobility Prediction

no code implementations • 1 Nov 2021 • Huandong Wang, Qiaohong Yu, Yu Liu, Depeng Jin, Yong Li

Further, a complex embedding model with elaborately designed scoring functions is proposed to measure the plausibility of facts in STKG to solve the knowledge graph completion problem, which considers temporal dynamics of the mobility patterns and utilizes PoI categories as the auxiliary information and background knowledge.

Knowledge Graph Completion

Paper
Add Code

Knowledge-driven Site Selection via Urban Knowledge Graph

no code implementations • 1 Nov 2021 • Yu Liu, Jingtao Ding, Yong Li

Specifically, motivated by distilled knowledge and rich semantics in KG, we firstly construct an urban KG (UrbanKG) with cities' key elements and semantic relationships captured.

Feature Engineering

Paper
Add Code

INTERN: A New Learning Paradigm Towards General Vision

no code implementations • 16 Nov 2021 • Jing Shao, Siyu Chen, Yangguang Li, Kun Wang, Zhenfei Yin, Yinan He, Jianing Teng, Qinghong Sun, Mengya Gao, Jihao Liu, Gengshi Huang, Guanglu Song, Yichao Wu, Yuming Huang, Fenggang Liu, Huan Peng, Shuo Qin, Chengyu Wang, Yujie Wang, Conghui He, Ding Liang, Yu Liu, Fengwei Yu, Junjie Yan, Dahua Lin, Xiaogang Wang, Yu Qiao

Enormous waves of technological innovations over the past several years, marked by the advances in AI technologies, are profoundly reshaping the industry and the society.

Paper
Add Code

Learning by Active Forgetting for Neural Networks

no code implementations • 21 Nov 2021 • Jian Peng, Xian Sun, Min Deng, Chao Tao, Bo Tang, Wenbo Li, Guohua Wu, QingZhu, Yu Liu, Tao Lin, Haifeng Li

This paper presents a learning model by active forgetting mechanism with artificial neural networks.

Paper
Add Code

A Bayesian Model for Online Activity Sample Sizes

no code implementations • 23 Nov 2021 • Thomas Richardson, Yu Liu, James McQueen, Doug Hains

Given observations on the number of unique users participating in an initial period, we present a simple but novel Bayesian method for predicting the number of additional individuals who will participate during a subsequent period.

Paper
Add Code

Reviewing continual learning from the perspective of human-level intelligence

no code implementations • 23 Nov 2021 • Yifan Chang, Wenbo Li, Jian Peng, Bo Tang, Yu Kang, Yinjie Lei, Yuanmiao Gui, Qing Zhu, Yu Liu, Haifeng Li

Different from previous reviews that mainly focus on the catastrophic forgetting phenomenon in CL, this paper surveys CL from a more macroscopic perspective based on the Stability Versus Plasticity mechanism.

Continual Learning

Paper
Add Code

Self-slimmed Vision Transformer

1 code implementation • 24 Nov 2021 • Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu

Specifically, we first design a novel Token Slimming Module (TSM), which can boost the inference efficiency of ViTs by dynamic token aggregation.

Knowledge Distillation

Paper
Code

Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing

no code implementations • 24 Nov 2021 • Yu Liu, Mingbo Zhao, Zhao Zhang, Haijun Zhang, Shuicheng Yan

Based on this dataset, we then propose the Arbitrary Virtual Try-On Network (AVTON) that is utilized for all-type clothes, which can synthesize realistic try-on images by preserving and trading off characteristics of the target clothes and the reference person.

Geometric Matching Virtual Try-on

Paper
Add Code

Optimization of RIS Configurations for Multiple-RIS-Aided mmWave Positioning Systems based on CRLB Analysis

no code implementations • 28 Nov 2021 • Yu Liu, Sheng Hong, Cunhua Pan, Yinlu Wang, Yijin Pan, Ming Chen

Reconfigurable intelligent surface (RIS) is a promising technology for future millimeter-wave (mmWave) communication systems.

Paper
Add Code

Overcome Anterograde Forgetting with Cycled Memory Networks

no code implementations • 4 Dec 2021 • Jian Peng, Dingqi Ye, Bo Tang, Yinjie Lei, Yu Liu, Haifeng Li

This work proposes a general framework named Cycled Memory Networks (CMN) to address the anterograde forgetting in neural networks for lifelong learning.

Transfer Learning

Paper
Add Code

Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution

1 code implementation • 9 Dec 2021 • Yunpeng Bai, Chen Gong, Bin Zhang, Guoliang Fan, Xinwen Hou, Yu Liu

HGCN-MIX models agents as well as their relationships as a hypergraph, where agents are nodes and hyperedges among nodes indicate that the corresponding agents can coordinate to achieve larger rewards.

reinforcement-learning Reinforcement Learning (RL) +4

Paper
Code

Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition

no code implementations • 22 Dec 2021 • Qingyuan Gong, Yu Liu, Liqiang Zhang, Renhe Liu

Visual place recognition (VPR) is a challenging task with the unbalance between enormous computational cost and high recognition performance.

Visual Place Recognition

Paper
Add Code

Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way

1 code implementation • CVPR 2022 • Qi Jia, Shuilian Yao, Yu Liu, Xin Fan, Risheng Liu, Zhongxuan Luo

To tackle camouflaged object detection (COD), we are inspired by humans attention coupled with the coarse-to-fine detection strategy, and thereby propose an iterative refinement framework, coined SegMaR, which integrates Segment, Magnify and Reiterate in a multi-stage detection fashion.

object-detection Object Detection

Paper
Code

Swift and Sure: Hardness-aware Contrastive Learning for Low-dimensional Knowledge Graph Embeddings

no code implementations • 3 Jan 2022 • Kai Wang, Yu Liu, Quan Z. Sheng

Knowledge graph embedding (KGE) has shown great potential in automatic knowledge graph (KG) completion and knowledge-driven tasks.

Contrastive Learning Knowledge Graph Embedding +1

Paper
Add Code

UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning

2 code implementations • 12 Jan 2022 • Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao

For Something-Something V1 and V2, our UniFormer achieves new state-of-the-art performances of 60. 9% and 71. 2% top-1 accuracy respectively.

Representation Learning

2,986

Paper
Code

UniFormer: Unifying Convolution and Self-attention for Visual Recognition

7 code implementations • 24 Jan 2022 • Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao

Different from the typical transformer blocks, the relation aggregators in our UniFormer block are equipped with local and global token affinity respectively in shallow and deep layers, allowing to tackle both redundancy and dependency for efficient and effective representation learning.

Ranked #151 on Image Classification on ImageNet

Image Classification object-detection +5

777

Paper
Code

Pedestrian Dead Reckoning System using Quasi-static Magnetic Field Detection

no code implementations • 24 Jan 2022 • Liqiang Zhang, Kai Guo, Yu Liu

Kalman filter-based Inertial Navigation System (INS) is a reliable and efficient method to estimate the position of a pedestrian indoors.

Position valid

Paper
Add Code

Meta Knowledge Distillation

no code implementations • 16 Feb 2022 • Jihao Liu, Boxiao Liu, Hongsheng Li, Yu Liu

Recent studies pointed out that knowledge distillation (KD) suffers from two degradation problems, the teacher-student gap and the incompatibility with strong data augmentations, making it not applicable to training state-of-the-art models, which are trained with advanced augmentations.

Ranked #133 on Image Classification on ImageNet

Data Augmentation Image Classification +1

Paper
Add Code

Mapping evolving population geography in China

1 code implementation • 4 Mar 2022 • Lei Dong, Rui Du, Yu Liu

China's demographic changes have important global economic and geopolitical implications.

Paper
Code

Dual-Domain Reconstruction Networks with V-Net and K-Net for fast MRI

no code implementations • 11 Mar 2022 • Xiaohan Liu, Yanwei Pang, Ruiqi Jin, Yu Liu, ZhenChang Wang

Purpose: To introduce a dual-domain reconstruction network with V-Net and K-Net for accurate MR image reconstruction from undersampled k-space data.

Image Reconstruction

Paper
Add Code

Privacy-Preserving Personalized Fitness Recommender System (P3FitRec): A Multi-level Deep Learning Approach

1 code implementation • 23 Mar 2022 • Xiao Liu, Bonan Gao, Basem Suleiman, Han You, Zisu Ma, Yu Liu, Ali Anaissi

Recommender systems have been successfully used in many domains with the help of machine learning algorithms.

Privacy Preserving Recommendation Systems

Paper
Code

HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding

1 code implementation • 30 Apr 2022 • Xiaosong Jia, Penghao Wu, Li Chen, Yu Liu, Hongyang Li, Junchi Yan

Based on these observations, we propose Heterogeneous Driving Graph Transformer (HDGT), a backbone modelling the driving scene as a heterogeneous graph with different types of nodes and edges.

Autonomous Driving graph construction +2

Paper
Code

HoVer-Trans: Anatomy-aware HoVer-Transformer for ROI-free Breast Cancer Diagnosis in Ultrasound Images

no code implementations • 17 May 2022 • Yuhao Mo, Chu Han, Yu Liu, Min Liu, Zhenwei Shi, Jiatai Lin, Bingchao Zhao, Chunwang Huang, Bingjiang Qiu, Yanfen Cui, Lei Wu, Xipeng Pan, Zeyan Xu, Xiaomei Huang, Zaiyi Liu, Ying Wang, Changhong Liang

In this study, we propose a novel ROI-free model for breast cancer diagnosis in ultrasound images with interpretable feature representations.

Anatomy Classification +1

Paper
Add Code

MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers

1 code implementation • CVPR 2023 • Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li

In this paper, we propose Mixed and Masked AutoEncoder (MixMAE), a simple but efficient pretraining method that is applicable to various hierarchical Vision Transformers.

Ranked #2 on Image Classification on Places205

Image Classification Object Detection +2

122

Paper
Code

Enhancing Quality of Pose-varied Face Restoration with Local Weak Feature Sensing and GAN Prior

no code implementations • 28 May 2022 • Kai Hu, Yu Liu, Renhe Liu, Wei Lu, Gang Yu, Bin Fu

In the asymmetric codec, we adopt a mixed multi-path residual block (MMRB) to gradually extract weak texture features of input images, which can better preserve the original facial features and avoid excessive fantasy.

Blind Face Restoration Super-Resolution

Paper
Add Code

Extending regionalization algorithms to explore spatial process heterogeneity

2 code implementations • 19 Jun 2022 • Hao Guo, Andre Python, Yu Liu

In spatial regression models, spatial heterogeneity may be considered with either continuous or discrete specifications.

regression

Paper
Code

DeStripe: A Self2Self Spatio-Spectral Graph Neural Network with Unfolded Hessian for Stripe Artifact Removal in Light-sheet Microscopy

no code implementations • 27 Jun 2022 • Yu Liu, Kurt Weiss, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng

Light-sheet fluorescence microscopy (LSFM) is a cutting-edge volumetric imaging technique that allows for three-dimensional imaging of mesoscopic samples with decoupled illumination and detection paths.

Denoising

Paper
Add Code

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

2 code implementations • 12 Jul 2022 • Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu

Finally, we integrate configurable operators and DSMs into a unified search space and search with a Reinforcement Learning-based search algorithm to fully explore the optimal combination of the operators.

Ranked #12 on Neural Architecture Search on ImageNet

Image Classification Neural Architecture Search

Paper
Code

TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers

1 code implementation • 18 Jul 2022 • Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu

In this paper, we propose a novel data augmentation technique TokenMix to improve the performance of vision transformers.

Data Augmentation

Paper
Code

Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer

1 code implementation • 28 Jul 2022 • Hao Shao, Letian Wang, RuoBing Chen, Hongsheng Li, Yu Liu

Large-scale deployment of autonomous vehicles has been continually delayed due to safety concerns.

Ranked #2 on Autonomous Driving on CARLA Leaderboard

Autonomous Driving CARLA longest6 +3

458

Paper
Code

Rethinking Robust Representation Learning Under Fine-grained Noisy Faces

no code implementations • 8 Aug 2022 • Bingqi Ma, Guanglu Song, Boxiao Liu, Yu Liu

To better understand this, we reformulate the noise type of each class in a more fine-grained manner as N-identities|K^C-clusters.

Face Recognition Representation Learning

Paper
Add Code

Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization

1 code implementation • 18 Aug 2022 • Xizhe Xue, Dongdong Yu, Lingqiao Liu, Yu Liu, Satoshi Tsutsui, Ying Li, Zehuan Yuan, Ping Song, Mike Zheng Shou

Based on the single-stage instance segmentation framework, we propose a regularization model to predict foreground pixels and use its relation to instance segmentation to construct a cross-task consistency loss.

Autonomous Driving Object +3

Paper
Code

Unifying Visual Perception by Dispersible Points Learning

1 code implementation • 18 Aug 2022 • Jianming Liang, Guanglu Song, Biao Leng, Yu Liu

The method, called UniHead, views different visual perception tasks as the dispersible points learning via the transformer encoder architecture.

Instance Segmentation Object +5

Paper
Code

Towards Robust Face Recognition with Comprehensive Search

no code implementations • 29 Aug 2022 • Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li

To eliminate the bias of single-aspect research and provide an overall understanding of the face recognition model design, we first carefully design the search space for each aspect, then a comprehensive search method is introduced to jointly search optimal data cleaning, architecture, and loss function design.

Face Recognition Robust Face Recognition

Paper
Add Code

EEG-based Emotion Recognition via Efficient Convolutional Neural Network and Contrastive Learning

no code implementations • IEEE Sensors Journal 2022 • Chang Li, Xuejuan Lin, Yu Liu, Rencheng Song, Juan Cheng, Xun Chen

To achieve a simple and effective model with supervised learning, we propose an efficient CNN and contrastive learning (ECNN-C) method for EEG-based emotion recognition.

Contrastive Learning EEG +1

Paper
Add Code

SOM-Net: Unrolling the Subspace-based Optimization for Solving Full-wave Inverse Scattering Problems

no code implementations • 8 Sep 2022 • Yu Liu, Hao Zhao, Rencheng Song, Xudong Chen, Chang Li, Xun Chen

The final output of the SOM-Net is the full predicted induced current, from which the scattered field and the permittivity image can also be deduced analytically.

Rolling Shutter Correction

Paper
Add Code

DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models

no code implementations • 16 Oct 2022 • Yueqin Yin, Lianghua Huang, Yu Liu, Kaiqi Huang

In this work, we first design a group of mechanisms to simulate generative artifacts of popular generators (i. e., GANs, autoregressive models, and diffusion models), given real images.

Image Generation Image Restoration

Paper
Add Code

Improving Object-centric Learning with Query Optimization

2 code implementations • 17 Oct 2022 • Baoxiong Jia, Yu Liu, Siyuan Huang

The ability to decompose complex natural scenes into meaningful object-centric abstractions lies at the core of human perception and reasoning.

Image Segmentation Object +3

Paper
Code

Large-batch Optimization for Dense Visual Predictions

1 code implementation • 20 Oct 2022 • Zeyue Xue, Jianming Liang, Guanglu Song, Zhuofan Zong, Liang Chen, Yu Liu, Ping Luo

To address this challenge, we propose a simple yet effective algorithm, named Adaptive Gradient Variance Modulator (AGVM), which can train dense visual predictors with very large batch size, enabling several benefits more appealing than prior arts.

Instance Segmentation object-detection +3

Paper
Code

PolyBuilding: Polygon Transformer for End-to-End Building Extraction

no code implementations • 3 Nov 2022 • Yuan Hu, Zhibin Wang, Zhou Huang, Yu Liu

Given a set of polygon queries, the model learns the relations among them and encodes context information from the image to predict the final set of building polygons with fixed vertex numbers.

Paper
Add Code

Channel Tracking for RIS-aided mmWave Communications Under High Mobility Scenarios

no code implementations • 7 Nov 2022 • Yu Liu, Ming Chen, Cunhua Pan, Yijin Pan, Yinlu Wang, Yaoming Huang, Tianyang Cao, Jiangzhou Wang

The emerging reconfigurable intelligent surface (RIS) technology is promising for applications in the millimeter wave (mmWave) communication systems to effectively compensate for propagation loss or tackle the blockage issue.

Vocal Bursts Intensity Prediction

Paper
Add Code

Semantic Encoder Guided Generative Adversarial Face Ultra-Resolution Network

no code implementations • 18 Nov 2022 • Xiang Wang, Yimin Yang, Qixiang Pang, Xiao Lu, Yu Liu, Shan Du

In this paper, we propose a novel face super-resolution method, namely Semantic Encoder guided Generative Adversarial Face Ultra-Resolution Network (SEGA-FURN) to ultra-resolve an unaligned tiny LR face image to its HR counterpart with multiple ultra-upscaling factors (e. g., 4x and 8x).

Image Super-Resolution

Paper
Add Code

Real-World Image Super Resolution via Unsupervised Bi-directional Cycle Domain Transfer Learning based Generative Adversarial Network

no code implementations • 19 Nov 2022 • Xiang Wang, Yimin Yang, Zhichang Guo, Zhili Zhou, Yu Liu, Qixiang Pang, Shan Du

First, the UBCDTN is able to produce an approximated real-like LR image through transferring the LR image from an artificially degraded domain to the real-world LR image domain.

Generative Adversarial Network Image Super-Resolution +1

Paper
Add Code

DETRs with Collaborative Hybrid Assignments Training

3 code implementations • ICCV 2023 • Zhuofan Zong, Guanglu Song, Yu Liu

This new training scheme can easily enhance the encoder's learning ability in end-to-end detectors by training the multiple parallel auxiliary heads supervised by one-to-many label assignments such as ATSS and Faster RCNN.

Ranked #1 on Object Detection on LVIS v1.0 val (using extra training data)

Instance Segmentation Object Detection +1

27,765

Paper
Code

Unsupervised Domain Adaptation GAN Inversion for Image Editing

no code implementations • 22 Nov 2022 • Siyu Xing, Chen Gong, Hewei Guo, Xiao-Yu Zhang, Xinwen Hou, Yu Liu

In this paper, we resolve this problem by introducing Unsupervised Domain Adaptation (UDA) into the Inversion process, namely UDA-Inversion, for both high-quality and low-quality image inversion and editing.

Image Reconstruction Unsupervised Domain Adaptation

Paper
Add Code

Teach-DETR: Better Training DETR with Teachers

1 code implementation • 22 Nov 2022 • Linjiang Huang, Kaixin Lu, Guanglu Song, Liang Wang, Si Liu, Yu Liu, Hongsheng Li

In this paper, we present a novel training scheme, namely Teach-DETR, to learn better DETR-based detectors from versatile teacher detectors.

Paper
Code

Dimensionality-Varying Diffusion Process

no code implementations • CVPR 2023 • Han Zhang, Ruili Feng, Zhantao Yang, Lianghua Huang, Yu Liu, Yifei Zhang, Yujun Shen, Deli Zhao, Jingren Zhou, Fan Cheng

Diffusion models, which learn to reverse a signal destruction process to generate new data, typically require the signal at each step to have the same dimension.

Image Generation

Paper
Add Code

ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency

1 code implementation • 29 Nov 2022 • Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang

In the learning phase, each agent minimizes the TD error that is dependent on how the subsequent agents have reacted to their chosen action.

Ranked #1 on SMAC on SMAC 3s5z_vs_3s6z

Decision Making Q-Learning +2

179

Paper
Code

Hyperbolic Hierarchical Contrastive Hashing

no code implementations • 17 Dec 2022 • Rukai Wei, Yu Liu, Jingkuan Song, Yanzhao Xie, Ke Zhou

To exploit the hierarchical semantic structures in hyperbolic space, we designed the hierarchical contrastive learning algorithm, including hierarchical instance-wise and hierarchical prototype-wise contrastive learning.

Contrastive Learning Retrieval

Paper
Add Code

Masked Autoencoders Are Stronger Knowledge Distillers

no code implementations • ICCV 2023 • Shanshan Lao, Guanglu Song, Boxiao Liu, Yu Liu, Yujiu Yang

In MKD, random patches of the input image are masked, and the corresponding missing feature is recovered by forcing it to imitate the output of the teacher.

Knowledge Distillation object-detection +2

Paper
Add Code

Deep Active Contours for Real-time 6-DoF Object Tracking

no code implementations • ICCV 2023 • Long Wang, Shen Yan, Jianan Zhen, Yu Liu, Maojun Zhang, Guofeng Zhang, Xiaowei Zhou

Specifically, given an initial pose, we project the object model to the image plane to obtain the initial contour and use a lightweight network to predict how the contour should move to match the true object boundary, which provides the gradients to optimize the object pose.

Computational Efficiency Object +1

Paper
Add Code

UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors

no code implementations • ICCV 2023 • Shanshan Lao, Guanglu Song, Boxiao Liu, Yu Liu, Yujiu Yang

Bridging this semantic gap now requires case-by-case algorithm design which is time-consuming and heavily relies on experienced adjustment.

Knowledge Distillation

Paper
Add Code

Generating Dynamic Kernels via Transformers for Lane Detection

1 code implementation • ICCV 2023 • Ziye Chen, Yu Liu, Mingming Gong, Bo Du, Guoqi Qian, Kate Smith-Miles

While such methods reduce the reliance on specific knowledge, the kernels computed from the key locations fail to capture the lane line's global structure due to its long and thin structure, leading to inaccurate detection of lane lines with complex topologies.

Ranked #1 on Lane Detection on CurveLanes

Lane Detection

Paper
Code

Urban Visual Intelligence: Studying Cities with AI and Street-level Imagery

no code implementations • 2 Jan 2023 • Fan Zhang, Arianna Salazar Miranda, Fábio Duarte, Lawrence Vale, Gary Hack, Min Chen, Yu Liu, Michael Batty, Carlo Ratti

The visual dimension of cities has been a fundamental subject in urban studies, since the pioneering work of scholars such as Sitte, Lynch, Arnheim, and Jacobs.

Paper
Add Code

Centralized Cooperative Exploration Policy for Continuous Control Tasks

1 code implementation • 6 Jan 2023 • Chao Li, Chen Gong, Qiang He, Xinwen Hou, Yu Liu

To explicitly encourage exploration in continuous control tasks, we propose CCEP (Centralized Cooperative Exploration Policy), which utilizes underestimation and overestimation of value functions to maintain the capacity of exploration.

Continuous Control

Paper
Code

Dimension Reduction and MARS

1 code implementation • 11 Feb 2023 • Yu Liu, Degui Li, Yingcun Xia

The multivariate adaptive regression spline (MARS) is one of the popular estimation methods for nonparametric multivariate regressions.

Dimensionality Reduction regression

Paper
Code

SLOTH: Structured Learning and Task-based Optimization for Time Series Forecasting on Hierarchies

no code implementations • 11 Feb 2023 • Fan Zhou, Chen Pan, Lintao Ma, Yu Liu, Shiyu Wang, James Zhang, Xinxin Zhu, Xuanwei Hu, Yunhua Hu, Yangfei Zheng, Lei Lei, Yun Hu

Moreover, unlike most previous reconciliation methods which either rely on strong assumptions or focus on coherent constraints only, we utilize deep neural optimization networks, which not only achieve coherency without any assumptions, but also allow more flexible and realistic constraints to achieve task-based targets, e. g., lower under-estimation penalty and meaningful decision-making loss to facilitate the subsequent downstream tasks.

Decision Making Multivariate Time Series Forecasting +1

Paper
Add Code

Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior

no code implementations • 13 Feb 2023 • Shen Yan, Xiaoya Cheng, Yuxiang Liu, Juelin Zhu, Rouwan Wu, Yu Liu, Maojun Zhang

Despite the significant progress in 6-DoF visual localization, researchers are mostly driven by ground-level benchmarks.

Pose Estimation Visual Localization

Paper
Add Code

Composer: Creative and Controllable Image Synthesis with Composable Conditions

6 code implementations • 20 Feb 2023 • Lianghua Huang, Di Chen, Yu Liu, Yujun Shen, Deli Zhao, Jingren Zhou

Recent large-scale generative models learned on big data are capable of synthesizing incredible images yet suffer from limited controllability.

Image Colorization Image-to-Image Translation +3

6,039

Paper
Code

Deep Learning for Video-Text Retrieval: a Review

no code implementations • 24 Feb 2023 • Cunjuan Zhu, Qi Jia, Wei Chen, Yanming Guo, Yu Liu

Video-Text Retrieval (VTR) aims to search for the most relevant video related to the semantics in a given sentence, and vice versa.

Retrieval Sentence +2

Paper
Add Code

Knowledge-infused Contrastive Learning for Urban Imagery-based Socioeconomic Prediction

1 code implementation • 25 Feb 2023 • Yu Liu, Xin Zhang, Jingtao Ding, Yanxin Xi, Yong Li

To address such issues, in this paper, we propose a Knowledge-infused Contrastive Learning (KnowCL) model for urban imagery-based socioeconomic prediction.

Contrastive Learning Representation Learning

Paper
Code

Comparison and Analysis of Cognitive Load under 2D/3D Visual Stimuli

no code implementations • 25 Feb 2023 • Yu Liu, Chen Song, Yunpeng Yin, Herui Shi, Jinglin Sun, Han Wang, Peiguang Jing

However, for videos that involve calculation tasks (P > 0. 05), the difference in cognitive load induced by 2D and 3D stimuli is not as pronounced.

EEG Experimental Design

Paper
Add Code

Cones: Concept Neurons in Diffusion Models for Customized Generation

1 code implementation • 9 Mar 2023 • Zhiheng Liu, Ruili Feng, Kai Zhu, Yifei Zhang, Kecheng Zheng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao

Concatenating multiple clusters of concept neurons can vividly generate all related concepts in a single image.

6,039

Paper
Code

Presentation Attack Detection with Advanced CNN Models for Noncontact-based Fingerprint Systems

no code implementations • 9 Mar 2023 • Sandip Purnapatra, Conor Miller-Lynch, Stephen Miner, Yu Liu, Keivan Bahmani, Soumyabrata Dey, Stephanie Schuckers

Touch-based fingerprint biometrics is one of the most popular biometric modalities with applications in several fields.

Paper
Add Code

Deep Learning-based Eye-Tracking Analysis for Diagnosis of Alzheimer's Disease Using 3D Comprehensive Visual Stimuli

no code implementations • 13 Mar 2023 • Fangyu Zuo, Peiguang Jing, Jinglin Sun, Jizhong, Duan, Yong Ji, Yu Liu

To better analyze the differences in visual attention between AD patients and normals, we first conduct a 3D comprehensive visual task on a non-invasive eye-tracking system to collect visual attention heatmaps.

Paper
Add Code

GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding

1 code implementation • ICCV 2023 • Jihao Liu, Tai Wang, Boxiao Liu, Qihang Zhang, Yu Liu, Hongsheng Li

In this paper, we propose Geometry Enhanced Masked Image Modeling (GeoMIM) to transfer the knowledge of the LiDAR model in a pretrain-finetune paradigm for improving the multi-view camera-based 3D detection.

3D Object Detection object-detection +1

Paper
Code

End-to-End Personalized Next Location Recommendation via Contrastive User Preference Modeling

no code implementations • 22 Mar 2023 • Yan Luo, Ye Liu, Fu-Lai Chung, Yu Liu, Chang Wen Chen

History encoder is designed to model mobility patterns from historical check-in sequences, while query generator explicitly learns user preferences to generate user-specific intention queries.

Paper
Add Code

TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs

no code implementations • 29 Mar 2023 • Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan

On the other hand, there are also many existing models and systems (symbolic-based or neural-based) that can do some domain-specific tasks very well.

Code Generation Common Sense Reasoning +1

Paper
Add Code

Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring

no code implementations • 2 Apr 2023 • Runzhe Wan, Yu Liu, James McQueen, Doug Hains, Rui Song

With the growing needs of online A/B testing to support the innovation in industry, the opportunity cost of running an experiment becomes non-negligible.

Decision Making reinforcement-learning

Paper
Add Code

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

1 code implementation • ICCV 2023 • Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, Yu Liu

The HoP approach is straightforward: given the current timestamp t, we generate a pseudo Bird's-Eye View (BEV) feature of timestamp t-k from its adjacent frames and utilize this feature to predict the object set at timestamp t-k. Our approach is motivated by the observation that enforcing the detector to capture both the spatial location and temporal motion of objects occurring at historical timestamps can lead to more accurate BEV feature learning.

Ranked #3 on 3D Object Detection on nuScenes Camera Only

3D Object Detection Object

171

Paper
Code

Long-term Visual Localization with Mobile Sensors

no code implementations • CVPR 2023 • Shen Yan, Yu Liu, Long Wang, Zehong Shen, Zhen Peng, Haomin Liu, Maojun Zhang, Guofeng Zhang, Xiaowei Zhou

Despite the remarkable advances in image matching and pose estimation, image-based localization of a camera in a temporally-varying outdoor environment is still a challenging problem due to huge appearance disparity between query and reference images caused by illumination, seasonal and structural changes.

Image-Based Localization Pose Estimation +1

Paper
Add Code

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors

1 code implementation • 8 May 2023 • Letian Wang, Jie Liu, Hao Shao, Wenshuo Wang, RuoBing Chen, Yu Liu, Steven L. Waslander

Inspired by this, we propose ASAP-RL, an efficient reinforcement learning algorithm for autonomous driving that simultaneously leverages motion skills and expert priors.

Autonomous Driving reinforcement-learning

Paper
Code

ReasonNet: End-to-End Driving with Temporal and Global Reasoning

no code implementations • CVPR 2023 • Hao Shao, Letian Wang, RuoBing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu

The large-scale deployment of autonomous vehicles is yet to come, and one of the major remaining challenges lies in urban dense traffic scenarios.

Ranked #1 on Autonomous Driving on CARLA Leaderboard

Autonomous Driving

Paper
Add Code

Weakly Supervised AUC Optimization: A Unified Partial AUC Approach

no code implementations • 23 May 2023 • Zheng Xie, Yu Liu, Hao-Yuan He, Ming Li, Zhi-Hua Zhou

Since acquiring perfect supervision is usually difficult, real-world machine learning tasks often confront inaccurate, incomplete, or inexact supervision, collectively referred to as weak supervision.

Paper
Add Code

AUC Optimization from Multiple Unlabeled Datasets

no code implementations • 25 May 2023 • Zheng Xie, Yu Liu, Ming Li

In this paper, we study the problem of building an AUC (area under ROC curve) optimization model from multiple unlabeled datasets, which maximizes the pairwise ranking ability of the classifier.

Weakly-supervised Learning

Paper
Add Code

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

no code implementations • NeurIPS 2023 • Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu Liu, Ping Luo

Text-to-image generation has recently witnessed remarkable achievements.

Ranked #11 on Text-to-Image Generation on MS COCO

Text-to-Image Generation

Paper
Add Code

Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising

1 code implementation • 29 May 2023 • Fu-Yun Wang, Wenshuo Chen, Guanglu Song, Han-Jia Ye, Yu Liu, Hongsheng Li

To address this challenge, we introduce a novel paradigm dubbed as Gen-L-Video, capable of extending off-the-shelf short video diffusion models for generating and editing videos comprising hundreds of frames with diverse semantic segments without introducing additional training, all while preserving content consistency.

Denoising Image Generation +2

259

Paper
Code

Cones 2: Customizable Image Synthesis with Multiple Subjects

1 code implementation • 30 May 2023 • Zhiheng Liu, Yifei Zhang, Yujun Shen, Kecheng Zheng, Kai Zhu, Ruili Feng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao

Synthesizing images with user-specified subjects has received growing attention due to its practical applications.

Image Generation

484

Paper
Code

Video Diffusion Models with Local-Global Context Guidance

1 code implementation • 5 Jun 2023 • Siyuan Yang, Lu Zhang, Yu Liu, Zhizhuo Jiang, You He

We construct a local-global context guidance strategy to capture the multi-perceptual embedding of the past fragment to boost the consistency of future prediction.

Future prediction Unconditional Video Generation +1

Paper
Code

Learning Search-Space Specific Heuristics Using Neural Networks

no code implementations • 6 Jun 2023 • Yu Liu, Ryo Kuroiwa, Alex Fukunaga

We propose and evaluate a system which learns a neuralnetwork heuristic function for forward search-based, satisficing classical planning.

regression

Paper
Add Code

Eliminating Lipschitz Singularities in Diffusion Models

no code implementations • 20 Jun 2023 • Zhantao Yang, Ruili Feng, Han Zhang, Yujun Shen, Kai Zhu, Lianghua Huang, Yifei Zhang, Yu Liu, Deli Zhao, Jingren Zhou, Fan Cheng

Diffusion models, which employ stochastic differential equations to sample images through integrals, have emerged as a dominant class of generative models.

Paper
Add Code

OpenSiteRec: An Open Dataset for Site Recommendation

no code implementations • 3 Jul 2023 • Xinhang Li, Xiangyu Zhao, Yejing Wang, Yu Liu, Yong Li, Cheng Long, Yong Zhang, Chunxiao Xing

As a representative information retrieval task, site recommendation, which aims at predicting the optimal sites for a brand or an institution to open new branches in an automatic data-driven way, is beneficial and crucial for brand development in modern business.

Benchmarking Information Retrieval +1

Paper
Add Code

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

2 code implementations • 7 Jul 2023 • Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Yu Liu, Kai Chen, Ping Luo

Before sending to LLM, the reference is replaced by RoI features and interleaved with language embeddings as a sequence.

Ranked #1 on Visual Question Answering (VQA) on VCR (Q-AR) test

Attribute Common Sense Reasoning +4

452

Paper
Code

AnyDoor: Zero-shot Object-level Image Customization

2 code implementations • 18 Jul 2023 • Xi Chen, Lianghua Huang, Yu Liu, Yujun Shen, Deli Zhao, Hengshuang Zhao

This work presents AnyDoor, a diffusion-based image generator with the power to teleport target objects to new scenes at user-specified locations in a harmonious way.

Object Virtual Try-on

6,039

Paper
Code

A Physics-Informed Data-Driven Fault Location Method for Transmission Lines Using Single-Ended Measurements with Field Data Validation

no code implementations • 19 Jul 2023 • Yiqi Xing, Yu Liu, Dayou Lu, Xinchen Zou, Xuming He

This procedure merges the gap between simulation and practical power systems, and at the same time considers the uncertainty of system and fault parameters in practice.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.