API-Net: Robust Generative Classifier via a Single Discriminator

1 code implementation ECCV 2020 Xinshuai Dong, Hong Liu, Rongrong Ji, Liujuan Cao, Qixiang Ye, Jianzhuang Liu, Qi Tian

On the contrary, a discriminative classifier only models the conditional distribution of labels given inputs, but benefits from effective optimization owing to its succinct structure.

Robust classification

Virtual Adversarial Training for Semi-supervised Breast Mass Classification

no code implementations25 Jan 2022 Xuxin Chen, Ximin Wang, Ke Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen Qiu

This study aims to develop a novel computer-aided diagnosis (CAD) scheme for mammographic breast mass classification using semi-supervised learning.

Deep Facial Synthesis: A New Challenge

4 code implementations31 Dec 2021 Deng-Ping Fan, Ziling Huang, Peng Zheng, Hong Liu, Xuebin Qin, Luc van Gool

The goal of this paper is to conduct a comprehensive study on the facial sketch synthesis (FSS) problem.

Image-to-Image Translation Style Transfer

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking

1 code implementation14 Dec 2021 Yidi Li, Hong Liu, Hao Tang

Multi-modal fusion is proven to be an effective method to improve the accuracy and robustness of speaker tracking, especially in complex scenarios.

Self-Supervised Learning

Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition

1 code implementation7 Dec 2021 Tianyu Guo, Hong Liu, Zhan Chen, Mengyuan Liu, Tao Wang, Runwei Ding

In this paper, to make better use of the movement patterns introduced by extreme augmentations, a Contrastive Learning framework utilizing Abundant Information Mining for self-supervised action Representation (AimCLR) is proposed.

Contrastive Learning Representation Learning +2

Pose-guided Feature Disentangling for Occluded Person Re-identification Based on Transformer

1 code implementation5 Dec 2021 Tao Wang, Hong Liu, Pinhao Song, Tianyu Guo, Wei Shi

Therefore, we propose a transformer-based Pose-guided Feature Disentangling (PFD) method by utilizing pose information to clearly disentangle semantic components (e. g. human body or joint parts) and selectively match non-occluded parts correspondingly.

Person Re-Identification

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

1 code implementation24 Nov 2021 Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc van Gool

Estimating 3D human poses from monocular videos is a challenging task due to depth ambiguity and self-occlusion.

3D Human Pose Estimation

Improving Camouflaged Object Detection with the Uncertainty of Pseudo-edge Labels

1 code implementation29 Oct 2021 Nobukatsu Kajiura, Hong Liu, Shin'ichi Satoh

This framework consists of three key components, i. e., a pseudo-edge generator, a pseudo-map generator, and an uncertainty-aware refinement module.

Object Detection

User Multi-Interest Modeling for Behavioral Cognition

no code implementations20 Oct 2021 Bei Yang, Ke Liu, Xiaoxiao Xu, Renjun Xu, Qinghui Sun, Hong Liu, Huan Xu

Representation modeling based on user behavior sequences is an important direction in user cognition.

Contrastive Learning Dimensionality Reduction

Self-supervised Learning is More Robust to Dataset Imbalance

no code implementations11 Oct 2021 Hong Liu, Jeff Z. HaoChen, Adrien Gaidon, Tengyu Ma

Third, inspired by the theoretical insights, we devise a re-weighted regularization technique that consistently improves the SSL representation quality on imbalanced datasets with several evaluation criteria, closing the small gap between balanced and imbalanced datasets with the same number of examples.

Self-Supervised Learning

Sound Event Detection Transformer: An Event-based End-to-End Model for Sound Event Detection

no code implementations5 Oct 2021 Zhirong Ye, Xiangdong Wang, Hong Liu, Yueliang Qian, Rui Tao, Long Yan, Kazushige Ouchi

A critical issue with the frame-based model is that it pursues the best frame-level prediction rather than the best event-level prediction.

Audio Tagging Boundary Detection +4

Interest-oriented Universal User Representation via Contrastive Learning

no code implementations18 Sep 2021 Qinghui Sun, Jie Gu, Bei Yang, Xiaoxiao Xu, Renjun Xu, Shangde Gao, Hong Liu, Huan Xu

Universal user representation has received many interests recently, with which we can be free from the cumbersome work of training a specific model for each downstream application.

Contrastive Learning Representation Learning +1

Variational Latent-State GPT for Semi-supervised Task-Oriented Dialog Systems

no code implementations9 Sep 2021 Hong Liu, Yucheng Cai, Zhenru Lin, Zhijian Ou, Yi Huang, Junlan Feng

In this paper, we propose Variational Latent-State GPT model (VLS-GPT), which is the first to combine the strengths of the two approaches.

Efficient Transformer for Single Image Super-Resolution

no code implementations25 Aug 2021 Zhisheng Lu, Hong Liu, Juncheng Li, Linlin Zhang

But with the heavy computational cost and high GPU memory occupation of the vision Transformer, the network can not be designed too deep.

Image Super-Resolution

Towards Robustness Against Natural Language Word Substitutions

1 code implementation ICLR 2021 Xinshuai Dong, Anh Tuan Luu, Rongrong Ji, Hong Liu

Robustness against word substitutions has a well-defined and widely acceptable form, i. e., using semantically similar words as substitutions, and thus it is considered as a fundamental stepping-stone towards broader robustness in natural language processing.

Natural Language Inference Sentiment Analysis

Recent advances and clinical applications of deep learning in medical image analysis

no code implementations27 May 2021 Xuxin Chen, Ximin Wang, Ke Zhang, Roy Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen Qiu

Deep learning has received extensive research interest in developing new medical image processing algorithms, and deep learning based models have been remarkably successful in a variety of medical imaging tasks to support disease detection and diagnosis.

Image Registration Lesion Classification

Achieving Domain Generalization in Underwater Object Detection by Image Stylization and Domain Mixup

no code implementations6 Apr 2021 Pinhao Song, Linhui Dai, Peipei Yuan, Hong Liu, Runwei Ding

The performance of existing underwater object detection methods degrades seriously when facing domain shift problem caused by complicated underwater environments.

Data Augmentation Domain Generalization +2

Stability from graph symmetrisation arguments with applications to inducibility

no code implementations19 Dec 2020 Hong Liu, Oleg Pikhurko, Maryam Sharifzadeh, Katherine Staden

We present a sufficient condition for the stability property of extremal graph problems that can be solved via Zykov's symmetrisation.


Multi-Scale Cascading Network with Compact Feature Learning for RGB-Infrared Person Re-Identification

no code implementations12 Dec 2020 Can Zhang, Hong Liu, Wei Guo, Mang Ye

RGB-Infrared person re-identification (RGB-IR Re-ID) aims to match persons from heterogeneous images captured by visible and thermal cameras, which is of great significance in the surveillance system under poor light conditions.

Person Re-Identification

Extremal density for sparse minors and subdivisions

no code implementations3 Dec 2020 John Haslegrave, JaeHoon Kim, Hong Liu

We prove an asymptotically tight bound on the extremal density guaranteeing subdivisions of bounded-degree bipartite graphs with a mild separability condition.

Combinatorics 05C83, 05C35

Learning to Adapt to Evolving Domains

1 code implementation NeurIPS 2020 Hong Liu, Mingsheng Long, Jianmin Wang, Yu Wang

(2) Since the target data arrive online, the agent should also maintain competence on previous target domains, i. e. to adapt without forgetting.

Meta-Learning Transfer Learning +1

Meta-learning Transferable Representations with a Single Target Domain

no code implementations3 Nov 2020 Hong Liu, Jeff Z. HaoChen, Colin Wei, Tengyu Ma

Recent works found that fine-tuning and joint training---two popular approaches for transfer learning---do not always improve accuracy on downstream tasks.

Meta-Learning Representation Learning +1

Anti-Bandit Neural Architecture Search for Model Defense

1 code implementation ECCV 2020 Hanlin Chen, Baochang Zhang, Song Xue, Xuan Gong, Hong Liu, Rongrong Ji, David Doermann

Deep convolutional neural networks (DCNNs) have dominated as the best performers in machine learning, but can be challenged by adversarial attacks.

Denoising Neural Architecture Search

Two-stage growth mode for lift-off mechanism in oblique shock-wave/jet interaction

no code implementations11 Jul 2020 Bin Yu, Miaosheng He, Bin Zhang, Hong Liu

Based on the objective coordinate system in frame of oblique shock structure, it is found that the nature of three-dimensional lift-off structure of a shockinduced streamwise vortex is inherently and precisely controlled by a two-stage growth mode of structure kinetics of a shock bubble interaction (SBI for short).

Fluid Dynamics

Bi-directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification

1 code implementation1 Jun 2020 Hanrong Ye, Hong Liu, Fanyang Meng, Xia Li

As an angularly discriminative feature space is important for classifying the human images based on their embedding vectors, in this paper, we propose a novel ranking loss function, named Bi-directional Exponential Angular Triplet Loss, to help learn an angularly separable common feature space by explicitly constraining the included angles between embedding vectors.

Person Re-Identification

Projection & Probability-Driven Black-Box Attack

1 code implementation CVPR 2020 Jie Li, Rongrong Ji, Hong Liu, Jianzhuang Liu, Bineng Zhong, Cheng Deng, Qi Tian

For reducing the solution space, we first model the adversarial perturbation optimization problem as a process of recovering frequency-sparse perturbations with compressed sensing, under the setting that random noise in the low-frequency space is more likely to be adversarial.

Online Initialization and Extrinsic Spatial-Temporal Calibration for Monocular Visual-Inertial Odometry

no code implementations12 Apr 2020 Weibo Huang, Hong Liu, Weiwei Wan

To compensate for the impact of time offset, our method includes two short-term motion interpolation algorithms for the camera and IMU pose estimation.

Pose Estimation

Spatial Pyramid Based Graph Reasoning for Semantic Segmentation

no code implementations CVPR 2020 Xia Li, Yibo Yang, Qijie Zhao, Tiancheng Shen, Zhouchen Lin, Hong Liu

The convolution operation suffers from a limited receptive filed, while global modeling is fundamental to dense prediction tasks, such as semantic segmentation.

Semantic Segmentation

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method

no code implementations14 Feb 2020 Bin Ren, Mengyuan Liu, Runwei Ding, Hong Liu

3D skeleton-based action recognition, owing to the latent advantages of skeleton, has been an active topic in computer vision.

Action Recognition Graph Convolutional Network +1

Asymmetric Generative Adversarial Networks for Image-to-Image Translation

1 code implementation14 Dec 2019 Hao Tang, Dan Xu, Hong Liu, Nicu Sebe

In this paper, we analyze the limitation of the existing symmetric GAN models in asymmetric translation tasks, and propose an AsymmetricGAN model with both translation and reconstruction generators of unequal sizes and different parameter-sharing strategy to adapt to the asymmetric need in both unsupervised and supervised image-to-image translation tasks.

Image-to-Image Translation Translation

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

1 code implementation12 Dec 2019 Hao Tang, Hong Liu, Nicu Sebe

The proposed model consists of a single generator and a discriminator taking a conditional image and the target controllable structure as input.

Facial Expression Translation Gesture-to-Gesture Translation +2

AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks

2 code implementations27 Nov 2019 Hao Tang, Hong Liu, Dan Xu, Philip H. S. Torr, Nicu Sebe

State-of-the-art methods in image-to-image translation are capable of learning a mapping from a source domain to a target domain with unpaired image data.

Image-to-Image Translation Translation

An End-to-end Approach for Lexical Stress Detection based on Transformer

no code implementations6 Nov 2019 Yong Ruan, Xiangdong Wang, Hong Liu, Zhigang Ou, Yun Gao, Jianfeng Cheng, Yueliang Qian

For this, we train transformer model using feature sequence of audio and their phoneme sequence with lexical stress marks.

General Classification

Universal Adversarial Perturbation via Prior Driven Uncertainty Approximation

no code implementations ICCV 2019 Hong Liu, Rongrong Ji, Jie Li, Baochang Zhang, Yue Gao, Yongjian Wu, Feiyue Huang

Deep learning models have shown their vulnerabilities to universal adversarial perturbations (UAP), which are quasi-imperceptible.

Towards Understanding the Transferability of Deep Representations

no code implementations26 Sep 2019 Hong Liu, Mingsheng Long, Jian-Min Wang, Michael. I. Jordan

3) The feasibility of transferability is related to the similarity of both input and label.

Guided Learning Convolution System for DCASE 2019 Task 4

1 code implementation11 Sep 2019 Liwei Lin, Xiangdong Wang, Hong Liu, Yueliang Qian

In this paper, we describe in detail the system we submitted to DCASE2019 task 4: sound event detection (SED) in domestic environments.

Event Detection Sound Event Detection

Identifying Illicit Accounts in Large Scale E-payment Networks -- A Graph Representation Learning Approach

no code implementations13 Jun 2019 Da Sun Handason Tam, Wing Cheong Lau, Bin Hu, Qiu Fang Ying, Dah Ming Chiu, Hong Liu

In the context of e-payment transaction graphs, the resultant node and edge embeddings can effectively characterize the user-background as well as the financial transaction patterns of individual account holders.

Graph Embedding Graph Mining +2

Guided learning for weakly-labeled semi-supervised sound event detection

1 code implementation6 Jun 2019 Liwei Lin, Xiangdong Wang, Hong Liu, Yueliang Qian

Instead of designing a single model by considering a trade-off between the two sub-targets, we design a teacher model aiming at audio tagging to guide a student model aiming at boundary detection to learn using the unlabeled data.

Audio Tagging Boundary Detection +3

Separate to Adapt: Open Set Domain Adaptation via Progressive Separation

no code implementations CVPR 2019 Hong Liu, Zhangjie Cao, Mingsheng Long, Jianmin Wang, Qiang Yang

While several methods have been proposed to address OSDA, none of them takes into account the openness of the target domain, which is measured by the proportion of unknown classes in all target classes.

Domain Adaptation

Specialized Decision Surface and Disentangled Feature for Weakly-Supervised Polyphonic Sound Event Detection

1 code implementation24 May 2019 Liwei Lin, Xiangdong Wang, Hong Liu, Yueliang Qian

In this paper, a special decision surface for the weakly-supervised sound event detection (SED) and a disentangled feature (DF) for the multi-label problem in polyphonic SED are proposed.

Event Detection Multi-Label Classification +2

Dual-branch residual network for lung nodule segmentation

no code implementations21 May 2019 Haichao Cao, Hong Liu, Enmin Song, Chih-Cheng Hung, Guangzhi Ma, Xiangyang Xu, Renchao Jin, Jianguo Lu

Experimental results show that the DB-ResNet achieves superior segmentation performance with an average dice score of 82. 74% on the dataset.

Computed Tomography (CT) Lung Nodule Segmentation

A novel algorithm for segmentation of leukocytes in peripheral blood

no code implementations21 May 2019 Haichao Cao, Hong Liu, Enmin Song

First, the nucleus of leukocyte was separated by using the stepwise averaging method.

Hadamard Matrix Guided Online Hashing

1 code implementation11 May 2019 Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Shen Chen, Qi Tian

We then treat the learning of hash functions as a set of binary classification problems to fit the assigned target code.

Two-Stage Convolutional Neural Network Architecture for Lung Nodule Detection

no code implementations9 May 2019 Haichao Cao, Hong Liu, Enmin Song, Guangzhi Ma, Xiangyang Xu, Renchao Jin, Tengying Liu, Chih-Cheng Hung

The CNN architecture in the first stage is based on the improved UNet segmentation network to establish an initial detection of lung nodules.

Computed Tomography (CT) Data Augmentation +3

Supervised Online Hashing via Hadamard Codebook Learning

1 code implementation28 Apr 2019 Mingbao Lin, Rongrong Ji, Hong Liu, Yongjian Liu

Notably, the proposed HCOH can be embedded with supervised labels and it not limited to a predefined category number.

Semantic Similarity Semantic Textual Similarity

Multi-view Vector-valued Manifold Regularization for Multi-label Image Classification

no code implementations8 Apr 2019 Yong Luo, DaCheng Tao, Chang Xu, Chao Xu, Hong Liu, Yonggang Wen

In computer vision, image datasets used for classification are naturally associated with multiple labels and comprised of multiple views, because each image may contain several objects (e. g. pedestrian, bicycle and tree) and is properly characterized by multiple visual features (e. g. color, texture and shape).

General Classification Multi-Label Image Classification

Towards Optimal Discrete Online Hashing with Balanced Similarity

1 code implementation29 Jan 2019 Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Yongjian Wu, Yunsheng Wu

In this paper, we propose a novel supervised online hashing method, termed Balanced Similarity for Online Discrete Hashing (BSODH), to solve the above problems in a unified framework.

Fast and Robust Dynamic Hand Gesture Recognition via Key Frames Extraction and Feature Fusion

1 code implementation15 Jan 2019 Hao Tang, Hong Liu, Wei Xiao, Nicu Sebe

Gesture recognition is a hot topic in computer vision and pattern recognition, which plays a vitally important role in natural human-computer interface.

Hand Gesture Recognition Hand-Gesture Recognition

Towards Visual Feature Translation

1 code implementation CVPR 2019 Jie Hu, Rongrong Ji, Hong Liu, Shengchuan Zhang, Cheng Deng, Qi Tian

In this paper, we make the first attempt towards visual feature translation to break through the barrier of using features across different visual search systems.


An objective-adaptive refinement criterion based on modified ridge extraction method for finite-time Lyapunov exponent (FTLE) calculation

no code implementations13 Nov 2018 Haotian Hang, Bin Yu, Yang Xiang, Bin Zhang, Hong Liu

High-accuracy and high-efficiency finite-time Lyapunov exponent (FTLE) calculation method has long been a research hot point, and adaptive refinement method is a kind of method in this field.

Fluid Dynamics

Video Logo Retrieval based on local Features

1 code implementation11 Aug 2018 Bochen Guan, Hanrong Ye, Hong Liu, William A. Sethares

Estimation of the frequency and duration of logos in videos is important and challenging in the advertisement industry as a way of estimating the impact of ad purchases.

Image Retrieval Video Retrieval

Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining

no code implementations ECCV 2018 Xia Li, Jianlong Wu, Zhouchen Lin, Hong Liu, Hongbin Zha

In heavy rain, rain streaks have various directions and shapes, which can be regarded as the accumulation of multiple rain streak layers.

Single Image Deraining

CerfGAN: A Compact, Effective, Robust, and Fast Model for Unsupervised Multi-Domain Image-to-Image Translation

no code implementations28 May 2018 Xiao Liu, Shengchuan Zhang, Hong Liu, Xin Liu, Cheng Deng, Rongrong Ji

In principle, CerfGAN contains a novel component, i. e., a multi-class discriminator (MCD), which gives the model an extremely powerful ability to match multiple translation mappings.

Face Hallucination Image-to-Image Translation +2

Bone marrow cells detection: A technique for the microscopic image analysis

no code implementations5 May 2018 Haichao Cao, Hong Liu, Enmin Song

The localization of BMC is achieved from a color transformation enhanced BMC sample image and stepwise averaging method (SAM).

General Classification

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation

1 code implementation CVPR 2018 Dan Xu, Wei Wang, Hao Tang, Hong Liu, Nicu Sebe, Elisa Ricci

Recent works have shown the benefit of integrating Conditional Random Fields (CRFs) models into deep architectures for improving pixel-level prediction tasks.

Monocular Depth Estimation

A Bidirectional Adaptive Bandwidth Mean Shift Strategy for Clustering

no code implementations22 Dec 2017 Fanyang Meng, Hong Liu, Yongsheng Liang, Wei Liu, Jihong Pei

The bandwidth of a kernel function is a crucial parameter in the mean shift algorithm.

Robust 3D Action Recognition through Sampling Local Appearances and Global Distributions

no code implementations4 Dec 2017 Mengyuan Liu, Hong Liu, Chen Chen

Then, motion and shape cues are jointly used to generate robust and distinctive spatial-temporal interest points (STIPs): motion-based STIPs and shape-based STIPs.

3D Action Recognition

Cross-Modality Binary Code Learning via Fusion Similarity Hashing

no code implementations CVPR 2017 Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang, Baochang Zhang

In this paper, we propose a hashing scheme, termed Fusion Similarity Hashing (FSH), which explicitly embeds the graph-based fusion similarity across modalities into a common Hamming space.

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition

no code implementations23 May 2017 Hong Liu, Juanhui Tu, Mengyuan Liu

Extensive experiments on the SmartHome dataset and the large-scale NTU RGB-D dataset demonstrate that our method outperforms most of RNN-based methods, which verify the complementary property between spatial and temporal information and the robustness to noise.

Skeleton Based Action Recognition

Ordinal Constrained Binary Code Learning for Nearest Neighbor Search

no code implementations19 Nov 2016 Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang

By given a large-scale training data set, it is very expensive to embed such ranking tuples in binary code learning.

Small Data Image Classification

Orientation Driven Bag of Appearances for Person Re-identification

2 code implementations9 May 2016 Liqian Ma, Hong Liu, Liang Hu, Can Wang, Qianru Sun

Experimental results on three public datasets and two proposed datasets demonstrate the superiority of the proposed approach, indicating the effectiveness of body structure and orientation information for improving re-identification performance.

Person Re-Identification

