Search Results for author: Jian Zhao

Found 94 papers, 34 papers with code

Unsupervised Domain Adaptation with Noise Resistible Mutual-Training for Person Re-identification

no code implementations ECCV 2020 Fang Zhao, Shengcai Liao, Guo-Sen Xie, Jian Zhao, Kaihao Zhang, Ling Shao

On the other hand, mutual instance selection further selects reliable and informative instances for training according to the peer-confidence and relationship disagreement of the networks.

Clustering Person Re-Identification +2

Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis

no code implementations11 Mar 2024 Zijian Chen, Mei Wang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Xingchen Cui, Jian Zhao

2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose.

Depth Estimation Face Recognition

PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation

no code implementations28 Feb 2024 Haoyu Xie, Changqi Wang, Jian Zhao, Yang Liu, Jun Dan, Chong Fu, Baigui Sun

To address this issue, we propose a robust contrastive-based S4 framework, termed the Probabilistic Representation Contrastive Learning (PRCL) framework to enhance the robustness of the unsupervised training process.

Contrastive Learning Semi-Supervised Semantic Segmentation

EmoWear: Exploring Emotional Teasers for Voice Message Interaction on Smartwatches

no code implementations11 Feb 2024 Pengcheng An, Jiawen Zhu, Zibo Zhang, Yifei Yin, Qingyuan Ma, Che Yan, Linghao Du, Jian Zhao

We introduce EmoWear, a smartwatch voice messaging system enabling users to apply 30 animation teasers on message bubbles to reflect emotions.

Retrieval

Not All Data Matters: An End-to-End Adaptive Dataset Pruning Framework for Enhancing Model Performance and Efficiency

no code implementations9 Dec 2023 Suorong Yang, Hongchao Yang, Suhan Guo, Furao Shen, Jian Zhao

AdaPruner can still significantly enhance model performance even after pruning up to 10-30\% of the training data.

DanZero+: Dominating the GuanDan Game through Reinforcement Learning

1 code implementation5 Dec 2023 Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li

The utilization of artificial intelligence (AI) in card games has been a well-explored subject within AI research for an extensive period.

Card Games reinforcement-learning

NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields

no code implementations29 Nov 2023 Xiaoliang Liu, Furao Shen, Feng Han, Jian Zhao, Changhai Nie

Face recognition (FR) technology plays a crucial role in various applications, but its vulnerability to adversarial attacks poses significant security concerns.

Adversarial Attack Face Recognition

RADAP: A Robust and Adaptive Defense Against Diverse Adversarial Patches on Face Recognition

no code implementations29 Nov 2023 Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie

RADAP employs innovative techniques, such as FCutout and F-patch, which use Fourier space sampling masks to improve the occlusion robustness of the FR model and the performance of the patch segmenter.

Face Recognition

A Simple Geometric-Aware Indoor Positioning Interpolation Algorithm Based on Manifold Learning

no code implementations27 Nov 2023 Suorong Yang, Geng Zhang, Jian Zhao, Furao Shen

Interpolation methodologies have been widely used within the domain of indoor positioning systems.

Panda or not Panda? Understanding Adversarial Attacks with Interactive Visualization

no code implementations22 Nov 2023 Yuzhe You, Jarvis Tse, Jian Zhao

Adversarial machine learning (AML) studies attacks that can fool machine learning algorithms into generating incorrect outcomes as well as the defenses against worst-case attacks to strengthen model robustness.

Image Classification

UniParser: Multi-Human Parsing with Unified Correlation Representation Learning

1 code implementation13 Oct 2023 Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao

Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information.

Image Segmentation Multi-Human Parsing +2

Adversarial Attacks on Video Object Segmentation with Hard Region Discovery

no code implementations25 Sep 2023 Ping Li, Yu Zhang, Li Yuan, Jian Zhao, Xianghua Xu, Xiaoqin Zhang

Particularly, the gradients from the segmentation model are exploited to discover the easily confused region, in which it is difficult to identify the pixel-wise objects from the background in a frame.

Autonomous Driving Object +5

3D Implicit Transporter for Temporally Consistent Keypoint Discovery

1 code implementation ICCV 2023 Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao

To address this issue, the Transporter method was introduced for 2D data, which reconstructs the target frame from the source frame to incorporate both spatial and temporal information.

Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning

no code implementations29 Aug 2023 Zhuo Zhou, Wenxuan Liu, Danni Xu, Zheng Wang, Jian Zhao

HID presents a unique challenge in that hidden intentions lack the obvious visual representations to distinguish them from normal intentions.

Intent Detection

Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System

1 code implementation27 Jun 2023 Xue-Feng Zhu, Tianyang Xu, Jian Zhao, Jia-Wei Liu, Kai Wang, Gang Wang, Jianan Li, Qiang Wang, Lei Jin, Zheng Zhu, Junliang Xing, Xiao-Jun Wu

Still, previous works have simplified such an anti-UAV task as a tracking problem, where the prior information of UAVs is always provided; such a scheme fails in real-world anti-UAV tasks (i. e. complex scenes, indeterminate-appear and -reappear UAVs, and real-time UAV surveillance).

GIMM: InfoMin-Max for Automated Graph Contrastive Learning

no code implementations27 May 2023 Xin Xiong, Furao Shen, Xiangyu Wang, Jian Zhao

Many GCL methods with automated data augmentation face the risk of insufficient information as they fail to preserve the essential information necessary for the downstream task.

Contrastive Learning Data Augmentation +2

A Fast and Robust Camera-IMU Online Calibration Method For Localization System

no code implementations14 May 2023 Xiaowen Tao, Pengxiang Meng, Bing Zhu, Jian Zhao

Autonomous driving has spurred the development of sensor fusion techniques, which combine data from multiple sensors to improve system performance.

Autonomous Driving Decision Making +2

Single-stage Multi-human Parsing via Point Sets and Center-based Offsets

1 code implementation22 Apr 2023 Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao

We instead present a high-performance Single-stage Multi-human Parsing (SMP) deep architecture that decouples the multi-human parsing problem into two fine-grained sub-problems, i. e., locating the human body and parts.

Multi-Human Parsing

Gradient Attention Balance Network: Mitigating Face Recognition Racial Bias via Gradient Attention

no code implementations5 Apr 2023 Linzhi Huang, Mei Wang, Jiahao Liang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Jian Zhao

Specifically, we use the gradient attention map (GAM) of the face recognition network to track the sensitive facial regions and make the GAMs of different races tend to be consistent through adversarial learning.

Face Recognition

MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID

1 code implementation CVPR 2023 Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao

Neural Architecture Search (NAS) has been increasingly appealing to the society of object Re-Identification (ReID), for that task-specific architectures significantly improve the retrieval performance.

Image Classification Neural Architecture Search +3

Heterogeneous Diversity Driven Active Learning for Multi-Object Tracking

no code implementations ICCV 2023 Rui Li, Baopeng Zhang, Jun Liu, Wei Liu, Jian Zhao, Zhu Teng

HD-AMOT defines the diversified informative representation by encoding the geometric and semantic information, and formulates the frame inference strategy as a Markov decision process to learn an optimal sampling policy based on the designed informative representation.

Active Learning Multi-Object Tracking

AdvMask: A Sparse Adversarial Attack Based Data Augmentation Method for Image Classification

no code implementations29 Nov 2022 Suorong Yang, Jinqiao Li, Jian Zhao, Furao Shen

The experimental results on various datasets and CNN models verify that the proposed method outperforms other previous data augmentation methods in image classification tasks.

Adversarial Attack Classification +2

Neural Dependencies Emerging from Learning Massive Categories

no code implementations CVPR 2023 Ruili Feng, Kecheng Zheng, Kai Zhu, Yujun Shen, Jian Zhao, Yukun Huang, Deli Zhao, Jingren Zhou, Michael Jordan, Zheng-Jun Zha

Through investigating the properties of the problem solution, we confirm that neural dependency is guaranteed by a redundant logit covariance matrix, which condition is easily met given massive categories, and that neural dependency is highly sparse, implying that one category correlates to only a few others.

Image Classification

DanZero: Mastering GuanDan Game with Reinforcement Learning

no code implementations31 Oct 2022 Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li

We compare it with 8 baseline AI programs which are based on heuristic rules and the results reveal the outstanding performance of DanZero.

Card Games reinforcement-learning +1

AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression

1 code implementation8 Oct 2022 Yabo Xiao, Xiaojuan Wang, Dongdong Yu, Kai Su, Lei Jin, Mei Song, Shuicheng Yan, Jian Zhao

With the proposed body representation, we further deliver a compact single-stage multi-person pose regression network, termed as AdaptivePose.

3D Multi-Person Pose Estimation Human Detection +2

AugRmixAT: A Data Processing and Training Method for Improving Multiple Robustness and Generalization Performance

no code implementations21 Jul 2022 Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie

In this paper, we propose a new data processing and training method, called AugRmixAT, which can simultaneously improve the generalization ability and multiple robustness of neural network models.

Adversarial Robustness

RSTAM: An Effective Black-Box Impersonation Attack on Face Recognition using a Mobile and Compact Printer

no code implementations25 Jun 2022 Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie

Furthermore, we propose a random meta-optimization strategy for ensembling several pre-trained face models to generate more general adversarial masks.

Face Recognition

RandoMix: A mixed sample data augmentation method with multiple mixed modes

no code implementations18 May 2022 Xiaoliang Liu, Furao Shen, Jian Zhao, Changhai Nie

Data augmentation plays a crucial role in enhancing the robustness and performance of machine learning models across various domains.

Data Augmentation

Multi-Target Active Object Tracking with Monte Carlo Tree Search and Target Motion Modeling

no code implementations7 May 2022 Zheng Chen, Jian Zhao, Mingyu Yang, Wengang Zhou, Houqiang Li

In this work, we are dedicated to multi-target active object tracking (AOT), where there are multiple targets as well as multiple cameras in the environment.

Multi-agent Reinforcement Learning Object Tracking

LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning

no code implementations5 May 2022 Mingyu Yang, Jian Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, Houqiang Li

In this way, agents dealing with the same subtask share their learning of specific abilities and different subtasks correspond to different specific abilities.

Multi-agent Reinforcement Learning reinforcement-learning +3

Infographics Wizard: Flexible Infographics Authoring and Design Exploration

1 code implementation21 Apr 2022 Anjul Tyagi, Jian Zhao, Pushkar Patel, Swasti Khurana, Klaus Mueller

With the help of designers, we propose a semi-automated infographic framework for general structured and flow-based infographic design generation.

Image Data Augmentation for Deep Learning: A Survey

no code implementations19 Apr 2022 Suorong Yang, Weikang Xiao, Mengchen Zhang, Suhan Guo, Jian Zhao, Furao Shen

By improving the quantity and diversity of training data, data augmentation has become an inevitable part of deep learning model training with image data.

Data Augmentation Image Classification +3

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

1 code implementation6 Apr 2022 Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li

Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games.

Thin-Plate Spline Motion Model for Image Animation

1 code implementation CVPR 2022 Jian Zhao, HUI ZHANG

Firstly, we propose thin-plate spline motion estimation to produce a more flexible optical flow, which warps the feature maps of the source image to the feature domain of the driving image.

Face Reenactment Image Animation +2

AutoAdversary: A Pixel Pruning Method for Sparse Adversarial Attack

no code implementations18 Mar 2022 Jinqiao Li, Xiaotao Liu, Jian Zhao, Furao Shen

A special branch of adversarial examples, namely sparse adversarial examples, can fool the target DNNs by perturbing only a few pixels.

Adversarial Attack Network Pruning

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

1 code implementation16 Mar 2022 Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu, Houqiang Li

In this way, CTDS balances the full utilization of global observation during training and the feasibility of decentralized execution for online inference.

Multi-agent Reinforcement Learning reinforcement-learning +3

MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning

2 code implementations CVPR 2022 Shiming Chen, Ziming Hong, Guo-Sen Xie, Wenhan Yang, Qinmu Peng, Kai Wang, Jian Zhao, Xinge You

Prior works either simply align the global features of an image with its associated class semantic vector or utilize unidirectional attention to learn the limited latent semantic representations, which could not effectively discover the intrinsic semantic knowledge e. g., attribute semantics) between visual and attribute features.

Attribute Transfer Learning +1

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

no code implementations9 Feb 2022 Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li

In cooperative multi-agent systems, agents jointly take actions and receive a team reward instead of individual rewards.

Single-Stage Is Enough: Multi-Person Absolute 3D Pose Estimation

no code implementations CVPR 2022 Lei Jin, Chenyang Xu, Xiaojuan Wang, Yabo Xiao, Yandong Guo, Xuecheng Nie, Jian Zhao

The existing multi-person absolute 3D pose estimation methods are mainly based on two-stage paradigm, i. e., top-down or bottom-up, leading to redundant pipelines with high computation cost.

3D Pose Estimation Depth Estimation +1

TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning

1 code implementation16 Dec 2021 Shiming Chen, Ziming Hong, Wenjin Hou, Guo-Sen Xie, Yibing Song, Jian Zhao, Xinge You, Shuicheng Yan, Ling Shao

Analogously, VAT uses the similar feature augmentation encoder to refine the visual features, which are further applied in visual$\rightarrow$attribute decoder to learn visual-based attribute features.

Attribute Zero-Shot Learning

EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In-Situ Code Search and Recommendation

no code implementations15 Dec 2021 Xingjun Li, Yizhi Zhang, Justin Leung, Chengnian Sun, Jian Zhao

This paper presents EDAssistant, a JupyterLab extension that supports EDA with in-situ search of example notebooks and recommendation of useful APIs, powered by novel interactive visualization of search results.

Code Search

Multi-scale fusion self attention mechanism

no code implementations29 Sep 2021 Qibin Li, Nianmin Yao, Jian Zhao, Yanan Zhang

Based on the traditional attention mechanism, multi-scale fusion self attention extracts phrase information at different scales by setting convolution kernels at different levels, and calculates the corresponding attention matrix at different scales, so that the model can better extract phrase level information.

Relation Extraction

User-Centric Semi-Automated Infographics Authoring and Recommendation

no code implementations26 Aug 2021 Anjul Tyagi, Jian Zhao, Pushkar Patel, Swasti Khurana, Klaus Mueller

Based on the framework, we also propose an interactive tool, \name{}, for assisting novice designers with creating high-quality infographics from an input in a markdown format by offering recommendations of different design components of infographics.

The 2nd Anti-UAV Workshop & Challenge: Methods and Results

no code implementations23 Aug 2021 Jian Zhao, Gang Wang, Jianan Li, Lei Jin, Nana Fan, Min Wang, Xiaojuan Wang, Ting Yong, Yafeng Deng, Yandong Guo, Shiming Ge, Guodong Guo

The 2nd Anti-UAV Workshop \& Challenge aims to encourage research in developing novel and accurate methods for multi-scale object tracking.

Object Tracking

Face.evoLVe: A High-Performance Face Recognition Library

1 code implementation19 Jul 2021 Qingzhong Wang, Pengfei Zhang, Haoyi Xiong, Jian Zhao

In this paper, we develop face. evoLVe -- a comprehensive library that collects and implements a wide range of popular deep learning-based methods for face recognition.

Face Alignment Face Recognition +1

Rethinking Sampling Strategies for Unsupervised Person Re-identification

2 code implementations7 Jul 2021 Xumeng Han, Xuehui Yu, Guorong Li, Jian Zhao, Gang Pan, Qixiang Ye, Jianbin Jiao, Zhenjun Han

While extensive research has focused on the framework design and loss function, this paper shows that sampling strategy plays an equally important role.

Pseudo Label Representation Learning +1

Interactive Dimensionality Reduction for Comparative Analysis

1 code implementation29 Jun 2021 Takanori Fujiwara, Xinhai Wei, Jian Zhao, Kwan-Liu Ma

However, existing DR methods provide limited capability and flexibility for such comparative analysis as each method is designed only for a narrow analysis target, such as identifying factors that most differentiate groups.

Contrastive Learning Dimensionality Reduction

SASICM A Multi-Task Benchmark For Subtext Recognition

no code implementations13 Jun 2021 Hua Yan, Feng Han, Junyi An, Weikang Xiao, Jian Zhao, Furao Shen

The F1 score of SASICMBERT, whose pretrained model is BERT, is 65. 12%, which is 0. 75% higher than that of SASICMg.

Image-to-Video Generation via 3D Facial Dynamics

no code implementations31 May 2021 Xiaoguang Tu, Yingtian Zou, Jian Zhao, Wenjie Ai, Jian Dong, Yuan YAO, Zhikang Wang, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

Video generation from a single face image is an interesting problem and usually tackled by utilizing Generative Adversarial Networks (GANs) to integrate information from the input face image and a sequence of sparse facial landmarks.

Image to Video Generation Video Prediction

Joint Face Image Restoration and Frontalization for Recognition

no code implementations12 May 2021 Xiaoguang Tu, Jian Zhao, Qiankun Liu, Wenjie Ai, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

First, MDFR is a well-designed encoder-decoder architecture which extracts feature representation from an input face image with arbitrary low-quality factors and restores it to a high-quality counterpart.

Face Recognition Image Restoration

Faster and Simpler Siamese Network for Single Object Tracking

no code implementations7 May 2021 Shaokui Jiang, Baile Xu, Jian Zhao, Furao Shen

With the development of the deep network and the release for a series of large scale datasets for single object tracking, siamese networks have been proposed and perform better than most of the traditional methods.

Object object-detection +2

IC Networks: Remodeling the Basic Unit for Convolutional Neural Networks

no code implementations6 Feb 2021 Junyi An, Fengshan Liu, Jian Zhao, Furao Shen

Inspired by the elastic collision model in physics, we present a general structure which can be integrated into the existing CNNs to improve their performance.

Anti-UAV: A Large Multi-Modal Benchmark for UAV Tracking

1 code implementation21 Jan 2021 Nan Jiang, Kuiran Wang, Xiaoke Peng, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li, Jian Zhao, Guodong Guo, Zhenjun Han

The releasing of such a large-scale dataset could be a useful initial step in research of tracking UAVs.

IC Neuron: An Efficient Unit to Construct Neural Networks

no code implementations23 Nov 2020 Junyi An, Fengshan Liu, Jian Zhao, Furao Shen

We believe that the IC neuron can be a basic unit to build network structures.

Effective Fusion Factor in FPN for Tiny Object Detection

no code implementations4 Nov 2020 Yuqi Gong, Xuehui Yu, Yao Ding, Xiaoke Peng, Jian Zhao, Zhenjun Han

We propose a novel concept, fusion factor, to control information that deep layers deliver to shallow layers, for adapting FPN to tiny object detection.

Object object-detection +1

OLALA: Object-Level Active Learning for Efficient Document Layout Annotation

1 code implementation5 Oct 2020 Zejiang Shen, Jian Zhao, Melissa Dell, YaoLiang Yu, Weining Li

Document images often have intricate layout structures, with numerous content regions (e. g. texts, figures, tables) densely arranged on each page.

Active Learning Object +1

The 1st Tiny Object Detection Challenge:Methods and Results

1 code implementation16 Sep 2020 Xuehui Yu, Zhenjun Han, Yuqi Gong, Nan Jiang, Jian Zhao, Qixiang Ye, Jie Chen, Yuan Feng, Bin Zhang, Xiaodi Wang, Ying Xin, Jingwei Liu, Mingyuan Mao, Sheng Xu, Baochang Zhang, Shumin Han, Cheng Gao, Wei Tang, Lizuo Jin, Mingbo Hong, Yuchao Yang, Shuiwang Li, Huan Luo, Qijun Zhao, Humphrey Shi

The 1st Tiny Object Detection (TOD) Challenge aims to encourage research in developing novel and accurate methods for tiny object detection in images which have wide views, with a current focus on tiny person detection.

Human Detection Object +2

A Visual Analytics Framework for Contrastive Network Analysis

no code implementations1 Aug 2020 Takanori Fujiwara, Jian Zhao, Francine Chen, Kwan-Liu Ma

A common network analysis task is comparison of two networks to identify unique characteristics in one network with respect to the other.

BIG-bench Machine Learning Contrastive Learning +1

Network Comparison with Interpretable Contrastive Network Representation Learning

3 code implementations25 May 2020 Takanori Fujiwara, Jian Zhao, Francine Chen, Yao-Liang Yu, Kwan-Liu Ma

This analysis task could be greatly assisted by contrastive learning, which is an emerging analysis approach to discover salient patterns in one dataset relative to another.

Contrastive Learning Representation Learning

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild

no code implementations7 Apr 2020 Zhecan Wang, Jian Zhao, Cheng Lu, Han Huang, Fan Yang, Lianji Li, Yandong Guo

To better demonstrate the advantage of our methods, we further propose a new benchmark dataset with the most rich distribution of head-gaze combination reflecting real-world scenarios.

Gaze Estimation

Temporal Convolutional Attention-based Network For Sequence Modeling

1 code implementation28 Feb 2020 Hongyan Hao, Yan Wang, Siqiao Xue, Yudi Xia, Jian Zhao, Furao Shen

So we propose an exploratory architecture referred to Temporal Convolutional Attention-based Network (TCAN) which combines temporal convolutional network and attention mechanism.

Pairwise Interactive Graph Attention Network for Context-Aware Recommendation

no code implementations18 Nov 2019 Yahui Liu, Furao Shen, Jian Zhao

PIGAT introduces the attention mechanism to consider the importance of each interacted user/item to both the user and the item, which captures user interests, item attractions and their influence on the recommendation context.

Graph Attention Recommendation Systems

SANVis: Visual Analytics for Understanding Self-Attention Networks

no code implementations13 Sep 2019 Cheonbok Park, Inyoup Na, Yongjang Jo, Sungbok Shin, Jaehyo Yoo, Bum Chul Kwon, Jian Zhao, Hyungjong Noh, Yeonsoo Lee, Jaegul Choo

Attention networks, a deep neural network architecture inspired by humans' attention mechanism, have seen significant success in image captioning, machine translation, and many other applications.

Image Captioning Machine Translation +2

Super Interaction Neural Network

1 code implementation29 May 2019 Yang Yao, Xu Zhang, Baile Xu, Furao Shen, Jian Zhao

Recent studies have demonstrated that the convolutional networks heavily rely on the quality and quantity of generated features.

Cross-Resolution Face Recognition via Prior-Aided Face Hallucination and Residual Knowledge Distillation

no code implementations26 May 2019 Hanyang Kong, Jian Zhao, Xiaoguang Tu, Junliang Xing, ShengMei Shen, Jiashi Feng

Recent deep learning based face recognition methods have achieved great performance, but it still remains challenging to recognize very low-resolution query face like 28x28 pixels when CCTV camera is far from the captured subject.

Face Hallucination Face Recognition +4

Label Mapping Neural Networks with Response Consolidation for Class Incremental Learning

no code implementations20 May 2019 Xu Zhang, Yang Yao, Baile Xu, Lekun Mao, Furao Shen, Jian Zhao, QIngwei Lin

In this paper, it is the first time to discuss the difficulty without support of old classes in class incremental learning, which is called as softmax suppression problem.

Class Incremental Learning Incremental Learning +1

"Tom" pet robot applied to urban autism

no code implementations14 May 2019 Xingqian Li, Chenwei Lou, Jian Zhao, HuaPeng Wei, Hongwei Zhao

The consequent urban autism problem has become more and more serious.

Operation-aware Neural Networks for User Response Prediction

4 code implementations2 Apr 2019 Yi Yang, Baile Xu, Furao Shen, Jian Zhao

Many deep models are proposed to automatically learn high-order feature interactions.

Multi-Prototype Networks for Unconstrained Set-based Face Recognition

no code implementations13 Feb 2019 Jian Zhao, Jianshu Li, Xiaoguang Tu, Fang Zhao, Yuan Xin, Junliang Xing, Hengzhu Liu, Shuicheng Yan, Jiashi Feng

In this paper, we study the challenging unconstrained set-based face recognition problem where each subject face is instantiated by a set of media (images and videos) instead of a single image.

Face Recognition

Learning Generalizable and Identity-Discriminative Representations for Face Anti-Spoofing

1 code implementation17 Jan 2019 Xiaoguang Tu, Jian Zhao, Mei Xie, Guodong Du, Hengsheng Zhang, Jianshu Li, Zheng Ma, Jiashi Feng

Face anti-spoofing (a. k. a presentation attack detection) has drawn growing attention due to the high-security demand in face authentication systems.

Domain Adaptation Face Anti-Spoofing +1

Dynamic Conditional Networks for Few-Shot Learning

no code implementations ECCV 2018 Fang Zhao, Jian Zhao, Shuicheng Yan, Jiashi Feng

This paper proposes a novel Dynamic Conditional Convolutional Network (DCCN) to handle conditional few-shot learning, i. e, only a few training samples are available for each condition.

Face Generation Few-Shot Learning +3

Object Relation Detection Based on One-shot Learning

no code implementations16 Jul 2018 Li Zhou, Jian Zhao, Jianshu Li, Li Yuan, Jiashi Feng

Detecting the relations among objects, such as "cat on sofa" and "person ride horse", is a crucial task in image understanding, and beneficial to bridging the semantic gap between images and natural language.

Object One-Shot Learning +1

Weakly Supervised Phrase Localization With Multi-Scale Anchored Transformer Network

no code implementations CVPR 2018 Fang Zhao, Jianshu Li, Jian Zhao, Jiashi Feng

In this paper, we propose a novel weakly supervised model, Multi-scale Anchored Transformer Network (MATN), to accurately localize free-form textual phrases with only image-level supervision.

Region Proposal

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing

2 code implementations10 Apr 2018 Jian Zhao, Jianshu Li, Yu Cheng, Li Zhou, Terence Sim, Shuicheng Yan, Jiashi Feng

Despite the noticeable progress in perceptual tasks like detection, instance segmentation and human parsing, computers still perform unsatisfactorily on visually understanding humans in crowded scenes, such as group behavior analysis, person re-identification and autonomous driving, etc.

Autonomous Driving Clustering +6

Integrated Face Analytics Networks through Cross-Dataset Hybrid Training

no code implementations16 Nov 2017 Jianshu Li, Shengtao Xiao, Fang Zhao, Jian Zhao, Jianan Li, Jiashi Feng, Shuicheng Yan, Terence Sim

Specifically, iFAN achieves an overall F-score of 91. 15% on the Helen dataset for face parsing, a normalized mean error of 5. 81% on the MTFL dataset for facial landmark localization and an accuracy of 45. 73% on the BNU dataset for emotion recognition with a single model.

Face Alignment Face Parsing +1

Multiple-Human Parsing in the Wild

2 code implementations19 May 2017 Jianshu Li, Jian Zhao, Yunchao Wei, Congyan Lang, Yidong Li, Terence Sim, Shuicheng Yan, Jiashi Feng

To address the multi-human parsing problem, we introduce a new multi-human parsing (MHP) dataset and a novel multi-human parsing model named MH-Parser.

Multi-Human Parsing

A Good Practice Towards Top Performance of Face Recognition: Transferred Deep Feature Fusion

1 code implementation3 Apr 2017 Lin Xiong, Jayashree Karlekar, Jian Zhao, Yi Cheng, Yan Xu, Jiashi Feng, Sugiri Pranata, ShengMei Shen

In this paper, we propose a unified learning framework named Transferred Deep Feature Fusion (TDFF) targeting at the new IARPA Janus Benchmark A (IJB-A) face recognition dataset released by NIST face challenge.

Face Recognition Transfer Learning

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild

no code implementations27 Dec 2016 Fang Zhao, Jiashi Feng, Jian Zhao, Wenhan Yang, Shuicheng Yan

The first one, named multi-scale spatial LSTM encoder, reads facial patches of various scales sequentially to output a latent representation, and occlusion-robustness is achieved owing to the fact that the influence of occlusion is only upon some of the patches.

Face Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.