Search Results for author: Junliang Xing

Found 73 papers, 30 papers with code

Reflective Policy Optimization

1 code implementation6 Jun 2024 Yaozhong Gan, Renye Yan, Zhe Wu, Junliang Xing

On-policy reinforcement learning methods, like Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO), often demand extensive data per update, leading to sample inefficiency.


Transductive Off-policy Proximal Policy Optimization

no code implementations6 Jun 2024 Yaozhong Gan, Renye Yan, Xiaoyang Tan, Zhe Wu, Junliang Xing

Proximal Policy Optimization (PPO) is a popular model-free reinforcement learning algorithm, esteemed for its simplicity and efficacy.

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

no code implementations CVPR 2024 Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai

In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.

3D Object Detection Motion Forecasting +4

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent

1 code implementation22 Apr 2024 Hang Xu, Kai Li, Bingyun Liu, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng

Counterfactual regret minimization (CFR) is a family of algorithms for effectively solving imperfect-information games.


A Parallel Attention Network for Cattle Face Recognition

no code implementations29 Mar 2024 Jiayu Li, Xuechao Zou, Shiying Wang, Ben Chen, Junliang Xing, Pin Tao

Thus, we create the first large-scale cattle face recognition dataset, ICRWE, for wild environments.

Face Recognition Position

SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement

1 code implementation CVPR 2024 Tao Wang, Lei Jin, Zheng Wang, Jianshu Li, Liang Li, Fang Zhao, Yu Cheng, Li Yuan, Li Zhou, Junliang Xing, Jian Zhao

To leverage this quality information we propose a motion refinement network termed SynSP to achieve a Synergy of Smoothness and Precision in the sequence refinement tasks.

Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing

no code implementations22 Dec 2023 Jinmin He, Kai Li, Yifan Zang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng

Multi-task reinforcement learning endeavors to accomplish a set of different tasks with a single policy.

PanBench: Towards High-Resolution and High-Performance Pansharpening

no code implementations20 Nov 2023 Shiying Wang, Xuechao Zou, Kai Li, Junliang Xing, Pin Tao

Pansharpening, a pivotal task in remote sensing, involves integrating low-resolution multispectral images with high-resolution panchromatic images to synthesize an image that is both high-resolution and retains multispectral information.

Change Detection Land Cover Classification +1

UniParser: Multi-Human Parsing with Unified Correlation Representation Learning

1 code implementation13 Oct 2023 Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao

Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information.

Image Segmentation Multi-Human Parsing +2

ARFA: An Asymmetric Receptive Field Autoencoder Model for Spatiotemporal Prediction

no code implementations1 Sep 2023 Wenxuan Zhang, Xuechao Zou, Li Wu, Xiaoying Wang, Jianqiang Huang, Junliang Xing

Additionally, we construct the RainBench, a large-scale radar echo dataset for precipitation prediction, to address the scarcity of meteorological data in the domain.

Decoder Weather Forecasting

High-Fidelity Lake Extraction via Two-Stage Prompt Enhancement: Establishing a Novel Baseline and Benchmark

1 code implementation16 Aug 2023 Ben Chen, Xuechao Zou, Kai Li, Yu Zhang, Junliang Xing, Pin Tao

Lake extraction from remote sensing imagery is a complex challenge due to the varied lake shapes and data noise.


Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space

1 code implementation11 Aug 2023 Haoyu Wang, Haozhe Wu, Junliang Xing, Jia Jia

Creating realistic 3D facial animation is crucial for various applications in the movie production and gaming industry, especially with the burgeoning demand in the metaverse.

motion retargeting Optical Flow Estimation

Semantics2Hands: Transferring Hand Motion Semantics between Avatars

1 code implementation11 Aug 2023 Zijie Ye, Jia Jia, Junliang Xing

Human hands, the primary means of non-verbal communication, convey intricate semantics in various scenarios.

Anatomy motion retargeting

Speech-Driven 3D Face Animation with Composite and Regional Facial Movements

1 code implementation10 Aug 2023 Haozhe Wu, Songtao Zhou, Jia Jia, Junliang Xing, Qi Wen, Xiang Wen

This paper emphasizes the importance of considering both the composite and regional natures of facial movements in speech-driven 3D face animation.

3D Face Animation

DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images

1 code implementation8 Aug 2023 Xuechao Zou, Kai Li, Junliang Xing, Yu Zhang, Shiying Wang, Lei Jin, Pin Tao

Optical satellite images are a critical data source; however, cloud cover often compromises their quality, hindering image applications and analysis.

Cloud Removal Image Generation

Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System

1 code implementation27 Jun 2023 Xue-Feng Zhu, Tianyang Xu, Jian Zhao, Jia-Wei Liu, Kai Wang, Gang Wang, Jianan Li, Qiang Wang, Lei Jin, Zheng Zhu, Junliang Xing, Xiao-Jun Wu

Still, previous works have simplified such an anti-UAV task as a tracking problem, where the prior information of UAVs is always provided; such a scheme fails in real-world anti-UAV tasks (i. e. complex scenes, indeterminate-appear and -reappear UAVs, and real-time UAV surveillance).

Shuffled Autoregression For Motion Interpolation

no code implementations10 Jun 2023 Shuo Huang, Jia Jia, Zongxin Yang, Wei Wang, Haozhe Wu, Yi Yang, Junliang Xing

However, motion interpolation is a more complex problem that takes isolated poses (e. g., only one start pose and one end pose) as input.

Motion Interpolation

Single-stage Multi-human Parsing via Point Sets and Center-based Offsets

1 code implementation22 Apr 2023 Jiaming Chu, Lei Jin, Junliang Xing, Jian Zhao

We instead present a high-performance Single-stage Multi-human Parsing (SMP) deep architecture that decouples the multi-human parsing problem into two fine-grained sub-problems, i. e., locating the human body and parts.

Multi-Human Parsing

PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-performance Cloud Removal from Multi-temporal Satellite Imagery

2 code implementations29 Mar 2023 Xuechao Zou, Kai Li, Junliang Xing, Pin Tao, Yachao Cui

Satellite imagery analysis plays a pivotal role in remote sensing; however, information loss due to cloud cover significantly impedes its application.

Cloud Detection Cloud Removal

MMTSA: Multimodal Temporal Segment Attention Network for Efficient Human Activity Recognition

no code implementations14 Oct 2022 Ziqi Gao, Yuntao Wang, Jianguo Chen, Junliang Xing, Shwetak Patel, Xin Liu, Yuanchun Shi

The efficiency evaluation on an edge device showed that MMTSA achieved significantly better accuracy, lower computational load, and lower inference latency than SOTA methods.

Human Activity Recognition

Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game

no code implementations ICLR 2022 Haobo Fu, Weiming Liu, Shuang Wu, Yijia Wang, Tao Yang, Kai Li, Junliang Xing, Bin Li, Bo Ma, Qiang Fu, Yang Wei

The deep policy gradient method has demonstrated promising results in many large-scale games, where the agent learns purely from its own experience.

counterfactual Policy Gradient Methods

RefineCap: Concept-Aware Refinement for Image Captioning

no code implementations8 Sep 2021 Yekun Chai, Shuo Jin, Junliang Xing

Automatically translating images to texts involves image scene understanding and language modeling.

Decoder Descriptive +4

Cooperative Multi-Agent Reinforcement Learning with Sequential Credit Assignment

1 code implementation NeurIPS 2021 Yifan Zang, Jinmin He, Kai Li, Lily Cao, Haobo Fu, Qiang Fu, Junliang Xing

In this paper, we propose a cooperative MARL method with sequential credit assignment (SeCA) that deduces each agent's contribution to the team's success one by one to learn better cooperation.

counterfactual Multi-agent Reinforcement Learning +4

L2E: Learning to Exploit Your Opponent

no code implementations18 Feb 2021 Zhe Wu, Kai Li, Enmin Zhao, Hang Xu, Meng Zhang, Haobo Fu, Bo An, Junliang Xing

In this work, we propose a novel Learning to Exploit (L2E) framework for implicit opponent modeling.

Anti-UAV: A Large Multi-Modal Benchmark for UAV Tracking

1 code implementation21 Jan 2021 Nan Jiang, Kuiran Wang, Xiaoke Peng, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li, Jian Zhao, Guodong Guo, Zhenjun Han

The releasing of such a large-scale dataset could be a useful initial step in research of tracking UAVs.

Temperature Regret Matching for Imperfect-Information Games

no code implementations1 Jan 2021 Enmin Zhao, Kai Li, Junliang Xing

Regret matching (RM) plays a crucial role in CFR and its variants to approach Nash equilibrium.


OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research

no code implementations11 Dec 2020 Kai Li, Hang Xu, Enmin Zhao, Zhe Wu, Junliang Xing

Owning to the unremitting efforts by a few institutes, significant progress has recently been made in designing superhuman AIs in No-limit Texas Hold'em (NLTH), the primary testbed for large-scale imperfect-information game research.

DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio

no code implementations12 Nov 2020 Jiangtao Kong, Yu Cheng, Benjia Zhou, Kai Li, Junliang Xing

To obtain a high-performance vehicle ReID model, we present a novel Distance Shrinking with Angular Marginalizing (DSAM) loss function to perform hybrid learning in both the Original Feature Space (OFS) and the Feature Angular Space (FAS) using the local verification and the global identification information.

Person Re-Identification Vehicle Re-Identification

RDSNet: A New Deep Architecture for Reciprocal Object Detection and Instance Segmentation

1 code implementation11 Dec 2019 Shaoru Wang, Yongchao Gong, Junliang Xing, Lichao Huang, Chang Huang, Weiming Hu

To reciprocate these two tasks, we design a two-stream structure to learn features on both the object level (i. e., bounding boxes) and the pixel level (i. e., instance masks) jointly.

Instance Segmentation Object +5

Relational Learning for Joint Head and Human Detection

1 code implementation24 Sep 2019 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Head and human detection have been rapidly improved with the development of deep convolutional neural networks.

Head Detection Human Detection +1

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

no code implementations15 Sep 2019 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians.

Data Augmentation Occlusion Handling +2

Cross-Resolution Face Recognition via Prior-Aided Face Hallucination and Residual Knowledge Distillation

no code implementations26 May 2019 Hanyang Kong, Jian Zhao, Xiaoguang Tu, Junliang Xing, ShengMei Shen, Jiashi Feng

Recent deep learning based face recognition methods have achieved great performance, but it still remains challenging to recognize very low-resolution query face like 28x28 pixels when CCTV camera is far from the captured subject.

Face Hallucination Face Recognition +4

Domain Adaptive Attention Learning for Unsupervised Person Re-Identification

no code implementations25 May 2019 Yangru Huang, Peixi Peng, Yi Jin, Yidong Li, Junliang Xing, Shiming Ge

In this approach, a domain adaptive attention model is learned to separate the feature map into domain-shared part and domain-specific part.

Domain Adaptation General Classification +2

Multi-Prototype Networks for Unconstrained Set-based Face Recognition

no code implementations13 Feb 2019 Jian Zhao, Jianshu Li, Xiaoguang Tu, Fang Zhao, Yuan Xin, Junliang Xing, Hengzhu Liu, Shuicheng Yan, Jiashi Feng

In this paper, we study the challenging unconstrained set-based face recognition problem where each subject face is instantiated by a set of media (images and videos) instead of a single image.

Face Recognition

SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks

13 code implementations CVPR 2019 Bo Li, Wei Wu, Qiang Wang, Fangyi Zhang, Junliang Xing, Junjie Yan

Moreover, we propose a new model architecture to perform depth-wise and layer-wise aggregations, which not only further improves the accuracy but also reduces the model size.

Translation Visual Object Tracking +1

Cooperative Multi-Agent Policy Gradients with Sub-optimal Demonstration

no code implementations5 Dec 2018 Peixi Peng, Junliang Xing

To learn the multi-agent cooperation effectively and tackle the sub-optimality of demonstration, a self-improving learning method is proposed: On the one hand, the centralized state-action values are initialized by the demonstration and updated by the learned decentralized policy to improve the sub-optimality.

Representation based and Attention augmented Meta learning

no code implementations19 Nov 2018 Yunxiao Qin, Chenxu Zhao, Zezheng Wang, Junliang Xing, Jun Wan, Zhen Lei

The method RAML aims to give the Meta learner the ability of leveraging the past learned knowledge to reduce the dimension of the original input data by expressing it into high representations, and help the Meta learner to perform well.

Few-Shot Learning

Temporal-Spatial Mapping for Action Recognition

no code implementations11 Sep 2018 Xiaolin Song, Cuiling Lan, Wen-Jun Zeng, Junliang Xing, Jingyu Yang, Xiaoyan Sun

We propose a video level 2D feature representation by transforming the convolutional features of all frames to a 2D feature map, referred to as VideoMap.

Action Recognition Image Classification +3

Selective Refinement Network for High Performance Face Detection

3 code implementations7 Sep 2018 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

In particular, the SRN consists of two modules: the Selective Two-step Classification (STC) module and the Selective Two-step Regression (STR) module.

Face Detection General Classification +2

Visual Tracking via Spatially Aligned Correlation Filters Network

no code implementations ECCV 2018 Mengdan Zhang, Qiang Wang, Junliang Xing, Jin Gao, Peixi Peng, Weiming Hu, Steve Maybank

Correlation filters based trackers rely on a periodic assumption of the search sample to efficiently distinguish the target from the background.

Visual Tracking

Pose Partition Networks for Multi-Person Pose Estimation

no code implementations ECCV 2018 Xuecheng Nie, Jiashi Feng, Junliang Xing, Shuicheng Yan

This paper proposes a novel Pose Partition Network (PPN) to address the challenging multi-person pose estimation problem.

Human Detection Multi-Person Pose Estimation

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking

2 code implementations CVPR 2018 Qiang Wang, Zhu Teng, Junliang Xing, Jin Gao, Weiming Hu, Stephen Maybank

The RASNet model reformulates the correlation filter within a Siamese tracking framework, and introduces different kinds of the attention mechanisms to adapt the model without updating the model online.

Object Tracking Representation Learning +1

Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation

no code implementations CVPR 2018 Kai Li, Junliang Xing, Chi Su, Weiming Hu, Yundong Zhang, Stephen Maybank

First, a novel cost-sensitive multi-task loss function is designed to learn transferable aging features by training on the source population.

Age Estimation

View Adaptive Neural Networks for High Performance Skeleton-based Human Action Recognition

2 code implementations20 Apr 2018 Pengfei Zhang, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jianru Xue, Nanning Zheng

In order to alleviate the effects of view variations, this paper introduces a novel view adaptation scheme, which automatically determines the virtual observation viewpoints in a learning based data driven manner.

Action Recognition Skeleton Based Action Recognition +1

Pose-driven Deep Convolutional Model for Person Re-identification

no code implementations ICCV 2017 Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian

Our deep architecture explicitly leverages the human part cues to alleviate the pose variations and learn robust feature representations from both the global image and different local parts.

Person Re-Identification

Human Pose Estimation using Global and Local Normalization

no code implementations ICCV 2017 Ke Sun, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Dong Liu, Jingdong Wang

We present a two-stage normalization scheme, human body normalization and limb normalization, to make the distribution of the relative joint locations compact, resulting in easier learning of convolutional spatial models and more accurate pose estimation.

Pose Estimation

A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection

1 code implementation CVPR 2017 Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou

At the global stage, given an image with a rough face detection result, the full face region is firstly re-initialized by a supervised spatial transformer network to a canonical shape state and then trained to regress a coarse landmark estimation.

Face Detection Facial Landmark Detection +1

Generative Partition Networks for Multi-Person Pose Estimation

1 code implementation21 May 2017 Xuecheng Nie, Jiashi Feng, Junliang Xing, Shuicheng Yan

This paper proposes a new Generative Partition Network (GPN) to address the challenging multi-person pose estimation problem.

 Ranked #1 on Multi-Person Pose Estimation on WAF (AP metric)

Human Detection Keypoint Detection +1

DCFNet: Discriminant Correlation Filters Network for Visual Tracking

5 code implementations13 Apr 2017 Qiang Wang, Jin Gao, Junliang Xing, Mengdan Zhang, Weiming Hu

In this work, we present an end-to-end lightweight network architecture, namely DCFNet, to learn the convolutional features and perform the correlation tracking process simultaneously.

Object Tracking Visual Tracking

View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data

1 code implementation ICCV 2017 Pengfei Zhang, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jianru Xue, Nanning Zheng

Rather than re-positioning the skeletons based on a human defined prior criterion, we design a view adaptive recurrent neural network (RNN) with LSTM architecture, which enables the network itself to adapt to the most suitable observation viewpoints from end to end.

Action Recognition Skeleton Based Action Recognition +1

A Graphical Social Topology Model for Multi-Object Tracking

no code implementations14 Feb 2017 Shan Gao, Xiaogang Chen, Qixiang Ye, Junliang Xing, Arjan Kuijper, Xiangyang Ji

Inspired with the social affinity property of moving objects, we propose a Graphical Social Topology (GST) model, which estimates the group dynamics by jointly modeling the group structure and the states of objects using a topological representation.

Multi-Object Tracking Object

Tensor Power Iteration for Multi-Graph Matching

no code implementations CVPR 2016 Xinchu Shi, Haibin Ling, Weiming Hu, Junliang Xing, Yanning Zhang

Due to its wide range of applications, matching between two graphs has been extensively studied and remains an active topic.

Graph Matching

Deep Attributes Driven Multi-Camera Person Re-identification

no code implementations11 May 2016 Chi Su, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian

And we propose a semi-supervised attribute learning framework which progressively boosts the accuracy of attributes only using a limited number of labeled data.

Attribute Metric Learning +1

Co-occurrence Feature Learning for Skeleton based Action Recognition using Regularized Deep LSTM Networks

no code implementations24 Mar 2016 Wentao Zhu, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Yanghao Li, Li Shen, Xiaohui Xie

Skeleton based action recognition distinguishes human actions using the trajectories of skeleton joints, which provide a very good representation for describing actions.

Action Recognition Skeleton Based Action Recognition +1

Local Subspace Collaborative Tracking

no code implementations ICCV 2015 Lin Ma, Xiaoqin Zhang, Weiming Hu, Junliang Xing, Jiwen Lu, Jie zhou

To address this, this paper presents a local subspace collaborative tracking method for robust visual tracking, where multiple linear and nonlinear subspaces are learned to better model the nonlinear relationship of object appearances.

Object Object Tracking +1

Shape Driven Kernel Adaptation in Convolutional Neural Network for Robust Facial Traits Recognition

no code implementations CVPR 2015 Shaoxin Li, Junliang Xing, Zhiheng Niu, Shiguang Shan, Shuicheng Yan

Comprehensive experiments on WebFace, Morph II and MultiPIE databases well validate the effectiveness of the proposed kernel adaptation method and tree-structured convolutional architecture for facial traits recognition tasks, including identity, age and gender classification.

Age And Gender Classification Gender Classification +1

Multiple Object Tracking: A Literature Review

no code implementations26 Sep 2014 Wenhan Luo, Junliang Xing, Anton Milan, Xiaoqin Zhang, Wei Liu, Tae-Kyun Kim

We inspect the recent advances in various aspects and propose some interesting directions for future research.

Multiple Object Tracking Object

Multi-target Tracking with Motion Context in Tensor Power Iteration

no code implementations CVPR 2014 Xinchu Shi, Haibin Ling, Weiming Hu, Chunfeng Yuan, Junliang Xing

In this paper, we model interactions between neighbor targets by pair-wise motion context, and further encode such context into the global association optimization.

Towards Multi-view and Partially-Occluded Face Alignment

no code implementations CVPR 2014 Junliang Xing, Zhiheng Niu, Junshi Huang, Weiming Hu, Shuicheng Yan

During each training stage, the SRD model learns a relational dictionary to capture consistent relationships between face appearance and shape, which are respectively modeled by the pose-indexed image features and the shape displacements for current estimated landmarks.

Face Alignment

Cannot find the paper you are looking for? You can Submit a new open access paper.