Search Results for author: Peng Wang

Found 205 papers, 65 papers with code

PCEE-BERT: Accelerating BERT Inference via Patient and Confident Early Exiting

1 code implementation Findings (NAACL) 2022 Zhen Zhang, Wei Zhu, Jinfan Zhang, Peng Wang, Rize Jin, Tae-Sun Chung

In this work, we propose Patient and Confident Early Exiting BERT (PCEE-BERT), an off-the-shelf sample-dependent early exiting method that can work with different PLMs and can also work along with popular model compression methods.

Model Compression Pretrained Language Models

A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model

no code implementations ICML 2020 Peng Wang, Zirui Zhou, Anthony Man-Cho So

In this paper, we focus on the problem of exactly recovering the communities in a binary symmetric SBM, where a graph of $n$ vertices is partitioned into two equal-sized communities and the vertices are connected with probability $p = \alpha\log(n)/n$ within communities and $q = \beta\log(n)/n$ across communities for some $\alpha>\beta>0$.

Stochastic Block Model

Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance

no code implementations16 Dec 2022 Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

To enhance the discriminative power of features, we propose a batch clustering based loss to encourage a clustering branch to generate distinct normal and abnormal clusters based on a batch of data.

Anomaly Detection Video Anomaly Detection

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

1 code implementation8 Dec 2022 Jinze Bai, Rui Men, Hao Yang, Xuancheng Ren, Kai Dang, Yichang Zhang, Xiaohuan Zhou, Peng Wang, Sinan Tan, An Yang, Zeyu Cui, Yu Han, Shuai Bai, Wenbin Ge, Jianxin Ma, Junyang Lin, Jingren Zhou, Chang Zhou

As a starting point, we provide presets of 7 different modalities and 23 highly-diverse example tasks in OFASys, with which we also develop a first-in-kind, single model, OFA+, that can handle text, image, speech, video, and motion data.

Multi-Task Learning

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion

no code implementations5 Dec 2022 Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Ruiqi Wu, Shizhou Zhang, Peng Wang, Yanning Zhang

The key insight of this design is that the cross-attention mechanism in the transformer could be an ideal solution to align the discriminative texture clues from the original image with the canonical view image, which could compensate for the low-quality texture information of the canonical view image.

Domain Generalization Generalizable Person Re-identification +1

NeuralUDF: Learning Unsigned Distance Fields for Multi-view Reconstruction of Surfaces with Arbitrary Topologies

no code implementations25 Nov 2022 Xiaoxiao Long, Cheng Lin, Lingjie Liu, YuAn Liu, Peng Wang, Christian Theobalt, Taku Komura, Wenping Wang

In this paper, we propose to represent surfaces as the Unsigned Distance Function (UDF) and develop a new volume rendering scheme to learn the neural UDF representation.

Neural Rendering

BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields

no code implementations23 Nov 2022 Peng Wang, Lingzhe Zhao, Ruijie Ma, Peidong Liu

Neural Radiance Fields (NeRF) have received considerable attention recently, due to its impressive capability in photo-realistic 3D reconstruction and novel view synthesis, given a set of posed camera images.

3D Reconstruction Novel View Synthesis

Semantic Guided Level-Category Hybrid Prediction Network for Hierarchical Image Classification

no code implementations22 Nov 2022 Peng Wang, Jingzhou Chen, Yuntao Qian

Hierarchical classification (HC) assigns each object with multiple labels organized into a hierarchical structure.

Image Classification Word Embeddings

FIRES: Fast Imaging and 3D Reconstruction of Archaeological Sherds

no code implementations13 Nov 2022 Jiepeng Wang, Congyi Zhang, Peng Wang, Xin Li, Peter J. Cobb, Christian Theobalt, Wenping Wang

Extensive validation in labs and testing in excavation sites demonstrated that our FIRES system provides the first fast, accurate, portal, and cost-effective solution for the task of imaging and 3D reconstruction of sherds in archaeological excavations.

3D Reconstruction

Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold

no code implementations19 Sep 2022 Can Yaras, Peng Wang, Zhihui Zhu, Laura Balzano, Qing Qu

When training overparameterized deep networks for classification tasks, it has been widely observed that the learned features exhibit a so-called "neural collapse" phenomenon.

Multi-class Classification Representation Learning +1

Levenshtein OCR

2 code implementations8 Sep 2022 Cheng Da, Peng Wang, Cong Yao

A novel scene text recognizer based on Vision-Language Transformer (VLT) is presented.

Imitation Learning Optical Character Recognition +1

Multi-Granularity Prediction for Scene Text Recognition

2 code implementations8 Sep 2022 Peng Wang, Cheng Da, Cong Yao

In this work, we first draw inspiration from the recent progress in Vision Transformer (ViT) to construct a conceptually simple yet powerful vision STR model, which is built upon ViT and outperforms previous state-of-the-art models for scene text recognition, including both pure vision models and language-augmented methods.

Language Modelling Optical Character Recognition +1

Instance Image Retrieval by Learning Purely From Within the Dataset

no code implementations12 Aug 2022 Zhongyan Zhang, Lei Wang, Yang Wang, Luping Zhou, Jianjia Zhang, Peng Wang, Fang Chen

Although achieving promising results, this approach is restricted by two issues: 1) the domain gap between benchmark datasets and the dataset of a given retrieval task; 2) the required auxiliary dataset cannot be readily obtained.

Image Retrieval Retrieval +1

Prompt Tuning for Generative Multimodal Pretrained Models

1 code implementation4 Aug 2022 Hao Yang, Junyang Lin, An Yang, Peng Wang, Chang Zhou, Hongxia Yang

Prompt tuning has become a new paradigm for model tuning and it has demonstrated success in natural language pretraining and even vision pretraining.

Image Captioning Visual Entailment +1

One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning

no code implementations31 Jul 2022 Zhipeng Zhang, Zhimin Wei, Zhongzhen Huang, Rui Niu, Peng Wang

However, one unsolved issue of these models is that the number of reasoning steps needs to be pre-defined and fixed before inference, ignoring the varying complexity of expressions.

Referring Expression Referring Expression Comprehension +2

Progressively-connected Light Field Network for Efficient View Synthesis

no code implementations10 Jul 2022 Peng Wang, YuAn Liu, Guying Lin, Jiatao Gu, Lingjie Liu, Taku Komura, Wenping Wang

ProLiF encodes a 4D light field, which allows rendering a large batch of rays in one training step for image- or patch-level losses.

Novel View Synthesis

NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors

no code implementations27 Jun 2022 Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang

The key idea of NeuRIS is to integrate estimated normal of indoor scenes as a prior in a neural rendering framework for reconstructing large texture-less shapes and, importantly, to do this in an adaptive manner to also enable the reconstruction of irregular shapes with fine details.

3D Reconstruction Neural Rendering

SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views

1 code implementation12 Jun 2022 Xiaoxiao Long, Cheng Lin, Peng Wang, Taku Komura, Wenping Wang

We introduce SparseNeuS, a novel neural rendering based method for the task of surface reconstruction from multi-view images.

Neural Rendering Surface Reconstruction

Convergence and Recovery Guarantees of the K-Subspaces Method for Subspace Clustering

1 code implementation11 Jun 2022 Peng Wang, Huikang Liu, Anthony Man-Cho So, Laura Balzano

The K-subspaces (KSS) method is a generalization of the K-means method for subspace clustering.

Fast-Spanning Ant Colony Optimisation (FaSACO) for Mobile Robot Coverage Path Planning

no code implementations31 May 2022 Christopher Carr, Peng Wang

Bio-inspired algorithms such as Ant Colony Optimisation (ACO) have been exploited to solve the problem because they can utilise heuristic information to mitigate the path planning complexity.

VoGE: A Differentiable Volume Renderer using Gaussian Ellipsoids for Analysis-by-Synthesis

1 code implementation30 May 2022 Angtian Wang, Peng Wang, Jian Sun, Adam Kortylewski, Alan Yuille

Differentiable rendering allows the application of computer graphics on vision tasks, e. g. object pose and shape fitting, via analysis-by-synthesis, where gradients at occluded regions are important when inverting the rendering process.

Pose Estimation

Balanced control between performance and saturation for constrained nonlinear systems

no code implementations10 May 2022 Peng Wang, Haibin Wang, Shuzhi Sam Ge, Xiaobing Zhang

This paper addresses the balanced control between performance and saturation for a class of constrained nonlinear systems, including the branches: balanced command filtered backstepping (BCFB) and balanced performance control (BPC).

Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information

no code implementations7 May 2022 Zhipeng Zhang, Xinglin Hou, Kai Niu, Zhongzhen Huang, Tiezheng Ge, Yuning Jiang, Qi Wu, Peng Wang

Therefore, we present a dataset, E-MMAD (e-commercial multimodal multi-structured advertisement copywriting), which requires, and supports much more detailed information in text generation.

Text Generation Video Captioning

Dual-Level Decoupled Transformer for Video Captioning

no code implementations6 May 2022 Yiqi Gao, Xinglin Hou, Wei Suo, Mengyang Sun, Tiezheng Ge, Yuning Jiang, Peng Wang

As for the latter, \textbf{\textit{"couple"}} means treating the generation of visual semantic and syntax-related words equally.

Video Captioning

CapOnImage: Context-driven Dense-Captioning on Image

no code implementations27 Apr 2022 Yiqi Gao, Xinglin Hou, Yuanmeng Zhang, Tiezheng Ge, Yuning Jiang, Peng Wang

Existing image captioning systems are dedicated to generating narrative captions for images, which are spatially detached from the image in presentation.

Dense Captioning Image Captioning

Pushing the Performance Limit of Scene Text Recognizer without Human Annotation

1 code implementation CVPR 2022 Caiyuan Zheng, Hui Li, Seon-Min Rhee, Seungju Han, Jae-Joon Han, Peng Wang

A robust consistency regularization based semi-supervised framework is proposed for STR, which can effectively solve the instability issue due to domain inconsistency between synthetic and real images.

Scene Text Recognition

NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night

1 code implementation CVPR 2022 Xueqing Deng, Peng Wang, Xiaochen Lian, Shawn Newsam

Notably, NightLab contains models at two levels of granularity, i. e. image and regional, and each level is composed of light adaptation and segmentation modules.

Self-Driving Cars Semantic Segmentation

DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization

no code implementations12 Apr 2022 Xueqing Deng, Dawei Sun, Shawn Newsam, Peng Wang

Specifically, given a pair of student and teacher networks, DistPro first sets up a rich set of KD connection from the transmitting layers of the teacher to the receiving layers of the student, and in the meanwhile, various transforms are also proposed for comparing feature maps along its pathway for the distillation.

Knowledge Distillation Meta-Learning

Self-Contrastive Learning based Semi-Supervised Radio Modulation Classification

no code implementations29 Mar 2022 Dongxin Liu, Peng Wang, Tianshi Wang, Tarek Abdelzaher

This paper presents a semi-supervised learning framework that is new in being designed for automatic modulation classification (AMC).

Classification Contrastive Learning

Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization

1 code implementation CVPR 2022 Wei Dong, Junsheng Wu, Yi Luo, ZongYuan Ge, Peng Wang

In this work, we present a simple-yet-effective self-supervised node representation learning strategy via directly maximizing the mutual information between the hidden representations of nodes and their neighbourhood, which can be theoretically justified by its link to graph smoothing.

Node Classification Representation Learning

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding

no code implementations ACL 2022 Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan, Jin Wang, Peng Wang, ShiLiang Pu, Fei Wu

To achieve effective grounding under a limited annotation budget, we investigate one-shot video grounding, and learn to ground natural language in all video frames with solely one frame labeled, in an end-to-end manner.

Representation Learning Video Grounding

Exact Community Recovery over Signed Graphs

no code implementations22 Feb 2022 Xiaolu Wang, Peng Wang, Anthony Man-Cho So

Signed graphs encode similarity and dissimilarity relationships among different entities with positive and negative edges.

Stochastic Block Model

Relation Regularized Scene Graph Generation

no code implementations22 Feb 2022 Yuyu Guo, Lianli Gao, Jingkuan Song, Peng Wang, Nicu Sebe, Heng Tao Shen, Xuelong Li

Inspired by this observation, in this article, we propose a relation regularized network (R2-Net), which can predict whether there is a relationship between two objects and encode this relation into object feature refinement and better SGG.

Graph Classification Graph Generation +4

Graph-based Extractive Explainer for Recommendations

no code implementations20 Feb 2022 Peng Wang, Renqin Cai, Hongning Wang

Explanations in a recommender system assist users in making informed decisions among a set of recommended items.

Recommendation Systems

Adaptive Graph Convolutional Networks for Weakly Supervised Anomaly Detection in Videos

no code implementations14 Feb 2022 Congqi Cao, Xin Zhang, Shizhou Zhang, Peng Wang, Yanning Zhang

For weakly supervised anomaly detection, most existing work is limited to the problem of inadequate video representation due to the inability of modeling long-term contextual information.

Association Graph Learning +1

Negative-ResNet: Noisy Ambulatory Electrocardiogram Signal Classification Scheme

no code implementations25 Jan 2022 Zijiao Chen, Zihuai Lin, Peng Wang, Ming Ding

With recently successful applications of deep learning in computer vision and general signal processing, deep learning has shown many unique advantages in medical signal processing.

Classification

Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification

no code implementations CVPR 2022 Jingzhou Chen, Peng Wang, Jian Liu, Yuntao Qian

Hierarchical multi-granularity classification (HMC) assigns hierarchical multi-granularity labels to each object and focuses on encoding the label hierarchy, e. g., ["Albatross", "Laysan Albatross"] from coarse-to-fine levels.

Fine-Grained Image Classification

Multi-Domain Joint Training for Person Re-Identification

no code implementations6 Jan 2022 Lu Yang, Lingqiao Liu, Yunlong Wang, Peng Wang, Yanning Zhang

Our discovery is that training with such an adaptive model can better benefit from more training samples.

Person Re-Identification

Robust Security Analysis Based on Random Geometry Theory for Satellite-Terrestrial-Vehicle Network

no code implementations28 Dec 2021 Xudong Li, Ye Fan, Rugui Yao, Peng Wang, Nan Qi, Xiaoya Zuo

Driven by B5G and 6G technologies, multi-network fusion is an indispensable tendency for future communications.

Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition

no code implementations22 Nov 2021 Peng Wang, Jun Wen, Chenyang Si, Yuntao Qian, Liang Wang

Finally, in the Information Fuser, we explore varied strategies to combine the Sequence Reconstructor and Contrastive Motion Learner, and propose to capture postures and motions simultaneously via a knowledge-distillation based fusion strategy that transfers the motion learning from the Contrastive Motion Learner to the Sequence Reconstructor.

Action Recognition Contrastive Learning +4

LoS-Map Construction for Proactive Relay of Opportunity Selection in 6G V2X Systems

no code implementations15 Nov 2021 Francesco Linsalata, Silvia Mura, Marouan Mizmizi, Maurizio Magarini, Peng Wang, Majid Nasiri Khormuji, Alberto Perotti, Umberto Spagnolini

Recent advances in Vehicle-to-Everything (V2X) technology and the upcoming sixth-generation (6G) network will dawn a new era for vehicular services with enhanced communication capabilities.

Autonomous Vehicles

Spatial-Interference Aware Cooperative Resource Allocation for 5G NR Sidelink Communications

no code implementations15 Nov 2021 Silvia Mura, Francesco Linsalata, Marouan Mizmizi, Maurizio Magarini, Majid Nasiri Khormuji, Peng Wang, Alberto Perotti, Umberto Spagnolini

Distributed resource allocation (RA) schemes have been introduced in cellular vehicle-to-everything (C-V2X) standard for vehicle-to-vehicle (V2V) sidelink (SL) communications to share the limited spectrum (sub-6GHz) efficiently.

NAS-FCOS: Efficient Search for Object Detection Architectures

1 code implementation24 Oct 2021 Ning Wang, Yang Gao, Hao Chen, Peng Wang, Zhi Tian, Chunhua Shen, Yanning Zhang

Neural Architecture Search (NAS) has shown great potential in effectively reducing manual effort in network design by automatically discovering optimal architectures.

Neural Architecture Search object-detection +1

StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis

1 code implementation ICLR 2022 Jiatao Gu, Lingjie Liu, Peng Wang, Christian Theobalt

We perform volume rendering only to produce a low-resolution feature map and progressively apply upsampling in 2D to address the first issue.

Image Generation

A deep learning pipeline for localization, differentiation, and uncertainty estimation of liver lesions using multi-phasic and multi-sequence MRI

no code implementations17 Oct 2021 Peng Wang, YuHsuan Wu, Bolin Lai, Xiao-Yun Zhou, Le Lu, Wendi Liu, Huabang Zhou, Lingyun Huang, Jing Xiao, Adam P. Harrison, Ningyang Jia, Heping Hu

Results: the proposed CAD solution achieves a mean F1 score of 0. 62, outperforming the abdominal radiologist (0. 47), matching the junior hepatology radiologist (0. 61), and underperforming the senior hepatology radiologist (0. 68).

Specificity

Space-and-time-synchronized simultaneous vehicle tracking/formation using cascaded prescribed-time control

no code implementations11 Sep 2021 Peng Wang, Ziyin Chen, Xiaobing Zhang

In this paper, we present a space-and-time-synchronized control method with application to the simultaneous tracking/formation.

Continual Neural Mapping: Learning An Implicit Scene Representation from Sequential Observations

no code implementations ICCV 2021 Zike Yan, Yuxin Tian, Xuesong Shi, Ping Guo, Peng Wang, Hongbin Zha

We introduce an experience replay approach to tackle an exemplary task of continual neural mapping: approximating a continuous signed distance function (SDF) from sequential depth images as a scene geometry representation.

Continual Learning

Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation

no code implementations5 Aug 2021 Yiming Li, Tao Kong, Ruihang Chu, Yifeng Li, Peng Wang, Lei LI

In a unified framework, we jointly predict the feasible 6-DoF grasp poses, instance semantic segmentation, and collision information.

Multi-Task Learning Pose Estimation +1

Neural Rays for Occlusion-aware Image-based Rendering

1 code implementation CVPR 2022 YuAn Liu, Sida Peng, Lingjie Liu, Qianqian Wang, Peng Wang, Christian Theobalt, Xiaowei Zhou, Wenping Wang

On such a 3D point, these generalization methods will include inconsistent image features from invisible views, which interfere with the radiance field construction.

Neural Rendering Novel View Synthesis +1

AdaXpert: Adapting Neural Architecture for Growing Data

1 code implementation1 Jul 2021 Shuaicheng Niu, Jiaxiang Wu, Guanghui Xu, Yifan Zhang, Yong Guo, Peilin Zhao, Peng Wang, Mingkui Tan

To address this, we present a neural architecture adaptation method, namely Adaptation eXpert (AdaXpert), to efficiently adjust previous architectures on the growing data.

Investigation of Bare-bones Algorithms from Quantum Perspective: A Quantum Dynamical Global Optimizer

no code implementations26 Jun 2021 Peng Wang, Gang Xin, Fang Wang

Correspondingly, the basic search behaviour is derived, which constitutes the basic iterative process of a simple optimization system.

NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction

4 code implementations NeurIPS 2021 Peng Wang, Lingjie Liu, YuAn Liu, Christian Theobalt, Taku Komura, Wenping Wang

In NeuS, we propose to represent a surface as the zero-level set of a signed distance function (SDF) and develop a new volume rendering method to train a neural SDF representation.

Novel View Synthesis Surface Reconstruction

HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers

1 code implementation CVPR 2021 Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo

Last, we proposed an efficient fine-grained search strategy to train HR-NAS, which effectively explores the search space, and finds optimal architectures given various tasks and computation resources.

Image Classification Neural Architecture Search +2

Fastening the Initial Access in 5G NR Sidelink for 6G V2X Networks

no code implementations10 Jun 2021 Marouan Mizmizi, Francesco Linsalata, Mattia Brambilla, Filippo Morandi, Kai Dong, Maurizio Magarini, Monica Nicoli, Majid Nasiri Khormuji, Peng Wang, Renaud Alexandre Pitaval, Umberto Spagnolini

The ever-increasing demand for intelligent, automated, and connected mobility solutions pushes for the development of an innovative sixth Generation (6G) of cellular networks.

Quantization

Generative Adversarial Networks: A Survey Towards Private and Secure Applications

no code implementations7 Jun 2021 Zhipeng Cai, Zuobin Xiong, Honghui Xu, Peng Wang, Wei Li, Yi Pan

Generative Adversarial Networks (GAN) have promoted a variety of applications in computer vision, natural language processing, etc.

Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation

no code implementations Findings (ACL) 2021 Peng Wang, Junyang Lin, An Yang, Chang Zhou, Yichang Zhang, Jingren Zhou, Hongxia Yang

Experimental results demonstrate that our method outperforms the previous state-of-the-art methods in both automatic and human evaluation, especially on coverage and faithfulness.

Table-to-Text Generation

Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention

no code implementations5 May 2021 Wei Suo, Mengyang Sun, Peng Wang, Qi Wu

Referring Expression Comprehension (REC) has become one of the most important tasks in visual reasoning, since it is an essential step for many vision-and-language tasks such as visual question answering.

Question Answering Referring Expression +3

Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads

no code implementations30 Apr 2021 Chenyu Gao, Qi Zhu, Peng Wang, Qi Wu

Based on this observation, we design a dynamic chopping module that can automatically remove heads and layers of the VisualBERT at an instance level when dealing with different questions.

Question Answering Visual Question Answering +1

CAT: Cross-Attention Transformer for One-Shot Object Detection

no code implementations30 Apr 2021 Weidong Lin, Yuyan Deng, Yang Gao, Ning Wang, Jinghao Zhou, Lingqiao Liu, Lei Zhang, Peng Wang

Given a query patch from a novel class, one-shot object detection aims to detect all instances of that class in a target image through the semantic similarity comparison.

object-detection One-Shot Object Detection +2

Center Prediction Loss for Re-identification

no code implementations30 Apr 2021 Lu Yang, Yunlong Wang, Lingqiao Liu, Peng Wang, Lu Chi, Zehuan Yuan, Changhu Wang, Yanning Zhang

In this paper, we propose a new loss based on center predictivity, that is, a sample must be positioned in a location of the feature space such that from it we can roughly predict the location of the center of same-class samples.

PURE: Passive mUlti-peRson idEntification via Deep Footstep Separation and Recognition

no code implementations15 Apr 2021 Chao Cai, Ruinan Jin, Peng Wang, Liyuan Ye, Hongbo Jiang, Jun Luo

Recently, \textit{passive behavioral biometrics} (e. g., gesture or footstep) have become promising complements to conventional user identification methods (e. g., face or fingerprint) under special situations, yet existing sensing technologies require lengthy measurement traces and cannot identify multiple users at the same time.

Person Identification

Residual Gaussian Process: A Tractable Nonparametric Bayesian Emulator for Multi-fidelity Simulations

no code implementations8 Apr 2021 Wei W. Xing, Akeel A. Shah, Peng Wang, Shandian Zhe Qian Fu, Robert. M. Kirby

The resulting model is equipped with a closed-form solution for the predictive posterior, making it applicable to advanced, high-dimensional tasks that require uncertainty estimation.

Active Learning

An Adversarial Human Pose Estimation Network Injected with Graph Structure

no code implementations29 Mar 2021 Lei Tian, Guoqiang Liang, Peng Wang, Chunhua Shen

Because of the invisible human keypoints in images caused by illumination, occlusion and overlap, it is likely to produce unreasonable human pose prediction for most of the current human pose estimation methods.

Pose Estimation Pose Prediction

Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification

no code implementations CVPR 2021 Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang

Learning discriminative image representations plays a vital role in long-tailed image classification because it can ease the classifier learning in imbalanced cases.

Classification Contrastive Learning +4

Hetero-Modal Learning and Expansive Consistency Constraints for Semi-Supervised Detection from Multi-Sequence Data

no code implementations24 Mar 2021 Bolin Lai, YuHsuan Wu, Xiao-Yun Zhou, Peng Wang, Le Lu, Lingyun Huang, Mei Han, Jing Xiao, Heping Hu, Adam P. Harrison

Lesion detection serves a critical role in early diagnosis and has been well explored in recent years due to methodological advancesand increased data availability.

Lesion Detection

Instance and Pair-Aware Dynamic Networks for Re-Identification

no code implementations9 Mar 2021 Bingliang Jiao, Xin Tan, Jinghao Zhou, Lu Yang, Yunlong Wang, Peng Wang

The proposed model is composed of three main branches where a self-guided dynamic branch is constructed to strengthen instance-specific features, focusing on every single image.

Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification

no code implementations9 Mar 2021 Lu Yang, Hongbang Liu, Jinghao Zhou, Lingqiao Liu, Lei Zhang, Peng Wang, Yanning Zhang

Learning cross-view consistent feature representation is the key for accurate vehicle Re-identification (ReID), since the visual appearance of vehicles changes significantly under different viewpoints.

Vehicle Re-Identification

Scalable Learning With a Structural Recurrent Neural Network for Short-Term Traffic Prediction

1 code implementation3 Mar 2021 Youngjoo Kim, Peng Wang, Lyudmila Mihaylova

With the real traffic speed data measured in the city of Santander, we demonstrate the proposed SRNN outperforms the image-based approaches using the capsule network (CapsNet) by 14. 1% and the convolutional neural network (CNN) by 5. 87%, respectively, in terms of root mean squared error (RMSE).

Semantic Similarity Semantic Textual Similarity +2

M6: A Chinese Multimodal Pretrainer

no code implementations1 Mar 2021 Junyang Lin, Rui Men, An Yang, Chang Zhou, Ming Ding, Yichang Zhang, Peng Wang, Ang Wang, Le Jiang, Xianyan Jia, Jie Zhang, Jianwei Zhang, Xu Zou, Zhikang Li, Xiaodong Deng, Jie Liu, Jinbao Xue, Huiling Zhou, Jianxin Ma, Jin Yu, Yong Li, Wei Lin, Jingren Zhou, Jie Tang, Hongxia Yang

In this work, we construct the largest dataset for multimodal pretraining in Chinese, which consists of over 1. 9TB images and 292GB texts that cover a wide range of domains.

Image Generation

Derive Lovelock Gravity from String Theory in Cosmological Background

no code implementations24 Dec 2020 Peng Wang, Houwen Wu, Haitang Yang, Shuxuan Ying

It was proved more than three decades ago, that the first order $\alpha'$ correction of string effective theory could be written as the Gauss-Bonnet term, which is the quadratic term of Lovelock gravity.

High Energy Physics - Theory General Relativity and Quantum Cosmology High Energy Physics - Phenomenology

Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data Augmentation

1 code implementation15 Dec 2020 Feixiang Lu, Zongdai Liu, Hui Miao, Peng Wang, Liangjun Zhang, Ruigang Yang, Dinesh Manocha, Bin Zhou

For autonomous driving, the dynamics and states of vehicle parts such as doors, the trunk, and the bonnet can provide meaningful semantic information and interaction states, which are essential to ensuring the safety of the self-driving vehicle.

Autonomous Driving Data Augmentation +3

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps

1 code implementation9 Dec 2020 Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu

Texts appearing in daily scenes that can be recognized by OCR (Optical Character Recognition) tools contain significant information, such as street name, product brand and prices.

Image Captioning Optical Character Recognition +2

Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning

1 code implementation7 Dec 2020 Haokui Zhang, Ying Li, Yenan Jiang, Peng Wang, Qiang Shen, Chunhua Shen

In contrast to previous approaches, we do not impose restrictions over the source data sets, in which they do not have to be collected by the same sensors as the target data sets.

Classification General Classification +1

Quantum Dynamics of Optimization Problems

no code implementations6 Dec 2020 Peng Wang, Gang Xin, Yuwei Jiao

The mathematical relationship between the objective function and the wave function is established, and the quantum interpretation of the optimization problem is realized.

Ferryman as SemEval-2020 Task 5: Optimized BERT for Detecting Counterfactuals

no code implementations SEMEVAL 2020 Weilong Chen, Yan Zhuang, Peng Wang, Feng Hong, Yan Wang, Yanru Zhang

The main purpose of this article is to state the effect of using different methods and models for counterfactual determination and detection of causal knowledge.

Counterfactual Detection

AprilE: Attention with Pseudo Residual Connection for Knowledge Graph Embedding

no code implementations COLING 2020 Yuzhang Liu, Peng Wang, Yingtai Li, Yizhan Shao, Zhongkai Xu

To address this issue, we propose a novel model, AprilE, which employs triple-level self-attention and pseudo residual connection to model relational patterns.

Knowledge Graph Embedding

WikiAsp: A Dataset for Multi-domain Aspect-based Summarization

1 code implementation16 Nov 2020 Hiroaki Hayashi, Prashant Budania, Peng Wang, Chris Ackerson, Raj Neervannan, Graham Neubig

In this paper, we propose WikiAsp, a large-scale dataset for multi-domain aspect-based summarization that attempts to spur research in the direction of open-domain aspect-based summarization.

Few-shot Action Recognition with Implicit Temporal Alignment and Pair Similarity Optimization

no code implementations13 Oct 2020 Congqi Cao, Yajuan Li, Qinyi Lv, Peng Wang, Yanning Zhang

Few-shot learning aims to recognize instances from novel classes with few labeled samples, which has great value in research and application.

Few-Shot action recognition Few Shot Action Recognition +3

Disentangled Neural Architecture Search

no code implementations24 Sep 2020 Xinyue Zheng, Peng Wang, Qigang Wang, Zhongchao shi

However, existing methods rely heavily on a black-box controller to search architectures, which suffers from the serious problem of lacking interpretability.

Neural Architecture Search

Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses

1 code implementation17 Jul 2020 Miao Liao, Sibo Zhang, Peng Wang, Hao Zhu, Xinxin Zuo, Ruigang Yang

In this paper, we propose a novel approach to convert given speech audio to a photo-realistic speaking video of a specific person, where the output video has synchronized, realistic, and expressive rich body dynamics.

Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks

no code implementations ECCV 2020 Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, Yinjie Lei

Most existing crowd counting systems rely on the availability of the object location annotation which can be expensive to obtain.

Crowd Counting

ODE-CNN: Omnidirectional Depth Extension Networks

no code implementations3 Jul 2020 Xinjing Cheng, Peng Wang, Yanqi Zhou, Chenye Guan, Ruigang Yang

Omnidirectional 360{\deg} camera proliferates rapidly for autonomous robots since it significantly enhances the perception ability by widening the field of view(FoV).

Non-Convex Exact Community Recovery in Stochastic Block Model

1 code implementation29 Jun 2020 Peng Wang, Zirui Zhou, Anthony Man-Cho So

Community detection in graphs that are generated according to stochastic block models (SBMs) has received much attention lately.

Community Detection Stochastic Block Model

A Robust Attentional Framework for License Plate Recognition in the Wild

no code implementations6 Jun 2020 Linjiang Zhang, Peng Wang, Hui Li, Zhen Li, Chunhua Shen, Yanning Zhang

On the other hand, the 2D attentional based license plate recognizer with an Xception-based CNN encoder is capable of recognizing license plates with different patterns under various scenarios accurately and robustly.

Image Generation License Plate Recognition

Structured Multimodal Attentions for TextVQA

2 code implementations1 Jun 2020 Chenyu Gao, Qi Zhu, Peng Wang, Hui Li, Yuliang Liu, Anton Van Den Hengel, Qi Wu

In this paper, we propose an end-to-end structured multimodal attention (SMA) neural network to mainly solve the first two issues above.

Graph Attention Optical Character Recognition +3

Vid2Curve: Simultaneous Camera Motion Estimation and Thin Structure Reconstruction from an RGB Video

no code implementations7 May 2020 Peng Wang, Lingjie Liu, Nenglun Chen, Hung-Kuo Chu, Christian Theobalt, Wenping Wang

We propose the first approach that simultaneously estimates camera motion and reconstructs the geometry of complex 3D thin structures in high quality from a color video captured by a handheld camera.

Motion Estimation Occlusion Handling +1

Challenge Closed-book Science Exam: A Meta-learning Based Question Answering System

no code implementations26 Apr 2020 Xinyue Zheng, Peng Wang, Qigang Wang, Zhongchao shi

Prior work in standardized science exams requires support from large text corpus, such as targeted science corpus fromWikipedia or SimpleWikipedia.

Language Modelling Meta-Learning +2

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

1 code implementation CVPR 2020 Jie Li, Kai Han, Peng Wang, Yu Liu, Xia Yuan

In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely.

3D Semantic Scene Completion from a single RGB image

TEDL: A Text Encryption Method Based on Deep Learning

1 code implementation9 Mar 2020 Xiang Li, Peng Wang

Firstly, both communication parties establish a word vector table by training a deep learning model according to specified hyperparameters.

Toward Interpretability of Dual-Encoder Models for Dialogue Response Suggestions

no code implementations2 Mar 2020 Yitong Li, Dianqi Li, Sushant Prakash, Peng Wang

To improve the interpretability in the dual encoder models, we design a novel regularization loss to minimize the mutual information between unimportant words and desired labels, in addition to the original attention method, so that important words are emphasized while unimportant words are de-emphasized.

Word Embeddings

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

1 code implementation CVPR 2020 Shizhe Chen, Qin Jin, Peng Wang, Qi Wu

From the ASG, we propose a novel ASG2Caption model, which is able to recognise user intentions and semantics in the graph, and therefore generate desired captions according to the graph structure.

Image Captioning

Using Sampled Network Data With The Autologistic Actor Attribute Model

2 code implementations30 Jan 2020 Alex D. Stivala, H. Colin Gallagher, David A. Rolls, Peng Wang, Garry L. Robins

Social science research increasingly benefits from statistical methods for understanding the structured nature of social life, including for social network data.

Social and Information Networks Methodology

Real-time Segmentation and Facial Skin Tones Grading

1 code implementation30 Dec 2019 Ling Luo, Dingyu Xue, Xinglong Feng, Yichun Yu, Peng Wang

Modern approaches for semantic segmention usually pay too much attention to the accuracy of the model, and therefore it is strongly recommended to introduce cumbersome backbones, which brings heavy computation burden and memory footprint.

To Balance or Not to Balance: A Simple-yet-Effective Approach for Learning with Long-Tailed Distributions

no code implementations10 Dec 2019 Jun-Jie Zhang, Lingqiao Liu, Peng Wang, Chunhua Shen

Such imbalanced distribution causes a great challenge for learning a deep neural network, which can be boiled down into a dilemma: on the one hand, we prefer to increase the exposure of tail class samples to avoid the excessive dominance of head classes in the classifier training.

Auxiliary Learning Self-Supervised Learning

AutoRemover: Automatic Object Removal for Autonomous Driving Videos

1 code implementation28 Nov 2019 Rong Zhang, Wei Li, Peng Wang, Chenye Guan, Jin Fang, Yuhang Song, Jinhui Yu, Baoquan Chen, Weiwei Xu, Ruigang Yang

To deal with shadows, we build up an autonomous driving shadow dataset and design a deep neural network to detect shadows automatically.

Autonomous Driving Video Inpainting

CSPN++: Learning Context and Resource Aware Convolutional Spatial Propagation Networks for Depth Completion

no code implementations13 Nov 2019 Xinjing Cheng, Peng Wang, Chenye Guan, Ruigang Yang

In this paper, we propose CSPN++, which further improves its effectiveness and efficiency by learning adaptive convolutional kernel sizes and the number of iterations for the propagation, thus the context and computational resources needed at each pixel could be dynamically assigned upon requests.

Depth Completion Stereo-LiDAR Fusion

Discriminative and Robust Online Learning for Siamese Visual Tracking

1 code implementation6 Sep 2019 Jinghao Zhou, Peng Wang, Haoyang Sun

The problem of visual object tracking has traditionally been handled by variant tracking paradigms, either learning a model of the object's appearance exclusively online or matching the object with the target in an offline-trained embedding space.

Visual Object Tracking Visual Tracking

Efficient Automatic Meta Optimization Search for Few-Shot Learning

no code implementations6 Sep 2019 Xinyue Zheng, Peng Wang, Qigang Wang, Zhongchao shi, Feiyu Xu

NAS automatically generates and evaluates meta-learner's architecture for few-shot learning problems, while the meta-learner uses meta-learning algorithm to optimize its parameters based on the distribution of learning tasks.

Few-Shot Learning Neural Architecture Search

Person Re-identification in Aerial Imagery

1 code implementation14 Aug 2019 Shizhou Zhang, Qi Zhang, Yifei Yang, Xing Wei, Peng Wang, Bingliang Jiao, Yanning Zhang

Our method can learn a discriminative and compact feature representation for ReID in aerial imagery and can be trained in an end-to-end fashion efficiently.

object-detection Object Detection +1

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

no code implementations29 Jul 2019 Damien Teney, Peng Wang, Jiewei Cao, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

One of the primary challenges faced by deep learning is the degree to which current methods exploit superficial statistics and dataset bias, rather than learning to generalise over the specific representations they have experienced.

Visual Reasoning

EPNAS: Efficient Progressive Neural Architecture Search

no code implementations7 Jul 2019 Yanqi Zhou, Peng Wang, Sercan Arik, Haonan Yu, Syed Zawad, Feng Yan, Greg Diamos

In this paper, we propose Efficient Progressive Neural Architecture Search (EPNAS), a neural architecture search (NAS) that efficiently handles large search space through a novel progressive search policy with performance prediction based on REINFORCE~\cite{Williams. 1992. PG}.

Neural Architecture Search

A Performance Evaluation of Correspondence Grouping Methods for 3D Rigid Data Matching

no code implementations5 Jul 2019 Jiaqi Yang, Ke Xian, Peng Wang, Yanning Zhang

Seeking consistent point-to-point correspondences between 3D rigid data (point clouds, meshes, or depth maps) is a fundamental problem in 3D computer vision.

3D Object Recognition Point Cloud Registration +1

Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching

no code implementations29 Jun 2019 Jiaqi Yang, Siwen Quan, Peng Wang, Yanning Zhang

The outcomes present interesting findings that may shed new light on this community and provide complementary perspectives to existing evaluations on the topic of local geometric feature description.

Object Recognition Point Cloud Registration +1

Towards End-to-End Text Spotting in Natural Scenes

no code implementations14 Jun 2019 Peng Wang, Hui Li, Chunhua Shen

Text spotting in natural scene images is of great importance for many image understanding tasks.

Image Cropping Text Spotting

Vehicle Re-identification in Aerial Imagery: Dataset and Approach

no code implementations ICCV 2019 Peng Wang, Bingliang Jiao, Lu Yang, Yifei Yang, Shizhou Zhang, Wei Wei, Yanning Zhang

It is capable of explicitly detecting discriminative parts for each specific vehicle and significantly outperforms the evaluated baselines and state-of-the-art vehicle ReID approaches.

Vehicle Re-Identification

Pixel-aware Deep Function-mixture Network for Spectral Super-Resolution

no code implementations24 Mar 2019 Lei Zhang, Zhiqiang Lang, Peng Wang, Wei Wei, Shengcai Liao, Ling Shao, Yanning Zhang

To address this problem, we propose a pixel-aware deep function-mixture network for SSR, which is composed of a new class of modules, termed function-mixture (FM) blocks.

Spectral Super-Resolution Super-Resolution

Structural Recurrent Neural Network for Traffic Speed Prediction

1 code implementation18 Feb 2019 Youngjoo Kim, Peng Wang, Lyudmila Mihaylova

We use a graph of a vehicular road network with recurrent neural networks (RNNs) to infer the interaction between adjacent road segments as well as the temporal dynamics.

Time Series Traffic Prediction

Flash: Efficient Dynamic Routing for Offchain Networks

2 code implementations14 Feb 2019 Peng Wang, Hong Xu, Xin Jin, Tao Wang

Mice payments are directly sent by looking up a routing table with a few precomputed paths to reduce probing overhead.

Networking and Internet Architecture

RPC: A Large-Scale Retail Product Checkout Dataset

no code implementations22 Jan 2019 Xiu-Shen Wei, Quan Cui, Lei Yang, Peng Wang, Lingqiao Liu

The main challenge of this problem comes from the large scale and the fine-grained nature of the product categories as well as the difficulty for collecting training images that reflect the realistic checkout scenarios due to continuous update of the products.

Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks

no code implementations CVPR 2019 Peng Wang, Qi Wu, Jiewei Cao, Chunhua Shen, Lianli Gao, Anton Van Den Hengel

Being composed of node attention component and edge attention component, the proposed graph attention mechanism explicitly represents inter-object relationships, and properties with a flexibility and power impossible with competing approaches.

Graph Attention Referring Expression +1

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

no code implementations CVPR 2019 Xibin Song, Peng Wang, Dingfu Zhou, Rui Zhu, Chenye Guan, Yuchao Dai, Hao Su, Hongdong Li, Ruigang Yang

Specifically, we first segment each car with a pre-trained Mask R-CNN, and then regress towards its 3D pose and shape based on a deformable 3D car model with or without using semantic keypoints.

3D Car Instance Understanding Autonomous Driving

Visual Question Answering as Reading Comprehension

no code implementations CVPR 2019 Hui Li, Peng Wang, Chunhua Shen, Anton Van Den Hengel

In contrast to struggling on multimodal feature fusion, in this paper, we propose to unify all the input information by natural language so as to convert VQA into a machine reading comprehension problem.

Common Sense Reasoning General Knowledge +3

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks

no code implementations24 Nov 2018 Haokui Zhang, Ying Li, Peng Wang, Yu Liu, Chunhua Shen

Different from RGB videos, depth data in RGB-D videos provide key complementary information for tristimulus visual data which potentially could achieve accuracy improvement for action recognition.

Action Recognition

Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding

1 code implementation14 Oct 2018 Chenxu Luo, Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia, Alan Yuille

Performance on the five tasks of depth estimation, optical flow estimation, odometry, moving object segmentation and scene flow estimation shows that our approach outperforms other SoTA methods.

Depth Estimation Optical Flow Estimation +2

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

1 code implementation8 Oct 2018 Yang Wang, Zhenheng Yang, Peng Wang, Yi Yang, Chenxu Luo, Wei Xu

Then the whole scene is decomposed into moving foreground and static background by compar- ing the estimated optical flow and rigid flow derived from the depth and ego-motion.

Motion Estimation Optical Flow Estimation

Learning Depth with Convolutional Spatial Propagation Network

1 code implementation4 Oct 2018 Xinjing Cheng, Peng Wang, Ruigang Yang

In this paper, we propose a simple yet effective convolutional spatial propagation network (CSPN) to learn the affinity matrix for various depth estimation tasks.

Depth Completion Depth Estimation +3

Towards Effective Deep Embedding for Zero-Shot Learning

no code implementations30 Aug 2018 Lei Zhang, Peng Wang, Lingqiao Liu, Chunhua Shen, Wei Wei, Yannning Zhang, Anton Van Den Hengel

Towards this goal, we present a simple but effective two-branch network to simultaneously map semantic descriptions and visual samples into a joint space, on which visual embeddings are forced to regress to their class-level semantic embeddings and the embeddings crossing classes are required to be distinguishable by a trainable classifier.

Zero-Shot Learning

A Capsule Network for Traffic Speed Prediction in Complex Road Networks

1 code implementation23 Jul 2018 Youngjoo Kim, Peng Wang, Yifei Zhu, Lyudmila Mihaylova

Traffic flow data from induction loop sensors are essentially a time series, which is also spatially related to traffic in different road segments.

Time Series Forecasting

Every Pixel Counts: Unsupervised Geometry Learning with Holistic 3D Motion Understanding

no code implementations27 Jun 2018 Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia

The four types of information, i. e. 2D flow, camera pose, segment mask and depth maps, are integrated into a differentiable holistic 3D motion parser (HMP), where per-pixel 3D motion for rigid background and moving objects are recovered.

Depth And Camera Motion Optical Flow Estimation +1

Adaptive Importance Learning for Improving Lightweight Image Super-resolution Network

no code implementations5 Jun 2018 Lei Zhang, Peng Wang, Chunhua Shen, Lingqiao Liu, Wei Wei, Yanning Zhang, Anton Van Den Hengel

In this study, we revisit this problem from an orthog- onal view, and propose a novel learning strategy to maxi- mize the pixel-wise fitting capacity of a given lightweight network architecture.

Image Super-Resolution Single Image Super Resolution

DeLS-3D: Deep Localization and Segmentation with a 3D Semantic Map

1 code implementation CVPR 2018 Peng Wang, Ruigang Yang, Binbin Cao, Wei Xu, Yuanqing Lin

The uniqueness of our design is a sensor fusion scheme which integrates camera videos, motion sensors (GPS/IMU), and a 3D semantic map in order to achieve robustness and efficiency of the system.

Autonomous Driving Pose Estimation +1

Piecewise classifier mappings: Learning fine-grained learners for novel categories with few examples

no code implementations11 May 2018 Xiu-Shen Wei, Peng Wang, Lingqiao Liu, Chunhua Shen, Jianxin Wu

To solve this problem, we propose an end-to-end trainable deep network which is inspired by the state-of-the-art fine-grained recognition model and is tailored for the FSFG task.

Few-Shot Learning Fine-Grained Image Recognition

SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting

1 code implementation9 May 2018 Yuhang Song, Chao Yang, Yeji Shen, Peng Wang, Qin Huang, C. -C. Jay Kuo

In this paper, we focus on image inpainting task, aiming at recovering the missing area of an incomplete image given the context information.

Image Inpainting Interactive Segmentation +1

Training a Binary Weight Object Detector by Knowledge Transfer for Autonomous Driving

no code implementations17 Apr 2018 Jiaolong Xu, Peng Wang, Heng Yang, Antonio M. López

Autonomous driving has harsh requirements of small model size and energy efficiency, in order to enable the embedded system to achieve real-time on-board object detection.

Autonomous Driving object-detection +2

The ApolloScape Open Dataset for Autonomous Driving and its Application

2 code implementations16 Mar 2018 Xinyu Huang, Peng Wang, Xinjing Cheng, Dingfu Zhou, Qichuan Geng, Ruigang Yang

In this paper, we provide a sensor fusion scheme integrating camera videos, consumer-grade motion sensors (GPS/IMU), and a 3D semantic map in order to achieve robust self-localization and semantic segmentation for autonomous driving.

Autonomous Driving Instance Segmentation +2

LEGO: Learning Edge with Geometry all at Once by Watching Videos

1 code implementation CVPR 2018 Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia

In our framework, the predicted depths, normals and edges are forced to be consistent all the time.

Occlusion Aware Unsupervised Learning of Optical Flow

no code implementations CVPR 2018 Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu

Especially on KITTI dataset where abundant unlabeled samples exist, our unsupervised method outperforms its counterpart trained with supervised learning.

Optical Flow Estimation

Unsupervised Learning of Geometry with Edge-aware Depth-Normal Consistency

no code implementations10 Nov 2017 Zhenheng Yang, Peng Wang, Wei Xu, Liang Zhao, Ramakant Nevatia

Learning to reconstruct depths in a single image by watching unlabeled videos via deep convolutional network (DCN) is attracting significant attention in recent years.

Depth Estimation

Fine-grained Pattern Matching Over Streaming Time Series

no code implementations27 Oct 2017 Rong Kang, Chen Wang, Peng Wang, Yuting Ding, Jian-Min Wang

Hence, we formulate a new problem, called "fine-grained pattern matching", which allows users to specify varied granularities of matching deviation to different segments of a given pattern, and fuzzy regions for adaptive breakpoints determination between consecutive segments.

Time Series