Search Results for author: Nanning Zheng

Found 140 papers, 46 papers with code

Kernel Least Mean Square with Adaptive Kernel Size

no code implementations • 23 Jan 2014 • Badong Chen, Junli Liang, Nanning Zheng, Jose C. Principe

Kernel adaptive filters (KAF) are a class of powerful nonlinear filters developed in Reproducing Kernel Hilbert Space (RKHS).

Time Series Time Series Prediction

Paper
Add Code

Salient Object Detection: A Discriminative Regional Feature Integration Approach

no code implementations • CVPR 2013 • Huaizu Jiang, Zejian yuan, Ming-Ming Cheng, Yihong Gong, Nanning Zheng, Jingdong Wang

Our method, which is based on multi-level image segmentation, utilizes the supervised learning approach to map the regional feature vector to a saliency score.

Image Segmentation Object +4

Paper
Add Code

Generalized Correntropy for Robust Adaptive Filtering

no code implementations • 12 Apr 2015 • Badong Chen, Lei Xing, Haiquan Zhao, Nanning Zheng, José C. Príncipe

In this work, we propose a generalized correntropy that adopts the generalized Gaussian density (GGD) function as the kernel (not necessarily a Mercer kernel), and present some important properties.

Paper
Add Code

Similarity Learning on an Explicit Polynomial Kernel Feature Map for Person Re-Identification

no code implementations • CVPR 2015 • Dapeng Chen, Zejian yuan, Gang Hua, Nanning Zheng, Jingdong Wang

We follow the learning-to-rank methodology and learn a similarity function to maximize the difference between the similarity scores of matched and unmatched images for a same person.

Learning-To-Rank Patch Matching +1

Paper
Add Code

Saturation-Preserving Specular Reflection Separation

no code implementations • CVPR 2015 • Yuanliu Liu, Zejian yuan, Nanning Zheng, Yang Wu

Specular reflection generally decreases the saturation of surface colors, which will be possibly confused with other colors that have the same hue but lower saturation.

Paper
Add Code

Contour Guided Hierarchical Model for Shape Matching

no code implementations • ICCV 2015 • Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng

For the purpose, we divide the shape template into overlapped parts and model the matching through a part-based layered structure that uses the latent variable to constrain parts' deformation.

Paper
Add Code

Illumination Robust Color Naming via Label Propagation

no code implementations • ICCV 2015 • Yuanliu liu, Zejian yuan, Badong Chen, Jianru Xue, Nanning Zheng

In this paper we address the problem of inferring the color composition of the intrinsic reflectance of objects, where the shadows and highlights may change the observed color dramatically.

Image Retrieval Retrieval

Paper
Add Code

Correntropy Maximization via ADMM - Application to Robust Hyperspectral Unmixing

no code implementations • 4 Feb 2016 • Fei Zhu, Abderrahim Halimi, Paul Honeine, Badong Chen, Nanning Zheng

In hyperspectral images, some spectral bands suffer from low signal-to-noise ratio due to noisy acquisition and atmospheric effects, thus requiring robust techniques for the unmixing problem.

Hyperspectral Unmixing

Paper
Add Code

Counting Grid Aggregation for Event Retrieval and Recognition

no code implementations • 5 Apr 2016 • Zhanning Gao, Gang Hua, Dongqing Zhang, Jianru Xue, Nanning Zheng

Event retrieval and recognition in a large corpus of videos necessitates a holistic fixed-size visual representation at the video clip level that is comprehensive, compact, and yet discriminative.

Retrieval

Paper
Add Code

Person Re-Identification by Multi-Channel Parts-Based CNN With Improved Triplet Loss Function

no code implementations • CVPR 2016 • De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, Nanning Zheng

Person re-identification across cameras remains a very challenging problem, especially when there are no overlapping fields of view between cameras.

Person Re-Identification

Paper
Add Code

Similarity Learning With Spatial Constraints for Person Re-Identification

no code implementations • CVPR 2016 • Dapeng Chen, Zejian yuan, Badong Chen, Nanning Zheng

We therefore learn a novel similarity function, which consists of multiple sub-similarity measurements with each taking in charge of a subregion.

Person Re-Identification

Paper
Add Code

Kernel Risk-Sensitive Loss: Definition, Properties and Application to Robust Adaptive Filtering

no code implementations • 1 Aug 2016 • Badong Chen, Lei Xing, Bin Xu, Haiquan Zhao, Nanning Zheng, Jose C. Principe

Nonlinear similarity measures defined in kernel space, such as correntropy, can extract higher-order statistics of data and offer potentially significant performance improvement over their linear counterparts especially in non-Gaussian signal processing and machine learning.

Paper
Add Code

Robust Learning with Kernel Mean p-Power Error Loss

no code implementations • 21 Dec 2016 • Badong Chen, Lei Xing, Xin Wang, Jing Qin, Nanning Zheng

Correntropy is a second order statistical measure in kernel space, which has been successfully applied in robust learning and signal processing.

Paper
Add Code

Brain Inspired Cognitive Model with Attention for Self-Driving Cars

no code implementations • 18 Feb 2017 • Shitao Chen, Songyi Zhang, Jinghao Shang, Badong Chen, Nanning Zheng

Perception-driven approach and end-to-end system are two major vision-based frameworks for self-driving cars.

Autonomous Driving Decision Making +1

Paper
Add Code

Single Image Super-resolution via a Lightweight Residual Convolutional Neural Network

no code implementations • 23 Mar 2017 • Yudong Liang, Ze Yang, Kai Zhang, Yihui He, Jinjun Wang, Nanning Zheng

To tackle with the second problem, a lightweight CNN architecture which has carefully designed width, depth and skip connections was proposed.

Image Super-Resolution SSIM

Paper
Add Code

View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data

1 code implementation • ICCV 2017 • Pengfei Zhang, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jianru Xue, Nanning Zheng

Rather than re-positioning the skeletons based on a human defined prior criterion, we design a view adaptive recurrent neural network (RNN) with LSTM architecture, which enables the network itself to adapt to the most suitable observation viewpoints from end to end.

Ranked #6 on Skeleton Based Action Recognition on SYSU 3D

Action Recognition Skeleton Based Action Recognition +1

122

Paper
Code

Single Image Super Resolution - When Model Adaptation Matters

no code implementations • 31 Mar 2017 • Yudong Liang, Radu Timofte, Jinjun Wang, Yihong Gong, Nanning Zheng

The internal contents of the low resolution input image is neglected with deep modeling despite the earlier works showing the power of using such internal priors.

Image Super-Resolution

Paper
Add Code

Detecting Drivable Area for Self-driving Cars: An Unsupervised Approach

no code implementations • 1 May 2017 • Ziyi Liu, Siyu Yu, Xiao Wang, Nanning Zheng

Experiments show that our unsupervised approach is efficient and robust for detecting drivable area for self-driving cars.

Self-Driving Cars

Paper
Add Code

Point to Set Similarity Based Deep Feature Learning for Person Re-Identification

no code implementations • CVPR 2017 • Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, Nanning Zheng

One of the key issues for deep learning based person Re-ID is the selection of proper similarity comparison criteria, and the performance of learned features using existing criterion based on pairwise similarity is still limited, because only P2P distances are mostly considered.

Person Re-Identification

Paper
Add Code

ER3: A Unified Framework for Event Retrieval, Recognition and Recounting

no code implementations • CVPR 2017 • Zhanning Gao, Gang Hua, Dong-Qing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng

We develop a unified framework for complex event retrieval, recognition and recounting.

Language Modelling Retrieval

Paper
Add Code

Deep Feature Learning via Structured Graph Laplacian Embedding for Person Re-Identification

no code implementations • 25 Jul 2017 • De Cheng, Yihong Gong, Zhihui Li, Weiwei Shi, Alexander G. Hauptmann, Nanning Zheng

The proposed method can take full advantages of the structured distance relationships among these training samples, with the constructed complete graph.

Person Re-Identification

Paper
Add Code

Associations among Image Assessments as Cost Functions in Linear Decomposition: MSE, SSIM, and Correlation Coefficient

no code implementations • 4 Aug 2017 • Jianji Wang, Nanning Zheng, Badong Chen, Jose C. Principe

Moreover, for a target vector, the ratio of the corresponding affine parameters in the MSE-based linear decomposition scheme and the SSIM-based scheme is a constant, which is just the value of PCC between the target vector and its estimated vector.

SSIM

Paper
Add Code

Large Margin Learning in Set to Set Similarity Comparison for Person Re-identification

no code implementations • 18 Aug 2017 • Sanping Zhou, Jinjun Wang, Rui Shi, Qiqi Hou, Yihong Gong, Nanning Zheng

The class-identity term keeps the intra-class samples within each camera view gathering together, the relative distance term maximizes the distance between the intra-class class set and inter-class set across different camera views, and the regularization term smoothness the parameters of deep convolutional neural network (CNN).

Person Re-Identification Retrieval

Paper
Add Code

Deep Self-Paced Learning for Person Re-Identification

no code implementations • 7 Oct 2017 • Sanping Zhou, Jinjun Wang, Deyu Meng, Xiaomeng Xin, Yubing Li, Yihong Gong, Nanning Zheng

In this paper, we propose a novel deep self-paced learning (DSPL) algorithm to alleviate this problem, in which we apply a self-paced constraint and symmetric regularization to help the relative distance metric training the deep neural network, so as to learn the stable and discriminative features for person Re-ID.

Person Re-Identification

Paper
Add Code

Quantized Minimum Error Entropy Criterion

no code implementations • 11 Oct 2017 • Badong Chen, Lei Xing, Nanning Zheng, Jose C. Príncipe

Comparing with traditional learning criteria, such as mean square error (MSE), the minimum error entropy (MEE) criterion is superior in nonlinear and non-Gaussian signal processing and machine learning.

Quantization

Paper
Add Code

A Novel Brain Decoding Method: a Correlation Network Framework for Revealing Brain Connections

no code implementations • 1 Dec 2017 • Siyu Yu, Nanning Zheng, Yongqiang Ma, Hao Wu, Badong Chen

Analyzing the correlations of collected data from human brain activities and representing activity patterns are two problems in brain decoding based on functional magnetic resonance imaging (fMRI) signals.

Brain Decoding

Paper
Add Code

Augmented Space Linear Model

no code implementations • 1 Feb 2018 • Zhengda Qin, Badong Chen, Nanning Zheng, Jose C. Principe

In this paper, we propose a linear model called Augmented Space Linear Model (ASLM), which uses the full joint space of input and desired signal as the projection space and approaches the performance of nonlinear models.

Computational Efficiency

Paper
Add Code

Fully Convolutional Grasp Detection Network with Oriented Anchor Box

no code implementations • 6 Mar 2018 • Xinwen Zhou, Xuguang Lan, Hanbo Zhang, Zhiqiang Tian, Yang Zhang, Nanning Zheng

The feature extractor is a deep convolutional neural network.

Paper
Add Code

Feature Selective Small Object Detection via Knowledge-based Recurrent Attentive Neural Network

no code implementations • 13 Mar 2018 • Kai Yi, Zhiqiang Jian, Shitao Chen, Nanning Zheng

At present, the performance of deep neural network in general object detection is comparable to or even surpasses that of human beings.

Autonomous Driving Decision Making +4

Paper
Add Code

Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition

no code implementations • 19 Mar 2018 • Jinliang Zang, Le Wang, Ziyi Liu, Qilin Zhang, Zhenxing Niu, Gang Hua, Nanning Zheng

Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs).

Action Recognition Temporal Action Localization

Paper
Add Code

View Adaptive Neural Networks for High Performance Skeleton-based Human Action Recognition

2 code implementations • 20 Apr 2018 • Pengfei Zhang, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jianru Xue, Nanning Zheng

In order to alleviate the effects of view variations, this paper introduces a novel view adaptation scheme, which automatically determines the virtual observation viewpoints in a learning based data driven manner.

Ranked #1 on Skeleton Based Action Recognition on UWA3D

Action Recognition Skeleton Based Action Recognition +1

122

Paper
Code

Kernelized Subspace Pooling for Deep Local Descriptors

no code implementations • CVPR 2018 • Xing Wei, Yue Zhang, Yihong Gong, Nanning Zheng

Experimental results on several patch matching benchmarks show that our method outperforms the state-of-the-arts significantly.

Patch Matching

Paper
Add Code

Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks

no code implementations • CVPR 2018 • Ping Wei, Yang Liu, Tianmin Shu, Nanning Zheng, Song-Chun Zhu

We built a new video dataset of tasks, intentions, and attention.

Paper
Add Code

Discriminative Feature Learning with Foreground Attention for Person Re-Identification

no code implementations • 4 Jul 2018 • Sanping Zhou, Jinjun Wang, Deyu Meng, Yudong Liang, Yihong Gong, Nanning Zheng

Specifically, a novel foreground attentive subnetwork is designed to drive the network's attention, in which a decoder network is used to reconstruct the binary mask by using a novel local regression loss function, and an encoder network is regularized by the decoder network to focus its attention on the foreground persons.

Multi-Task Learning Person Re-Identification

Paper
Add Code

Adding Attentiveness to the Neurons in Recurrent Neural Networks

no code implementations • ECCV 2018 • Pengfei Zhang, Jianru Xue, Cuiling Lan, Wen-Jun Zeng, Zhanning Gao, Nanning Zheng

We propose adding a simple yet effective Element-wiseAttention Gate (EleAttG) to an RNN block (e. g., all RNN neurons in a network layer) that empowers the RNN neurons to have the attentiveness capability.

Ranked #98 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

ROI-based Robotic Grasp Detection for Object Overlapping Scenes

no code implementations • 30 Aug 2018 • Hanbo Zhang, Xuguang Lan, Site Bai, Xinwen Zhou, Zhiqiang Tian, Nanning Zheng

Experimental results demonstrate that ROI-GD performs much better in object overlapping scenes and at the meantime, remains comparable with state-of-the-art grasp detection algorithms on Cornell Grasp Dataset and Jacquard Dataset.

Robotics

Paper
Add Code

Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification

no code implementations • ECCV 2018 • Xing Wei, Yue Zhang, Yihong Gong, Jiawei Zhang, Nanning Zheng

The reason is that the bilinear feature matrix is sensitive to the magnitudes and correlations of local CNN feature elements which can be measured by its singular values.

Fine-Grained Image Classification Fine-Grained Visual Recognition +1

Paper
Add Code

Transductive Semi-Supervised Deep Learning using Min-Max Features

no code implementations • ECCV 2018 • Weiwei Shi, Yihong Gong, Chris Ding, Zhiheng MaXiaoyu Tao, Nanning Zheng

In this paper, we propose Transductive Semi-Supervised Deep Learning (TSSDL) method that is effective for training Deep Convolutional Neural Network (DCNN) models.

General Classification Image Classification +1

Paper
Add Code

A Real-time Robotic Grasp Approach with Oriented Anchor Box

no code implementations • 8 Sep 2018 • Hanbo Zhang, Xinwen Zhou, Xuguang Lan, Jin Li, Zhiqiang Tian, Nanning Zheng

The main component of our approach is a grasp detection network with oriented anchor boxes as detection priors.

Robotics

Paper
Add Code

A Multi-task Convolutional Neural Network for Autonomous Robotic Grasping in Object Stacking Scenes

no code implementations • 19 Sep 2018 • Hanbo Zhang, Xuguang Lan, Site Bai, Lipeng Wan, Chenjie Yang, Nanning Zheng

Autonomous robotic grasping plays an important role in intelligent robotics.

Robotics

Paper
Add Code

Consistency-aware Shading Orders Selective Fusion for Intrinsic Image Decomposition

no code implementations • 23 Oct 2018 • Yuanliu Liu, Ang Li, Zejian yuan, Badong Chen, Nanning Zheng

We propose a Consistency-aware Selective Fusion (CSF) to integrate the pairwise orders into a globally consistent order.

Intrinsic Image Decomposition

Paper
Add Code

SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory Prediction

1 code implementation • CVPR 2019 • Pu Zhang, Wanli Ouyang, Pengfei Zhang, Jianru Xue, Nanning Zheng

In order to address this issue, we propose a data-driven state refinement module for LSTM network (SR-LSTM), which activates the utilization of the current intention of neighbors, and jointly and iteratively refines the current states of all participants in the crowd through a message passing mechanism.

Pedestrian Trajectory Prediction Trajectory Prediction

110

Paper
Code

Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition

2 code implementations • CVPR 2020 • Pengfei Zhang, Cuiling Lan, Wen-Jun Zeng, Junliang Xing, Jianru Xue, Nanning Zheng

Skeleton-based human action recognition has attracted great interest thanks to the easy accessibility of the human skeleton data.

Ranked #1 on Skeleton Based Action Recognition on SYSU 3D

Action Recognition Computational Efficiency +2

176

Paper
Code

Hindsight Trust Region Policy Optimization

1 code implementation • 29 Jul 2019 • Hanbo Zhang, Site Bai, Xuguang Lan, David Hsu, Nanning Zheng

We propose \emph{Hindsight Trust Region Policy Optimization}(HTRPO), a new RL algorithm that extends the highly successful TRPO algorithm with \emph{hindsight} to tackle the challenge of sparse rewards.

Atari Games Policy Gradient Methods +1

Paper
Code

An encoding framework with brain inner state for natural image identification

no code implementations • 22 Aug 2019 • Hao Wu, Ziyu Zhu, Jiayi Wang, Nanning Zheng, Badong Chen

The framework comprises two parts: forward encoding model that deals with visual stimuli and inner state model that captures influence from intrinsic connections in the brain.

Brain Decoding

Paper
Add Code

EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks

no code implementations • 3 Sep 2019 • Pengfei Zhang, Jianru Xue, Cuiling Lan, Wen-Jun Zeng, Zhanning Gao, Nanning Zheng

For an RNN block, an EleAttG is used for adaptively modulating the input by assigning different levels of importance, i. e., attention, to each element/dimension of the input.

Ranked #3 on Skeleton Based Action Recognition on SYSU 3D

Action Recognition Gesture Recognition +1

Paper
Add Code

Learnable Tree Filter for Structure-preserving Feature Transform

1 code implementation • NeurIPS 2019 • Lin Song, Yanwei Li, Zeming Li, Gang Yu, Hongbin Sun, Jian Sun, Nanning Zheng

To this end, tree filtering modules are embedded to formulate a unified framework for semantic segmentation.

Semantic Segmentation

140

Paper
Code

BEYOND SUPERVISED LEARNING: RECOGNIZING UNSEEN ATTRIBUTE-OBJECT PAIRS WITH VISION-LANGUAGE FUSION AND ATTRACTOR NETWORKS

no code implementations • ICLR 2020 • Hui Chen, Zhixiong Nan, Nanning Zheng

This paper handles a challenging problem, unseen attribute-object pair recognition, which asks a model to simultaneously recognize the attribute type and the object type of a given image while this attribute-object pair is not included in the training set.

Attribute Object

Paper
Add Code

Visual Semantic SLAM with Landmarks for Large-Scale Outdoor Environment

1 code implementation • 4 Jan 2020 • Zirui Zhao, Yijun Mao, Yan Ding, Pengju Ren, Nanning Zheng

Semantic SLAM is an important field in autonomous driving and intelligent agents, which can enable robots to achieve high-level navigation tasks, obtain simple cognition or reasoning ability and achieve language-based human-robot-interaction.

Autonomous Driving Semantic Segmentation +1

282

Paper
Code

REGNet: REgion-based Grasp Network for End-to-end Grasp Detection in Point Clouds

1 code implementation • 28 Feb 2020 • Binglei Zhao, Hanbo Zhang, Xuguang Lan, Haoyu Wang, Zhiqiang Tian, Nanning Zheng

Reliable robotic grasping in unstructured environments is a crucial but challenging task.

Robotics

Paper
Code

Compositional Generalization by Learning Analytical Expressions

1 code implementation • NeurIPS 2020 • Qian Liu, Shengnan An, Jian-Guang Lou, Bei Chen, Zeqi Lin, Yan Gao, Bin Zhou, Nanning Zheng, Dongmei Zhang

Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily.

Hierarchical Reinforcement Learning

359

Paper
Code

Traffic Agent Trajectory Prediction Using Social Convolution and Attention Mechanism

no code implementations • 6 Jul 2020 • Tao Yang, Zhixiong Nan, He Zhang, Shitao Chen, Nanning Zheng

In this paper, we propose a model to predict the trajectories of target agents around an autonomous vehicle.

Autonomous Driving Decision Making +1

Paper
Add Code

A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning

2 code implementations • ECCV 2020 • Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng

Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.

Generalized Zero-Shot Learning

Paper
Code

Conditional Uncorrelation and Efficient Non-approximate Subset Selection in Sparse Regression

no code implementations • 8 Sep 2020 • Jianji Wang, Qi Liu, Shupei Zhang, Nanning Zheng, Fei-Yue Wang

By the proposed method, the computational complexity is reduced from $O(\frac{1}{6}{k^3}+mk^2+mkd)$ to $O(\frac{1}{6}{k^3}+\frac{1}{2}mk^2)$ for each candidate subset in sparse regression.

regression

Paper
Add Code

Learning to Infer Unseen Attribute-Object Compositions

no code implementations • 27 Oct 2020 • Hui Chen, Zhixiong Nan, Jingjing Jiang, Nanning Zheng

The composition recognition of unseen attribute-object is critical to make machines learn to decompose and compose complex concepts like people.

Attribute Object

Paper
Add Code

Multi-agent Policy Optimization with Approximatively Synchronous Advantage Estimation

no code implementations • 7 Dec 2020 • Lipeng Wan, Xuwei Song, Xuguang Lan, Nanning Zheng

General methods for policy based multi-agent reinforcement learning to solve the challenge introduce differentiate value functions or advantage functions for individual agents.

Multi-agent Reinforcement Learning Starcraft

Paper
Add Code

End-to-End Object Detection with Fully Convolutional Network

1 code implementation • CVPR 2021 • JianFeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng

Mainstream object detectors based on the fully convolutional network has achieved impressive performance.

object-detection Object Detection

489

Paper
Code

Fine-Grained Dynamic Head for Object Detection

1 code implementation • NeurIPS 2020 • Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng

To this end, we propose a fine-grained dynamic head to conditionally select a pixel-level combination of FPN features from different scales for each instance, which further releases the ability of multi-scale feature representation.

Object object-detection +1

Paper
Code

Rethinking Learnable Tree Filter for Generic Feature Transform

1 code implementation • NeurIPS 2020 • Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Xiangyu Zhang, Hongbin Sun, Jian Sun, Nanning Zheng

The Learnable Tree Filter presents a remarkable approach to model structure-preserving relations for semantic segmentation.

Instance Segmentation object-detection +3

Paper
Code

Meta Pairwise Relationship Distillation for Unsupervised Person Re-Identification

1 code implementation • ICCV 2021 • Haoxuanye Ji, Le Wang, Sanping Zhou, Wei Tang, Nanning Zheng, Gang Hua

Unsupervised person re-identification (Re-ID) remains challenging due to the lack of ground-truth labels.

Unsupervised Person Re-Identification

Paper
Code

Practical Order Attack in Deep Ranking

no code implementations • 1 Jan 2021 • Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Xu Yinghui, Nanning Zheng, Gang Hua

The objective of this paper is to formalize and practically implement a new adversarial attack against deep ranking systems, i. e., the Order Attack, which covertly alters the relative order of a selected set of candidates according to a permutation vector predefined by the attacker, with only limited interference to other unrelated candidates.

Adversarial Attack Image Retrieval

Paper
Add Code

Towards Building A Group-based Unsupervised Representation Disentanglement Framework

1 code implementation • ICLR 2022 • Tao Yang, Xuanchi Ren, Yuwang Wang, Wenjun Zeng, Nanning Zheng

We then propose a model, based on existing VAE-based methods, to tackle the unsupervised learning problem of the framework.

Disentanglement

Paper
Code

A Driving Behavior Recognition Model with Bi-LSTM and Multi-Scale CNN

no code implementations • 1 Mar 2021 • He Zhang, Zhixiong Nan, Tao Yang, Yifan Liu, Nanning Zheng

In autonomous driving, perceiving the driving behaviors of surrounding agents is important for the ego-vehicle to make a reasonable decision.

Autonomous Driving

Paper
Add Code

Practical Relative Order Attack in Deep Ranking

2 code implementations • ICCV 2021 • Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Yinghui Xu, Nanning Zheng, Gang Hua

In this paper, we formulate a new adversarial attack against deep ranking systems, i. e., the Order Attack, which covertly alters the relative order among a selected set of candidates according to an attacker-specified permutation, with limited interference to other unrelated candidates.

Adversarial Attack

Paper
Code

ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization

no code implementations • 28 Mar 2021 • Ziyi Liu, Le Wang, Qilin Zhang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua

In this paper, we introduce an Action-Context Separation Network (ACSNet) that explicitly takes into account context for accurate action localization.

Ranked #7 on Weakly Supervised Action Localization on THUMOS’14

Video Polyp Segmentation Weakly Supervised Action Localization +2

Paper
Add Code

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

no code implementations • 30 Mar 2021 • Ziyi Liu, Le Wang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua

To address this challenge, we introduce a framework that learns two feature subspaces respectively for actions and their context.

Action Recognition Weakly-supervised Temporal Action Localization +1

Paper
Add Code

REGRAD: A Large-Scale Relational Grasp Dataset for Safe and Object-Specific Robotic Grasping in Clutter

1 code implementation • 29 Apr 2021 • Hanbo Zhang, Deyu Yang, Han Wang, Binglei Zhao, Xuguang Lan, Jishiyu Ding, Nanning Zheng

In this paper, we present a new dataset named REGRAD for the learning of relationships among objects and grasps.

Robotic Grasping

Paper
Code

Adversarial Attack and Defense in Deep Ranking

1 code implementation • 7 Jun 2021 • Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Nanning Zheng, Gang Hua

In this paper, we propose two attacks against deep ranking systems, i. e., Candidate Attack and Query Attack, that can raise or lower the rank of chosen candidates by adversarial perturbations.

Adversarial Attack Adversarial Robustness

Paper
Code

Video Imprint

no code implementations • 7 Jun 2021 • Zhanning Gao, Le Wang, Nebojsa Jojic, Zhenxing Niu, Nanning Zheng, Gang Hua

In the proposed framework, a dedicated feature alignment module is incorporated for redundancy removal across frames to produce the tensor representation, i. e., the video imprint.

Language Modelling Retrieval

Paper
Add Code

Learning Algebraic Recombination for Compositional Generalization

2 code implementations • Findings (ACL) 2021 • Chenyao Liu, Shengnan An, Zeqi Lin, Qian Liu, Bei Chen, Jian-Guang Lou, Lijie Wen, Nanning Zheng, Dongmei Zhang

In this paper, we propose LeAR, an end-to-end neural model to learn algebraic recombination for compositional generalization.

Ranked #2 on Semantic Parsing on CFQ

Semantic Parsing

359

Paper
Code

Probabilistic Human Motion Prediction via A Bayesian Neural Network

no code implementations • 14 Jul 2021 • Jie Xu, Xingyu Chen, Xuguang Lan, Nanning Zheng

The experimental results show that our approach makes the interaction more efficient and safer.

Human motion prediction motion prediction

Paper
Add Code

X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question Answering

1 code implementation • 24 Jul 2021 • Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng

In this paper, we formulate OOD generalization in VQA as a compositional generalization problem and propose a graph generative modeling-based training scheme (X-GGM) to implicitly model the problem.

Attribute Out-of-Distribution Generalization +2

Paper
Code

Enriching Local and Global Contexts for Temporal Action Localization

1 code implementation • ICCV 2021 • Zixin Zhu, Wei Tang, Le Wang, Nanning Zheng, Gang Hua

We explore two existing models to be the P-Net in our experiments.

Action Classification Retrieval +2

Paper
Code

Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction

1 code implementation • ICCV 2021 • Fang Zheng, Le Wang, Sanping Zhou, Wei Tang, Zhenxing Niu, Nanning Zheng, Gang Hua

Specifically, the proposed unlimited neighborhood interaction module generates the fused-features of all agents involved in an interaction simultaneously, which is adaptive to any number of agents and any range of interaction area.

Graph Attention Trajectory Prediction

Paper
Code

Model-based Decision Making with Imagination for Autonomous Parking

1 code implementation • 25 Aug 2021 • Ziyue Feng, Yu Chen, Shitao Chen, Nanning Zheng

The proposed algorithm consists of three parts: an imaginative model for anticipating results before parking, an improved rapid-exploring random tree (RRT) for planning a feasible trajectory from a given start point to a parking lot, and a path smoothing module for optimizing the efficiency of parking tasks.

Autonomous Driving Decision Making

Paper
Code

INVIGORATE: Interactive Visual Grounding and Grasping in Clutter

no code implementations • 25 Aug 2021 • Hanbo Zhang, Yunfan Lu, Cunjun Yu, David Hsu, Xuguang Lan, Nanning Zheng

This paper presents INVIGORATE, a robot system that interacts with human through natural language and grasps a specified object in clutter.

Blocking Object +5

Paper
Add Code

LGD: Label-guided Self-distillation for Object Detection

1 code implementation • 23 Sep 2021 • Peizhen Zhang, Zijian Kang, Tong Yang, Xiangyu Zhang, Nanning Zheng, Jian Sun

Instead, we generate an instructive knowledge based only on student representations and regular labels.

Instance Segmentation Object +4

Paper
Code

Foreground-attention in neural decoding: Guiding Loop-Enc-Dec to reconstruct visual stimulus images from fMRI

no code implementations • 29 Sep 2021 • Kai Chen, Yongqiang Ma, Mingyang Sheng, Nanning Zheng

Inspired by the mechanism of human visual attention, in this paper, we propose a novel method of reconstructing visual stimulus images, which first decodes the distribution of visual attention from fMRI, and then reconstructs the visual images guided by visual attention.

Image Reconstruction

Paper
Add Code

Improved Drug-target Interaction Prediction with Intermolecular Graph Transformer

no code implementations • 14 Oct 2021 • Siyuan Liu, Yusong Wang, Tong Wang, Yifan Deng, Liang He, Bin Shao, Jian Yin, Nanning Zheng, Tie-Yan Liu

The identification of active binding drugs for target proteins (termed as drug-target interaction prediction) is the key challenge in virtual screening, which plays an essential role in drug discovery.

Drug Discovery Molecular Docking +1

Paper
Add Code

Instance-Conditional Knowledge Distillation for Object Detection

1 code implementation • NeurIPS 2021 • Zijian Kang, Peizhen Zhang, Xiangyu Zhang, Jian Sun, Nanning Zheng

Knowledge distillation has shown great success in classification, however, it is still challenging for detection.

Image Classification Knowledge Distillation +3

Paper
Code

Multi-Scale Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition

no code implementations • 7 Nov 2021 • Pengfei Zhang, Cuiling Lan, Wenjun Zeng, Junliang Xing, Jianru Xue, Nanning Zheng

Skeleton data is of low dimension.

Action Recognition Skeleton Based Action Recognition +1

Paper
Add Code

LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering

1 code implementation • 29 Nov 2021 • Jingjing Jiang, Ziyi Liu, Nanning Zheng

Video Question Answering (VideoQA), aiming to correctly answer the given question based on understanding multi-modal video content, is challenging due to the rich video content.

Question Answering Video Question Answering +2

Paper
Code

Co-evolution Transformer for Protein Contact Prediction

1 code implementation • NeurIPS 2021 • He Zhang, Fusong Ju, Jianwei Zhu, Liang He, Bin Shao, Nanning Zheng, Tie-Yan Liu

These methods generally derive coevolutionary features by aggregating the learned residue representations from individual sequences with equal weights, which is inconsistent with the premise that residue co-evolutions are a reflection of collective covariation patterns of numerous homologous proteins.

Paper
Code

TransVPR: Transformer-based place recognition with multi-level attention aggregation

no code implementations • CVPR 2022 • Ruotong Wang, Yanqing Shen, Weiliang Zuo, Sanping Zhou, Nanning Zheng

In addition, the output tokens from Transformer layers filtered by the fused attention mask are considered as key-patch descriptors, which are used to perform spatial matching to re-rank the candidates retrieved by the global image features.

Autonomous Driving Visual Place Recognition

Paper
Add Code

Trajectory Forecasting from Detection with Uncertainty-Aware Motion Encoding

no code implementations • 3 Feb 2022 • Pu Zhang, Lei Bai, Jianru Xue, Jianwu Fang, Nanning Zheng, Wanli Ouyang

Trajectories obtained from object detection and tracking are inevitably noisy, which could cause serious forecasting errors to predictors built on ground truth trajectories.

object-detection Object Detection +1

Paper
Add Code

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models

no code implementations • 7 Mar 2022 • Shengnan An, Yifei Li, Zeqi Lin, Qian Liu, Bei Chen, Qiang Fu, Weizhu Chen, Nanning Zheng, Jian-Guang Lou

This motivates us to propose input-tuning, which fine-tunes both the continuous prompts and the input representations, leading to a more effective way to adapt unfamiliar inputs to frozen PLMs.

Language Modelling Natural Language Understanding +1

Paper
Add Code

Test-time Batch Normalization

no code implementations • 20 May 2022 • Tao Yang, Shenglong Zhou, Yuwang Wang, Yan Lu, Nanning Zheng

Deep neural networks often suffer the data distribution shift between training and testing, and the batch statistics are observed to reflect the shift.

Domain Generalization

Paper
Add Code

Visual Concepts Tokenization

2 code implementations • 20 May 2022 • Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng

We further propose a Concept Disentangling Loss to facilitate that different concept tokens represent independent visual concepts.

Representation Learning

Paper
Code

Social Interpretable Tree for Pedestrian Trajectory Prediction

1 code implementation • 26 May 2022 • Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Fang Zheng, Nanning Zheng, Gang Hua

Understanding the multiple socially-acceptable future behaviors is an essential task for many vision applications.

Pedestrian Trajectory Prediction Trajectory Prediction

Paper
Code

Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization

no code implementations • CVPR 2022 • Kun Xia, Le Wang, Sanping Zhou, Nanning Zheng, Wei Tang

The main challenge of Temporal Action Localization is to retrieve subtle human actions from various co-occurring ingredients, e. g., context and background, in an untrimmed video.

Temporal Action Localization

Paper
Add Code

DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection

1 code implementation • 22 Jul 2022 • Jinrong Yang, Lin Song, Songtao Liu, Weixin Mao, Zeming Li, Xiaoping Li, Hongbin Sun, Jian Sun, Nanning Zheng

Many point-based 3D detectors adopt point-feature sampling strategies to drop some points for efficient inference.

3D Object Detection object-detection

Paper
Code

Correlation Information Bottleneck: Towards Adapting Pretrained Multimodal Models for Robust Visual Question Answering

1 code implementation • 14 Sep 2022 • Jingjing Jiang, Ziyi Liu, Nanning Zheng

In this paper, we aim to improve input robustness from an information bottleneck perspective when adapting pretrained VLMs to the downstream VQA task.

Adversarial Robustness Question Answering +1

Paper
Code

Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios

no code implementations • 10 Oct 2022 • Xingyu Chen, Jianru Xue, Jianwu Fang, Yuxin Pan, Nanning Zheng

In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU.

Object object-detection +1

Paper
Add Code

Could Giant Pretrained Image Models Extract Universal Representations?

no code implementations • 3 Nov 2022 • Yutong Lin, Ze Liu, Zheng Zhang, Han Hu, Nanning Zheng, Stephen Lin, Yue Cao

In this paper, we present a study of frozen pretrained models when applied to diverse and representative computer vision tasks, including object detection, semantic segmentation and video action recognition.

Ranked #3 on Action Recognition In Videos on Kinetics-400

Action Recognition In Videos Instance Segmentation +5

Paper
Add Code

Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning

no code implementations • 22 Nov 2022 • Lipeng Wan, Zeyang Liu, Xingyu Chen, Xuguang Lan, Nanning Zheng

To ensure optimal consistency, the optimal node is required to be the unique STN.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition

no code implementations • CVPR 2023 • Yanqing Shen, Sanping Zhou, Jingwen Fu, Ruotong Wang, Shitao Chen, Nanning Zheng

In this paper, we propose StructVPR, a novel training architecture for VPR, to enhance structural knowledge in RGB global features and thus improve feature stability in a constantly changing environment.

Image Retrieval Knowledge Distillation +3

Paper
Add Code

Object-fabrication Targeted Attack for Object Detection

no code implementations • 13 Dec 2022 • Xuchong Zhang, Changfeng Sun, Haoliang Han, Hang Wang, Hongbin Sun, Nanning Zheng

Evaluation results demonstrate that, the proposed object-fabrication targeted attack mode and the corresponding targeted feature space attack method show significant improvements in terms of image-specific attack, universal performance and generalization capability, compared with the previous targeted attack for object detection.

Adversarial Attack Object +2

Paper
Add Code

DETR Does Not Need Multi-Scale or Locality Design

1 code implementation • ICCV 2023 • Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu

This paper presents an improved DETR detector that maintains a "plain" nature: using a single-scale feature map and global cross-attention calculations without specific locality constraints, in contrast to previous leading DETR-based detectors that reintroduce architectural inductive biases of multi-scale and locality into the decoder.

182

Paper
Code

Inverse Compositional Learning for Weakly-supervised Relation Grounding

no code implementations • ICCV 2023 • Huan Li, Ping Wei, Zeyu Ma, Nanning Zheng

In this study, we introduce a novel approach called inverse compositional learning (ICL) for weakly-supervised video relation grounding.

Relation Video Understanding

Paper
Add Code

Dynamic Grained Encoder for Vision Transformers

1 code implementation • NeurIPS 2021 • Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng

Specifically, we propose a Dynamic Grained Encoder for vision transformers, which can adaptively assign a suitable number of queries to each spatial region.

Image Classification Language Modelling +2

Paper
Code

Does Deep Learning Learn to Abstract? A Systematic Probing Framework

1 code implementation • 23 Feb 2023 • Shengnan An, Zeqi Lin, Bei Chen, Qiang Fu, Nanning Zheng, Jian-Guang Lou

Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context.

359

Paper
Code

BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP for Generic Natural Visual Stimulus Decoding

1 code implementation • 25 Feb 2023 • Yulong Liu, Yongqiang Ma, Wei Zhou, Guibo Zhu, Nanning Zheng

Our experiments show that this combination can boost the decoding model's performance on certain tasks like fMRI-text matching and fMRI-to-image generation.

Brain Decoding Image Generation +3

Paper
Code

MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering

1 code implementation • CVPR 2023 • Jingjing Jiang, Nanning Zheng

In this paper, we propose MixPHM, a redundancy-aware parameter-efficient tuning method that outperforms full finetuning in low-resource VQA.

Question Answering Visual Question Answering

Paper
Code

Milestones in Autonomous Driving and Intelligent Vehicles: Survey of Surveys

no code implementations • 30 Mar 2023 • Long Chen, Yuchen Li, Chao Huang, Bai Li, Yang Xing, Daxin Tian, Li Li, Zhongxu Hu, Xiaoxiang Na, Zixuan Li, Siyu Teng, Chen Lv, Jinjun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang

Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace due to the convenience, safety, and economic benefits.

Autonomous Driving Ethics

Paper
Add Code

Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection

no code implementations • 6 Apr 2023 • Yuhao Huang, Sanping Zhou, Junjie Zhang, Jinpeng Dong, Nanning Zheng

Efficient representation of point clouds is fundamental for LiDAR-based 3D object detection.

3D Object Detection object-detection +1

Paper
Add Code

MMRDN: Consistent Representation for Multi-View Manipulation Relationship Detection in Object-Stacked Scenes

no code implementations • 25 Apr 2023 • Han Wang, Jiayuan Zhang, Lipeng Wan, Xingyu Chen, Xuguang Lan, Nanning Zheng

Manipulation relationship detection (MRD) aims to guide the robot to grasp objects in the right order, which is important to ensure the safety and reliability of grasping in object stacked scenes.

Position Relationship Detection

Paper
Add Code

Quadric Representations for LiDAR Odometry, Mapping and Localization

no code implementations • 27 Apr 2023 • Chao Xia, Chenfeng Xu, Patrick Rim, Mingyu Ding, Nanning Zheng, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan

Current LiDAR odometry, mapping and localization methods leverage point-wise representations of 3D scenes and achieve high accuracy in autonomous driving tasks.

Autonomous Driving

Paper
Add Code

How Do In-Context Examples Affect Compositional Generalization?

no code implementations • 8 May 2023 • Shengnan An, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Jian-Guang Lou, Dongmei Zhang

Compositional generalization--understanding unseen combinations of seen primitives--is an essential reasoning capability in human intelligence.

In-Context Learning

Paper
Add Code

Milestones in Autonomous Driving and Intelligent Vehicles Part I: Control, Computing System Design, Communication, HD Map, Testing, and Human Behaviors

no code implementations • 12 May 2023 • Long Chen, Yuchen Li, Chao Huang, Yang Xing, Daxin Tian, Li Li, Zhongxu Hu, Siyu Teng, Chen Lv, Jinjun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang

Our work is divided into 3 independent articles and the first part is a Survey of Surveys (SoS) for total technologies of AD and IVs that involves the history, summarizes the milestones, and provides the perspectives, ethics, and future research directions.

Autonomous Driving Ethics

Paper
Add Code

Skill-Based Few-Shot Selection for In-Context Learning

no code implementations • 23 May 2023 • Shengnan An, Bo Zhou, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Weizhu Chen, Jian-Guang Lou

Few-shot selection -- selecting appropriate examples for each test instance separately -- is important for in-context learning.

In-Context Learning Semantic Parsing +1

Paper
Add Code

Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization

no code implementations • 29 May 2023 • Tao Yang, Yuwang Wang, Cuiling Lan, Yan Lu, Nanning Zheng

In this paper, we study several typical disentangled representation learning works in terms of both disentanglement and compositional generalization abilities, and we provide an important insight: vector-based representation (using a vector instead of a scalar to represent a concept) is the key to empower both good disentanglement and strong compositional generalization.

Disentanglement

Paper
Add Code

Milestones in Autonomous Driving and Intelligent Vehicles Part II: Perception and Planning

no code implementations • 3 Jun 2023 • Long Chen, Siyu Teng, Bai Li, Xiaoxiang Na, Yuchen Li, Zixuan Li, Jinjun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang

Growing interest in autonomous driving (AD) and intelligent vehicles (IVs) is fueled by their promise for enhanced safety, efficiency, and economic benefits.

Autonomous Driving Ethics

Paper
Add Code

When and Why Momentum Accelerates SGD:An Empirical Study

no code implementations • 15 Jun 2023 • Jingwen Fu, Bohan Wang, Huishuai Zhang, Zhizheng Zhang, Wei Chen, Nanning Zheng

In the comparison of SGDM and SGD with the same effective learning rate and the same batch size, we observe a consistent pattern: when $\eta_{ef}$ is small, SGDM and SGD experience almost the same empirical training losses; when $\eta_{ef}$ surpasses a certain threshold, SGDM begins to perform better than SGD.

Paper
Add Code

LongNet: Scaling Transformers to 1,000,000,000 Tokens

3 code implementations • 5 Jul 2023 • Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Nanning Zheng, Furu Wei

Scaling sequence length has become a critical demand in the era of large language models.

18,315

Paper
Code

MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection

no code implementations • 18 Jul 2023 • Zewei Lin, Yanqing Shen, Sanping Zhou, Shitao Chen, Nanning Zheng

In this paper, we propose a novel and effective Multi-Level Fusion network, named as MLF-DET, for high-performance cross-modal 3D object DETection, which integrates both the feature-level fusion and decision-level fusion to fully utilize the information in the image.

3D Object Detection Data Augmentation +1

Paper
Add Code

FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene

no code implementations • 27 Jul 2023 • Chengrui Wei, Meng Yang, Lei He, Nanning Zheng

It has long been an ill-posed problem to predict absolute depth maps from single images in real (unseen) indoor scenes.

3D Reconstruction Data Augmentation +1

Paper
Add Code

DETR Doesn't Need Multi-Scale or Locality Design

1 code implementation • 3 Aug 2023 • Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu

182

Paper
Code

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

1 code implementation • 8 Aug 2023 • Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo

We introduce a highly performant 3D object detector for point clouds using the DETR framework.

Ranked #2 on 3D Object Detection on ScanNetV2

3D Object Detection object-detection +1

Paper
Code

Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction

1 code implementation • 29 Aug 2023 • Wenjie Gao, Jiawei Fu, Yanqing Shen, Haodong Jing, Shitao Chen, Nanning Zheng

To enable better integration of satellite maps with existing methods, we propose a hierarchical fusion module, which includes feature-level fusion and BEV-level fusion.

Autonomous Driving Semantic Segmentation

Paper
Code

InteractionNet: Joint Planning and Prediction for Autonomous Driving with Transformers

1 code implementation • 7 Sep 2023 • Jiawei Fu, Yanqing Shen, Zhiqiang Jian, Shitao Chen, Jingmin Xin, Nanning Zheng

Planning and prediction are two important modules of autonomous driving and have experienced tremendous advancement recently.

Ranked #9 on CARLA longest6 on CARLA

Autonomous Driving CARLA longest6

Paper
Code

Generalization error bounds for iterative learning algorithms with bounded updates

no code implementations • 10 Sep 2023 • Jingwen Fu, Nanning Zheng

This paper explores the generalization characteristics of iterative learning algorithms with bounded updates for non-convex loss functions, employing information-theoretic techniques.

Paper
Add Code

Breaking through the learning plateaus of in-context learning in Transformer

no code implementations • 12 Sep 2023 • Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng

To study the mechanism behind the learning plateaus, we conceptually seperate a component within the model's internal representation that is exclusively affected by the model's weights.

In-Context Learning Representation Learning

Paper
Add Code

Overcoming the Barrier of Orbital-Free Density Functional Theory for Molecular Systems Using Deep Learning

no code implementations • 28 Sep 2023 • He Zhang, Siyuan Liu, Jiacheng You, Chang Liu, Shuxin Zheng, Ziheng Lu, Tong Wang, Nanning Zheng, Bin Shao

Orbital-free density functional theory (OFDFT) is a quantum chemistry formulation that has a lower cost scaling than the prevailing Kohn-Sham DFT, which is increasingly desired for contemporary molecular research.

Paper
Add Code

Open-Vocabulary Animal Keypoint Detection with Semantic-feature Matching

no code implementations • 8 Oct 2023 • Hao Zhang, Lumin Xu, Shenqi Lai, Wenqi Shao, Nanning Zheng, Ping Luo, Yu Qiao, Kaipeng Zhang

Current image-based keypoint detection methods for animal (including human) bodies and faces are generally divided into full-supervised and few-shot class-agnostic approaches.

Keypoint Detection

Paper
Add Code

G2-MonoDepth: A General Framework of Generalized Depth Inference from Monocular RGB+X Data

1 code implementation • 24 Oct 2023 • Haotian Wang, Meng Yang, Nanning Zheng

This paper investigates a unified task of monocular depth inference, which infers high-quality depth maps from all kinds of input raw data from various robots in unseen scenes.

Data Augmentation Depth Completion +1

Paper
Code

Closing the Gap Between the Upper Bound and the Lower Bound of Adam's Iteration Complexity

no code implementations • 27 Oct 2023 • Bohan Wang, Jingwen Fu, Huishuai Zhang, Nanning Zheng, Wei Chen

Recently, Arjevani et al. [1] established a lower bound of iteration complexity for the first-order optimization under an $L$-smooth condition and a bounded noise variance assumption.

LEMMA valid

Paper
Add Code

Learning From Mistakes Makes LLM Better Reasoner

1 code implementation • 31 Oct 2023 • Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen

To further improve their reasoning capabilities, this work explores whether LLMs can LEarn from MistAkes (LEMA), akin to the human learning process.

GSM8K Math +1

Paper
Code

FFINet: Future Feedback Interaction Network for Motion Forecasting

no code implementations • 8 Nov 2023 • Miao Kang, Shengqi Wang, Sanping Zhou, Ke Ye, Jingjing Jiang, Nanning Zheng

In this paper, we propose a novel Future Feedback Interaction Network (FFINet) to aggregate features the current observations and potential future interactions for trajectory prediction.

Motion Forecasting Position +1

Paper
Add Code

Single-Shot and Multi-Shot Feature Learning for Multi-Object Tracking

no code implementations • 17 Nov 2023 • Yizhe Li, Sanping Zhou, Zheng Qin, Le Wang, Jinjun Wang, Nanning Zheng

In this paper, we propose a simple yet effective two-stage feature learning paradigm to jointly learn single-shot and multi-shot features for different targets, so as to achieve robust data association in the tracking process.

Multi-Object Tracking

Paper
Add Code

IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate Importance

1 code implementation • 19 Dec 2023 • Hongyi He, Longjun Liu, Haonan Zhang, Nanning Zheng

Among existing Neural Architecture Search methods, DARTS is known for its efficiency and simplicity.

Neural Architecture Search

Paper
Code

An automated framework for brain vessel centerline extraction from CTA images

1 code implementation • 13 Jan 2024 • Sijie Liu, Ruisheng Su, Jianghang Su, Jingmin Xin, Jiayi Wu, Wim van Zwam, Pieter Jan van Doormaal, Aad van der Lugt, Wiro J. Niessen, Nanning Zheng, Theo van Walsum

In this paper, we consider automatic lumen segmentation generation without additional annotation effort by physicians and more effective use of the generated lumen segmentation for improved centerline extraction performance.

Segmentation

Paper
Code

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement

no code implementations • 15 Feb 2024 • Tao Yang, Cuiling Lan, Yan Lu, Nanning Zheng

Disentangled representation learning strives to extract the intrinsic factors within observed data.

Disentanglement Image Reconstruction +1

Paper
Add Code

Dual-Space Optimization: Improved Molecule Sequence Design by Latent Prompt Transformer

no code implementations • 27 Feb 2024 • Deqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu, Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng, Ying Nian Wu

Designing molecules with desirable properties, such as drug-likeliness and high binding affinities towards protein targets, is a challenging problem.

Paper
Add Code

From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs

no code implementations • 28 Feb 2024 • Yulong Liu, Yunlong Yuan, Chunwei Wang, Jianhua Han, Yongqiang Ma, Li Zhang, Nanning Zheng, Hang Xu

In this work, we introduce a novel tool invocation pipeline designed to control massive real-world APIs.

In-Context Learning

Paper
Add Code

Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample Selection

1 code implementation • 4 Mar 2024 • Shitao Chen, Haolin Zhang, Nanning Zheng

3D object detection based on LiDAR point cloud and prior anchor boxes is a critical technology for autonomous driving environment perception and understanding.

3D Object Detection Autonomous Driving +2

Paper
Code

Common 7B Language Models Already Possess Strong Math Capabilities

no code implementations • 7 Mar 2024 • Chen Li, Weiqi Wang, Jingcheng Hu, Yixuan Wei, Nanning Zheng, Han Hu, Zheng Zhang, Houwen Peng

This paper shows that the LLaMA-2 7B model with common pre-training already exhibits strong mathematical abilities, as evidenced by its impressive accuracy of 97. 7% and 72. 0% on the GSM8K and MATH benchmarks, respectively, when selecting the best response from 256 random generations.

GSM8K Math

Paper
Add Code

See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI

no code implementations • 11 Mar 2024 • Yulong Liu, Yongqiang Ma, Guibo Zhu, Haodong Jing, Nanning Zheng

Our model integrates a high-level perception decoding pipeline and a pixel-wise reconstruction pipeline guided by high-level perceptions, simulating bottom-up and top-down processes in neuroscience.

Brain Decoding General Knowledge +1

Paper
Add Code

Exploring Hardware Friendly Bottleneck Architecture in CNN for Embedded Computing Systems

no code implementations • 11 Mar 2024 • Xing Lei, Longjun Liu, Zhiheng Zhou, Hongbin Sun, Nanning Zheng

We deploy our L-Mobilenet model to ZYNQ embedded platform for fully evaluating the performance of our design.

Paper
Add Code

Self-Consistency Training for Hamiltonian Prediction

no code implementations • 14 Mar 2024 • He Zhang, Chang Liu, Zun Wang, Xinran Wei, Siyuan Liu, Nanning Zheng, Bin Shao, Tie-Yan Liu

This merit addresses the data scarcity difficulty, and distinguishes the task from other property prediction formulations with unique benefits: (1) self-consistency training enables the model to be trained on a large amount of unlabeled data, hence substantially enhances generalization; (2) self-consistency training is more efficient than labeling data with DFT for supervised training, since it is an amortization of DFT calculation over a set of molecular structures.

Property Prediction

Paper
Add Code

Robust Noisy Label Learning via Two-Stream Sample Distillation

no code implementations • 16 Apr 2024 • Sihan Bai, Sanping Zhou, Zheng Qin, Le Wang, Nanning Zheng

Noisy label learning aims to learn robust networks under the supervision of noisy labels, which plays a critical role in deep learning.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.