1 code implementation • 8 Sep 2024 • Ning Gao, Sanping Zhou, Le Wang, Nanning Zheng
In this paper, we propose a simple yet effective semi-supervised learning framework, termed Progressive Mean Teachers (PMT), for medical image segmentation, whose goal is to generate high-fidelity pseudo labels by learning robust and diverse features in the training process.
no code implementations • 8 Jul 2024 • Shaoning Li, Mingyu Li, Yusong Wang, Xinheng He, Nanning Zheng, Jian Zhang, Pheng-Ann Heng
Investigating conformational landscapes of proteins is a crucial way to understand their biological functions and properties.
no code implementations • 20 May 2024 • Jingwen Fu, Zhizheng Zhang, Yan Lu, Nanning Zheng
Compositional Generalization (CG) embodies the ability to comprehend novel combinations of familiar concepts, representing a significant cognitive leap in human intellectual advancement.
no code implementations • CVPR 2024 • Tianci Bi, Xiaoyi Zhang, Zhizheng Zhang, Wenxuan Xie, Cuiling Lan, Yan Lu, Nanning Zheng
Significant progress has been made in scene text detection models since the rise of deep learning, but scene text layout analysis, which aims to group detected text instances as paragraphs, has not kept pace.
no code implementations • 1 May 2024 • Shaoning Li, Yusong Wang, Mingyu Li, Jian Zhang, Bin Shao, Nanning Zheng, Jian Tang
Molecular dynamics (MD) is a crucial technique for simulating biological systems, enabling the exploration of their dynamic nature and fostering an understanding of their functions and properties.
2 code implementations • 25 Apr 2024 • Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou
While many contemporary large language models (LLMs) can process lengthy input, they still struggle to fully utilize information within the long context, known as the lost-in-the-middle challenge.
no code implementations • 16 Apr 2024 • Sihan Bai, Sanping Zhou, Zheng Qin, Le Wang, Nanning Zheng
Noisy label learning aims to learn robust networks under the supervision of noisy labels, which plays a critical role in deep learning.
no code implementations • 14 Mar 2024 • He Zhang, Chang Liu, Zun Wang, Xinran Wei, Siyuan Liu, Nanning Zheng, Bin Shao, Tie-Yan Liu
Predicting the mean-field Hamiltonian matrix in density functional theory is a fundamental formulation to leverage machine learning for solving molecular science problems.
no code implementations • 11 Mar 2024 • Xing Lei, Longjun Liu, Zhiheng Zhou, Hongbin Sun, Nanning Zheng
We deploy our L-Mobilenet model to ZYNQ embedded platform for fully evaluating the performance of our design.
no code implementations • 11 Mar 2024 • Yulong Liu, Yongqiang Ma, Guibo Zhu, Haodong Jing, Nanning Zheng
Our model integrates a high-level perception decoding pipeline and a pixel-wise reconstruction pipeline guided by high-level perceptions, simulating bottom-up and top-down processes in neuroscience.
1 code implementation • 7 Mar 2024 • Chen Li, Weiqi Wang, Jingcheng Hu, Yixuan Wei, Nanning Zheng, Han Hu, Zheng Zhang, Houwen Peng
This paper shows that the LLaMA-2 7B model with common pre-training already exhibits strong mathematical abilities, as evidenced by its impressive accuracy of 97. 7% and 72. 0% on the GSM8K and MATH benchmarks, respectively, when selecting the best response from 256 random generations.
1 code implementation • 4 Mar 2024 • Shitao Chen, Haolin Zhang, Nanning Zheng
3D object detection based on LiDAR point cloud and prior anchor boxes is a critical technology for autonomous driving environment perception and understanding.
no code implementations • 28 Feb 2024 • Yulong Liu, Yunlong Yuan, Chunwei Wang, Jianhua Han, Yongqiang Ma, Li Zhang, Nanning Zheng, Hang Xu
In this work, we introduce a novel tool invocation pipeline designed to control massive real-world APIs.
no code implementations • 27 Feb 2024 • Deqian Kong, Yuhao Huang, Jianwen Xie, Edouardo Honig, Ming Xu, Shuanghong Xue, Pei Lin, Sanping Zhou, Sheng Zhong, Nanning Zheng, Ying Nian Wu
Designing molecules with desirable properties, such as drug-likeliness and high binding affinities towards protein targets, is a challenging problem.
no code implementations • 15 Feb 2024 • Tao Yang, Cuiling Lan, Yan Lu, Nanning Zheng
Disentangled representation learning strives to extract the intrinsic factors within observed data.
1 code implementation • 13 Jan 2024 • Sijie Liu, Ruisheng Su, Jianghang Su, Jingmin Xin, Jiayi Wu, Wim van Zwam, Pieter Jan van Doormaal, Aad van der Lugt, Wiro J. Niessen, Nanning Zheng, Theo van Walsum
In this paper, we consider automatic lumen segmentation generation without additional annotation effort by physicians and more effective use of the generated lumen segmentation for improved centerline extraction performance.
1 code implementation • 19 Dec 2023 • Hongyi He, Longjun Liu, Haonan Zhang, Nanning Zheng
Among existing Neural Architecture Search methods, DARTS is known for its efficiency and simplicity.
no code implementations • 17 Nov 2023 • Yizhe Li, Sanping Zhou, Zheng Qin, Le Wang, Jinjun Wang, Nanning Zheng
In this paper, we propose a simple yet effective two-stage feature learning paradigm to jointly learn single-shot and multi-shot features for different targets, so as to achieve robust data association in the tracking process.
no code implementations • 8 Nov 2023 • Miao Kang, Shengqi Wang, Sanping Zhou, Ke Ye, Jingjing Jiang, Nanning Zheng
In this paper, we propose a novel Future Feedback Interaction Network (FFINet) to aggregate features the current observations and potential future interactions for trajectory prediction.
1 code implementation • 31 Oct 2023 • Shengnan An, Zexiong Ma, Zeqi Lin, Nanning Zheng, Jian-Guang Lou, Weizhu Chen
To further improve their reasoning capabilities, this work explores whether LLMs can LEarn from MistAkes (LEMA), akin to the human learning process.
no code implementations • 27 Oct 2023 • Bohan Wang, Jingwen Fu, Huishuai Zhang, Nanning Zheng, Wei Chen
Recently, Arjevani et al. [1] established a lower bound of iteration complexity for the first-order optimization under an $L$-smooth condition and a bounded noise variance assumption.
1 code implementation • 24 Oct 2023 • Haotian Wang, Meng Yang, Nanning Zheng
This paper investigates a unified task of monocular depth inference, which infers high-quality depth maps from all kinds of input raw data from various robots in unseen scenes.
no code implementations • 8 Oct 2023 • Hao Zhang, Lumin Xu, Shenqi Lai, Wenqi Shao, Nanning Zheng, Ping Luo, Yu Qiao, Kaipeng Zhang
Current image-based keypoint detection methods for animal (including human) bodies and faces are generally divided into full-supervised and few-shot class-agnostic approaches.
no code implementations • 28 Sep 2023 • He Zhang, Siyuan Liu, Jiacheng You, Chang Liu, Shuxin Zheng, Ziheng Lu, Tong Wang, Nanning Zheng, Bin Shao
Orbital-free density functional theory (OFDFT) is a quantum chemistry formulation that has a lower cost scaling than the prevailing Kohn-Sham DFT, which is increasingly desired for contemporary molecular research.
no code implementations • 12 Sep 2023 • Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng
To study the mechanism behind the learning plateaus, we conceptually seperate a component within the model's internal representation that is exclusively affected by the model's weights.
no code implementations • 10 Sep 2023 • Jingwen Fu, Nanning Zheng
This paper explores the generalization characteristics of iterative learning algorithms with bounded updates for non-convex loss functions, employing information-theoretic techniques.
1 code implementation • 7 Sep 2023 • Jiawei Fu, Yanqing Shen, Zhiqiang Jian, Shitao Chen, Jingmin Xin, Nanning Zheng
Planning and prediction are two important modules of autonomous driving and have experienced tremendous advancement recently.
Ranked #9 on CARLA longest6 on CARLA
1 code implementation • 29 Aug 2023 • Wenjie Gao, Jiawei Fu, Yanqing Shen, Haodong Jing, Shitao Chen, Nanning Zheng
To enable better integration of satellite maps with existing methods, we propose a hierarchical fusion module, which includes feature-level fusion and BEV-level fusion.
1 code implementation • 8 Aug 2023 • Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo
We introduce a highly performant 3D object detector for point clouds using the DETR framework.
Ranked #2 on 3D Object Detection on ScanNetV2
1 code implementation • 3 Aug 2023 • Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu
This paper presents an improved DETR detector that maintains a "plain" nature: using a single-scale feature map and global cross-attention calculations without specific locality constraints, in contrast to previous leading DETR-based detectors that reintroduce architectural inductive biases of multi-scale and locality into the decoder.
no code implementations • 27 Jul 2023 • Chengrui Wei, Meng Yang, Lei He, Nanning Zheng
It has long been an ill-posed problem to predict absolute depth maps from single images in real (unseen) indoor scenes.
no code implementations • 18 Jul 2023 • Zewei Lin, Yanqing Shen, Sanping Zhou, Shitao Chen, Nanning Zheng
In this paper, we propose a novel and effective Multi-Level Fusion network, named as MLF-DET, for high-performance cross-modal 3D object DETection, which integrates both the feature-level fusion and decision-level fusion to fully utilize the information in the image.
3 code implementations • 5 Jul 2023 • Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Nanning Zheng, Furu Wei
Scaling sequence length has become a critical demand in the era of large language models.
no code implementations • 15 Jun 2023 • Jingwen Fu, Bohan Wang, Huishuai Zhang, Zhizheng Zhang, Wei Chen, Nanning Zheng
In the comparison of SGDM and SGD with the same effective learning rate and the same batch size, we observe a consistent pattern: when $\eta_{ef}$ is small, SGDM and SGD experience almost the same empirical training losses; when $\eta_{ef}$ surpasses a certain threshold, SGDM begins to perform better than SGD.
no code implementations • 3 Jun 2023 • Long Chen, Siyu Teng, Bai Li, Xiaoxiang Na, Yuchen Li, Zixuan Li, Jinjun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang
Growing interest in autonomous driving (AD) and intelligent vehicles (IVs) is fueled by their promise for enhanced safety, efficiency, and economic benefits.
no code implementations • 29 May 2023 • Tao Yang, Yuwang Wang, Cuiling Lan, Yan Lu, Nanning Zheng
In this paper, we study several typical disentangled representation learning works in terms of both disentanglement and compositional generalization abilities, and we provide an important insight: vector-based representation (using a vector instead of a scalar to represent a concept) is the key to empower both good disentanglement and strong compositional generalization.
no code implementations • 23 May 2023 • Shengnan An, Bo Zhou, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Weizhu Chen, Jian-Guang Lou
Few-shot selection -- selecting appropriate examples for each test instance separately -- is important for in-context learning.
no code implementations • 12 May 2023 • Long Chen, Yuchen Li, Chao Huang, Yang Xing, Daxin Tian, Li Li, Zhongxu Hu, Siyu Teng, Chen Lv, Jinjun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang
Our work is divided into 3 independent articles and the first part is a Survey of Surveys (SoS) for total technologies of AD and IVs that involves the history, summarizes the milestones, and provides the perspectives, ethics, and future research directions.
no code implementations • 8 May 2023 • Shengnan An, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Jian-Guang Lou, Dongmei Zhang
Compositional generalization--understanding unseen combinations of seen primitives--is an essential reasoning capability in human intelligence.
no code implementations • 27 Apr 2023 • Chao Xia, Chenfeng Xu, Patrick Rim, Mingyu Ding, Nanning Zheng, Kurt Keutzer, Masayoshi Tomizuka, Wei Zhan
Current LiDAR odometry, mapping and localization methods leverage point-wise representations of 3D scenes and achieve high accuracy in autonomous driving tasks.
no code implementations • 25 Apr 2023 • Han Wang, Jiayuan Zhang, Lipeng Wan, Xingyu Chen, Xuguang Lan, Nanning Zheng
Manipulation relationship detection (MRD) aims to guide the robot to grasp objects in the right order, which is important to ensure the safety and reliability of grasping in object stacked scenes.
no code implementations • 6 Apr 2023 • Yuhao Huang, Sanping Zhou, Junjie Zhang, Jinpeng Dong, Nanning Zheng
Efficient representation of point clouds is fundamental for LiDAR-based 3D object detection.
no code implementations • 30 Mar 2023 • Long Chen, Yuchen Li, Chao Huang, Bai Li, Yang Xing, Daxin Tian, Li Li, Zhongxu Hu, Xiaoxiang Na, Zixuan Li, Siyu Teng, Chen Lv, Jinjun Wang, Dongpu Cao, Nanning Zheng, Fei-Yue Wang
Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace due to the convenience, safety, and economic benefits.
1 code implementation • CVPR 2023 • Jingjing Jiang, Nanning Zheng
In this paper, we propose MixPHM, a redundancy-aware parameter-efficient tuning method that outperforms full finetuning in low-resource VQA.
1 code implementation • 25 Feb 2023 • Yulong Liu, Yongqiang Ma, Wei Zhou, Guibo Zhu, Nanning Zheng
Our experiments show that this combination can boost the decoding model's performance on certain tasks like fMRI-text matching and fMRI-to-image generation.
1 code implementation • 23 Feb 2023 • Shengnan An, Zeqi Lin, Bei Chen, Qiang Fu, Nanning Zheng, Jian-Guang Lou
Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context.
1 code implementation • NeurIPS 2023 • Tao Yang, Yuwang Wang, Yan Lv, Nanning Zheng
Targeting to understand the underlying explainable factors behind observations and modeling the conditional generation process on these factors, we connect disentangled representation learning to Diffusion Probabilistic Models (DPMs) to take advantage of the remarkable modeling ability of DPMs.
1 code implementation • NeurIPS 2021 • Lin Song, Songyang Zhang, Songtao Liu, Zeming Li, Xuming He, Hongbin Sun, Jian Sun, Nanning Zheng
Specifically, we propose a Dynamic Grained Encoder for vision transformers, which can adaptively assign a suitable number of queries to each spatial region.
1 code implementation • ICCV 2023 • Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu
This paper presents an improved DETR detector that maintains a "plain" nature: using a single-scale feature map and global cross-attention calculations without specific locality constraints, in contrast to previous leading DETR-based detectors that reintroduce architectural inductive biases of multi-scale and locality into the decoder.
no code implementations • ICCV 2023 • Huan Li, Ping Wei, Zeyu Ma, Nanning Zheng
In this study, we introduce a novel approach called inverse compositional learning (ICL) for weakly-supervised video relation grounding.
no code implementations • 13 Dec 2022 • Xuchong Zhang, Changfeng Sun, Haoliang Han, Hang Wang, Hongbin Sun, Nanning Zheng
Evaluation results demonstrate that, the proposed object-fabrication targeted attack mode and the corresponding targeted feature space attack method show significant improvements in terms of image-specific attack, universal performance and generalization capability, compared with the previous targeted attack for object detection.
no code implementations • CVPR 2023 • Yanqing Shen, Sanping Zhou, Jingwen Fu, Ruotong Wang, Shitao Chen, Nanning Zheng
In this paper, we propose StructVPR, a novel training architecture for VPR, to enhance structural knowledge in RGB global features and thus improve feature stability in a constantly changing environment.
no code implementations • 22 Nov 2022 • Lipeng Wan, Zeyang Liu, Xingyu Chen, Xuguang Lan, Nanning Zheng
To ensure optimal consistency, the optimal node is required to be the unique STN.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 3 Nov 2022 • Yutong Lin, Ze Liu, Zheng Zhang, Han Hu, Nanning Zheng, Stephen Lin, Yue Cao
In this paper, we present a study of frozen pretrained models when applied to diverse and representative computer vision tasks, including object detection, semantic segmentation and video action recognition.
Ranked #3 on Action Recognition In Videos on Kinetics-400
no code implementations • 10 Oct 2022 • Xingyu Chen, Jianru Xue, Jianwu Fang, Yuxin Pan, Nanning Zheng
In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU.
1 code implementation • 14 Sep 2022 • Jingjing Jiang, Ziyi Liu, Nanning Zheng
In this paper, we aim to improve input robustness from an information bottleneck perspective when adapting pretrained VLMs to the downstream VQA task.
1 code implementation • 22 Jul 2022 • Jinrong Yang, Lin Song, Songtao Liu, Weixin Mao, Zeming Li, Xiaoping Li, Hongbin Sun, Jian Sun, Nanning Zheng
Many point-based 3D detectors adopt point-feature sampling strategies to drop some points for efficient inference.
no code implementations • CVPR 2022 • Kun Xia, Le Wang, Sanping Zhou, Nanning Zheng, Wei Tang
The main challenge of Temporal Action Localization is to retrieve subtle human actions from various co-occurring ingredients, e. g., context and background, in an untrimmed video.
1 code implementation • 26 May 2022 • Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Fang Zheng, Nanning Zheng, Gang Hua
Understanding the multiple socially-acceptable future behaviors is an essential task for many vision applications.
2 code implementations • 20 May 2022 • Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng
We further propose a Concept Disentangling Loss to facilitate that different concept tokens represent independent visual concepts.
no code implementations • 20 May 2022 • Tao Yang, Shenglong Zhou, Yuwang Wang, Yan Lu, Nanning Zheng
Deep neural networks often suffer the data distribution shift between training and testing, and the batch statistics are observed to reflect the shift.
no code implementations • 7 Mar 2022 • Shengnan An, Yifei Li, Zeqi Lin, Qian Liu, Bei Chen, Qiang Fu, Weizhu Chen, Nanning Zheng, Jian-Guang Lou
This motivates us to propose input-tuning, which fine-tunes both the continuous prompts and the input representations, leading to a more effective way to adapt unfamiliar inputs to frozen PLMs.
no code implementations • 3 Feb 2022 • Pu Zhang, Lei Bai, Jianru Xue, Jianwu Fang, Nanning Zheng, Wanli Ouyang
Trajectories obtained from object detection and tracking are inevitably noisy, which could cause serious forecasting errors to predictors built on ground truth trajectories.
no code implementations • CVPR 2022 • Ruotong Wang, Yanqing Shen, Weiliang Zuo, Sanping Zhou, Nanning Zheng
In addition, the output tokens from Transformer layers filtered by the fused attention mask are considered as key-patch descriptors, which are used to perform spatial matching to re-rank the candidates retrieved by the global image features.
1 code implementation • NeurIPS 2021 • He Zhang, Fusong Ju, Jianwei Zhu, Liang He, Bin Shao, Nanning Zheng, Tie-Yan Liu
These methods generally derive coevolutionary features by aggregating the learned residue representations from individual sequences with equal weights, which is inconsistent with the premise that residue co-evolutions are a reflection of collective covariation patterns of numerous homologous proteins.
1 code implementation • 29 Nov 2021 • Jingjing Jiang, Ziyi Liu, Nanning Zheng
Video Question Answering (VideoQA), aiming to correctly answer the given question based on understanding multi-modal video content, is challenging due to the rich video content.
no code implementations • 7 Nov 2021 • Pengfei Zhang, Cuiling Lan, Wenjun Zeng, Junliang Xing, Jianru Xue, Nanning Zheng
Skeleton data is of low dimension.
1 code implementation • NeurIPS 2021 • Zijian Kang, Peizhen Zhang, Xiangyu Zhang, Jian Sun, Nanning Zheng
Knowledge distillation has shown great success in classification, however, it is still challenging for detection.
no code implementations • 14 Oct 2021 • Siyuan Liu, Yusong Wang, Tong Wang, Yifan Deng, Liang He, Bin Shao, Jian Yin, Nanning Zheng, Tie-Yan Liu
The identification of active binding drugs for target proteins (termed as drug-target interaction prediction) is the key challenge in virtual screening, which plays an essential role in drug discovery.
no code implementations • 29 Sep 2021 • Kai Chen, Yongqiang Ma, Mingyang Sheng, Nanning Zheng
Inspired by the mechanism of human visual attention, in this paper, we propose a novel method of reconstructing visual stimulus images, which first decodes the distribution of visual attention from fMRI, and then reconstructs the visual images guided by visual attention.
1 code implementation • 23 Sep 2021 • Peizhen Zhang, Zijian Kang, Tong Yang, Xiangyu Zhang, Nanning Zheng, Jian Sun
Instead, we generate an instructive knowledge based only on student representations and regular labels.
no code implementations • 25 Aug 2021 • Hanbo Zhang, Yunfan Lu, Cunjun Yu, David Hsu, Xuguang Lan, Nanning Zheng
This paper presents INVIGORATE, a robot system that interacts with human through natural language and grasps a specified object in clutter.
1 code implementation • 25 Aug 2021 • Ziyue Feng, Yu Chen, Shitao Chen, Nanning Zheng
The proposed algorithm consists of three parts: an imaginative model for anticipating results before parking, an improved rapid-exploring random tree (RRT) for planning a feasible trajectory from a given start point to a parking lot, and a path smoothing module for optimizing the efficiency of parking tasks.
1 code implementation • ICCV 2021 • Fang Zheng, Le Wang, Sanping Zhou, Wei Tang, Zhenxing Niu, Nanning Zheng, Gang Hua
Specifically, the proposed unlimited neighborhood interaction module generates the fused-features of all agents involved in an interaction simultaneously, which is adaptive to any number of agents and any range of interaction area.
1 code implementation • ICCV 2021 • Zixin Zhu, Wei Tang, Le Wang, Nanning Zheng, Gang Hua
We explore two existing models to be the P-Net in our experiments.
1 code implementation • 24 Jul 2021 • Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng
In this paper, we formulate OOD generalization in VQA as a compositional generalization problem and propose a graph generative modeling-based training scheme (X-GGM) to implicitly model the problem.
no code implementations • 14 Jul 2021 • Jie Xu, Xingyu Chen, Xuguang Lan, Nanning Zheng
The experimental results show that our approach makes the interaction more efficient and safer.
2 code implementations • Findings (ACL) 2021 • Chenyao Liu, Shengnan An, Zeqi Lin, Qian Liu, Bei Chen, Jian-Guang Lou, Lijie Wen, Nanning Zheng, Dongmei Zhang
In this paper, we propose LeAR, an end-to-end neural model to learn algebraic recombination for compositional generalization.
Ranked #2 on Semantic Parsing on CFQ
1 code implementation • 7 Jun 2021 • Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Nanning Zheng, Gang Hua
In this paper, we propose two attacks against deep ranking systems, i. e., Candidate Attack and Query Attack, that can raise or lower the rank of chosen candidates by adversarial perturbations.
no code implementations • 7 Jun 2021 • Zhanning Gao, Le Wang, Nebojsa Jojic, Zhenxing Niu, Nanning Zheng, Gang Hua
In the proposed framework, a dedicated feature alignment module is incorporated for redundancy removal across frames to produce the tensor representation, i. e., the video imprint.
1 code implementation • 29 Apr 2021 • Hanbo Zhang, Deyu Yang, Han Wang, Binglei Zhao, Xuguang Lan, Jishiyu Ding, Nanning Zheng
In this paper, we present a new dataset named REGRAD for the learning of relationships among objects and grasps.
no code implementations • 30 Mar 2021 • Ziyi Liu, Le Wang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua
To address this challenge, we introduce a framework that learns two feature subspaces respectively for actions and their context.
Action Recognition Weakly-supervised Temporal Action Localization +1
no code implementations • 28 Mar 2021 • Ziyi Liu, Le Wang, Qilin Zhang, Wei Tang, Junsong Yuan, Nanning Zheng, Gang Hua
In this paper, we introduce an Action-Context Separation Network (ACSNet) that explicitly takes into account context for accurate action localization.
Ranked #7 on Weakly Supervised Action Localization on THUMOS’14
Video Polyp Segmentation Weakly Supervised Action Localization +2
2 code implementations • ICCV 2021 • Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Yinghui Xu, Nanning Zheng, Gang Hua
In this paper, we formulate a new adversarial attack against deep ranking systems, i. e., the Order Attack, which covertly alters the relative order among a selected set of candidates according to an attacker-specified permutation, with limited interference to other unrelated candidates.
no code implementations • 1 Mar 2021 • He Zhang, Zhixiong Nan, Tao Yang, Yifan Liu, Nanning Zheng
In autonomous driving, perceiving the driving behaviors of surrounding agents is important for the ego-vehicle to make a reasonable decision.
1 code implementation • ICLR 2022 • Tao Yang, Xuanchi Ren, Yuwang Wang, Wenjun Zeng, Nanning Zheng
We then propose a model, based on existing VAE-based methods, to tackle the unsupervised learning problem of the framework.
no code implementations • 1 Jan 2021 • Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Xu Yinghui, Nanning Zheng, Gang Hua
The objective of this paper is to formalize and practically implement a new adversarial attack against deep ranking systems, i. e., the Order Attack, which covertly alters the relative order of a selected set of candidates according to a permutation vector predefined by the attacker, with only limited interference to other unrelated candidates.
1 code implementation • ICCV 2021 • Haoxuanye Ji, Le Wang, Sanping Zhou, Wei Tang, Nanning Zheng, Gang Hua
Unsupervised person re-identification (Re-ID) remains challenging due to the lack of ground-truth labels.
no code implementations • 7 Dec 2020 • Lipeng Wan, Xuwei Song, Xuguang Lan, Nanning Zheng
General methods for policy based multi-agent reinforcement learning to solve the challenge introduce differentiate value functions or advantage functions for individual agents.
1 code implementation • NeurIPS 2020 • Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Xiangyu Zhang, Hongbin Sun, Jian Sun, Nanning Zheng
The Learnable Tree Filter presents a remarkable approach to model structure-preserving relations for semantic segmentation.
1 code implementation • NeurIPS 2020 • Lin Song, Yanwei Li, Zhengkai Jiang, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng
To this end, we propose a fine-grained dynamic head to conditionally select a pixel-level combination of FPN features from different scales for each instance, which further releases the ability of multi-scale feature representation.
1 code implementation • CVPR 2021 • JianFeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng
Mainstream object detectors based on the fully convolutional network has achieved impressive performance.
no code implementations • 27 Oct 2020 • Hui Chen, Zhixiong Nan, Jingjing Jiang, Nanning Zheng
The composition recognition of unseen attribute-object is critical to make machines learn to decompose and compose complex concepts like people.
no code implementations • 8 Sep 2020 • Jianji Wang, Qi Liu, Shupei Zhang, Nanning Zheng, Fei-Yue Wang
By the proposed method, the computational complexity is reduced from $O(\frac{1}{6}{k^3}+mk^2+mkd)$ to $O(\frac{1}{6}{k^3}+\frac{1}{2}mk^2)$ for each candidate subset in sparse regression.
2 code implementations • ECCV 2020 • Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng
Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.
no code implementations • 6 Jul 2020 • Tao Yang, Zhixiong Nan, He Zhang, Shitao Chen, Nanning Zheng
In this paper, we propose a model to predict the trajectories of target agents around an autonomous vehicle.
1 code implementation • NeurIPS 2020 • Qian Liu, Shengnan An, Jian-Guang Lou, Bei Chen, Zeqi Lin, Yan Gao, Bin Zhou, Nanning Zheng, Dongmei Zhang
Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily.
1 code implementation • 28 Feb 2020 • Binglei Zhao, Hanbo Zhang, Xuguang Lan, Haoyu Wang, Zhiqiang Tian, Nanning Zheng
Reliable robotic grasping in unstructured environments is a crucial but challenging task.
Robotics
1 code implementation • 4 Jan 2020 • Zirui Zhao, Yijun Mao, Yan Ding, Pengju Ren, Nanning Zheng
Semantic SLAM is an important field in autonomous driving and intelligent agents, which can enable robots to achieve high-level navigation tasks, obtain simple cognition or reasoning ability and achieve language-based human-robot-interaction.
no code implementations • ICLR 2020 • Hui Chen, Zhixiong Nan, Nanning Zheng
This paper handles a challenging problem, unseen attribute-object pair recognition, which asks a model to simultaneously recognize the attribute type and the object type of a given image while this attribute-object pair is not included in the training set.
1 code implementation • NeurIPS 2019 • Lin Song, Yanwei Li, Zeming Li, Gang Yu, Hongbin Sun, Jian Sun, Nanning Zheng
To this end, tree filtering modules are embedded to formulate a unified framework for semantic segmentation.
no code implementations • 3 Sep 2019 • Pengfei Zhang, Jianru Xue, Cuiling Lan, Wen-Jun Zeng, Zhanning Gao, Nanning Zheng
For an RNN block, an EleAttG is used for adaptively modulating the input by assigning different levels of importance, i. e., attention, to each element/dimension of the input.
Ranked #3 on Skeleton Based Action Recognition on SYSU 3D
no code implementations • 22 Aug 2019 • Hao Wu, Ziyu Zhu, Jiayi Wang, Nanning Zheng, Badong Chen
The framework comprises two parts: forward encoding model that deals with visual stimuli and inner state model that captures influence from intrinsic connections in the brain.
1 code implementation • 29 Jul 2019 • Hanbo Zhang, Site Bai, Xuguang Lan, David Hsu, Nanning Zheng
We propose \emph{Hindsight Trust Region Policy Optimization}(HTRPO), a new RL algorithm that extends the highly successful TRPO algorithm with \emph{hindsight} to tackle the challenge of sparse rewards.
2 code implementations • CVPR 2020 • Pengfei Zhang, Cuiling Lan, Wen-Jun Zeng, Junliang Xing, Jianru Xue, Nanning Zheng
Skeleton-based human action recognition has attracted great interest thanks to the easy accessibility of the human skeleton data.
Ranked #1 on Skeleton Based Action Recognition on SYSU 3D
1 code implementation • CVPR 2019 • Pu Zhang, Wanli Ouyang, Pengfei Zhang, Jianru Xue, Nanning Zheng
In order to address this issue, we propose a data-driven state refinement module for LSTM network (SR-LSTM), which activates the utilization of the current intention of neighbors, and jointly and iteratively refines the current states of all participants in the crowd through a message passing mechanism.
no code implementations • 23 Oct 2018 • Yuanliu Liu, Ang Li, Zejian yuan, Badong Chen, Nanning Zheng
We propose a Consistency-aware Selective Fusion (CSF) to integrate the pairwise orders into a globally consistent order.
no code implementations • 19 Sep 2018 • Hanbo Zhang, Xuguang Lan, Site Bai, Lipeng Wan, Chenjie Yang, Nanning Zheng
Autonomous robotic grasping plays an important role in intelligent robotics.
Robotics
no code implementations • 8 Sep 2018 • Hanbo Zhang, Xinwen Zhou, Xuguang Lan, Jin Li, Zhiqiang Tian, Nanning Zheng
The main component of our approach is a grasp detection network with oriented anchor boxes as detection priors.
Robotics
no code implementations • ECCV 2018 • Xing Wei, Yue Zhang, Yihong Gong, Jiawei Zhang, Nanning Zheng
The reason is that the bilinear feature matrix is sensitive to the magnitudes and correlations of local CNN feature elements which can be measured by its singular values.
Fine-Grained Image Classification Fine-Grained Visual Recognition +1
no code implementations • ECCV 2018 • Weiwei Shi, Yihong Gong, Chris Ding, Zhiheng MaXiaoyu Tao, Nanning Zheng
In this paper, we propose Transductive Semi-Supervised Deep Learning (TSSDL) method that is effective for training Deep Convolutional Neural Network (DCNN) models.
no code implementations • 30 Aug 2018 • Hanbo Zhang, Xuguang Lan, Site Bai, Xinwen Zhou, Zhiqiang Tian, Nanning Zheng
Experimental results demonstrate that ROI-GD performs much better in object overlapping scenes and at the meantime, remains comparable with state-of-the-art grasp detection algorithms on Cornell Grasp Dataset and Jacquard Dataset.
Robotics
no code implementations • ECCV 2018 • Pengfei Zhang, Jianru Xue, Cuiling Lan, Wen-Jun Zeng, Zhanning Gao, Nanning Zheng
We propose adding a simple yet effective Element-wiseAttention Gate (EleAttG) to an RNN block (e. g., all RNN neurons in a network layer) that empowers the RNN neurons to have the attentiveness capability.
Ranked #102 on Skeleton Based Action Recognition on NTU RGB+D
no code implementations • 4 Jul 2018 • Sanping Zhou, Jinjun Wang, Deyu Meng, Yudong Liang, Yihong Gong, Nanning Zheng
Specifically, a novel foreground attentive subnetwork is designed to drive the network's attention, in which a decoder network is used to reconstruct the binary mask by using a novel local regression loss function, and an encoder network is regularized by the decoder network to focus its attention on the foreground persons.
no code implementations • CVPR 2018 • Xing Wei, Yue Zhang, Yihong Gong, Nanning Zheng
Experimental results on several patch matching benchmarks show that our method outperforms the state-of-the-arts significantly.
no code implementations • CVPR 2018 • Ping Wei, Yang Liu, Tianmin Shu, Nanning Zheng, Song-Chun Zhu
We built a new video dataset of tasks, intentions, and attention.
2 code implementations • 20 Apr 2018 • Pengfei Zhang, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jianru Xue, Nanning Zheng
In order to alleviate the effects of view variations, this paper introduces a novel view adaptation scheme, which automatically determines the virtual observation viewpoints in a learning based data driven manner.
Ranked #1 on Skeleton Based Action Recognition on UWA3D
no code implementations • 19 Mar 2018 • Jinliang Zang, Le Wang, Ziyi Liu, Qilin Zhang, Zhenxing Niu, Gang Hua, Nanning Zheng
Research in human action recognition has accelerated significantly since the introduction of powerful machine learning tools such as Convolutional Neural Networks (CNNs).
no code implementations • 13 Mar 2018 • Kai Yi, Zhiqiang Jian, Shitao Chen, Nanning Zheng
At present, the performance of deep neural network in general object detection is comparable to or even surpasses that of human beings.
no code implementations • 6 Mar 2018 • Xinwen Zhou, Xuguang Lan, Hanbo Zhang, Zhiqiang Tian, Yang Zhang, Nanning Zheng
The feature extractor is a deep convolutional neural network.
no code implementations • 1 Feb 2018 • Zhengda Qin, Badong Chen, Nanning Zheng, Jose C. Principe
In this paper, we propose a linear model called Augmented Space Linear Model (ASLM), which uses the full joint space of input and desired signal as the projection space and approaches the performance of nonlinear models.
no code implementations • 1 Dec 2017 • Siyu Yu, Nanning Zheng, Yongqiang Ma, Hao Wu, Badong Chen
Analyzing the correlations of collected data from human brain activities and representing activity patterns are two problems in brain decoding based on functional magnetic resonance imaging (fMRI) signals.
no code implementations • 11 Oct 2017 • Badong Chen, Lei Xing, Nanning Zheng, Jose C. Príncipe
Comparing with traditional learning criteria, such as mean square error (MSE), the minimum error entropy (MEE) criterion is superior in nonlinear and non-Gaussian signal processing and machine learning.
no code implementations • 7 Oct 2017 • Sanping Zhou, Jinjun Wang, Deyu Meng, Xiaomeng Xin, Yubing Li, Yihong Gong, Nanning Zheng
In this paper, we propose a novel deep self-paced learning (DSPL) algorithm to alleviate this problem, in which we apply a self-paced constraint and symmetric regularization to help the relative distance metric training the deep neural network, so as to learn the stable and discriminative features for person Re-ID.
no code implementations • 18 Aug 2017 • Sanping Zhou, Jinjun Wang, Rui Shi, Qiqi Hou, Yihong Gong, Nanning Zheng
The class-identity term keeps the intra-class samples within each camera view gathering together, the relative distance term maximizes the distance between the intra-class class set and inter-class set across different camera views, and the regularization term smoothness the parameters of deep convolutional neural network (CNN).
no code implementations • 4 Aug 2017 • Jianji Wang, Nanning Zheng, Badong Chen, Jose C. Principe
Moreover, for a target vector, the ratio of the corresponding affine parameters in the MSE-based linear decomposition scheme and the SSIM-based scheme is a constant, which is just the value of PCC between the target vector and its estimated vector.
no code implementations • 25 Jul 2017 • De Cheng, Yihong Gong, Zhihui Li, Weiwei Shi, Alexander G. Hauptmann, Nanning Zheng
The proposed method can take full advantages of the structured distance relationships among these training samples, with the constructed complete graph.
no code implementations • CVPR 2017 • Zhanning Gao, Gang Hua, Dong-Qing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng
We develop a unified framework for complex event retrieval, recognition and recounting.
no code implementations • CVPR 2017 • Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, Nanning Zheng
One of the key issues for deep learning based person Re-ID is the selection of proper similarity comparison criteria, and the performance of learned features using existing criterion based on pairwise similarity is still limited, because only P2P distances are mostly considered.
no code implementations • 1 May 2017 • Ziyi Liu, Siyu Yu, Xiao Wang, Nanning Zheng
Experiments show that our unsupervised approach is efficient and robust for detecting drivable area for self-driving cars.
no code implementations • 31 Mar 2017 • Yudong Liang, Radu Timofte, Jinjun Wang, Yihong Gong, Nanning Zheng
The internal contents of the low resolution input image is neglected with deep modeling despite the earlier works showing the power of using such internal priors.
1 code implementation • ICCV 2017 • Pengfei Zhang, Cuiling Lan, Junliang Xing, Wen-Jun Zeng, Jianru Xue, Nanning Zheng
Rather than re-positioning the skeletons based on a human defined prior criterion, we design a view adaptive recurrent neural network (RNN) with LSTM architecture, which enables the network itself to adapt to the most suitable observation viewpoints from end to end.
Ranked #6 on Skeleton Based Action Recognition on SYSU 3D
no code implementations • 23 Mar 2017 • Yudong Liang, Ze Yang, Kai Zhang, Yihui He, Jinjun Wang, Nanning Zheng
To tackle with the second problem, a lightweight CNN architecture which has carefully designed width, depth and skip connections was proposed.
no code implementations • 18 Feb 2017 • Shitao Chen, Songyi Zhang, Jinghao Shang, Badong Chen, Nanning Zheng
Perception-driven approach and end-to-end system are two major vision-based frameworks for self-driving cars.
no code implementations • 21 Dec 2016 • Badong Chen, Lei Xing, Xin Wang, Jing Qin, Nanning Zheng
Correntropy is a second order statistical measure in kernel space, which has been successfully applied in robust learning and signal processing.
no code implementations • 1 Aug 2016 • Badong Chen, Lei Xing, Bin Xu, Haiquan Zhao, Nanning Zheng, Jose C. Principe
Nonlinear similarity measures defined in kernel space, such as correntropy, can extract higher-order statistics of data and offer potentially significant performance improvement over their linear counterparts especially in non-Gaussian signal processing and machine learning.
no code implementations • CVPR 2016 • Dapeng Chen, Zejian yuan, Badong Chen, Nanning Zheng
We therefore learn a novel similarity function, which consists of multiple sub-similarity measurements with each taking in charge of a subregion.
1 code implementation • CVPR 2016 • De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, Nanning Zheng
Person re-identification across cameras remains a very challenging problem, especially when there are no overlapping fields of view between cameras.
no code implementations • 5 Apr 2016 • Zhanning Gao, Gang Hua, Dongqing Zhang, Jianru Xue, Nanning Zheng
Event retrieval and recognition in a large corpus of videos necessitates a holistic fixed-size visual representation at the video clip level that is comprehensive, compact, and yet discriminative.
no code implementations • 4 Feb 2016 • Fei Zhu, Abderrahim Halimi, Paul Honeine, Badong Chen, Nanning Zheng
In hyperspectral images, some spectral bands suffer from low signal-to-noise ratio due to noisy acquisition and atmospheric effects, thus requiring robust techniques for the unmixing problem.
no code implementations • ICCV 2015 • Yuanliu liu, Zejian yuan, Badong Chen, Jianru Xue, Nanning Zheng
In this paper we address the problem of inferring the color composition of the intrinsic reflectance of objects, where the shadows and highlights may change the observed color dramatically.
no code implementations • ICCV 2015 • Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng
For the purpose, we divide the shape template into overlapped parts and model the matching through a part-based layered structure that uses the latent variable to constrain parts' deformation.
no code implementations • CVPR 2015 • Dapeng Chen, Zejian yuan, Gang Hua, Nanning Zheng, Jingdong Wang
We follow the learning-to-rank methodology and learn a similarity function to maximize the difference between the similarity scores of matched and unmatched images for a same person.
no code implementations • CVPR 2015 • Yuanliu Liu, Zejian yuan, Nanning Zheng, Yang Wu
Specular reflection generally decreases the saturation of surface colors, which will be possibly confused with other colors that have the same hue but lower saturation.
no code implementations • 12 Apr 2015 • Badong Chen, Lei Xing, Haiquan Zhao, Nanning Zheng, José C. Príncipe
In this work, we propose a generalized correntropy that adopts the generalized Gaussian density (GGD) function as the kernel (not necessarily a Mercer kernel), and present some important properties.
no code implementations • CVPR 2013 • Huaizu Jiang, Zejian yuan, Ming-Ming Cheng, Yihong Gong, Nanning Zheng, Jingdong Wang
Our method, which is based on multi-level image segmentation, utilizes the supervised learning approach to map the regional feature vector to a saliency score.
no code implementations • 23 Jan 2014 • Badong Chen, Junli Liang, Nanning Zheng, Jose C. Principe
Kernel adaptive filters (KAF) are a class of powerful nonlinear filters developed in Reproducing Kernel Hilbert Space (RKHS).