Search Results for author: Fuchun Sun

Found 87 papers, 38 papers with code

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

no code implementations • CVPR 2016 • Tao Kong, Anbang Yao, Yurong Chen, Fuchun Sun

Almost all of the current top-performing object detection networks employ region proposals to guide the search for object instances.

Object object-detection +2

Paper
Add Code

Sparse Coding and Dictionary Learning With Linear Dynamical Systems

no code implementations • CVPR 2016 • Wenbing Huang, Fuchun Sun, Lele Cao, Deli Zhao, Huaping Liu, Mehrtash Harandi

To enhance the performance of LDSs, in this paper, we address the challenging issue of performing sparse coding on the space of LDSs, where both data and dictionary atoms are LDSs.

Dictionary Learning Video Classification

Paper
Add Code

Analyzing Linear Dynamical Systems: From Modeling to Coding and Learning

no code implementations • 3 Aug 2016 • Wenbing Huang, Fuchun Sun, Lele Cao, Mehrtash Harandi

We then devise efficient algorithms to perform sparse coding and dictionary learning on the space of infinite-dimensional subspaces.

Dictionary Learning General Classification +5

Paper
Add Code

MAT: A Multimodal Attentive Translator for Image Captioning

no code implementations • 18 Feb 2017 • Chang Liu, Fuchun Sun, Changhu Wang, Feng Wang, Alan Yuille

In this way, the sequential representation of an image can be naturally translated to a sequence of words, as the target sequence of the RNN model.

Caption Generation Image Captioning +2

Paper
Add Code

RON: Reverse Connection with Objectness Prior Networks for Object Detection

1 code implementation • CVPR 2017 • Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, Ming Lu, Yurong Chen

To address (a), we design the reverse connection, which enables the network to detect objects on multi-levels of CNNs.

Object object-detection +2

355

Paper
Code

Efficient Optimization for Linear Dynamical Systems with Applications to Clustering and Sparse Coding

no code implementations • NeurIPS 2017 • Wenbing Huang, Mehrtash Harandi, Tong Zhang, Lijie Fan, Fuchun Sun, Junzhou Huang

Linear Dynamical Systems (LDSs) are fundamental tools for modeling spatio-temporal data in various disciplines.

Clustering

Paper
Add Code

Task Transfer by Preference-Based Cost Learning

no code implementations • 12 May 2018 • Mingxuan Jing, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu

The goal of task transfer in reinforcement learning is migrating the action policy of an agent to the target task from the source task.

Paper
Add Code

Learning and Inferring Movement with Deep Generative Model

no code implementations • 18 May 2018 • Mingxuan Jing, Xiaojian Ma, Fuchun Sun, Huaping Liu

Learning and inference movement is a very challenging problem due to its high dimensionality and dependency to varied environments or tasks.

Motion Planning

Paper
Add Code

Multimodal Classification with Deep Convolutional-Recurrent Neural Networks for Electroencephalography

no code implementations • 24 Jul 2018 • Chuanqi Tan, Fuchun Sun, Wenchang Zhang, Jianhua Chen, Chunfang Liu

Herein, we propose a novel approach to modeling cognitive events from EEG data by reducing it to a video classification problem, which is designed to preserve the multimodal information of EEG.

Brain Computer Interface Classification +4

Paper
Add Code

A Survey on Deep Transfer Learning

no code implementations • 6 Aug 2018 • Chuanqi Tan, Fuchun Sun, Tao Kong, Wenchang Zhang, Chao Yang, Chunfang Liu

As a new classification platform, deep learning has recently received increasing attention from researchers and has been successfully applied to many domains.

General Classification Transfer Learning

Paper
Add Code

Deep Transfer Learning for EEG-based Brain Computer Interface

no code implementations • 6 Aug 2018 • Chuanqi Tan, Fuchun Sun, Wenchang Zhang

First, we model cognitive events based on EEG data by characterizing the data using EEG optical flow, which is designed to preserve multimodal EEG information in a uniform representation.

Brain Computer Interface EEG +2

Paper
Add Code

Deep Feature Pyramid Reconfiguration for Object Detection

no code implementations • ECCV 2018 • Tao Kong, Fuchun Sun, Wenbing Huang, Huaping Liu

In this paper, we begin by investigating current feature pyramids solutions, and then reformulate the feature pyramid construction as the feature reconfiguration process.

Object object-detection +1

Paper
Add Code

Vision-based Teleoperation of Shadow Dexterous Hand using End-to-End Deep Neural Network

4 code implementations • 17 Sep 2018 • Shuang Li, Xiaojian Ma, Hongzhuo Liang, Michael Görner, Philipp Ruppel, Bing Fang, Fuchun Sun, Jianwei Zhang

In this paper, we present TeachNet, a novel neural network architecture for intuitive and markerless vision-based teleoperation of dexterous robotic hands.

Robotics

Paper
Code

PointNetGPD: Detecting Grasp Configurations from Point Sets

4 code implementations • 17 Sep 2018 • Hongzhuo Liang, Xiaojian Ma, Shuang Li, Michael Görner, Song Tang, Bin Fang, Fuchun Sun, Jianwei Zhang

In this paper, we propose an end-to-end grasp evaluation model to address the challenging problem of localizing robot grasp configurations directly from the point cloud.

Robotics

301

Paper
Code

Consistent Optimization for Single-Shot Object Detection

no code implementations • 19 Jan 2019 • Tao Kong, Fuchun Sun, Huaping Liu, Yuning Jiang, Jianbo Shi

We present consistent optimization for single stage object detection.

Object object-detection +1

Paper
Add Code

FoveaBox: Beyond Anchor-based Object Detector

7 code implementations • 8 Apr 2019 • Tao Kong, Fuchun Sun, Huaping Liu, Yuning Jiang, Lei LI, Jianbo Shi

In FoveaBox, an instance is assigned to adjacent feature levels to make the model more accurate. We demonstrate its effectiveness on standard benchmarks and report extensive experimental analysis.

Ranked #82 on Object Detection on COCO test-dev (APM metric)

Object object-detection +1

27,765

Paper
Code

Attention-based Transfer Learning for Brain-computer Interface

no code implementations • 25 Apr 2019 • Chuanqi Tan, Fuchun Sun, Tao Kong, Bin Fang, Wenchang Zhang

Different functional areas of the human brain play different roles in brain activity, which has not been paid sufficient research attention in the brain-computer interface (BCI) field.

Brain Computer Interface Classification +3

Paper
Add Code

Deep Point-wise Prediction for Action Temporal Proposal

1 code implementation • 17 Sep 2019 • Luxuan Li, Tao Kong, Fuchun Sun, Huaping Liu

Detecting actions in videos is an important yet challenging task.

Action Recognition Temporal Action Proposal Generation

Paper
Code

Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement

no code implementations • NeurIPS 2019 • Chao Yang, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu, Junzhou Huang, Chuang Gan

This paper studies Learning from Observations (LfO) for imitation learning with access to state-only demonstrations.

Imitation Learning

Paper
Add Code

Regularized Adversarial Sampling and Deep Time-aware Attention for Click-Through Rate Prediction

no code implementations • 3 Nov 2019 • Yikai Wang, Liang Zhang, Quanyu Dai, Fuchun Sun, Bo Zhang, Yang He, Weipeng Yan, Yongjun Bao

In deep CTR models, exploiting users' historical data is essential for learning users' behaviors and interests.

Click-Through Rate Prediction

Paper
Add Code

Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance

no code implementations • 16 Nov 2019 • Mingxuan Jing, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Chao Yang, Bin Fang, Huaping Liu

In this paper, we study Reinforcement Learning from Demonstrations (RLfD) that improves the exploration efficiency of Reinforcement Learning (RL) by providing expert demonstrations.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

FoveaBox: Beyound Anchor-based Object Detection

no code implementations • ICLR 2020 • Tao Kong, Fuchun Sun, Huaping Liu, Yuning Jiang, Lei LI, Jianbo Shi

While almost all state-of-the-art object detectors utilize predefined anchors to enumerate possible locations, scales and aspect ratios for the search of the objects, their performance and generalization ability are also limited to the design of anchors.

Object object-detection +1

Paper
Add Code

Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

2 code implementations • CVPR 2020 • Runfa Chen, Wenbing Huang, Binghui Huang, Fuchun Sun, Bin Fang

The proposed architecture, termed as NICE-GAN, exhibits two advantageous patterns over previous approaches: First, it is more compact since no independent encoding component is required; Second, this plug-in encoder is directly trained by the adversary loss, making it more informative and trained more effectively if a multi-scale discriminator is applied.

Translation Unsupervised Image-To-Image Translation

232

Paper
Code

Robust Robotic Pouring using Audition and Haptics

1 code implementation • 29 Feb 2020 • Hongzhuo Liang, Chuangchuang Zhou, Shuang Li, Xiaojian Ma, Norman Hendrich, Timo Gerkmann, Fuchun Sun, Marcus Stoffel, Jianwei Zhang

Both network training results and robot experiments demonstrate that MP-Net is robust against noise and changes to the task and environment.

Paper
Code

MQA: Answering the Question via Robotic Manipulation

1 code implementation • 10 Mar 2020 • Yuhong Deng, Di Guo, Xiaofeng Guo, Naifu Zhang, Huaping Liu, Fuchun Sun

In this paper, we propose a novel task, Manipulation Question Answering (MQA), where the robot performs manipulation actions to change the environment in order to answer a given question.

Imitation Learning Question Answering +1

Paper
Code

A Mobile Robot Hand-Arm Teleoperation System by Vision and IMU

1 code implementation • 11 Mar 2020 • Shuang Li, Jiaxi Jiang, Philipp Ruppel, Hongzhuo Liang, Xiaojian Ma, Norman Hendrich, Fuchun Sun, Jianwei Zhang

In this paper, we present a multimodal mobile teleoperation system that consists of a novel vision-based hand pose regression network (Transteleop) and an IMU-based arm tracking method.

Anatomy Image-to-Image Translation +1

Paper
Code

Towards Embodied Scene Description

no code implementations • 30 Apr 2020 • Sinan Tan, Huaping Liu, Di Guo, Xin-Yu Zhang, Fuchun Sun

Embodiment is an important characteristic for all intelligent agents (creatures and robots), while existing scene description tasks mainly focus on analyzing images passively and the semantic understanding of the scenario is separated from the interaction between the agent and the environment.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Resolution Switchable Networks for Runtime Efficient Image Recognition

1 code implementation • ECCV 2020 • Yikai Wang, Fuchun Sun, Duo Li, Anbang Yao

We propose a general method to train a single convolutional neural network which is capable of switching image resolutions at inference.

Knowledge Distillation Quantization

Paper
Code

A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning

2 code implementations • ECCV 2020 • Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng

Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.

Generalized Zero-Shot Learning

Paper
Code

Tackling Over-Smoothing for General Graph Convolutional Networks

no code implementations • 22 Aug 2020 • Wenbing Huang, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

Increasing the depth of GCN, which is expected to permit more expressivity, is shown to incur performance detriment especially on node classification.

Node Classification

Paper
Add Code

Unsupervised Representation Learning by InvariancePropagation

1 code implementation • 7 Oct 2020 • Feng Wang, Huaping Liu, Di Guo, Fuchun Sun

In this paper, we propose Invariance Propagation to focus on learning representations invariant to category-level variations, which are provided by different instances from the same category.

Contrastive Learning Representation Learning +1

Paper
Code

Deep Multimodal Fusion by Channel Exchanging

1 code implementation • NeurIPS 2020 • Yikai Wang, Wenbing Huang, Fuchun Sun, Tingyang Xu, Yu Rong, Junzhou Huang

Deep multimodal fusion by using multiple sources of data for classification or regression has exhibited a clear advantage over the unimodal counterpart on various applications.

Image-to-Image Translation Semantic Segmentation +1

276

Paper
Code

Fault-Aware Robust Control via Adversarial Reinforcement Learning

no code implementations • 17 Nov 2020 • Fan Yang, Chao Yang, Di Guo, Huaping Liu, Fuchun Sun

Robots have limited adaptation ability compared to humans and animals in the case of damage.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Elastic Interaction of Particles for Robotic Tactile Simulation

no code implementations • 23 Nov 2020 • Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun

At its core, EIP models the tactile sensor as a group of coordinated particles, and the elastic theory is applied to regulate the deformation of particles during the contact process.

Paper
Add Code

Adversarial Option-Aware Hierarchical Imitation Learning

1 code implementation • 10 Jun 2021 • Mingxuan Jing, Wenbing Huang, Fuchun Sun, Xiaojian Ma, Tao Kong, Chuang Gan, Lei LI

In particular, we propose an Expectation-Maximization(EM)-style algorithm: an E-step that samples the options of expert conditioned on the current learned policy, and an M-step that updates the low- and high-level policies of agent simultaneously to minimize the newly proposed option-occupancy measurement between the expert and the agent.

Imitation Learning

Paper
Code

Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion

1 code implementation • 11 Aug 2021 • Yikai Wang, Fuchun Sun, Ming Lu, Anbang Yao

We propose a compact and effective framework to fuse multimodal features at multiple layers in a single network.

Ranked #42 on Semantic Segmentation on NYU Depth v2

Representation Learning Semantic Segmentation +1

Paper
Code

Elastic Tactile Simulation Towards Tactile-Visual Perception

2 code implementations • 11 Aug 2021 • Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun, Chang Li

By contrast, EIP models the tactile sensor as a group of coordinated particles, and the elastic property is applied to regulate the deformation of particles during contact.

Paper
Code

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

no code implementations • 20 Sep 2021 • Xinzhu Liu, Di Guo, Huaping Liu, Fuchun Sun

In this paper, we propose the multi-agent visual semantic navigation, in which multiple agents collaborate with others to find multiple target objects.

Efficient Exploration

Paper
Add Code

Constrained Graph Mechanics Networks

no code implementations • ICLR 2022 • Wenbing Huang, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

In this manner, the geometrical constraints are implicitly and naturally encoded in the forward kinematics.

Paper
Add Code

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

1 code implementation • ICCV 2021 • Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao

In the low-bit quantization field, training Binary Neural Networks (BNNs) is the extreme solution to ease the deployment of deep models on resource-constrained devices, having the lowest storage cost and significantly cheaper bit-wise operations compared to 32-bit floating-point counterparts.

Quantization

Paper
Code

Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction

1 code implementation • 4 Dec 2021 • Yikai Wang, Fuchun Sun, Wenbing Huang, Fengxiang He, DaCheng Tao

For the application of dense image prediction, the validity of CEN is tested by four different scenarios: multimodal fusion, cycle multimodal fusion, multitask learning, and multimodal multitask learning.

Ranked #7 on Semantic Segmentation on LLRGBD-synthetic

Semantic Segmentation

276

Paper
Code

Bootstrapping Informative Graph Augmentation via A Meta Learning Approach

1 code implementation • 11 Jan 2022 • Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Fuchun Sun, Changwen Zheng

To this end, we propose a novel approach to learning a graph augmenter that can generate an augmentation with uniformity and informativeness.

Contrastive Learning Informativeness +2

Paper
Code

Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation

no code implementations • 26 Jan 2022 • Sinan Tan, Mengmeng Ge, Di Guo, Huaping Liu, Fuchun Sun

In the Vision-and-Language Navigation task, the embodied agent follows linguistic instructions and navigates to a specific goal.

Representation Learning Test unseen +1

Paper
Add Code

Sim2Real Object-Centric Keypoint Detection and Description

no code implementations • 1 Feb 2022 • Chengliang Zhong, Chao Yang, Jinshan Qi, Fuchun Sun, Huaping Liu, Xiaodong Mu, Wenbing Huang

Keypoint detection and description play a central role in computer vision.

6D Pose Estimation Contrastive Learning +3

Paper
Add Code

Sound Adversarial Audio-Visual Navigation

1 code implementation • ICLR 2022 • Yinfeng Yu, Wenbing Huang, Fuchun Sun, Changan Chen, Yikai Wang, Xiaohong Liu

In this work, we design an acoustically complex environment in which, besides the target sound, there exists a sound attacker playing a zero-sum game with the agent.

Navigate Visual Navigation

Paper
Code

Adversarial Texture for Fooling Person Detectors in the Physical World

1 code implementation • CVPR 2022 • Zhanhao Hu, Siyuan Huang, Xiaopei Zhu, Fuchun Sun, Bo Zhang, Xiaolin Hu

Experiments showed that these clothes could fool person detectors in the physical world.

Paper
Code

Equivariant Graph Mechanics Networks with Constraints

1 code implementation • 12 Mar 2022 • Wenbing Huang, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

The core of GMN is that it represents, by generalized coordinates, the forward kinematics information (positions and velocities) of a structural object.

Paper
Code

Smoothing Matters: Momentum Transformer for Domain Adaptive Semantic Segmentation

1 code implementation • 15 Mar 2022 • Runfa Chen, Yu Rong, Shangmin Guo, Jiaqi Han, Fuchun Sun, Tingyang Xu, Wenbing Huang

After the great success of Vision Transformer variants (ViTs) in computer vision, it has also demonstrated great potential in domain adaptive semantic segmentation.

Ranked #7 on Semantic Segmentation on SYNTHIA-to-Cityscapes

Pseudo Label Segmentation +2

Paper
Code

Multimodal Token Fusion for Vision Transformers

11 code implementations • journal 2022 • Yikai Wang, Xinghao Chen, Lele Cao, Wenbing Huang, Fuchun Sun, Yunhe Wang

Many adaptations of transformers have emerged to address the single-modal vision tasks, where self-attention modules are stacked to handle input sources like images.

Ranked #1 on Semantic Segmentation on SUN-RGBD (using extra training data)

3D Object Detection Image-to-Image Translation +2

835

Paper
Code

SNAKE: Shape-aware Neural 3D Keypoint Field

1 code implementation • 3 Jun 2022 • Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang

Detecting 3D keypoints from point clouds is important for shape reconstruction, while this work investigates the dual question: can shape reconstruction benefit 3D keypoint detection?

Keypoint Detection

208

Paper
Code

Similarity-aware Positive Instance Sampling for Graph Contrastive Pre-training

no code implementations • NeurIPS 2021 • Xueyi Liu, Yu Rong, Tingyang Xu, Fuchun Sun, Wenbing Huang, Junzhou Huang

To remedy this issue, we propose to select positive graph instances directly from existing graphs in the training set, which ultimately maintains the legality and similarity to the target graphs.

Contrastive Learning Graph Classification +1

Paper
Add Code

Robust Causal Graph Representation Learning against Confounding Effects

1 code implementation • 18 Aug 2022 • Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Bing Xu, Changwen Zheng, Fuchun Sun

This observation reveals that there exist confounders in graphs, which may interfere with the model learning semantic information, and current graph representation learning methods have not eliminated their influence.

Graph Representation Learning

Paper
Code

Disentangle and Remerge: Interventional Knowledge Distillation for Few-Shot Object Detection from A Conditional Causal Perspective

1 code implementation • 26 Aug 2022 • Jiangmeng Li, Yanan Zhang, Wenwen Qiang, Lingyu Si, Chengbo Jiao, Xiaohui Hu, Changwen Zheng, Fuchun Sun

To understand the reasons behind this phenomenon, we revisit the learning paradigm of knowledge distillation on the few-shot object detection task from the causal theoretic standpoint, and accordingly, develop a Structural Causal Model.

Few-Shot Learning Few-Shot Object Detection +4

Paper
Code

Pay Self-Attention to Audio-Visual Navigation

no code implementations • 4 Oct 2022 • Yinfeng Yu, Lele Cao, Fuchun Sun, Xiaohong Liu, Liejun Wang

Audio-visual embodied navigation, as a hot research topic, aims training a robot to reach an audio target using egocentric visual (from the sensors mounted on the robot) and audio (emitted from the target) input.

Visual Navigation

Paper
Add Code

Bridged Transformer for Vision and Point Cloud 3D Object Detection

2 code implementations • CVPR 2022 • Yikai Wang, TengQi Ye, Lele Cao, Wenbing Huang, Fuchun Sun, Fengxiang He, DaCheng Tao

Recently, there is a trend of leveraging multiple sources of input data, such as complementing the 3D point cloud with 2D images that often have richer color and fewer noises.

3D Object Detection Object +1

Paper
Code

When to Update Your Model: Constrained Model-based Reinforcement Learning

1 code implementation • 15 Oct 2022 • Tianying Ji, Yu Luo, Fuchun Sun, Mingxuan Jing, Fengxiang He, Wenbing Huang

Our follow-up derived bounds reveal the relationship between model shifts and performance improvement.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Code

Recognizing Object by Components with Human Prior Knowledge Enhances Adversarial Robustness of Deep Neural Networks

1 code implementation • 4 Dec 2022 • Xiao Li, Ziqi Wang, Bo Zhang, Fuchun Sun, Xiaolin Hu

The first stage of ROCK corresponds to the process of decomposing objects into parts in human vision.

Adversarial Robustness Inductive Bias +2

Paper
Code

A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal

1 code implementation • 12 Dec 2022 • Ke Liang, Lingyuan Meng, Meng Liu, Yue Liu, Wenxuan Tu, Siwei Wang, Sihang Zhou, Xinwang Liu, Fuchun Sun

According to the graph types, existing KGR models can be roughly divided into three categories, i. e., static models, temporal models, and multi-modal models.

General Knowledge Knowledge Graph Embedding +3

955

Paper
Code

Timestamp-Supervised Action Segmentation from the Perspective of Clustering

1 code implementation • 22 Dec 2022 • Dazhao Du, Enhan Li, Lingyu Si, Fanjiang Xu, Fuchun Sun

Most existing methods generate pseudo-labels for all frames in each video to train the segmentation model.

Action Segmentation Clustering +2

Paper
Code

Introducing Expertise Logic into Graph Representation Learning from A Causal Perspective

no code implementations • 20 Jan 2023 • Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Xingzhe Su, Fengge Wu, Changwen Zheng, Fuchun Sun

By further observing the ramifications of introducing expertise logic into graph representation learning, we conclude that leading the GNNs to learn human expertise can improve the model performance.

Graph Representation Learning Knowledge Graphs

Paper
Add Code

A Survey on Causal Reinforcement Learning

no code implementations • 10 Feb 2023 • Yan Zeng, Ruichu Cai, Fuchun Sun, Libo Huang, Zhifeng Hao

While Reinforcement Learning (RL) achieves tremendous success in sequential decision-making problems of many domains, it still faces key challenges of data inefficiency and the lack of interpretability.

Decision Making reinforcement-learning +1

Paper
Add Code

Deep Reinforcement Learning for Robotic Pushing and Picking in Cluttered Environment

no code implementations • 21 Feb 2023 • Yuhong Deng, Xiaofeng Guo, Yixuan Wei, Kai Lu, Bin Fang, Di Guo, Huaping Liu, Fuchun Sun

A composite robotic hand composed of a suction cup and a gripper is designed for grasping the object stably.

Object reinforcement-learning +2

Paper
Add Code

Intriguing Property and Counterfactual Explanation of GAN for Remote Sensing Image Generation

no code implementations • 9 Mar 2023 • Xingzhe Su, Wenwen Qiang, Jie Hu, Fengge Wu, Changwen Zheng, Fuchun Sun

Based on this SCM, we theoretically prove that the quality of generated images is positively correlated with the amount of feature information.

counterfactual Counterfactual Explanation +1

Paper
Add Code

Compacting Binary Neural Networks by Sparse Kernel Selection

no code implementations • CVPR 2023 • Yikai Wang, Wenbing Huang, Yinpeng Dong, Fuchun Sun, Anbang Yao

Binary Neural Network (BNN) represents convolution weights with 1-bit values, which enhances the efficiency of storage and computation.

Binarization

Paper
Add Code

SE-Bridge: Speech Enhancement with Consistent Brownian Bridge

no code implementations • 23 May 2023 • Zhibin Qiu, Mengfan Fu, Fuchun Sun, Gulila Altenbek, Hao Huang

Our experiments on multiple datasets demonstrate the effectiveness of SE-Bridge in SE.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Subequivariant Graph Reinforcement Learning in 3D Environments

1 code implementation • 30 May 2023 • Runfa Chen, Jiaqi Han, Fuchun Sun, Wenbing Huang

Learning a shared policy that guides the locomotion of different agents is of core interest in Reinforcement Learning (RL), which leads to the study of morphology-agnostic RL.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Manifold Constraint Regularization for Remote Sensing Image Generation

no code implementations • 31 May 2023 • Xingzhe Su, Changwen Zheng, Wenwen Qiang, Fengge Wu, Junsuo Zhao, Fuchun Sun, Hui Xiong

This study identifies a previously overlooked issue: GANs exhibit a heightened susceptibility to overfitting on remote sensing images. To address this challenge, this paper analyzes the characteristics of remote sensing images and proposes manifold constraint regularization, a novel approach that tackles overfitting of GANs on remote sensing images for the first time.

Image Generation

Paper
Add Code

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

no code implementations • 5 Jun 2023 • Tianying Ji, Yu Luo, Fuchun Sun, Xianyuan Zhan, Jianwei Zhang, Huazhe Xu

Learning high-quality Q-value functions plays a key role in the success of many modern off-policy deep reinforcement learning (RL) algorithms.

Continuous Control Reinforcement Learning (RL)

Paper
Add Code

Structure-Aware DropEdge Towards Deep Graph Convolutional Networks

no code implementations • 21 Jun 2023 • Jiaqi Han, Wenbing Huang, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

Regarding the layer-dependent sampler, we interestingly find that increasingly sampling edges from the bottom layer yields superior performance than the decreasing counterpart as well as DropEdge.

Node Classification

Paper
Add Code

A Dimensional Structure based Knowledge Distillation Method for Cross-Modal Learning

no code implementations • 28 Jun 2023 • Lingyu Si, Hongwei Dong, Wenwen Qiang, Junzhi Yu, Wenlong Zhai, Changwen Zheng, Fanjiang Xu, Fuchun Sun

To address this issue, in this paper, we discover the correlation between feature discriminability and dimensional structure (DS) by analyzing and observing features extracted from simple and hard tasks.

Knowledge Distillation

Paper
Add Code

Towards Task Sampler Learning for Meta-Learning

1 code implementation • 18 Jul 2023 • Jingyao Wang, Wenwen Qiang, Xingzhe Su, Changwen Zheng, Fuchun Sun, Hui Xiong

We obtain three conclusions: (i) there is no universal task sampling strategy that can guarantee the optimal performance of meta-learning models; (ii) over-constraining task diversity may incur the risk of under-fitting or over-fitting during training; and (iii) the generalization performance of meta-learning models are affected by task diversity, task entropy, and task difficulty.

Few-Shot Learning General Knowledge

Paper
Code

Towards the Sparseness of Projection Head in Self-Supervised Learning

no code implementations • 18 Jul 2023 • Zeen Song, Xingzhe Su, Jingyao Wang, Wenwen Qiang, Changwen Zheng, Fuchun Sun

In recent years, self-supervised learning (SSL) has emerged as a promising approach for extracting valuable representations from unlabeled data.

Contrastive Learning Self-Supervised Learning

Paper
Add Code

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation

no code implementations • 16 Aug 2023 • Kai Li, Runxuan Yang, Fuchun Sun, Xiaolin Hu

Recent research has made significant progress in designing fusion modules for audio-visual speech separation.

Speech Separation

Paper
Add Code

Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos

no code implementations • ICCV 2023 • Yikai Wang, Yinpeng Dong, Fuchun Sun, Xiao Yang

The key idea of our method, Root Pose Decomposition (RPD), is to maintain a per-frame root pose transformation, meanwhile building a dense field with local transformations to rectify the root pose.

3D Reconstruction Object

Paper
Add Code

PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification

no code implementations • 5 Oct 2023 • Feihong He, Gang Li, Lingyu Si, Leilei Yan, Fanzhang Li, Fuchun Sun

In particular, our method achieves 97. 07% and 90. 88% on 5-way 5-shot and 5-way 1-shot tasks of miniImageNet, which surpasses the state-of-the-art results with accuracy of 7. 27% and 8. 72%, respectively.

Classification Contrastive Learning +2

Paper
Add Code

Measuring Acoustics with Collaborative Multiple Agents

no code implementations • 9 Oct 2023 • Yinfeng Yu, Changan Chen, Lele Cao, Fangkai Yang, Fuchun Sun

As humans, we hear sound every second of our life.

Paper
Add Code

AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation

no code implementations • 6 Dec 2023 • Xinzhou Wang, Yikai Wang, Junliang Ye, Zhengyi Wang, Fuchun Sun, Pengkun Liu, Ling Wang, Kai Sun, Xintong Wang, Bin He

Extensive experiments demonstrate the capability of our method in generating high-flexibility text-guided 3D models from the monocular video, while also showing improved reconstruction performance over existing non-rigid reconstruction methods.

3D Generation Denoising +1

Paper
Add Code

UIEDP:Underwater Image Enhancement with Diffusion Prior

no code implementations • 11 Dec 2023 • Dazhao Du, Enhan Li, Lingyu Si, Fanjiang Xu, Jianwei Niu, Fuchun Sun

To address this issue, we propose UIE with Diffusion Prior (UIEDP), a novel framework treating UIE as a posterior distribution sampling process of clear images conditioned on degraded underwater inputs.

Image Generation No-Reference Image Quality Assessment +1

Paper
Add Code

Hierarchical Topology Isomorphism Expertise Embedded Graph Contrastive Learning

1 code implementation • 21 Dec 2023 • Jiangmeng Li, Yifan Jin, Hang Gao, Wenwen Qiang, Changwen Zheng, Fuchun Sun

To this end, we propose a novel hierarchical topology isomorphism expertise embedded graph contrastive learning, which introduces knowledge distillations to empower GCL models to learn the hierarchical topology isomorphism expertise, including the graph-tier and subgraph-tier.

Contrastive Learning Graph Representation Learning +1

Paper
Code

BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction

1 code implementation • 25 Jan 2024 • Jiangmeng Li, Fei Song, Yifan Jin, Wenwen Qiang, Changwen Zheng, Fuchun Sun, Hui Xiong

From the perspective of distribution analyses, we disclose that the intrinsic issues behind the phenomenon are the over-multitudinous conceptual knowledge contained in PLMs and the abridged knowledge for target downstream domains, which jointly result in that PLMs mis-locate the knowledge distributions corresponding to the target domains in the universal knowledge embedding space.

Domain Adaptation

Paper
Code

Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy

no code implementations • 7 Feb 2024 • Ruichu Cai, Siyang Huang, Jie Qiao, Wei Chen, Yan Zeng, Keli Zhang, Fuchun Sun, Yang Yu, Zhifeng Hao

As a key component to intuitive cognition and reasoning solutions in human intelligence, causal knowledge provides great potential for reinforcement learning (RL) agents' interpretability towards decision-making by helping reduce the searching space.

Decision Making Reinforcement Learning (RL)

Paper
Add Code

ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

no code implementations • 22 Feb 2024 • Tianying Ji, Yongyuan Liang, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu

The varying significance of distinct primitive behaviors during the policy learning process has been overlooked by prior model-free RL algorithms.

Continuous Control Efficient Exploration

Paper
Add Code

Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning

no code implementations • 27 Feb 2024 • Zhaoxun Ju, Chao Yang, Hongbo Wang, Yu Qiao, Fuchun Sun

Language-conditioned robot behavior plays a vital role in executing complex tasks by associating human commands or instructions with perception and actions.

Imitation Learning Quantization

Paper
Add Code

Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding

1 code implementation • 15 Mar 2024 • Pengkun Liu, Yikai Wang, Fuchun Sun, Jiafang Li, Hang Xiao, Hongxiang Xue, Xinzhou Wang

As a result, with a single image CLIP embedding, Isotropic3D is capable of generating multi-view mutually consistent images and also a 3D model with more symmetrical and neat content, well-proportioned geometry, rich colored texture, and less distortion compared with existing image-to-3D methods while still preserving the similarity to the reference image to a large extent.

3D Generation Image to 3D +1

Paper
Code

Equivariant Local Reference Frames for Unsupervised Non-rigid Point Cloud Shape Correspondence

no code implementations • 1 Apr 2024 • Ling Wang, Runfa Chen, Yikai Wang, Fuchun Sun, Xinzhou Wang, Sun Kai, Guangyuan Fu, Jianwei Zhang, Wenbing Huang

Based on the assumption of local rigidity, one solution for reducing complexity is to decompose the overall shape into independent local regions using Local Reference Frames (LRFs) that are invariant to SE(3) transformations.

Paper
Add Code

Intriguing Properties of Positional Encoding in Time Series Forecasting

1 code implementation • 16 Apr 2024 • Jianqi Zhang, Jingyao Wang, Wenwen Qiang, Fanjiang Xu, Changwen Zheng, Fuchun Sun, Hui Xiong

Motivated by these findings, we introduce two new PEs: Temporal Position Encoding (T-PE) for temporal tokens and Variable Positional Encoding (V-PE) for variable tokens.

Time Series Time Series Forecasting

Paper
Code

Multi-Agent Embodied Question Answering in Interactive Environments

no code implementations • ECCV 2020 • Sinan Tan, Weilai Xiang, Huaping Liu, Di Guo, Fuchun Sun

We investigate a new AI task --- Multi-Agent Interactive Question Answering --- where several agents explore the scene jointly in interactive environments to answer a question.

3D Reconstruction Embodied Question Answering +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.