Search Results for author: Fuchun Sun

Found 61 papers, 26 papers with code

Multi-Agent Embodied Question Answering in Interactive Environments

no code implementations ECCV 2020 Sinan Tan, Weilai Xiang, Huaping Liu, Di Guo, Fuchun Sun

We investigate a new AI task --- Multi-Agent Interactive Question Answering --- where several agents explore the scene jointly in interactive environments to answer a question.

3D Reconstruction Embodied Question Answering +1

Introducing Expertise Logic into Graph Representation Learning from A Causal Perspective

no code implementations20 Jan 2023 Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Xingzhe Su, Fengge Wu, Changwen Zheng, Fuchun Sun

Following the theoretical guidance, we innovatively introduce the auxiliary causal logic learning paradigm to improve the model to learn the expertise logic causally related to the graph representation learning task.

Graph Representation Learning Knowledge Graphs

Timestamp-Supervised Action Segmentation in the Perspective of Clustering

no code implementations22 Dec 2022 Dazhao Du, Enhan Li, Lingyu Si, Fanjiang Xu, Fuchun Sun

We find the frames near the boundaries of action segments are in the transition region between two consecutive actions and have unclear semantics, which we call ambiguous intervals.

Action Segmentation Pseudo Label

Reasoning over Different Types of Knowledge Graphs: Static, Temporal and Multi-Modal

1 code implementation12 Dec 2022 Ke Liang, Lingyuan Meng, Meng Liu, Yue Liu, Wenxuan Tu, Siwei Wang, Sihang Zhou, Xinwang Liu, Fuchun Sun

The early works in this domain mainly focus on static KGR and tend to directly apply general knowledge graph embedding models to the reasoning task.

General Knowledge Knowledge Graph Embedding +3

Pay Self-Attention to Audio-Visual Navigation

no code implementations4 Oct 2022 Yinfeng Yu, Lele Cao, Fuchun Sun, Xiaohong Liu, Liejun Wang

Audio-visual embodied navigation, as a hot research topic, aims training a robot to reach an audio target using egocentric visual (from the sensors mounted on the robot) and audio (emitted from the target) input.

Visual Navigation

Bridged Transformer for Vision and Point Cloud 3D Object Detection

no code implementations CVPR 2022 Yikai Wang, TengQi Ye, Lele Cao, Wenbing Huang, Fuchun Sun, Fengxiang He, DaCheng Tao

Recently, there is a trend of leveraging multiple sources of input data, such as complementing the 3D point cloud with 2D images that often have richer color and fewer noises.

3D Object Detection object-detection

Disentangle and Remerge: Interventional Knowledge Distillation for Few-Shot Object Detection from A Conditional Causal Perspective

no code implementations26 Aug 2022 Jiangmeng Li, Yanan Zhang, Wenwen Qiang, Lingyu Si, Chengbo Jiao, Xiaohui Hu, Changwen Zheng, Fuchun Sun

To understand the reasons behind this phenomenon, we revisit the learning paradigm of knowledge distillation on the few-shot object detection task from the causal theoretic standpoint, and accordingly, develop a Structural Causal Model.

Few-Shot Learning Few-Shot Object Detection +2

Robust Causal Graph Representation Learning against Confounding Effects

no code implementations18 Aug 2022 Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Bing Xu, Changwen Zheng, Fuchun Sun

This observation reveals that there exist confounders in graphs, which may interfere with the model learning semantic information, and current graph representation learning methods have not eliminated their influence.

Graph Representation Learning

Similarity-aware Positive Instance Sampling for Graph Contrastive Pre-training

no code implementations NeurIPS 2021 Xueyi Liu, Yu Rong, Tingyang Xu, Fuchun Sun, Wenbing Huang, Junzhou Huang

To remedy this issue, we propose to select positive graph instances directly from existing graphs in the training set, which ultimately maintains the legality and similarity to the target graphs.

Contrastive Learning Graph Classification +1

SNAKE: Shape-aware Neural 3D Keypoint Field

1 code implementation3 Jun 2022 Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang

Detecting 3D keypoints from point clouds is important for shape reconstruction, while this work investigates the dual question: can shape reconstruction benefit 3D keypoint detection?

Keypoint Detection

Multimodal Token Fusion for Vision Transformers

4 code implementations CVPR 2022 Yikai Wang, Xinghao Chen, Lele Cao, Wenbing Huang, Fuchun Sun, Yunhe Wang

Many adaptations of transformers have emerged to address the single-modal vision tasks, where self-attention modules are stacked to handle input sources like images.

3D Object Detection Image-to-Image Translation +2

Smoothing Matters: Momentum Transformer for Domain Adaptive Semantic Segmentation

1 code implementation15 Mar 2022 Runfa Chen, Yu Rong, Shangmin Guo, Jiaqi Han, Fuchun Sun, Tingyang Xu, Wenbing Huang

After the great success of Vision Transformer variants (ViTs) in computer vision, it has also demonstrated great potential in domain adaptive semantic segmentation.

Pseudo Label Synthetic-to-Real Translation +1

Equivariant Graph Mechanics Networks with Constraints

1 code implementation12 Mar 2022 Wenbing Huang, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

The core of GMN is that it represents, by generalized coordinates, the forward kinematics information (positions and velocities) of a structural object.

Sound Adversarial Audio-Visual Navigation

1 code implementation ICLR 2022 Yinfeng Yu, Wenbing Huang, Fuchun Sun, Changan Chen, Yikai Wang, Xiaohong Liu

In this work, we design an acoustically complex environment in which, besides the target sound, there exists a sound attacker playing a zero-sum game with the agent.

Navigate Visual Navigation

Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation

no code implementations26 Jan 2022 Sinan Tan, Mengmeng Ge, Di Guo, Huaping Liu, Fuchun Sun

In the Vision-and-Language Navigation task, the embodied agent follows linguistic instructions and navigates to a specific goal.

Representation Learning Test unseen +1

Bootstrapping Informative Graph Augmentation via A Meta Learning Approach

1 code implementation11 Jan 2022 Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Fuchun Sun, Changwen Zheng

To this end, we propose a novel approach to learning a graph augmenter that can generate an augmentation with uniformity and informativeness.

Contrastive Learning Informativeness +2

Channel Exchanging Networks for Multimodal and Multitask Dense Image Prediction

1 code implementation4 Dec 2021 Yikai Wang, Fuchun Sun, Wenbing Huang, Fengxiang He, DaCheng Tao

For the application of dense image prediction, the validity of CEN is tested by four different scenarios: multimodal fusion, cycle multimodal fusion, multitask learning, and multimodal multitask learning.

Semantic Segmentation

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

1 code implementation ICCV 2021 Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao

In the low-bit quantization field, training Binary Neural Networks (BNNs) is the extreme solution to ease the deployment of deep models on resource-constrained devices, having the lowest storage cost and significantly cheaper bit-wise operations compared to 32-bit floating-point counterparts.

Quantization

Constrained Graph Mechanics Networks

no code implementations ICLR 2022 Wenbing Huang, Jiaqi Han, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

In this manner, the geometrical constraints are implicitly and naturally encoded in the forward kinematics.

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

no code implementations20 Sep 2021 Xinzhu Liu, Di Guo, Huaping Liu, Fuchun Sun

In this paper, we propose the multi-agent visual semantic navigation, in which multiple agents collaborate with others to find multiple target objects.

Efficient Exploration

Elastic Tactile Simulation Towards Tactile-Visual Perception

1 code implementation11 Aug 2021 Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun, Chang Li

By contrast, EIP models the tactile sensor as a group of coordinated particles, and the elastic property is applied to regulate the deformation of particles during contact.

Adversarial Option-Aware Hierarchical Imitation Learning

1 code implementation10 Jun 2021 Mingxuan Jing, Wenbing Huang, Fuchun Sun, Xiaojian Ma, Tao Kong, Chuang Gan, Lei LI

In particular, we propose an Expectation-Maximization(EM)-style algorithm: an E-step that samples the options of expert conditioned on the current learned policy, and an M-step that updates the low- and high-level policies of agent simultaneously to minimize the newly proposed option-occupancy measurement between the expert and the agent.

Imitation Learning

Elastic Interaction of Particles for Robotic Tactile Simulation

no code implementations23 Nov 2020 Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun

At its core, EIP models the tactile sensor as a group of coordinated particles, and the elastic theory is applied to regulate the deformation of particles during the contact process.

Deep Multimodal Fusion by Channel Exchanging

1 code implementation NeurIPS 2020 Yikai Wang, Wenbing Huang, Fuchun Sun, Tingyang Xu, Yu Rong, Junzhou Huang

Deep multimodal fusion by using multiple sources of data for classification or regression has exhibited a clear advantage over the unimodal counterpart on various applications.

Image-to-Image Translation Semantic Segmentation +1

Unsupervised Representation Learning by InvariancePropagation

1 code implementation7 Oct 2020 Feng Wang, Huaping Liu, Di Guo, Fuchun Sun

In this paper, we propose Invariance Propagation to focus on learning representations invariant to category-level variations, which are provided by different instances from the same category.

Contrastive Learning Representation Learning +1

Tackling Over-Smoothing for General Graph Convolutional Networks

no code implementations22 Aug 2020 Wenbing Huang, Yu Rong, Tingyang Xu, Fuchun Sun, Junzhou Huang

Increasing the depth of GCN, which is expected to permit more expressivity, is shown to incur performance detriment especially on node classification.

Node Classification

A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning

2 code implementations ECCV 2020 Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng

Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.

Generalized Zero-Shot Learning

Resolution Switchable Networks for Runtime Efficient Image Recognition

1 code implementation ECCV 2020 Yikai Wang, Fuchun Sun, Duo Li, Anbang Yao

We propose a general method to train a single convolutional neural network which is capable of switching image resolutions at inference.

Knowledge Distillation Quantization

Towards Embodied Scene Description

no code implementations30 Apr 2020 Sinan Tan, Huaping Liu, Di Guo, Xin-Yu Zhang, Fuchun Sun

Embodiment is an important characteristic for all intelligent agents (creatures and robots), while existing scene description tasks mainly focus on analyzing images passively and the semantic understanding of the scenario is separated from the interaction between the agent and the environment.

Imitation Learning reinforcement-learning +1

A Mobile Robot Hand-Arm Teleoperation System by Vision and IMU

1 code implementation11 Mar 2020 Shuang Li, Jiaxi Jiang, Philipp Ruppel, Hongzhuo Liang, Xiaojian Ma, Norman Hendrich, Fuchun Sun, Jianwei Zhang

In this paper, we present a multimodal mobile teleoperation system that consists of a novel vision-based hand pose regression network (Transteleop) and an IMU-based arm tracking method.

Anatomy Image-to-Image Translation +1

MQA: Answering the Question via Robotic Manipulation

1 code implementation10 Mar 2020 Yuhong Deng, Di Guo, Xiaofeng Guo, Naifu Zhang, Huaping Liu, Fuchun Sun

In this paper, we propose a novel task, Manipulation Question Answering (MQA), where the robot performs manipulation actions to change the environment in order to answer a given question.

Imitation Learning Question Answering +1

Robust Robotic Pouring using Audition and Haptics

1 code implementation29 Feb 2020 Hongzhuo Liang, Chuangchuang Zhou, Shuang Li, Xiaojian Ma, Norman Hendrich, Timo Gerkmann, Fuchun Sun, Marcus Stoffel, Jianwei Zhang

Both network training results and robot experiments demonstrate that MP-Net is robust against noise and changes to the task and environment.

Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

2 code implementations CVPR 2020 Runfa Chen, Wenbing Huang, Binghui Huang, Fuchun Sun, Bin Fang

The proposed architecture, termed as NICE-GAN, exhibits two advantageous patterns over previous approaches: First, it is more compact since no independent encoding component is required; Second, this plug-in encoder is directly trained by the adversary loss, making it more informative and trained more effectively if a multi-scale discriminator is applied.

Translation Unsupervised Image-To-Image Translation

FoveaBox: Beyound Anchor-based Object Detection

no code implementations ICLR 2020 Tao Kong, Fuchun Sun, Huaping Liu, Yuning Jiang, Lei LI, Jianbo Shi

While almost all state-of-the-art object detectors utilize predefined anchors to enumerate possible locations, scales and aspect ratios for the search of the objects, their performance and generalization ability are also limited to the design of anchors.

object-detection Object Detection

Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance

no code implementations16 Nov 2019 Mingxuan Jing, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Chao Yang, Bin Fang, Huaping Liu

In this paper, we study Reinforcement Learning from Demonstrations (RLfD) that improves the exploration efficiency of Reinforcement Learning (RL) by providing expert demonstrations.

reinforcement-learning reinforcement Learning

Attention-based Transfer Learning for Brain-computer Interface

no code implementations25 Apr 2019 Chuanqi Tan, Fuchun Sun, Tao Kong, Bin Fang, Wenchang Zhang

Different functional areas of the human brain play different roles in brain activity, which has not been paid sufficient research attention in the brain-computer interface (BCI) field.

Classification EEG +2

FoveaBox: Beyond Anchor-based Object Detector

6 code implementations8 Apr 2019 Tao Kong, Fuchun Sun, Huaping Liu, Yuning Jiang, Lei LI, Jianbo Shi

In FoveaBox, an instance is assigned to adjacent feature levels to make the model more accurate. We demonstrate its effectiveness on standard benchmarks and report extensive experimental analysis.

Ranked #93 on Object Detection on COCO test-dev (APM metric)

object-detection Object Detection

Vision-based Teleoperation of Shadow Dexterous Hand using End-to-End Deep Neural Network

4 code implementations17 Sep 2018 Shuang Li, Xiaojian Ma, Hongzhuo Liang, Michael Görner, Philipp Ruppel, Bing Fang, Fuchun Sun, Jianwei Zhang

In this paper, we present TeachNet, a novel neural network architecture for intuitive and markerless vision-based teleoperation of dexterous robotic hands.

Robotics

PointNetGPD: Detecting Grasp Configurations from Point Sets

4 code implementations17 Sep 2018 Hongzhuo Liang, Xiaojian Ma, Shuang Li, Michael Görner, Song Tang, Bin Fang, Fuchun Sun, Jianwei Zhang

In this paper, we propose an end-to-end grasp evaluation model to address the challenging problem of localizing robot grasp configurations directly from the point cloud.

Robotics

Deep Feature Pyramid Reconfiguration for Object Detection

no code implementations ECCV 2018 Tao Kong, Fuchun Sun, Wenbing Huang, Huaping Liu

In this paper, we begin by investigating current feature pyramids solutions, and then reformulate the feature pyramid construction as the feature reconfiguration process.

object-detection Object Detection

Deep Transfer Learning for EEG-based Brain Computer Interface

no code implementations6 Aug 2018 Chuanqi Tan, Fuchun Sun, Wenchang Zhang

First, we model cognitive events based on EEG data by characterizing the data using EEG optical flow, which is designed to preserve multimodal EEG information in a uniform representation.

EEG Optical Flow Estimation +1

A Survey on Deep Transfer Learning

no code implementations6 Aug 2018 Chuanqi Tan, Fuchun Sun, Tao Kong, Wenchang Zhang, Chao Yang, Chunfang Liu

As a new classification platform, deep learning has recently received increasing attention from researchers and has been successfully applied to many domains.

General Classification Transfer Learning

Multimodal Classification with Deep Convolutional-Recurrent Neural Networks for Electroencephalography

no code implementations24 Jul 2018 Chuanqi Tan, Fuchun Sun, Wenchang Zhang, Jianhua Chen, Chunfang Liu

Herein, we propose a novel approach to modeling cognitive events from EEG data by reducing it to a video classification problem, which is designed to preserve the multimodal information of EEG.

Classification EEG +3

Learning and Inferring Movement with Deep Generative Model

no code implementations18 May 2018 Mingxuan Jing, Xiaojian Ma, Fuchun Sun, Huaping Liu

Learning and inference movement is a very challenging problem due to its high dimensionality and dependency to varied environments or tasks.

Motion Planning

Task Transfer by Preference-Based Cost Learning

no code implementations12 May 2018 Mingxuan Jing, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu

The goal of task transfer in reinforcement learning is migrating the action policy of an agent to the target task from the source task.

MAT: A Multimodal Attentive Translator for Image Captioning

no code implementations18 Feb 2017 Chang Liu, Fuchun Sun, Changhu Wang, Feng Wang, Alan Yuille

In this way, the sequential representation of an image can be naturally translated to a sequence of words, as the target sequence of the RNN model.

Image Captioning Machine Translation +1

Analyzing Linear Dynamical Systems: From Modeling to Coding and Learning

1 code implementation3 Aug 2016 Wenbing Huang, Fuchun Sun, Lele Cao, Mehrtash Harandi

We then devise efficient algorithms to perform sparse coding and dictionary learning on the space of infinite-dimensional subspaces.

Dictionary Learning General Classification +4

Sparse Coding and Dictionary Learning With Linear Dynamical Systems

no code implementations CVPR 2016 Wenbing Huang, Fuchun Sun, Lele Cao, Deli Zhao, Huaping Liu, Mehrtash Harandi

To enhance the performance of LDSs, in this paper, we address the challenging issue of performing sparse coding on the space of LDSs, where both data and dictionary atoms are LDSs.

Dictionary Learning Video Classification

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

no code implementations CVPR 2016 Tao Kong, Anbang Yao, Yurong Chen, Fuchun Sun

Almost all of the current top-performing object detection networks employ region proposals to guide the search for object instances.

object-detection Object Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.