Search Results for author: Hao Dong

Found 135 papers, 59 papers with code

CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation

no code implementations CVPR 2025 Yuxing Long, Jiyao Zhang, Mingjie Pan, Tianshu Wu, Taewhan Kim, Hao Dong

Furthermore, we propose the first manual-based manipulation planning model ManualPlan to set up a group of baselines for the CheckManual benchmark.

Common Sense Reasoning Question Answering

BiAssemble: Learning Collaborative Affordance for Bimanual Geometric Assembly

no code implementations6 Jun 2025 Yan Shen, Ruihai Wu, Yubin Ke, Xinyuan Song, Zeyi Li, Xiaoqi Li, Hongwei Fan, Haoran Lu, Hao Dong

Shape assembly, the process of combining parts into a complete whole, is a crucial robotic skill with broad real-world applications.

Diversity

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping

no code implementations30 May 2025 Mingxu Zhang, Xiaoqi Li, Jiahui Xu, Kaichen Zhou, Hojin Bae, Yan Shen, Chuyan Xiong, Jiaming Liu, Hao Dong

To address this, leveraging the power of single view 3D object reconstruction approaches, we propose a training free framework SR3D that enables robotic grasping of transparent and specular objects from a single view observation.

3D Object Reconstruction 3D Reconstruction +3

To Trust Or Not To Trust Your Vision-Language Model's Prediction

1 code implementation29 May 2025 Hao Dong, Moru Liu, Jian Liang, Eleni Chatzi, Olga Fink

Vision-Language Models (VLMs) have demonstrated strong capabilities in aligning visual and textual modalities, enabling a wide range of applications in multimodal understanding and generation.

Transfer Learning

SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams

no code implementations26 May 2025 Zhuoheng Gao, Yihao Li, Jiyao Zhang, Rui Zhao, Tong Wu, Hao Tang, Zhaofei Yu, Hao Dong, Guozhang Chen, Tiejun Huang

To address this gap, we propose SpikeStereoNet, a brain-inspired framework and the first to estimate stereo depth directly from raw spike streams.

Stereo Depth Estimation

GCAL: Adapting Graph Models to Evolving Domain Shifts

1 code implementation22 May 2025 Ziyue Qiao, Qianyi Cai, Hao Dong, Jiawei Gu, Pengyang Wang, Meng Xiao, Xiao Luo, Hui Xiong

This paper addresses the challenge of graph domain adaptation on evolving, multiple out-of-distribution (OOD) graphs.

Bilevel Optimization Domain Adaptation +2

Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

2 code implementations22 May 2025 Moru Liu, Hao Dong, Jessica Kelly, Olga Fink, Mario Trapp

Out-of-distribution (OOD) detection and segmentation are crucial for deploying machine learning models in safety-critical applications such as autonomous driving and robot-assisted surgery.

Autonomous Driving Out-of-Distribution Detection +1

Disentangled Multi-span Evolutionary Network against Temporal Knowledge Graph Reasoning

no code implementations20 May 2025 Hao Dong, Ziyue Qiao, Zhiyuan Ning, Qi Hao, Yi Du, Pengyang Wang, Yuanchun Zhou

However, they still have limitations: 1) In modeling subgraph semantic evolution, they usually neglect the internal structural interactions between subgraphs, which are actually crucial for encoding TKGs.

Knowledge Graphs

DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy

1 code implementation16 May 2025 Yuran Wang, Ruihai Wu, Yue Chen, Jiarui Wang, Jiaqi Liang, Ziyu Zhu, Haoran Geng, Jitendra Malik, Pieter Abbeel, Hao Dong

To improve generalization across diverse garment shapes and deformations, we propose a Hierarchical gArment-manipuLation pOlicy (HALO).

Reinforcement Learning (RL)

Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position Predictivity

1 code implementation13 May 2025 Dazhong Rong, Hao Dong, Xing Gao, Jiyu Wei, Di Hong, Yaoyao Hao, Qinming He, Yueming Wang

Based on the concept that ventral visual stream (VVS) mainly functions for object recognition, current unsupervised task-driven methods model VVS by contrastive learning, and have achieved good brain similarity.

Contrastive Learning Object +3

3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment

no code implementations3 May 2025 Xiaoqi Li, Jiaming Liu, Nuowei Han, Liang Heng, Yandong Guo, Hao Dong, Yang Liu

The 3D weakly-supervised visual grounding task aims to localize oriented 3D boxes in point clouds based on natural language descriptions without requiring annotations to guide model learning.

Sentence Visual Grounding

Boosting Universal LLM Reward Design through the Heuristic Reward Observation Space Evolution

no code implementations10 Apr 2025 Zen Kit Heng, Zimeng Zhao, Tianhao Wu, Yuanfei Wang, Mingdong Wu, Yangang Wang, Hao Dong

Large Language Models (LLMs) are emerging as promising tools for automated reinforcement learning (RL) reward design, owing to their robust capabilities in commonsense reasoning and code generation.

Code Generation Reinforcement Learning (RL)

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

no code implementations CVPR 2025 Mingju Gao, Yike Pan, Huan-ang Gao, Zongzheng Zhang, Wenyi Li, Hao Dong, Hao Tang, Li Yi, Hao Zhao

As interest grows in world models that predict future states from current observations and actions, accurately modeling part-level dynamics has become increasingly relevant for various applications.

4D reconstruction

Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach

no code implementations18 Mar 2025 Tianshu Wu, Jiyao Zhang, Shiqian Liang, Zhengxiao Han, Hao Dong

Recent learning-based robot pose estimation methods, while advancing online calibration, struggle with cross-robot generalization and require the robot to be fully visible.

Robot Pose Estimation Zero-shot Generalization

GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation

no code implementations CVPR 2025 Ruihai Wu, Ziyu Zhu, Yuran Wang, Yue Chen, Jiarui Wang, Hao Dong

Cluttered garments manipulation poses significant challenges due to the complex, deformable nature of garments and intricate garment relations.

Retrieval

AdaManip: Adaptive Articulated Object Manipulation Environments and Policy Learning

no code implementations16 Feb 2025 Yuanfei Wang, Xiaojie Zhang, Ruihai Wu, Yu Li, Yan Shen, Mingdong Wu, Zhaofeng He, Yizhou Wang, Hao Dong

To enhance the diversity and complexity of adaptive manipulation mechanisms, we build a novel articulated object manipulation environment and equip it with 9 categories of objects.

Imitation Learning

CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World

no code implementations12 Feb 2025 Yankai Fu, Qiuxuan Feng, Ning Chen, Zichen Zhou, Mengzhen Liu, Mingdong Wu, Tianxing Chen, Shanyu Rong, Jiaming Liu, Hao Dong, Shanghang Zhang

However, obtaining high-quality 3D representations presents two key problems: (1) the quality of point clouds captured by a single-view camera is significantly affected by factors such as camera resolution, positioning, and occlusions caused by the dexterous hand; (2) the global point clouds lack crucial contact information and spatial correspondences, which are necessary for fine-grained dexterous manipulation tasks.

6D Pose Estimation Imitation Learning

Interactive Symbolic Regression through Offline Reinforcement Learning: A Co-Design Framework

1 code implementation5 Feb 2025 Yuan Tian, Wenqi Zhou, Michele Viscione, Hao Dong, David Kammer, Olga Fink

Symbolic Regression (SR) holds great potential for uncovering underlying mathematical and physical relationships from observed data.

Equation Discovery regression +2

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

5 code implementations30 Jan 2025 Hao Dong, Moru Liu, Kaiyang Zhou, Eleni Chatzi, Juho Kannala, Cyrill Stachniss, Olga Fink

Besides, the recent advent of large-scale pre-trained multimodal foundation models, such as CLIP, has inspired works leveraging these models to enhance adaptation and generalization performances or adapting them to downstream tasks.

Action Recognition Domain Generalization +2

An Automatic Sound and Complete Abstraction Method for Generalized Planning with Baggable Types

1 code implementation25 Jan 2025 Hao Dong, Zheyuan Shi, Hemeng Zeng, Yongmei Liu

We define a class of domains called proper baggable domains, and show that for such domains, the BQNP problem got by our automatic method is a sound and complete abstraction for a generalized planning problem whose instances share the same bags with the given instance but the sizes of the bags might be different.

Unseen Visual Anomaly Generation

no code implementations CVPR 2025 Han Sun, Yunkang Cao, Hao Dong, Olga Fink

Visual anomaly detection (AD) presents significant challenges due to the scarcity of anomalous data samples.

Anomaly Detection Image Generation

ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?

no code implementations13 Dec 2024 Taewhan Kim, Hojin Bae, Zeming Li, Xiaoqi Li, Iaroslav Ponomarenko, Ruihai Wu, Hao Dong

Visual actionable affordance has emerged as a transformative approach in robotics, focusing on perceiving interaction areas prior to manipulation.

Robot Manipulation

PGSO: Prompt-based Generative Sequence Optimization Network for Aspect-based Sentiment Analysis

no code implementations1 Dec 2024 Hao Dong, Wei Wei

The rule-based approach relies on handcraft priority of dependency relation to reorder the context, while the score-based algorithm dynamically regulates the contextual sequence by calculating word position scores using neural network.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Recall and Refine: A Simple but Effective Source-free Open-set Domain Adaptation Framework

1 code implementation19 Nov 2024 Ismail Nejjar, Hao Dong, Olga Fink

Open-set Domain Adaptation (OSDA) aims to adapt a model from a labeled source domain to an unlabeled target domain, where novel classes - also referred to as target-private unknown classes - are present.

Source-Free Domain Adaptation

Is Precise Recovery Necessary? A Task-Oriented Imputation Approach for Time Series Forecasting on Variable Subset

no code implementations15 Nov 2024 Qi Hao, Runchang Liang, Yue Gao, Hao Dong, Wei Fan, Lu Jiang, Pengyang Wang

Variable Subset Forecasting (VSF) refers to a unique scenario in multivariate time series forecasting, where available variables in the inference phase are only a subset of the variables in the training phase.

Imputation Multivariate Time Series Forecasting +1

DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection

2 code implementations CVPR 2025 Shawn Li, Huixian Gong, Hao Dong, Tiankai Yang, Zhengzhong Tu, Yue Zhao

Extensive experiments on two tasks, five datasets, and nine base OOD algorithms demonstrate that DPU significantly improves OOD detection performance, setting a new state-of-the-art in multimodal OOD detection, with improvements of up to 80 percent in Far-OOD detection.

Optical Flow Estimation Out-of-Distribution Detection +1

ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy

no code implementations6 Nov 2024 Chenrui Tie, Yue Chen, Ruihai Wu, Boxuan Dong, Zeyi Li, Chongkai Gao, Hao Dong

We theoretically extend equivariant Markov kernels and simplify the condition of equivariant diffusion process, thereby significantly improving training efficiency for trajectory-level SE(3) equivariant diffusion policy in an end-to-end manner.

Imitation Learning Robot Manipulation

GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation

1 code implementation2 Nov 2024 Haoran Lu, Ruihai Wu, Yitong Li, Sijie Li, Ziyu Zhu, Chuanruo Ning, Yan Shen, Longzan Luo, Yuanpei Chen, Hao Dong

Recent successes in reinforcement learning and vision-based methods offer promising avenues for learning garment manipulation.

Imitation Learning

RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations

1 code implementation1 Oct 2024 Kaichen Zhou, Yang Cao, Taewhan Kim, Hao Zhao, Hao Dong, Kai Ming Ting, Ye Zhu

To address this gap, we introduce the Realistic Anomaly Detection (RAD) dataset, the first multi-view RGB-based anomaly detection dataset specifically collected using a real robot arm, providing unique and realistic data scenarios.

Anomaly Detection Data Augmentation +1

CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving

no code implementations16 Aug 2024 Shihan Peng, Hanyu Zhou, Hao Dong, Zhiwei Shi, Haoyue Liu, Yuxing Duan, Yi Chang, Luxin Yan

In this work, we introduce hybrid coaxial event-frame devices to build the multimodal system, and propose a coaxial stereo event camera (CoSEC) dataset for autonomous driving.

Autonomous Driving Optical Flow Estimation

EqvAfford: SE(3) Equivariance for Point-Level Affordance Learning

no code implementations4 Aug 2024 Yue Chen, Chenrui Tie, Ruihai Wu, Hao Dong

Humans perceive and interact with the world with the awareness of equivariance, facilitating us in manipulating different objects in diverse poses.

TARGO: Benchmarking Target-driven Object Grasping under Occlusions

no code implementations8 Jul 2024 Yan Xia, Ran Ding, Ziyuan Qin, Guanqi Zhan, Kaichen Zhou, Long Yang, Hao Dong, Daniel Cremers

3) We also generate a large-scale training dataset via a scalable pipeline, which can be used to boost the performance of grasping under occlusion and generalized to the real world.

Benchmarking Object +1

Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision

1 code implementation1 Jul 2024 Hao Dong, Eleni Chatzi, Olga Fink

In this work, we introduce a novel approach to address Multimodal Open-Set Domain Generalization (MM-OSDG) for the first time, utilizing self-supervision.

Domain Generalization Meta-Learning

Human-centered In-building Embodied Delivery Benchmark

1 code implementation25 Jun 2024 Zhuoqun Xu, Yang Liu, Xiaoqi Li, Jiyao Zhang, Hao Dong

This environment also includes autonomous human characters and robots with grasping and mobility capabilities, as well as a large number of interactive items.

Make Graph Neural Networks Great Again: A Generic Integration Paradigm of Topology-Free Patterns for Traffic Speed Prediction

1 code implementation24 Jun 2024 Yicheng Zhou, Pengfei Wang, Hao Dong, Denghui Zhang, Dingqi Yang, Yanjie Fu, Pengyang Wang

To tackle this challenge, we propose a generic model for enabling the current GNN-based methods to preserve topology-free patterns.

SpatialBot: Precise Spatial Understanding with Vision Language Models

1 code implementation19 Jun 2024 Wenxiao Cai, Iaroslav Ponomarenko, Jianhao Yuan, Xiaoqi Li, Wankou Yang, Hao Dong, Bo Zhao

Vision Language Models (VLMs) have achieved impressive performance in 2D image understanding, however they are still struggling with spatial understanding which is the foundation of Embodied AI.

Spatial Reasoning

AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation

no code implementations17 Jun 2024 Chuyan Xiong, Chengyu Shen, Xiaoqi Li, Kaichen Zhou, Jeremy Liu, Ruiping Wang, Hao Dong

The ability to reflect on and correct failures is crucial for robotic systems to interact stably with real-life objects. Observing the generalization and reasoning capabilities of Multimodal Large Language Models (MLLMs), previous approaches have aimed to utilize these models to enhance robotic systems accordingly. However, these methods typically focus on high-level planning corrections using an additional MLLM, with limited utilization of failed samples to correct low-level contact poses which is particularly prone to occur during articulated object manipulation. To address this gap, we propose an Autonomous Interactive Correction (AIC) MLLM, which makes use of previous low-level interaction experiences to correct SE(3) pose predictions for articulated object.

Pose Prediction Test-time Adaptation

GFPack++: Improving 2D Irregular Packing by Learning Gradient Field with Attention

no code implementations9 Jun 2024 Tianyang Xue, Lin Lu, Yang Liu, Mingdong Wu, Hao Dong, Yanbin Zhang, Renmin Han, Baoquan Chen

The difficulty of learning from teacher packing is to capture the complex geometric relationships among packing examples, which include the spatial (position, orientation) relationships of objects, their geometric features, and container boundary conditions.

Combinatorial Optimization

InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment

no code implementations7 Jun 2024 Yuxing Long, Wenzhe Cai, Hongcheng Wang, Guanqi Zhan, Hao Dong

To reach this goal, we introduce Dynamic Chain-of-Navigation (DCoN) to unify the planning process for different types of navigation instructions.

Navigate

Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking

no code implementations6 Jun 2024 Jiyao Zhang, Weiyao Huang, Bo Peng, Mingdong Wu, Fei Hu, Zijian Chen, Bo Zhao, Hao Dong

6D Object Pose Estimation is a crucial yet challenging task in computer vision, suffering from a significant lack of large-scale datasets.

6D Pose Estimation using RGB Benchmarking +3

Anomaly Anything: Promptable Unseen Visual Anomaly Generation

no code implementations3 Jun 2024 Han Sun, Yunkang Cao, Hao Dong, Olga Fink

Visual anomaly detection (AD) presents significant challenges due to the scarcity of anomalous data samples.

Anomaly Detection Image Generation

Learning Manipulation by Predicting Interaction

1 code implementation1 Jun 2024 Jia Zeng, Qingwen Bu, Bangjun Wang, Wenke Xia, Li Chen, Hao Dong, Haoming Song, Dong Wang, Di Hu, Ping Luo, Heming Cui, Bin Zhao, Xuelong Li, Yu Qiao, Hongyang Li

To this end, we propose a general pre-training pipeline that learns Manipulation by Predicting the Interaction (MPI) and enhances the visual representation. Given a pair of keyframes representing the initial and final states, along with language instructions, our algorithm predicts the transition frame and detects the interaction object, respectively.

Representation Learning

MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities

1 code implementation27 May 2024 Hao Dong, Yue Zhao, Eleni Chatzi, Olga Fink

Extensive experiments on MultiOOD demonstrate that training with A2D and NP-Mix improves existing OOD detection algorithms by a large margin.

Autonomous Driving Out-of-Distribution Detection

UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence

no code implementations CVPR 2024 Ruihai Wu, Haoran Lu, Yiyan Wang, YuBo Wang, Hao Dong

Garment manipulation (e. g., unfolding, folding and hanging clothes) is essential for future robots to accomplish home-assistant tasks, while highly challenging due to the diversity of garment configurations, geometries and deformations.

Diversity

SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network

1 code implementation27 Mar 2024 Yuxuan Wan, Kaichen Zhou, jinhong Chen, Hao Dong

To support research in this area, we present the LEGO Error Correction Assembly Dataset (LEGO-ECA), comprising manual images for assembly steps and instances of assembly failures.

NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation

no code implementations13 Mar 2024 ran Xu, Yan Shen, Xiaoqi Li, Ruihai Wu, Hao Dong

To address these challenges, we introduce a comprehensive benchmark, NrVLM, comprising 15 distinct manipulation tasks, containing over 4500 episodes meticulously annotated with fine-grained language instructions.

Robot Manipulation

JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection

no code implementations12 Mar 2024 Hanyu Zhou, Zhiwei Shi, Hao Dong, Shihan Peng, Yi Chang, Luxin Yan

In spatial reasoning stage, we project the compensated events into the same image coordinate, discretize the timestamp of events to obtain a time image that can reflect the motion confidence, and further segment the moving object through adaptive threshold on the time image.

Motion Compensation Moving Object Detection +3

Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making

2 code implementations7 Feb 2024 Yuan Tian, Wenqi Zhou, Hao Dong, David S. Kammer, Olga Fink

Our results demonstrate that Sym-Q excels not only in recovering underlying mathematical structures but also uniquely learns to efficiently refine the output expression based on reward signals, thereby discovering underlying expressions.

Decision Making regression +2

SplatMesh: Interactive 3D Segmentation and Editing Using Mesh-Based Gaussian Splatting

no code implementations26 Dec 2023 Kaichen Zhou, Lanqing Hong, Xinhai Chang, Yingji Zhong, Enze Xie, Hao Dong, Zhihao LI, Yongxin Yang, Zhenguo Li, Wei zhang

A key challenge in fine-grained 3D-based interactive editing is the absence of an efficient representation that balances diverse modifications with high-quality view synthesis under a given memory constraint.

Interactive Segmentation Segmentation

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

no code implementations CVPR 2024 Xiaoqi Li, Mingxu Zhang, Yiran Geng, Haoran Geng, Yuxing Long, Yan Shen, Renrui Zhang, Jiaming Liu, Hao Dong

By fine-tuning the injected adapters, we preserve the inherent common sense and reasoning ability of the MLLMs while equipping them with the ability for manipulation.

Common Sense Reasoning Language Modeling +6

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers

1 code implementation19 Dec 2023 Ruiyuan Zhang, Jiaxiang Liu, Zexi Li, Hao Dong, Jie Fu, Chao Wu

Therefore, there is a need to develop a scalable framework for geometric fracture assembly without relying on semantic information.

3D Assembly

Learning Part Motion of Articulated Objects Using Spatially Continuous Neural Implicit Representations

no code implementations21 Nov 2023 Yushi Du, Ruihai Wu, Yan Shen, Hao Dong

More importantly, while many methods could only model a certain kind of joint motion (such as the revolution in the clockwise order), our proposed framework is generic to different kinds of joint motions in that transformation matrix can model diverse kinds of joint motions in the space.

NNG-Mix: Improving Semi-supervised Anomaly Detection with Pseudo-anomaly Generation

1 code implementation20 Nov 2023 Hao Dong, Gaëtan Frusque, Yue Zhao, Eleni Chatzi, Olga Fink

While AD is typically treated as an unsupervised learning task due to the high cost of label annotation, it is more practical to assume access to a small set of labeled anomaly samples from domain experts, as is the case for semi-supervised anomaly detection.

Data Augmentation Fault Detection +4

SimMMDG: A Simple and Effective Framework for Multi-modal Domain Generalization

1 code implementation NeurIPS 2023 Hao Dong, Ismail Nejjar, Han Sun, Eleni Chatzi, Olga Fink

In real-world scenarios, achieving domain generalization (DG) presents significant challenges as models are required to generalize to unknown target distributions.

Contrastive Learning Diversity +1

SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation

no code implementations25 Oct 2023 Qianxu Wang, Haotong Zhang, Congyue Deng, Yang You, Hao Dong, Yixin Zhu, Leonidas Guibas

Central to SparseDFF is a feature refinement network, optimized with a contrastive loss between views and a point-pruning mechanism for feature continuity.

One-Shot Learning

Learning Gradient Fields for Scalable and Generalizable Irregular Packing

no code implementations18 Oct 2023 Tianyang Xue, Mingdong Wu, Lin Lu, Haoxuan Wang, Hao Dong, Baoquan Chen

In this work, we delve deeper into a novel machine learning-based approach that formulates the packing problem as conditional generative modeling.

Collision Avoidance Layout Design +1

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection

no code implementations13 Oct 2023 Xiaoqi Li, Yanzi Wang, Yan Shen, Ponomarenko Iaroslav, Haoran Lu, Qianxu Wang, Boshi An, Jiaming Liu, Hao Dong

This framework is designed to capture multiple perspectives of the target object and infer depth information to complement its geometry.

Object Robot Manipulation

Improving Compositional Text-to-image Generation with Large Vision-Language Models

no code implementations10 Oct 2023 Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris Metaxas

However, compositional text-to-image models frequently encounter difficulties in generating high-quality images that accurately align with input texts describing multiple objects, variable attributes, and intricate spatial relationships.

Attribute Text to Image Generation +1

Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions

no code implementations20 Sep 2023 Yuxing Long, Xiaoqi Li, Wenzhe Cai, Hao Dong

The performances on the representative VLN task R2R show that our method surpasses the leading zero-shot VLN model by a large margin on all metrics.

Language Modelling Large Language Model

Semi-supervised Domain Adaptation in Graph Transfer Learning

no code implementations19 Sep 2023 Ziyue Qiao, Xiao Luo, Meng Xiao, Hao Dong, Yuanchun Zhou, Hui Xiong

To deal with the domain shift, we add adaptive shift parameters to each of the source nodes, which are trained in an adversarial manner to align the cross-domain distributions of node embedding, thus the node classifier trained on labeled source nodes can be transferred to the target nodes.

GRAPH DOMAIN ADAPTATION Semi-supervised Domain Adaptation +2

Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects

no code implementations NeurIPS 2023 Chuanruo Ning, Ruihai Wu, Haoran Lu, Kaichun Mo, Hao Dong

Our framework explicitly estimates the geometric similarity across different categories, identifying local areas that differ from shapes in the training categories for efficient exploration while concurrently transferring affordance knowledge to similar parts of the objects.

Efficient Exploration Few-Shot Learning

Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly

1 code implementation ICCV 2023 Ruihai Wu, Chenrui Tie, Yushi Du, Yan Zhao, Hao Dong

Shape assembly aims to reassemble parts (or fragments) into a complete object, which is a common task in our daily life.

Disentanglement

GraspGF: Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping

no code implementations12 Sep 2023 Tianhao Wu, Mingdong Wu, Jiyao Zhang, Yunchong Gan, Hao Dong

In this paper, we propose a novel task called human-assisting dexterous grasping that aims to train a policy for controlling a robotic hand's fingers to assist users in grasping objects.

Score-PA: Score-based 3D Part Assembly

1 code implementation8 Sep 2023 Junfeng Cheng, Mingdong Wu, Ruiyuan Zhang, Guanqi Zhan, Chao Wu, Hao Dong

In this paper, we formulate this task from a novel generative perspective, introducing the Score-based 3D Part Assembly framework (Score-PA) for 3D part assembly.

Diversity

Temporal Inductive Path Neural Network for Temporal Knowledge Graph Reasoning

no code implementations6 Sep 2023 Hao Dong, Pengyang Wang, Meng Xiao, Zhiyuan Ning, Pengfei Wang, Yuanchun Zhou

Subsequently, we utilize the defined query-aware temporal paths on a history temporal graph to model historical path information related to queries for reasoning.

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

1 code implementation24 Aug 2023 Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao

However, the prior pre-training stage not only introduces excessive time overhead, but also incurs a significant domain gap on `unseen' classes.

3D Semantic Segmentation Few-shot 3D semantic segmentation +1

GenPose: Generative Category-level Object Pose Estimation via Diffusion Models

no code implementations18 Jun 2023 Jiyao Zhang, Mingdong Wu, Hao Dong

Object pose estimation plays a vital role in embodied AI and computer vision, enabling intelligent agents to comprehend and interact with their surroundings.

6D Pose Estimation 6D Pose Estimation using RGBD +2

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

1 code implementation18 May 2023 Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li

This paper presents Instruct2Act, a framework that utilizes Large Language Models to map multi-modal instructions to sequential actions for robotic manipulation tasks.

Language Modeling Language Modelling +4

Personalize Segment Anything Model with One Shot

1 code implementation4 May 2023 Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Xianzheng Ma, Hao Dong, Peng Gao, Hongsheng Li

Driven by large-data pre-training, Segment Anything Model (SAM) has been demonstrated as a powerful and promptable framework, revolutionizing the segmentation models.

model Personalized Segmentation +6

Adaptive Path-Memory Network for Temporal Knowledge Graph Reasoning

1 code implementation25 Apr 2023 Hao Dong, Zhiyuan Ning, Pengyang Wang, Ziyue Qiao, Pengfei Wang, Yuanchun Zhou, Yanjie Fu

Temporal knowledge graph (TKG) reasoning aims to predict the future missing facts based on historical information and has gained increasing research interest recently.

Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks

no code implementations29 Mar 2023 Haoqi Yuan, Chi Zhang, Hongcheng Wang, Feiyang Xie, Penglin Cai, Hao Dong, Zongqing Lu

Our method outperforms baselines by a large margin and is the most sample-efficient demonstration-free RL method to solve Minecraft Tech Tree tasks.

Minecraft Multi-Task Learning +2

Learning Foresightful Dense Visual Affordance for Deformable Object Manipulation

no code implementations ICCV 2023 Ruihai Wu, Chuanruo Ning, Hao Dong

In this paper, we study deformable object manipulation using dense visual affordance, with generalization towards diverse states, and propose a novel kind of foresightful dense affordance, which avoids local optima by estimating states' values for long-term manipulation.

Deformable Object Manipulation Object

Resilient Binary Neural Network

1 code implementation2 Feb 2023 Sheng Xu, Yanjing Li, Teli Ma, Mingbao Lin, Hao Dong, Baochang Zhang, Peng Gao, Jinhu Lv

In this paper, we introduce a Resilient Binary Neural Network (ReBNN) to mitigate the frequent oscillation for better BNNs' training.

GFPose: Learning 3D Human Pose Prior with Gradient Fields

1 code implementation CVPR 2023 Hai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang

During the denoising process, GFPose implicitly incorporates pose priors in gradients and unifies various discriminative and generative tasks in an elegant framework.

Denoising Monocular 3D Human Pose Estimation +1

SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation

1 code implementation28 Nov 2022 Hao Dong, Weihao Gu, Xianjing Zhang, Jintao Xu, Rui Ai, Huimin Lu, Juho Kannala, Xieyuanli Chen

However, current works are based on raw data or network feature-level fusion and only consider short-range HD map generation, limiting their deployment to realistic autonomous driving applications.

Autonomous Driving Depth Estimation

MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library

1 code implementation11 Oct 2022 Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Xiaodan Liang, Zhihui Li, Xiaojun Chang, Yaodong Yang

A significant challenge facing researchers in the area of multi-agent reinforcement learning (MARL) pertains to the identification of a library that can offer fast and compatible development for multi-agent tasks and algorithm combinations, while obviating the need to consider compatibility issues.

Multi-agent Reinforcement Learning reinforcement-learning +2

Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors

1 code implementation27 Sep 2022 Hao Dong, Xieyuanli Chen, Mihai Dusmanu, Viktor Larsson, Marc Pollefeys, Cyrill Stachniss

A distinctive representation of image patches in form of features is a key component of many computer vision and robotics tasks, such as image matching, image retrieval, and visual localization.

Dimensionality Reduction Image Retrieval +2

End-to-End Affordance Learning for Robotic Manipulation

1 code implementation26 Sep 2022 Yiran Geng, Boshi An, Haoran Geng, Yuanpei Chen, Yaodong Yang, Hao Dong

Such contact prediction process then leads to an end-to-end affordance learning framework that can generalize over different types of manipulation tasks.

Reinforcement Learning (RL)

Collaboration of Pre-trained Models Makes Better Few-shot Learner

no code implementations25 Sep 2022 Renrui Zhang, Bohao Li, Wei zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao

In this paper, we propose CoMo, a Collaboration of pre-trained Models that incorporates diverse prior knowledge from various pre-training paradigms for better few-shot learning.

Few-Shot Learning Representation Learning

Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification

no code implementations16 Sep 2022 Meng Xiao, Ziyue Qiao, Yanjie Fu, Hao Dong, Yi Du, Pengyang Wang, Hui Xiong, Yuanchun Zhou

Specifically, we first propose a hierarchical transformer to extract the textual semantic information of proposals.

Classification

Estimation of Average Derivatives of Latent Regressors: With an Application to Inference on Buffer-Stock Saving

no code implementations13 Sep 2022 Hao Dong, Yuya Sasaki

Based on the proposed estimator, we construct a formal test on the sub-unity of the marginal propensity to consume out of permanent income (MPCP) under a nonparametric consumption model and a permanent-transitory model of income dynamics with nonparametric distribution.

Unity

TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification

no code implementations2 Sep 2022 Mingdong Wu, Fangwei Zhong, Yulong Xia, Hao Dong

For object rearrangement, the TarGF can be used in two ways: 1) For model-based planning, we can cast the target gradient into a reference control and output actions with a distributed path planner; 2) For model-free reinforcement learning, the TarGF is not only used for estimating the likelihood-change as a reward but also provides suggested actions in residual policy learning.

Imitation Learning Object +3

Online Pole Segmentation on Range Images for Long-term LiDAR Localization in Urban Environments

1 code implementation15 Aug 2022 Hao Dong, Xieyuanli Chen, Simo Särkkä, Cyrill Stachniss

We further use the extracted poles as pseudo labels to train a deep neural network for online range image-based pole segmentation.

Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects

1 code implementation7 Aug 2022 Qiyu Dai, Jiyao Zhang, Qiwei Li, Tianhao Wu, Hao Dong, Ziyuan Liu, Ping Tan, He Wang

Commercial depth sensors usually generate noisy and missing depths, especially on specular and transparent objects, which poses critical issues to downstream depth or point cloud-based tasks.

Pose Estimation Transparent objects

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

no code implementations2 Aug 2022 Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang

The necessity for cooperation among intelligent machines has popularised cooperative multi-agent reinforcement learning (MARL) in the artificial intelligence (AI) research community.

MuJoCo Multi-agent Reinforcement Learning

Scalable Model-based Policy Optimization for Decentralized Networked Systems

no code implementations13 Jul 2022 Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple tasks.

Traffic Signal Control

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain

1 code implementation8 Jul 2022 Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei zhang, He Chen

Currently, under supervised learning, a model pretrained by a large-scale nature scene dataset and then fine-tuned on a few specific task labeling data is the paradigm that has dominated the knowledge transfer learning.

Land Cover Classification object-detection +3

DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation

no code implementations5 Jul 2022 Yan Zhao, Ruihai Wu, Zhehuan Chen, Yourong Zhang, Qingnan Fan, Kaichun Mo, Hao Dong

It is essential yet challenging for future home-assistant robots to understand and manipulate diverse 3D objects in daily human environments.

3D geometry

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

1 code implementation17 Jun 2022 Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Yiran Geng, Hao Dong, Zongqing Lu, Song-Chun Zhu, Yaodong Yang

In this study, we propose the Bimanual Dexterous Hands Benchmark (Bi-DexHands), a simulator that involves two dexterous hands with tens of bimanual manipulation tasks and thousands of target objects.

Few-Shot Learning Offline RL +3

Who Should Review Your Proposal? Interdisciplinary Topic Path Detection for Research Proposals

no code implementations7 Mar 2022 Meng Xiao, Ziyue Qiao, Yanjie Fu, Hao Dong, Yi Du, Pengyang Wang, Dong Li, Yuanchun Zhou

After extracting the semantic and interdisciplinary knowledge, we design a level-wise prediction component to fuse the two types of knowledge representations and detect interdisciplinary topic paths for each proposal.

GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning

no code implementations4 Mar 2022 Tianhao Wu, Fangwei Zhong, Yiran Geng, Hongchen Wang, Yongjian Zhu, Yizhou Wang, Hao Dong

we formulate the dynamic grasping problem as a 'move-and-grasp' game, where the robot is to pick up the object on the mover and the adversarial mover is to find a path to escape it.

Object reinforcement-learning +2

AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-shot Interactions

no code implementations1 Dec 2021 Yian Wang, Ruihai Wu, Kaichun Mo, Jiaqi Ke, Qingnan Fan, Leonidas Guibas, Hao Dong

Perceiving and interacting with 3D articulated objects, such as cabinets, doors, and faucets, pose particular challenges for future home-assistant robots performing daily tasks in human environments.

Friction

Fast and Flexible Human Pose Estimation with HyperPose

1 code implementation26 Aug 2021 Yixiao Guo, Jiawei Liu, Guo Li, Luo Mai, Hao Dong

When it comes to customising these algorithms for real-world applications, none of the existing libraries can offer both the flexibility of developing custom pose estimation algorithms and the high-performance of executing these algorithms on commodity devices.

Pose Estimation

VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects

no code implementations ICLR 2022 Ruihai Wu, Yan Zhao, Kaichun Mo, Zizheng Guo, Yian Wang, Tianhao Wu, Qingnan Fan, Xuelin Chen, Leonidas Guibas, Hao Dong

In this paper, we propose object-centric actionable visual priors as a novel perception-interaction handshaking point that the perception system outputs more actionable guidance than kinematic structure estimation, by predicting dense geometry-aware, interaction-aware, and task-aware visual action affordance and trajectory proposals.

Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning

1 code implementation19 Apr 2021 Jie Ren, Yewen Li, Zihan Ding, Wei Pan, Hao Dong

However, grasping distinguishable skills for some tasks with non-unique optima can be essential for further improving its learning efficiency and performance, which may lead to a multimodal policy represented as a mixture-of-experts (MOE).

Deep Reinforcement Learning Mixture-of-Experts +3

Product semantics translation from brain activity via adversarial learning

no code implementations29 Mar 2021 Pan Wang, Zhifeng Gong, Shuo Wang, Hao Dong, Jialu Fan, Ling Li, Peter Childs, Yike Guo

To modify a design semantic of a given product from personalised brain activity via adversarial learning, in this work, we propose a deep generative transformation model to modify product semantics from the brain signal.

EEG Translation

LightCAKE: A Lightweight Framework for Context-Aware Knowledge Graph Embedding

no code implementations22 Feb 2021 Zhiyuan Ning, Ziyue Qiao, Hao Dong, Yi Du, Yuanchun Zhou

Knowledge graph embedding (KGE) models learn to project symbolic entities and relations into a continuous vector space based on the observed triplets.

Knowledge Graph Embedding Knowledge Graphs

P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding

no code implementations24 Dec 2020 Yunze Liu, Li Yi, Shanghang Zhang, Qingnan Fan, Thomas Funkhouser, Hao Dong

Self-supervised representation learning is a critical problem in computer vision, as it provides a way to pretrain feature extractors on large unlabeled datasets that can be used as an initialization for more efficient and effective training on downstream tasks.

Contrastive Learning Representation Learning +1

End-to-End Object Detection with Adaptive Clustering Transformer

1 code implementation18 Nov 2020 Minghang Zheng, Peng Gao, Renrui Zhang, Kunchang Li, Xiaogang Wang, Hongsheng Li, Hao Dong

In this paper, a novel variant of transformer named Adaptive Clustering Transformer(ACT) has been proposed to reduce the computation cost for high-resolution input.

Clustering Object +2

Bilateral Asymmetry Guided Counterfactual Generating Network for Mammogram Classification

no code implementations30 Sep 2020 Chu-ran Wang, Jing Li, Fandong Zhang, Xinwei Sun, Hao Dong, Yizhou Yu, Yizhou Wang

Mammogram benign or malignant classification with only image-level labels is challenging due to the absence of lesion annotations.

Classification counterfactual +1

The DongNiao International Birds 10000 Dataset

no code implementations21 Sep 2020 Jian Mei, Hao Dong

DongNiao International Birds 10000 (DIB-10K) is a challenging image dataset which has more than 10 thousand different types of birds.

BIG-bench Machine Learning Image Classification

Lyapunov-Based Reinforcement Learning for Decentralized Multi-Agent Control

no code implementations20 Sep 2020 Qingrui Zhang, Hao Dong, Wei Pan

More importantly, the existing multi-agent reinforcement learning (MARL) algorithms cannot ensure the closed-loop stability of a multi-agent system from a control-theoretic perspective, so the learned control polices are highly possible to generate abnormal or dangerous behaviors in real applications.

Deep Reinforcement Learning Multi-agent Reinforcement Learning +2

Efficient Reinforcement Learning Development with RLzoo

1 code implementation18 Sep 2020 Zihan Ding, Tianyang Yu, Yanhua Huang, Hongming Zhang, Guo Li, Quancheng Guo, Luo Mai, Hao Dong

RLzoo provides developers with (i) high-level yet flexible APIs for prototyping DRL agents, and further customising the agents for best performance, (ii) a model zoo where users can import a wide range of DRL agents and easily compare their performance, and (iii) an algorithm that can automatically construct DRL agents with custom components (which are critical to improve agent's performance in custom applications).

Deep Reinforcement Learning reinforcement-learning +1

Generative 3D Part Assembly via Dynamic Graph Learning

3 code implementations NeurIPS 2020 Jialei Huang, Guanqi Zhan, Qingnan Fan, Kaichun Mo, Lin Shao, Baoquan Chen, Leonidas Guibas, Hao Dong

Analogous to buying an IKEA furniture, given a set of 3D parts that can assemble a single shape, an intelligent agent needs to perceive the 3D part geometry, reason to propose pose estimations for the input parts, and finally call robotic planning and control routines for actuation.

Graph Learning Graph Neural Network +2

Role-Wise Data Augmentation for Knowledge Distillation

1 code implementation ICLR 2020 Jie Fu, Xue Geng, Zhijian Duan, Bohan Zhuang, Xingdi Yuan, Adam Trischler, Jie Lin, Chris Pal, Hao Dong

To our knowledge, existing methods overlook the fact that although the student absorbs extra knowledge from the teacher, both models share the same input data -- and this data is the only medium by which the teacher's knowledge can be demonstrated.

Data Augmentation Knowledge Distillation

DLGAN: Disentangling Label-Specific Fine-Grained Features for Image Manipulation

1 code implementation22 Nov 2019 Guanqi Zhan, Yihao Zhao, Bingchan Zhao, Haoqi Yuan, Baoquan Chen, Hao Dong

By mapping the discrete label-specific attribute features into a continuous prior distribution, we leverage the advantages of both discrete labels and reference images to achieve image manipulation in a hybrid fashion.

Attribute Image Manipulation +1

Gear Training: A new way to implement high-performance model-parallel training

no code implementations11 Jun 2018 Hao Dong, Shuai Li, Dongchang Xu, Yi Ren, Di Zhang

The training of Deep Neural Networks usually needs tremendous computing resources.

Generative Creativity: Adversarial Learning for Bionic Design

no code implementations19 May 2018 Simiao Yu, Hao Dong, Pan Wang, Chao Wu, Yike Guo

Bionic design refers to an approach of generative creativity in which a target object (e. g. a floor lamp) is designed to contain features of biological source objects (e. g. flowers), resulting in creative biologically-inspired design.

Dropping Activation Outputs with Localized First-layer Deep Network for Enhancing User Privacy and Data Security

no code implementations20 Nov 2017 Hao Dong, Chao Wu, Zhen Wei, Yike Guo

However, current architecture of deep networks suffers the privacy issue that users need to give out their data to the model (typically hosted in a server or a cluster on Cloud) for training or prediction.

Anomaly Detection Decision Making +1

TensorLayer: A Versatile Library for Efficient Deep Learning Development

2 code implementations26 Jul 2017 Hao Dong, Akara Supratak, Luo Mai, Fangde Liu, Axel Oehmichen, Simiao Yu, Yike Guo

Deep learning has enabled major advances in the fields of computer vision, natural language processing, and multimedia among many others.

Deep Learning Management

Semantic Image Synthesis via Adversarial Learning

2 code implementations ICCV 2017 Hao Dong, Simiao Yu, Chao Wu, Yike Guo

In this paper, we propose a way of synthesizing realistic images directly with natural language description, which has many useful applications, e. g. intelligent image manipulation.

Image Manipulation

Deep De-Aliasing for Fast Compressive Sensing MRI

no code implementations19 May 2017 Simiao Yu, Hao Dong, Guang Yang, Greg Slabaugh, Pier Luigi Dragotti, Xujiong Ye, Fangde Liu, Simon Arridge, Jennifer Keegan, David Firmin, Yike Guo

Fast Magnetic Resonance Imaging (MRI) is highly in demand for many clinical applications in order to reduce the scanning cost and improve the patient experience.

Compressive Sensing De-aliasing +1

Automatic Brain Tumor Detection and Segmentation Using U-Net Based Fully Convolutional Networks

no code implementations10 May 2017 Hao Dong, Guang Yang, Fangde Liu, Yuanhan Mo, Yike Guo

In this context, a reliable fully automatic segmentation method for the brain tumor segmentation is necessary for an efficient measurement of the tumor extent.

Brain Tumor Segmentation Diagnostic +3

I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation

no code implementations20 Mar 2017 Hao Dong, Jingqing Zhang, Douglas McIlwraith, Yike Guo

We demonstrate that %the capability of our method to understand the sentence descriptions, so as to I2T2I can generate better multi-categories images using MSCOCO than the state-of-the-art.

Caption Generation Data Augmentation +6

DeepSleepNet: a Model for Automatic Sleep Stage Scoring based on Raw Single-Channel EEG

8 code implementations12 Mar 2017 Akara Supratak, Hao Dong, Chao Wu, Yike Guo

This demonstrated that, without changing the model architecture and the training algorithm, our model could automatically learn features for sleep stage scoring from different raw single-channel EEGs from different datasets without utilizing any hand-engineered features.

EEG Sleep Stage Detection

Unsupervised Image-to-Image Translation with Generative Adversarial Networks

no code implementations10 Jan 2017 Hao Dong, Paarth Neekhara, Chao Wu, Yike Guo

It's useful to automatically transform an image from its original form to some synthetic form (style, partial contents, etc.

Form Translation +1

Mixed Neural Network Approach for Temporal Sleep Stage Classification

no code implementations15 Oct 2016 Hao Dong, Akara Supratak, Wei Pan, Chao Wu, Paul M. Matthews, Yike Guo

Use of this recording configuration with neural network deconvolution promises to make clinically indicated home sleep studies practical.

Classification EEG +1

DropNeuron: Simplifying the Structure of Deep Neural Networks

1 code implementation23 Jun 2016 Wei Pan, Hao Dong, Yike Guo

We proposed regularisers which support a simple mechanism of dropping neurons during a network training process.

Cannot find the paper you are looking for? You can Submit a new open access paper.