Search Results for author: Hao-Shu Fang

Found 28 papers, 19 papers with code

RH20T: A Comprehensive Robotic Dataset for Learning Diverse Skills in One-Shot

no code implementations • 2 Jul 2023 • Hao-Shu Fang, Hongjie Fang, Zhenyu Tang, Jirong Liu, Chenxi Wang, JunBo Wang, Haoyi Zhu, Cewu Lu

A key challenge in robotic manipulation in open domains is how to acquire diverse and generalizable skills for robots.

Imitation Learning Motion Planning +2

Paper
Add Code

Target-Referenced Reactive Grasping for Dynamic Objects

no code implementations • CVPR 2023 • Jirong Liu, Ruo Zhang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Chenxi Wang, Sheng Xu, Hengxu Yan, Cewu Lu

Reactive grasping, which enables the robot to successfully grasp dynamic moving objects, is of great interest in robotics.

Paper
Add Code

Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions

1 code implementation • 14 Nov 2022 • Yong-Lu Li, Hongwei Fan, Zuoyu Qiu, Yiming Dou, Liang Xu, Hao-Shu Fang, Peiyang Guo, Haisheng Su, Dongliang Wang, Wei Wu, Cewu Lu

In daily HOIs, humans often interact with a variety of objects, e. g., holding and touching dozens of household items in cleaning.

Human-Object Interaction Detection Object +3

Paper
Code

AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time

7 code implementations • 7 Nov 2022 • Hao-Shu Fang, Jiefeng Li, Hongyang Tang, Chao Xu, Haoyi Zhu, Yuliang Xiu, Yong-Lu Li, Cewu Lu

Accurate whole-body multi-person pose estimation and tracking is an important yet challenging topic in computer vision.

Knowledge Distillation Multi-Person Pose Estimation +1

7,713

Paper
Code

X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360$^{\circ} $ Insufficient RGB-D Views

1 code implementation • 11 Oct 2022 • Haoyi Zhu, Hao-Shu Fang, Cewu Lu

In this paper, we focus on a rarely discussed but important setting: can we train one model that can represent multiple scenes, with 360$^\circ $ insufficient views and RGB-D images?

Novel View Synthesis

Paper
Code

Unseen Object 6D Pose Estimation: A Benchmark and Baselines

no code implementations • 23 Jun 2022 • Minghao Gou, Haolin Pan, Hao-Shu Fang, Ziyuan Liu, Cewu Lu, Ping Tan

In this paper, we propose a new task that enables and facilitates algorithms to estimate the 6D pose estimation of novel objects during testing.

6D Pose Estimation

Paper
Add Code

TransCG: A Large-Scale Real-World Dataset for Transparent Object Depth Completion and a Grasping Baseline

1 code implementation • 17 Feb 2022 • Hongjie Fang, Hao-Shu Fang, Sheng Xu, Cewu Lu

However, the majority of current grasping algorithms would fail in this case since they heavily rely on the depth image, while ordinary depth sensors usually fail to produce accurate depth information for transparent objects owing to the reflection and refraction of light.

Ranked #1 on Transparent Object Depth Estimation on TransCG

Depth Completion Robotic Grasping +2

Paper
Code

HAKE: A Knowledge Engine Foundation for Human Activity Understanding

3 code implementations • 14 Feb 2022 • Yong-Lu Li, Xinpeng Liu, Xiaoqian Wu, Yizhuo Li, Zuoyu Qiu, Liang Xu, Yue Xu, Hao-Shu Fang, Cewu Lu

Human activity understanding is of widespread interest in artificial intelligence and spans diverse applications like health care and behavior analysis.

Action Recognition Human-Object Interaction Detection +2

217

Paper
Code

Human Trajectory Prediction With Momentary Observation

no code implementations • CVPR 2022 • Jianhua Sun, YuXuan Li, Liang Chai, Hao-Shu Fang, Yong-Lu Li, Cewu Lu

Human trajectory prediction task aims to analyze human future movements given their past status, which is a crucial step for many autonomous systems such as self-driving cars and social robots.

Self-Driving Cars Trajectory Prediction

Paper
Add Code

SuctionNet-1Billion: A Large-Scale Benchmark for Suction Grasping

no code implementations • 23 Mar 2021 • Hanwen Cao, Hao-Shu Fang, Wenhai Liu, Cewu Lu

Meanwhile, we propose a method to predict numerous suction poses from an RGB-D image of a cluttered scene and demonstrate our superiority against several previous methods.

Robotic Grasping

Paper
Add Code

Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis

1 code implementation • ICCV 2021 • Jianhua Sun, YuXuan Li, Hao-Shu Fang, Cewu Lu

Multimodal prediction results are essential for trajectory prediction task as there is no single correct answer for the future.

Clustering General Classification +1

Paper
Code

RGB Matters: Learning 7-DoF Grasp Poses on Monocular RGBD Images

1 code implementation • 3 Mar 2021 • Minghao Gou, Hao-Shu Fang, Zhanda Zhu, Sheng Xu, Chenxi Wang, Cewu Lu

In the first stage, an encoder-decoder like convolutional neural network Angle-View Net(AVN) is proposed to predict the SO(3) orientation of the gripper at every location of the image.

Paper
Code

Graspness Discovery in Clutters for Fast and Accurate Grasp Detection

1 code implementation • ICCV 2021 • Chenxi Wang, Hao-Shu Fang, Minghao Gou, Hongjie Fang, Jin Gao, Cewu Lu

To quickly detect graspness in practice, we develop a neural network named graspness model to approximate the searching process.

Ranked #3 on Robotic Grasping on GraspNet-1Billion

Robotic Grasping

101

Paper
Code

DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection

1 code implementation • 2 Oct 2020 • Hao-Shu Fang, Yichen Xie, Dian Shao, Cewu Lu

On the other hand, existing one-stage methods mainly focus on the union regions of interactions, which introduce unnecessary visual information as disturbances to HOI detection.

Ranked #15 on Human-Object Interaction Detection on V-COCO

Human-Object Interaction Detection

Paper
Code

DecAug: Augmenting HOI Detection via Decomposition

no code implementations • 2 Oct 2020 • Yichen Xie, Hao-Shu Fang, Dian Shao, Yong-Lu Li, Cewu Lu

Human-object interaction (HOI) detection requires a large amount of annotated data.

Ranked #68 on Domain Generalization on PACS

Data Augmentation Domain Generalization +2

Paper
Add Code

GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping

1 code implementation • CVPR 2020 • Hao-Shu Fang, Chenxi Wang, Minghao Gou, Cewu Lu

In this work, we contribute a large-scale grasp pose detection dataset with a unified evaluation system.

Ranked #6 on Robotic Grasping on GraspNet-1Billion

Pose Prediction Robotic Grasping

381

Paper
Code

PaStaNet: Toward Human Activity Knowledge Engine

2 code implementations • CVPR 2020 • Yong-Lu Li, Liang Xu, Xinpeng Liu, Xijie Huang, Yue Xu, Shiyi Wang, Hao-Shu Fang, Ze Ma, Mingyang Chen, Cewu Lu

In light of this, we propose a new path: infer human part states first and then reason out the activities based on part-level semantics.

Ranked #3 on Human-Object Interaction Detection on HICO

Action Detection Human-Object Interaction Detection +1

217

Paper
Code

GraspNet: A Large-Scale Clustered and Densely Annotated Dataset for Object Grasping

no code implementations • 31 Dec 2019 • Hao-Shu Fang, Chenxi Wang, Minghao Gou, Cewu Lu

Object grasping is critical for many applications, which is also a challenging computer vision problem.

Paper
Add Code

InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting

3 code implementations • ICCV 2019 • Hao-Shu Fang, Jianhua Sun, Runzhong Wang, Minghao Gou, Yong-Lu Li, Cewu Lu

With the guidance of such map, we boost the performance of R101-Mask R-CNN on instance segmentation from 35. 7 mAP to 37. 9 mAP without modifying the backbone or network structure.

Ranked #78 on Instance Segmentation on COCO test-dev

Data Augmentation Instance Segmentation +3

27,765

Paper
Code

Cross-Domain Adaptation for Animal Pose Estimation

no code implementations • ICCV 2019 • Jinkun Cao, Hongyang Tang, Hao-Shu Fang, Xiaoyong Shen, Cewu Lu, Yu-Wing Tai

Therefore, the easily available human pose dataset, which is of a much larger scale than our labeled animal dataset, provides important prior knowledge to boost up the performance on animal pose estimation.

Animal Pose Estimation Domain Adaptation

Paper
Add Code

HAKE: Human Activity Knowledge Engine

4 code implementations • 13 Apr 2019 • Yong-Lu Li, Liang Xu, Xinpeng Liu, Xijie Huang, Yue Xu, Mingyang Chen, Ze Ma, Shiyi Wang, Hao-Shu Fang, Cewu Lu

To address these and promote the activity understanding, we build a large-scale Human Activity Knowledge Engine (HAKE) based on the human body part states.

Ranked #2 on Human-Object Interaction Detection on HICO (using extra training data)

Action Detection Human-Object Interaction Detection +1

217

Paper
Code

Estimating 6D Pose From Localizing Designated Surface Keypoints

6 code implementations • 4 Dec 2018 • Zelin Zhao, Gao Peng, Haoyu Wang, Hao-Shu Fang, Chengkun Li, Cewu Lu

In this paper, we present an accurate yet effective solution for 6D pose estimation from an RGB image.

Ranked #17 on 6D Pose Estimation using RGB on LineMOD

6D Pose Estimation 6D Pose Estimation using RGB +1

Paper
Code

CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark

3 code implementations • CVPR 2019 • Jiefeng Li, Can Wang, Hao Zhu, Yihuan Mao, Hao-Shu Fang, Cewu Lu

In this paper, we propose a novel and efficient method to tackle the problem of pose estimation in the crowd and a new dataset to better evaluate algorithms.

Ranked #6 on Multi-Person Pose Estimation on OCHuman

Keypoint Detection Multi-Person Pose Estimation

4,986

Paper
Code

Transferable Interactiveness Knowledge for Human-Object Interaction Detection

3 code implementations • CVPR 2019 • Yong-Lu Li, Siyuan Zhou, Xijie Huang, Liang Xu, Ze Ma, Hao-Shu Fang, Yan-Feng Wang, Cewu Lu

On account of the generalization of interactiveness, interactiveness network is a transferable knowledge learner and can be cooperated with any HOI detection models to achieve desirable results.

Ranked #29 on Human-Object Interaction Detection on V-COCO

Human-Object Interaction Detection Object

228

Paper
Code

Pairwise Body-Part Attention for Recognizing Human-Object Interactions

1 code implementation • ECCV 2018 • Hao-Shu Fang, Jinkun Cao, Yu-Wing Tai, Cewu Lu

We propose a new pairwise body-part attention model which can learn to focus on crucial parts, and their correlations for HOI recognition.

Ranked #5 on Human-Object Interaction Detection on HICO

feature selection Human-Object Interaction Detection +1

Paper
Code

Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer

1 code implementation • CVPR 2018 • Hao-Shu Fang, Guansong Lu, Xiaolin Fang, Jianwen Xie, Yu-Wing Tai, Cewu Lu

In this paper, we present a novel method to generate synthetic human part segmentation data using easily-obtained human keypoint annotations.

Ranked #4 on Human Part Segmentation on PASCAL-Part (using extra training data)

Human Parsing Human Part Segmentation +3

300

Paper
Code

Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation

no code implementations • 17 Oct 2017 • Hao-Shu Fang, Yuanlu Xu, Wenguan Wang, Xiaobai Liu, Song-Chun Zhu

In this paper, we propose a pose grammar to tackle the problem of 3D human pose estimation.

Ranked #1 on 3D Absolute Human Pose Estimation on Human3.6M (Average MPJPE (mm) metric)

3D Absolute Human Pose Estimation 3D Pose Estimation

Paper
Add Code

RMPE: Regional Multi-person Pose Estimation

14 code implementations • ICCV 2017 • Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu

In this paper, we propose a novel regional multi-person pose estimation (RMPE) framework to facilitate pose estimation in the presence of inaccurate human bounding boxes.

Ranked #1 on Pose Estimation on UAV-Human

2D Human Pose Estimation Human Detection +2

7,713

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.