Search Results for author: Xiaoxue Chen

Found 16 papers, 14 papers with code

Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail

no code implementations • 18 Mar 2024 • Mingjin Chen, JunHao Chen, Xiaojun Ye, Huan-ang Gao, Xiaoxue Chen, Zhaoxin Fan, Hao Zhao

In this paper, we propose a new method called \emph{Ultraman} for fast reconstruction of textured 3D human models from a single image.

3D Human Reconstruction Texture Synthesis

Paper
Add Code

NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields

1 code implementation • 22 Sep 2023 • Xiaoxue Chen, Junchen Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

In this paper, we introduce the refractive-reflective field.

3D Reconstruction Object

Paper
Code

ECT: Fine-grained Edge Detection with Learned Cause Tokens

1 code implementation • 6 Aug 2023 • Shaocong Xu, Xiaoxue Chen, Yuhang Zheng, Guyue Zhou, Yurong Chen, Hongbin Zha, Hao Zhao

To address these three issues, we propose a two-stage transformer-based network sequentially predicting generic edges and fine-grained edges, which has a global receptive field thanks to the attention mechanism.

Edge Detection

Paper
Code

STRAP: Structured Object Affordance Segmentation with Point Supervision

1 code implementation • 17 Apr 2023 • Leiyao Cui, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yixin Zhu

By label affinity, we refer to affordance segmentation as a multi-label prediction problem: A plate can be both holdable and containable.

Object Scene Understanding

Paper
Code

DPF: Learning Dense Prediction Fields with Weak Supervision

1 code implementation • CVPR 2023 • Xiaoxue Chen, Yuhang Zheng, Yupeng Zheng, Qiang Zhou, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

We showcase the effectiveness of DPFs using two substantially different tasks: high-level semantic parsing and low-level intrinsic image decomposition.

Intrinsic Image Decomposition Scene Understanding +1

Paper
Code

From Semi-supervised to Omni-supervised Room Layout Estimation Using Point Clouds

1 code implementation • 31 Jan 2023 • Huan-ang Gao, Beiwen Tian, Pengfei Li, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yurong Chen, Hongbin Zha

But adapting this scheme to the state-of-the-art (SOTA) solution for PC-based layout estimation is not straightforward.

Motion Planning Pseudo Label +2

109

Paper
Code

INT2: Interactive Trajectory Prediction at Intersections

1 code implementation • ICCV 2023 • Zhijie Yan, Pengfei Li, Zheng Fu, Shaocong Xu, Yongliang Shi, Xiaoxue Chen, Yuhang Zheng, Yang Li, Tianyu Liu, Chuxuan Li, Nairui Luo, Xu Gao, Yilun Chen, Zuoxu Wang, Yifeng Shi, Pengfei Huang, Zhengxiao Han, Jirui Yuan, Jiangtao Gong, Guyue Zhou, Hang Zhao, Hao Zhao

One of the most challenging problems in motion forecasting is interactive trajectory prediction, whose goal is to jointly forecasts the future trajectories of interacting agents.

Motion Forecasting Trajectory Prediction

Paper
Code

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

1 code implementation • 19 Oct 2022 • Pengfei Li, Beiwen Tian, Yongliang Shi, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

As such, we study the challenging problem of task oriented detection, which aims to find objects that best afford an action indicated by verbs like sit comfortably on.

Instance Segmentation Referring Expression +2

121

Paper
Code

Understanding Embodied Reference with Touch-Line Transformer

1 code implementation • 11 Oct 2022 • Yang Li, Xiaoxue Chen, Hao Zhao, Jiangtao Gong, Guyue Zhou, Federico Rossano, Yixin Zhu

Human studies have revealed that objects referred to or pointed to do not lie on the elbow-wrist line, a common misconception; instead, they lie on the so-called virtual touch line.

Paper
Code

Distance-Aware Occlusion Detection with Focused Attention

1 code implementation • 23 Aug 2022 • Yang Li, Yucheng Tu, Xiaoxue Chen, Hao Zhao, Guyue Zhou

In this work, (1) we propose a novel three-decoder architecture as the infrastructure for focused attention; 2) we use the generalized intersection box prediction task to effectively guide our model to focus on occlusion-specific regions; 3) our model achieves a new state-of-the-art performance on distance-aware relationship detection.

Human-Object Interaction Detection Relationship Detection +1

Paper
Code

SNAKE: Shape-aware Neural 3D Keypoint Field

1 code implementation • 3 Jun 2022 • Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang

Detecting 3D keypoints from point clouds is important for shape reconstruction, while this work investigates the dual question: can shape reconstruction benefit 3D keypoint detection?

Keypoint Detection

208

Paper
Code

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

1 code implementation • CVPR 2022 • Xiaoxue Chen, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

Multi-task indoor scene understanding is widely considered as an intriguing formulation, as the affinity of different tasks may lead to improved performance.

Ranked #51 on Semantic Segmentation on NYU Depth v2

Attribute Scene Understanding +2

Paper
Code

PQ-Transformer: Jointly Parsing 3D Objects and Layouts from Point Clouds

1 code implementation • 12 Sep 2021 • Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

Such a scheme has two limitations: 1) Storing and running several networks for different tasks are expensive for typical robotic platforms.

object-detection Object Detection +2

Paper
Code

Text Recognition in the Wild: A Survey

1 code implementation • 7 May 2020 • Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang

This paper aims to (1) summarize the fundamental problems and the state-of-the-art associated with scene text recognition; (2) introduce new insights and ideas; (3) provide a comprehensive review of publicly available resources; (4) point out directions for future work.

Scene Text Recognition

596

Paper
Code