Search Results for author: Hao Zhao

Found 38 papers, 24 papers with code

Simulate Bumblebee and Extend It to Support LE Coded PHY in BLE version 5

no code implementations17 May 2023 Hao Zhao

Subsequently, we extend Bumblebee to support LE Coded PHY in BLE version 5 and conduct experiments to verify its performance.

DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection

1 code implementation25 Apr 2023 Huan-ang Gao, Beiwen Tian, Pengfei Li, Hao Zhao, Guyue Zhou

While this paradigm is natural for image-level or pixel-level prediction, adapting it to the detection problem is challenged by the issue of proposal matching.

3D Object Detection object-detection +1

Delving into Shape-aware Zero-shot Semantic Segmentation

1 code implementation CVPR 2023 Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou

Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy.

Image Segmentation Semantic Segmentation

STRAP: Structured Object Affordance Segmentation with Point Supervision

1 code implementation17 Apr 2023 Leiyao Cui, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yixin Zhu

By label affinity, we refer to affordance segmentation as a multi-label prediction problem: A plate can be both holdable and containable.

Scene Understanding

DPF: Learning Dense Prediction Fields with Weak Supervision

1 code implementation CVPR 2023 Xiaoxue Chen, Yuhang Zheng, Yupeng Zheng, Qiang Zhou, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

We showcase the effectiveness of DPFs using two substantially different tasks: high-level semantic parsing and low-level intrinsic image decomposition.

Intrinsic Image Decomposition Scene Understanding +1

LODE: Locally Conditioned Eikonal Implicit Scene Completion from Sparse LiDAR

1 code implementation27 Feb 2023 Pengfei Li, Ruowen Zhao, Yongliang Shi, Hao Zhao, Jirui Yuan, Guyue Zhou, Ya-Qin Zhang

In this paper, we propose a novel Eikonal formulation that conditions the implicit representation on localized shape priors which function as dense boundary value constraints, and demonstrate it works on SemanticKITTI and SemanticPOSS.

Autonomous Driving Representation Learning

ADAPT: Action-aware Driving Caption Transformer

1 code implementation1 Feb 2023 Bu Jin, Xinyu Liu, Yupeng Zheng, Pengfei Li, Hao Zhao, Tong Zhang, Yuhang Zheng, Guyue Zhou, Jingjing Liu

To bridge the gap, we propose an end-to-end transformer-based architecture, ADAPT (Action-aware Driving cAPtion Transformer), which provides user-friendly natural language narrations and reasoning for each decision making step of autonomous vehicular control and action.

Autonomous Driving Decision Making

SC-wLS: Towards Interpretable Feed-forward Camera Re-localization

1 code implementation23 Oct 2022 Xin Wu, Hao Zhao, Shunkai Li, Yingdian Cao, Hongbin Zha

Visual re-localization aims to recover camera poses in a known environment, which is vital for applications like robotics or augmented reality.

regression

VIBUS: Data-efficient 3D Scene Parsing with VIewpoint Bottleneck and Uncertainty-Spectrum Modeling

1 code implementation20 Oct 2022 Beiwen Tian, Liyi Luo, Hao Zhao, Guyue Zhou

In the first stage, we perform self-supervised representation learning on unlabeled points with the proposed Viewpoint Bottleneck loss function.

Representation Learning Scene Parsing

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

1 code implementation19 Oct 2022 Pengfei Li, Beiwen Tian, Yongliang Shi, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

As such, we study the challenging problem of task oriented detection, which aims to find objects that best afford an action indicated by verbs like sit comfortably on.

Instance Segmentation Referring Expression +2

Planning Assembly Sequence with Graph Transformer

1 code implementation11 Oct 2022 Lin Ma, Jiangtao Gong, Hao Xu, Hao Chen, Hao Zhao, Wenbing Huang, Guyue Zhou

In this paper, we present a graph-transformer based framework for the ASP problem which is trained and demonstrated on a self-collected ASP database.

Understanding Embodied Reference with Touch-Line Transformer

1 code implementation11 Oct 2022 Yang Li, Xiaoxue Chen, Hao Zhao, Jiangtao Gong, Guyue Zhou, Federico Rossano, Yixin Zhu

Human studies have revealed that objects referred to or pointed to do not lie on the elbow-wrist line, a common misconception; instead, they lie on the so-called virtual touch line.

City-scale Incremental Neural Mapping with Three-layer Sampling and Panoptic Representation

no code implementations28 Sep 2022 Yongliang Shi, Runyi Yang, Pengfei Li, Zirui Wu, Hao Zhao, Guyue Zhou

Neural implicit representations are drawing a lot of attention from the robotics community recently, as they are expressive, continuous and compact.

LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF

1 code implementation18 Sep 2022 Zhenxin Zhu, Yuantao Chen, Zirui Wu, Chao Hou, Yongliang Shi, Chuxuan Li, Pengfei Li, Hao Zhao, Guyue Zhou

In this paper, we present LATITUDE: Global Localization with Truncated Dynamic Low-pass Filter, which introduces a two-stage localization mechanism in city-scale NeRF.

Pose Prediction

SOM-Net: Unrolling the Subspace-based Optimization for Solving Full-wave Inverse Scattering Problems

no code implementations8 Sep 2022 Yu Liu, Hao Zhao, Rencheng Song, Xudong Chen, Chang Li, Xun Chen

The final output of the SOM-Net is the full predicted induced current, from which the scattered field and the permittivity image can also be deduced analytically.

Rolling Shutter Correction

Distance-Aware Occlusion Detection with Focused Attention

1 code implementation23 Aug 2022 Yang Li, Yucheng Tu, Xiaoxue Chen, Hao Zhao, Guyue Zhou

In this work, (1) we propose a novel three-decoder architecture as the infrastructure for focused attention; 2) we use the generalized intersection box prediction task to effectively guide our model to focus on occlusion-specific regions; 3) our model achieves a new state-of-the-art performance on distance-aware relationship detection.

Human-Object Interaction Detection Relationship Detection +1

Language-guided Semantic Style Transfer of 3D Indoor Scenes

1 code implementation16 Aug 2022 Bu Jin, Beiwen Tian, Hao Zhao, Guyue Zhou

We address the new problem of language-guided semantic style transfer of 3D indoor scenes.

Style Transfer

Model-Driven Based Deep Unfolding Equalizer for Underwater Acoustic OFDM Communications

no code implementations10 Jul 2022 Hao Zhao, Cui Yang, Yalu Xu, Fei Ji, Miaowen Wen, Yankun Chen

Each layer of UDNet is designed according to the classical minimum mean square error (MMSE) equalizer.

SNAKE: Shape-aware Neural 3D Keypoint Field

1 code implementation3 Jun 2022 Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang

Detecting 3D keypoints from point clouds is important for shape reconstruction, while this work investigates the dual question: can shape reconstruction benefit 3D keypoint detection?

Keypoint Detection

High-Fidelity Human Avatars From a Single RGB Camera

no code implementations CVPR 2022 Hao Zhao, Jinsong Zhang, Yu-Kun Lai, Zerong Zheng, Yingdi Xie, Yebin Liu, Kun Li

To cope with the complexity of textures and generate photo-realistic results, we propose a reference-based neural rendering network and exploit a bottom-up sharpening-guided fine-tuning strategy to obtain detailed textures.

Neural Rendering Vocal Bursts Intensity Prediction

Transferable End-to-end Room Layout Estimation via Implicit Encoding

no code implementations21 Dec 2021 Hao Zhao, Rene Ranftl, Yurong Chen, Hongbin Zha

Here we propose an end-to-end method that directly predicts parametric layouts from an input panorama image.

Room Layout Estimation

Semi-supervised Implicit Scene Completion from Sparse LiDAR

1 code implementation29 Nov 2021 Pengfei Li, Yongliang Shi, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

Recent advances show that semi-supervised implicit representation learning can be achieved through physical constraints like Eikonal equations.

Representation Learning

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

1 code implementation CVPR 2022 Xiaoxue Chen, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

Multi-task indoor scene understanding is widely considered as an intriguing formulation, as the affinity of different tasks may lead to improved performance.

Scene Understanding Semantic Segmentation +1

PQ-Transformer: Jointly Parsing 3D Objects and Layouts from Point Clouds

1 code implementation12 Sep 2021 Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang

Such a scheme has two limitations: 1) Storing and running several networks for different tasks are expensive for typical robotic platforms.

object-detection Object Detection +2

Constrained R-CNN: A general image manipulation detection model

no code implementations19 Nov 2019 Chao Yang, Huizhou Li, Fangting Lin, Bin Jiang, Hao Zhao

Finally, the coarse localization information guides the model to further learn the finer local features and segment out the tampered region.

General Classification Image Forensics +3

Deeply-supervised Knowledge Synergy

1 code implementation CVPR 2019 Dawei Sun, Anbang Yao, Aojun Zhou, Hao Zhao

Convolutional Neural Networks (CNNs) have become deeper and more complicated compared with the pioneering AlexNet.

General Classification Image Classification

A Closed-form Solution to Universal Style Transfer

2 code implementations ICCV 2019 Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen, Feng Xu, Li Zhang

Although plenty of methods have been proposed, a theoretical analysis of feature transform is still missing.

Style Transfer

SnapQuant: A Probabilistic and Nested Parameterization for Binary Networks

no code implementations27 Sep 2018 Kuan Wang, Hao Zhao, Anbang Yao, Aojun Zhou, Dawei Sun, Yurong Chen

During the training phase, we generate binary weights on-the-fly since what we actually maintain is the policy network, and all the binary weights are used in a burn-after-reading style.

Network Sketching: Exploiting Binary Structure in Deep CNNs

no code implementations CVPR 2017 Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen

Convolutional neural networks (CNNs) with deep architectures have substantially advanced the state-of-the-art in computer vision tasks.

Cannot find the paper you are looking for? You can Submit a new open access paper.