no code implementations • 17 May 2023 • Hao Zhao
Subsequently, we extend Bumblebee to support LE Coded PHY in BLE version 5 and conduct experiments to verify its performance.
1 code implementation • 25 Apr 2023 • Huan-ang Gao, Beiwen Tian, Pengfei Li, Hao Zhao, Guyue Zhou
While this paradigm is natural for image-level or pixel-level prediction, adapting it to the detection problem is challenged by the issue of proposal matching.
1 code implementation • CVPR 2023 • Xinyu Liu, Beiwen Tian, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Hao Zhao, Guyue Zhou
Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy.
1 code implementation • 17 Apr 2023 • Leiyao Cui, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yixin Zhu
By label affinity, we refer to affordance segmentation as a multi-label prediction problem: A plate can be both holdable and containable.
1 code implementation • CVPR 2023 • Xiaoxue Chen, Yuhang Zheng, Yupeng Zheng, Qiang Zhou, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
We showcase the effectiveness of DPFs using two substantially different tasks: high-level semantic parsing and low-level intrinsic image decomposition.
1 code implementation • 27 Feb 2023 • Pengfei Li, Ruowen Zhao, Yongliang Shi, Hao Zhao, Jirui Yuan, Guyue Zhou, Ya-Qin Zhang
In this paper, we propose a novel Eikonal formulation that conditions the implicit representation on localized shape priors which function as dense boundary value constraints, and demonstrate it works on SemanticKITTI and SemanticPOSS.
1 code implementation • 2 Feb 2023 • Yupeng Zheng, Chengliang Zhong, Pengfei Li, Huan-ang Gao, Yuhang Zheng, Bu Jin, Ling Wang, Hao Zhao, Guyue Zhou, Qichao Zhang, Dongbin Zhao
By fitting a bridge-shaped curve to the illumination map distribution, both regions are suppressed and two tasks are bridged naturally.
1 code implementation • 1 Feb 2023 • Bu Jin, Xinyu Liu, Yupeng Zheng, Pengfei Li, Hao Zhao, Tong Zhang, Yuhang Zheng, Guyue Zhou, Jingjing Liu
To bridge the gap, we propose an end-to-end transformer-based architecture, ADAPT (Action-aware Driving cAPtion Transformer), which provides user-friendly natural language narrations and reasoning for each decision making step of autonomous vehicular control and action.
1 code implementation • 31 Jan 2023 • Huan-ang Gao, Beiwen Tian, Pengfei Li, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yurong Chen, Hongbin Zha
But adapting this scheme to the state-of-the-art (SOTA) solution for PC-based layout estimation is not straightforward.
no code implementations • 14 Nov 2022 • Zirui Wu, Yuantao Chen, Runyi Yang, Zhenxin Zhu, Chao Hou, Yongliang Shi, Hao Zhao, Guyue Zhou
Large-scale radiance fields are promising mapping tools for smart transportation applications like autonomous driving or drone delivery.
1 code implementation • 23 Oct 2022 • Xin Wu, Hao Zhao, Shunkai Li, Yingdian Cao, Hongbin Zha
Visual re-localization aims to recover camera poses in a known environment, which is vital for applications like robotics or augmented reality.
1 code implementation • 20 Oct 2022 • Beiwen Tian, Liyi Luo, Hao Zhao, Guyue Zhou
In the first stage, we perform self-supervised representation learning on unlabeled points with the proposed Viewpoint Bottleneck loss function.
1 code implementation • 19 Oct 2022 • Pengfei Li, Beiwen Tian, Yongliang Shi, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
As such, we study the challenging problem of task oriented detection, which aims to find objects that best afford an action indicated by verbs like sit comfortably on.
1 code implementation • 11 Oct 2022 • Lin Ma, Jiangtao Gong, Hao Xu, Hao Chen, Hao Zhao, Wenbing Huang, Guyue Zhou
In this paper, we present a graph-transformer based framework for the ASP problem which is trained and demonstrated on a self-collected ASP database.
1 code implementation • 11 Oct 2022 • Yang Li, Xiaoxue Chen, Hao Zhao, Jiangtao Gong, Guyue Zhou, Federico Rossano, Yixin Zhu
Human studies have revealed that objects referred to or pointed to do not lie on the elbow-wrist line, a common misconception; instead, they lie on the so-called virtual touch line.
no code implementations • 28 Sep 2022 • Yongliang Shi, Runyi Yang, Pengfei Li, Zirui Wu, Hao Zhao, Guyue Zhou
Neural implicit representations are drawing a lot of attention from the robotics community recently, as they are expressive, continuous and compact.
1 code implementation • 18 Sep 2022 • Zhenxin Zhu, Yuantao Chen, Zirui Wu, Chao Hou, Yongliang Shi, Chuxuan Li, Pengfei Li, Hao Zhao, Guyue Zhou
In this paper, we present LATITUDE: Global Localization with Truncated Dynamic Low-pass Filter, which introduces a two-stage localization mechanism in city-scale NeRF.
no code implementations • 8 Sep 2022 • Yu Liu, Hao Zhao, Rencheng Song, Xudong Chen, Chang Li, Xun Chen
The final output of the SOM-Net is the full predicted induced current, from which the scattered field and the permittivity image can also be deduced analytically.
1 code implementation • 23 Aug 2022 • Yang Li, Yucheng Tu, Xiaoxue Chen, Hao Zhao, Guyue Zhou
In this work, (1) we propose a novel three-decoder architecture as the infrastructure for focused attention; 2) we use the generalized intersection box prediction task to effectively guide our model to focus on occlusion-specific regions; 3) our model achieves a new state-of-the-art performance on distance-aware relationship detection.
Human-Object Interaction Detection
Relationship Detection
+1
1 code implementation • 16 Aug 2022 • Bu Jin, Beiwen Tian, Hao Zhao, Guyue Zhou
We address the new problem of language-guided semantic style transfer of 3D indoor scenes.
no code implementations • 10 Jul 2022 • Hao Zhao, Cui Yang, Yalu Xu, Fei Ji, Miaowen Wen, Yankun Chen
Each layer of UDNet is designed according to the classical minimum mean square error (MMSE) equalizer.
1 code implementation • 3 Jun 2022 • Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang
Detecting 3D keypoints from point clouds is important for shape reconstruction, while this work investigates the dual question: can shape reconstruction benefit 3D keypoint detection?
no code implementations • CVPR 2022 • Hao Zhao, Jinsong Zhang, Yu-Kun Lai, Zerong Zheng, Yingdi Xie, Yebin Liu, Kun Li
To cope with the complexity of textures and generate photo-realistic results, we propose a reference-based neural rendering network and exploit a bottom-up sharpening-guided fine-tuning strategy to obtain detailed textures.
no code implementations • 21 Dec 2021 • Hao Zhao, Rene Ranftl, Yurong Chen, Hongbin Zha
Here we propose an end-to-end method that directly predicts parametric layouts from an input panorama image.
1 code implementation • 29 Nov 2021 • Pengfei Li, Yongliang Shi, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
Recent advances show that semi-supervised implicit representation learning can be achieved through physical constraints like Eikonal equations.
1 code implementation • CVPR 2022 • Xiaoxue Chen, Tianyu Liu, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
Multi-task indoor scene understanding is widely considered as an intriguing formulation, as the affinity of different tasks may lead to improved performance.
Ranked #31 on
Semantic Segmentation
on NYU Depth v2
1 code implementation • 17 Sep 2021 • Liyi Luo, Beiwen Tian, Hao Zhao, Guyue Zhou
Semantic understanding of 3D point clouds is important for various robotics applications.
1 code implementation • 12 Sep 2021 • Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
Such a scheme has two limitations: 1) Storing and running several networks for different tasks are expensive for typical robotic platforms.
no code implementations • 24 May 2021 • Hao Zhao, Fei Ji, Quansheng Guan, Qiang Li, Shuai Wang, Hefeng Dong, Miaowen Wen
In summary, the proposed ARC/FML for OoT is a promising scheme for information exchange across water and air.
no code implementations • 17 Oct 2020 • Yunchao Wei, Shuai Zheng, Ming-Ming Cheng, Hang Zhao, LiWei Wang, Errui Ding, Yi Yang, Antonio Torralba, Ting Liu, Guolei Sun, Wenguan Wang, Luc van Gool, Wonho Bae, Junhyug Noh, Jinhwan Seo, Gunhee Kim, Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang, Chuangchuang Tan, Tao Ruan, Guanghua Gu, Shikui Wei, Yao Zhao, Mariia Dobko, Ostap Viniavskyi, Oles Dobosevych, Zhendong Wang, Zhenyuan Chen, Chen Gong, Huanqing Yan, Jun He
The purpose of the Learning from Imperfect Data (LID) workshop is to inspire and facilitate the research in developing novel approaches that would harness the imperfect data and improve the data-efficiency during training.
no code implementations • 19 Nov 2019 • Chao Yang, Huizhou Li, Fangting Lin, Bin Jiang, Hao Zhao
Finally, the coarse localization information guides the model to further learn the finer local features and segment out the tampered region.
1 code implementation • ECCV 2018 • Jiahui Zhang, Hao Zhao, Anbang Yao, Yurong Chen, Li Zhang, Hongen Liao
We introduce Spatial Group Convolution (SGC) for accelerating the computation of 3D dense prediction tasks.
Ranked #9 on
3D Semantic Scene Completion
on SemanticKITTI
1 code implementation • CVPR 2019 • Dawei Sun, Anbang Yao, Aojun Zhou, Hao Zhao
Convolutional Neural Networks (CNNs) have become deeper and more complicated compared with the pioneering AlexNet.
2 code implementations • ICCV 2019 • Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen, Feng Xu, Li Zhang
Although plenty of methods have been proposed, a theoretical analysis of feature transform is still missing.
no code implementations • 27 Sep 2018 • Kuan Wang, Hao Zhao, Anbang Yao, Aojun Zhou, Dawei Sun, Yurong Chen
During the training phase, we generate binary weights on-the-fly since what we actually maintain is the policy network, and all the binary weights are used in a burn-after-reading style.
no code implementations • ICCV 2017 • Ming Lu, Hao Zhao, Anbang Yao, Feng Xu, Yurong Chen, Li Zhang
Our method decomposes the semantic style transfer problem into feature reconstruction part and feature decoder part.
no code implementations • CVPR 2017 • Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang
In this paper, we propose an alternative method to estimate room layouts of cluttered indoor scenes.
no code implementations • CVPR 2017 • Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen
Convolutional neural networks (CNNs) with deep architectures have substantially advanced the state-of-the-art in computer vision tasks.