no code implementations • ECCV 2020 • Haoang Li, Pyojin Kim, Ji Zhao, Kyungdon Joo, Zhipeng Cai, Zhe Liu , Yun-hui Liu
In Atlanta world, given a set of image lines, we aim to cluster them by the unknown-but-sought VPs whose number is unknown.
1 code implementation • 14 Mar 2025 • Zhicheng Feng, Xieyuanli Chen, Chenghao Shi, Lun Luo, Zhichao Chen, Yun-hui Liu, Huimin Lu
In this paper, we introduce a novel image-goal navigation approach, named RFSG.
no code implementations • 23 Jan 2025 • Wailing Tang, Biqi Yang, Pheng-Ann Heng, Yun-hui Liu, Chi-Wing Fu
Few-shot Semantic Segmentation (FSS) is a challenging task that utilizes limited support images to segment associated unseen objects in query images.
no code implementations • 23 Sep 2024 • Rui Cao, Chuanxin Song, Biqi Yang, Jiangliu Wang, Pheng-Ann Heng, Yun-hui Liu
Unseen Object Instance Segmentation (UOIS) is crucial for autonomous robots operating in unstructured environments.
no code implementations • 18 Sep 2024 • Lei Cheng, Junpeng Hu, Haodong Yan, Mariia Gladkova, Tianyu Huang, Yun-hui Liu, Daniel Cremers, Haoang Li
Photometric bundle adjustment (PBA) is widely used in estimating the camera pose and 3D geometry by assuming a Lambertian world.
1 code implementation • 4 Sep 2024 • Jiaxin Guo, Jiangliu Wang, Ruofeng Wei, Di Kang, Qi Dou, Yun-hui Liu
In neural rendering, we design a base-adaptive NeRF network to exploit the uncertainty estimation for explicitly handling the photometric inconsistencies.
1 code implementation • 11 Jul 2024 • Rui Cao, Jiangliu Wang, Yun-hui Liu
Inspired by the recent success of Mamba, a state space model with linear scalability in sequence length, this paper presents SR-Mamba, a novel attention-free model specifically tailored to meet the challenges of surgical phase recognition.
1 code implementation • 3 Jul 2024 • Jiaxin Guo, Jiangliu Wang, Di Kang, Wenzhen Dong, Wenting Wang, Yun-hui Liu
To tackle this problem, in this paper, we propose the first SfM-free 3DGS-based method for surgical scene reconstruction by jointly optimizing the camera poses and scene representation.
1 code implementation • 9 Apr 2024 • Tianyu Huang, Haoang Li, Liangzu Peng, Yinlong Liu, Yun-hui Liu
Our strategy largely reduces the search space and can guarantee accuracy with only a few inlier samples, therefore enjoying an excellent trade-off between efficiency and robustness.
1 code implementation • CVPR 2024 • Tianyu Huang, Liangzu Peng, René Vidal, Yun-hui Liu
Given an input set of $3$D point pairs, the goal of outlier-robust $3$D registration is to compute some rotation and translation that align as many point pairs as possible.
1 code implementation • CVPR 2024 • Tongfan Guan, Chen Wang, Yun-hui Liu
Stereo matching is a core task for many computer vision and robotics applications.
no code implementations • 11 Mar 2024 • Jiaxin Guo, Jiangliu Wang, Zhaoshuo Li, Tongyu Jia, Qi Dou, Yun-hui Liu
Soft tissue tracking is crucial for computer-assisted interventions.
no code implementations • 8 Nov 2023 • Biqi Yang, Weiliang Tang, Xiaojie Gao, Xianzhi Li, Yun-hui Liu, Chi-Wing Fu, Pheng-Ann Heng
In large-scale storehouses, precise instance masks are crucial for robotic bin picking but are challenging to obtain.
no code implementations • 25 Sep 2023 • Jiangliu Wang, Jianbo Jiao, Yibing Song, Stephen James, Zhan Tong, Chongjian Ge, Pieter Abbeel, Yun-hui Liu
This work aims to improve unsupervised audio-visual pre-training.
1 code implementation • CVPR 2023 • XiaoYu Zhang, Yun-hui Liu
Furthermore, to reduce the influence of different spatial distributions between the mapping and query sequences, which is not considered in previous methods, we also introduce a space constraint term based on 3D discretized grids.
2 code implementations • 20 Feb 2023 • Tao Huang, Kai Chen, Bin Li, Yun-hui Liu, Qi Dou
Task automation of surgical robot has the potentials to improve surgical efficiency.
no code implementations • ICCV 2023 • Haoang Li, Jinhu Dong, Binghui Wen, Ming Gao, Tianyu Huang, Yun-hui Liu, Daniel Cremers
It abstracts the shape prior of a category, and thus can provide constraints on the overall shape of an instance.
1 code implementation • CVPR 2023 • Sheng Xie, Daochuan Wang, Yun-hui Liu
Estimating depth from four large field of view (FoV) cameras has been a difficult and understudied problem.
no code implementations • CVPR 2023 • Tianyu Huang, Haoang Li, Kejing He, Congying Sui, Bin Li, Yun-hui Liu
As to the orthographic projection problem, we propose a novel Viewing Direction-aided Positional Encoding (VDPE) strategy.
no code implementations • 3 Nov 2022 • Kai Chen, Stephen James, Congying Sui, Yun-hui Liu, Pieter Abbeel, Qi Dou
To further improve the performance of the stereo framework, StereoPose is equipped with a parallax attention module for stereo feature fusion and an epipolar loss for improving the stereo-view consistency of network predictions.
no code implementations • 14 Apr 2022 • Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-hui Liu, Pieter Abbeel, Qi Dou
To continuously improve the quality of pseudo labels, we iterate the above steps by taking the trained student model as a new teacher and re-label real data using the refined teacher model.
no code implementations • 5 Mar 2022 • Yidan Feng, Biqi Yang, Xianzhi Li, Chi-Wing Fu, Rui Cao, Kai Chen, Qi Dou, Mingqiang Wei, Yun-hui Liu, Pheng-Ann Heng
Industrial bin picking is a challenging task that requires accurate and robust segmentation of individual object instances.
no code implementations • CVPR 2022 • Wen Chen, Haoang Li, Qiang Nie, Yun-hui Liu
Given a set of putative 3D-3D point correspondences, we aim to remove outliers and estimate rigid transformation with 6 degrees of freedom (DOF).
1 code implementation • 30 Aug 2021 • Jiaqi Xu, Bin Li, Bo Lu, Yun-hui Liu, Qi Dou, Pheng-Ann Heng
Ten learning-based surgical tasks are built in the platform, which are common in the real autonomous surgical execution.
no code implementations • CVPR 2021 • Haoang Li, Kai Chen, Ji Zhao, Jiangliu Wang, Pyojin Kim, Zhe Liu, Yun-hui Liu
In contrast, we propose the first approach suitable for both structured and unstructured scenes.
no code implementations • 24 Mar 2021 • Zixu Zhao, Yueming Jin, Bo Lu, Chi-Fai Ng, Qi Dou, Yun-hui Liu, Pheng-Ann Heng
To greatly increase the label efficiency, we explore a new problem, i. e., adaptive instrument segmentation, which is to effectively adapt one source model to new robotic surgical videos from multiple target domains, only given the annotated instruments in the first frame.
no code implementations • ICCV 2021 • Haoang Li, Kai Chen, Pyojin Kim, Kuk-Jin Yoon, Zhe Liu, Kyungdon Joo, Yun-hui Liu
Based on this map, we can detect all the VPs.
no code implementations • 3 Nov 2020 • Yonghao Long, Jie Ying Wu, Bo Lu, Yueming Jin, Mathias Unberath, Yun-hui Liu, Pheng Ann Heng, Qi Dou
Automatic surgical gesture recognition is fundamentally important to enable intelligent cognitive assistance in robotic surgery.
Ranked #1 on
Action Segmentation
on JIGSAWS
2 code implementations • 31 Aug 2020 • Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Wei Liu, Yun-hui Liu
Specifically, given an unlabeled video clip, we compute a series of spatio-temporal statistical summaries, such as the spatial location and dominant direction of the largest motion, the spatial location and dominant color of the largest color diversity along the temporal axis, etc.
1 code implementation • ECCV 2020 • Jiangliu Wang, Jianbo Jiao, Yun-hui Liu
This paper addresses the problem of self-supervised video representation learning from a new perspective -- by video pace prediction.
1 code implementation • ECCV 2020 • Qiang Nie, Ziwei Liu, Yun-hui Liu
Learning a good 3D human pose representation is important for human pose related tasks, e. g. human 3D pose estimation and action recognition.
Ranked #14 on
Self-supervised Skeleton-based Action Recognition
on NTU RGB+D
(Accuracy (XView) metric)
no code implementations • ECCV 2020 • Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, Mingxiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yun-hui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim
To address these issues, we designed a public challenge (HANDS'19) to evaluate the abilities of current 3D hand pose estimators (HPEs) to interpolate and extrapolate the poses of a training set.
no code implementations • 12 Nov 2019 • Weiguo Zhou, Xin Jiang, Chen Chen, Sijia Mei, Yun-hui Liu
In this paper, we propose a method that takes advantage of human hand morphological topology (HMT) structure to improve the pose estimation performance.
Robotics Human-Computer Interaction
no code implementations • 19 Sep 2019 • Jie Zhao, Xin Jiang, Xiaoman Wang, Shengfan Wang, Yun-hui Liu
The proposal in this paper is verified by a simulated assembly in which a robot arm completed the assembly process including parts picking from bin and a subsequent peg-in-hole assembly.
no code implementations • 30 Apr 2019 • Zhe Liu, Chuanzhe Suo, Shunbo Zhou, Huanshu Wei, Yingtian Liu, Hesheng Wang, Yun-hui Liu
Place recognition and loop-closure detection are main challenges in the localization, mapping and navigation tasks of self-driving vehicles.
1 code implementation • CVPR 2019 • Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Yun-hui Liu, Wei Liu
We conduct extensive experiments with C3D to validate the effectiveness of our proposed approach.
Ranked #47 on
Self-Supervised Action Recognition
on HMDB51
no code implementations • 24 Feb 2019 • Shengfan Wang, Xin Jiang, Jie Zhao, Xiaoman Wang, Weiguo Zhou, Yun-hui Liu, Fellow IEEE
This paper presents an efficient neural network model to generate robotic grasps with high resolution images.
Robotics
2 code implementations • ICCV 2019 • Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Yingtian Liu, Peng Yin, Hesheng Wang, Yun-hui Liu
Point cloud based place recognition is still an open issue due to the difficulty in extracting local features from the raw 3D point cloud and generating the global descriptor, and it's even harder in the large-scale dynamic environments.
Ranked #5 on
3D Place Recognition
on CS-Campus3D