no code implementations • ICCV 2023 • Jiashuo Fan, Yaoyuan Liang, Leyao Liu, ShaoLun Huang, Lei Zhang
We evaluate our approach on two datasets and show that our proposed RCA-NOC approach outperforms state-of-the-art methods by a large margin, demonstrating its effectiveness in improving vision-language representation for novel object captioning.
no code implementations • 19 Jul 2023 • Leyao Liu, Tao Kong, Minzhao Zhu, Jiashuo Fan, Lu Fang
Instead of directly using the model inference way, i. e., mean-shift clustering, to generate the pseudo labels, we propose to use k-means with fixed initial seeds: the annotated points.
no code implementations • CVPR 2022 • Leyao Liu, Tian Zheng, Yun-Jou Lin, Kai Ni, Lu Fang
Based on INS-Conv, an online joint 3D semantic and instance segmentation pipeline is proposed, reaching an inference speed of 15 FPS on GPU and 10 FPS on CPU.