no code implementations • 28 Aug 2024 • Zhaoxuan Wang, Xu Han, Hongxin Liu, Xianzhi Li
Particularly, our RIDE is compatible and easy to plug into the existing one-stage and two-stage 3D detectors, and boosts both detection performance and rotation robustness.
1 code implementation • 28 Aug 2024 • Yuan Tang, Xu Han, Xianzhi Li, Qiao Yu, Jinfeng Xu, Yixue Hao, Long Hu, Min Chen
To address this task, we introduce GreenPLM, which leverages more text data to compensate for the lack of 3D data.
1 code implementation • 22 Jun 2024 • Qiao Yu, Xianzhi Li, Yuan Tang, Jinfeng Xu, Long Hu, Yixue Hao, Min Chen
Addressing this, we propose PointDreamer, a novel framework for textured mesh reconstruction from colored point cloud.
1 code implementation • 2 May 2024 • Yuan Tang, Xu Han, Xianzhi Li, Qiao Yu, Yixue Hao, Long Hu, Min Chen
Notably, MiniGPT-3D gains an 8. 12 increase on GPT-4 evaluation score for the challenging object captioning task compared to ShapeLLM-13B, while the latter costs 160 total GPU-hours on 8 A800.
Ranked #1 on 3D Object Captioning on Objaverse
1 code implementation • 23 Apr 2024 • Xu Han, Yuan Tang, Zhaoxuan Wang, Xianzhi Li
In contrast, the newly proposed Mamba model, based on state space models (SSM), outperforms Transformer in multiple areas with only linear complexity.
1 code implementation • CVPR 2024 • Jinfeng Xu, Siyuan Yang, Xianzhi Li, Yuan Tang, Yixue Hao, Long Hu, Min Chen
Existing point cloud semantic segmentation networks cannot identify unknown classes and update their knowledge, due to a closed-set and static perspective of the real world, which would induce the intelligent agent to make bad decisions.
no code implementations • 8 Nov 2023 • Biqi Yang, Weiliang Tang, Xiaojie Gao, Xianzhi Li, Yun-hui Liu, Chi-Wing Fu, Pheng-Ann Heng
In large-scale storehouses, precise instance masks are crucial for robotic bin picking but are challenging to obtain.
1 code implementation • 4 Sep 2023 • Yong Cao, Ruixue Ding, Boli Chen, Xianzhi Li, Min Chen, Daniel Hershcovich, Pengjun Xie, Fei Huang
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates, which is crucial for location-related services such as navigation maps.
5 code implementations • 1 Sep 2023 • Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng
We introduce Point-Bind, a 3D multi-modality model aligning point clouds with 2D image, language, audio, and video.
Ranked #5 on 3D Question Answering (3D-QA) on 3D MM-Vet
1 code implementation • 25 Aug 2023 • Yihao Fang, Xianzhi Li, Stephen W. Thomas, Xiaodan Zhu
Open intent detection, a crucial aspect of natural language understanding, involves the identification of previously unseen intents in user-generated text.
Ranked #1 on Open Intent Detection on StackOverflow_CG
1 code implementation • IEEE Transactions on Multimedia 2023 • Yuan Tang, Xianzhi Li, Jinfeng Xu, Qiao Yu, Long Hu, Yixue Hao, Min Chen
In our work, we present Point-LGMask, a novel method to embed both local and global contexts with multi-ratio masking, which is quite effective for self-supervised feature learning of point clouds but is unfortunately ignored by existing pre-training works.
Ranked #3 on Few-Shot 3D Point Cloud Classification on ModelNet40 5-way (10-shot) (using extra training data)
no code implementations • 12 May 2023 • Zizhang Wu, Zhuozheng Li, Zhi-Gang Fan, Yunzhe Wu, Yuanzhu Gan, Jian Pu, Xianzhi Li
During the refinement process, context-aware temporal attention (CTA) is developed to capture the global temporal-context correlations to maintain the feature consistency and estimation integrity of moving objects.
no code implementations • 10 May 2023 • Xianzhi Li, Samuel Chan, Xiaodan Zhu, Yulong Pei, Zhiqiang Ma, Xiaomo Liu, Sameena Shah
The most recent large language models(LLMs) such as ChatGPT and GPT-4 have shown exceptional capabilities of generalist models, achieving state-of-the-art performance on a wide range of NLP tasks with little or no adaptation.
Ranked #1 on Question Answering on ConvFinQA
no code implementations • 3 May 2023 • Yong Cao, Xianzhi Li, Huiwen Liu, Wen Dai, Shuai Chen, Bin Wang, Min Chen, Daniel Hershcovich
In this study, we propose a novel framework, RE-KBQA, that utilizes relations in the knowledge base to enhance entity representation and introduce additional supervision.
no code implementations • 27 Feb 2023 • Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzhi Li, Pheng-Ann Heng
In this paper, we explore how the 2D modality can benefit 3D masked autoencoding, and propose Joint-MAE, a 2D-3D joint MAE framework for self-supervised 3D point cloud pre-training.
no code implementations • 8 Dec 2022 • Zizhang Wu, Yuanzhu Gan, Xianzhi Li, Yunzhe Wu, Xiaoquan Wang, Tianhao Xu, Fan Wang
Most existing networks based on public datasets may generalize suboptimal results on these valet parking scenes, also affected by the fisheye distortion.
no code implementations • 30 Nov 2022 • Zizhang Wu, Yunzhe Wu, Jian Pu, Xianzhi Li, Xiaoquan Wang
Specifically, we leverage intermediate features and responses for knowledge distillation.
2 code implementations • 24 Nov 2022 • Jinfeng Xu, Xianzhi Li, Yuan Tang, Qiao Yu, Yixue Hao, Long Hu, Min Chen
In our work, we present CasFusionNet, a novel cascaded network for point cloud semantic scene completion by dense feature fusion.
no code implementations • 25 Oct 2022 • Xianzhi Li, Will Aitken, Xiaodan Zhu, Stephen W. Thomas
With the recent surge of NLP technologies in the financial domain, banks and other financial entities have adopted virtual agents (VA) to assist customers.
1 code implementation • Findings (NAACL) 2022 • Yong Cao, Wei Li, Xianzhi Li, Min Chen, Guangyong Chen, Long Hu, Zhengdao Li, Hwang Kai
Sign language recognition and translation first uses a recognition module to generate glosses from sign language videos and then employs a translation module to translate glosses into spoken sentences.
no code implementations • 5 Mar 2022 • Yidan Feng, Biqi Yang, Xianzhi Li, Chi-Wing Fu, Rui Cao, Kai Chen, Qi Dou, Mingqiang Wei, Yun-hui Liu, Pheng-Ann Heng
Industrial bin picking is a challenging task that requires accurate and robust segmentation of individual object instances.
1 code implementation • 28 Feb 2022 • Ruihui Li, Xianzhi Li, Tien-Tsin Wong, Chi-Wing Fu
To achieve a learnable self-embedding scheme, we design a novel framework with two jointly-trained networks: one to encode the input point set into its self-embedded sparse point set and the other to leverage the embedded information for inverting the original point set back.
1 code implementation • 10 Aug 2021 • Ruihui Li, Xianzhi Li, Ka-Hei Hui, Chi-Wing Fu
We present SP-GAN, a new unsupervised sphere-guided generative model for direct synthesis of 3D shapes in the form of point clouds.
3 code implementations • CVPR 2021 • Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu
Point clouds produced by 3D scanning are often sparse, non-uniform, and noisy.
1 code implementation • 16 Mar 2020 • Xianzhi Li, Ruihui Li, Guangyong Chen, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
Recently, many deep neural networks were designed to process 3D point clouds, but a common drawback is that rotation invariance is not ensured, leading to poor generalization to arbitrary orientations.
no code implementations • 14 Mar 2020 • Chao Huang, Ruihui Li, Xianzhi Li, Chi-Wing Fu
This paper presents a novel non-local part-aware deep neural network to denoise point clouds by exploring the inherent non-local self-similarity in 3D objects and scenes.
2 code implementations • CVPR 2020 • Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu
We present PointAugment, a new auto-augmentation framework that automatically optimizes and augments point cloud samples to enrich the data diversity when we train a classification network.
Ranked #2 on 3D Point Cloud Data Augmentation on ModelNet40
3 code implementations • ICCV 2019 • Zhiliang Zeng, Xianzhi Li, Ying Kin Yu, Chi-Wing Fu
Besides walls and rooms, we aim to recognize diverse floor plan elements, such as doors, windows and different types of rooms, in the floor layouts.
3 code implementations • ICCV 2019 • Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
Point clouds acquired from range scans are often sparse, noisy, and non-uniform.
no code implementations • 5 May 2019 • Xianzhi Li, Lequan Yu, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
This paper presents a novel approach to learn and detect distinctive regions on 3D shapes.
no code implementations • ECCV 2018 • Lequan Yu, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
In this paper, we present the first deep learning based edge-aware technique to facilitate the consolidation of point clouds.
3 code implementations • CVPR 2018 • Lequan Yu, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng
Learning and analyzing 3D point clouds with deep networks is challenging due to the sparseness and irregularity of the data.
Ranked #3 on Point Cloud Super Resolution on SHREC15