no code implementations • ECCV 2020 • Xiaofeng Yang, Guosheng Lin, Fengmao Lv, Fayao Liu
Compositional visual question answering requires reasoning over both semantic and geometry object relations.
no code implementations • 17 Feb 2025 • Chaoyue Song, Jianfeng Zhang, Xiu Li, Fan Yang, YiWen Chen, Zhongcong Xu, Jun Hao Liew, Xiaoyang Guo, Fayao Liu, Jiashi Feng, Guosheng Lin
With the explosive growth of 3D content creation, there is an increasing demand for automatically converting static 3D models into articulation-ready versions that support realistic animation.
no code implementations • 9 Sep 2024 • Chengzeng Feng, Jiacheng Wei, Cheng Chen, Yang Li, Pan Ji, Fayao Liu, Hongdong Li, Guosheng Lin
We propose Prim2Room, a novel framework for controllable room mesh generation leveraging 2D layout conditions and 3D primitive retrieval to facilitate precise 3D layout specification.
no code implementations • 5 Aug 2024 • Weide Liu, Xingxing Wang, Lu Wang, Jun Cheng, Fayao Liu, Xulei Yang
In this paper, we introduce a novel Gaussian mixture based evidential learning solution for robust stereo matching.
4 code implementations • 5 Jun 2024 • Xiaofeng Yang, Cheng Chen, Xulei Yang, Fayao Liu, Guosheng Lin
Besides the generative capabilities of diffusion priors, motivated by the unique time-symmetry properties of rectified flow models, a variant of our method can additionally perform image inversion.
no code implementations • 27 May 2024 • Zhoujie Fu, Jiacheng Wei, Wenhao Shen, Chaoyue Song, Xiaofeng Yang, Fayao Liu, Xulei Yang, Guosheng Lin
In this work, we introduce a novel approach for creating controllable dynamics in 3D-generated Gaussians using casually captured reference videos.
no code implementations • 7 May 2024 • Zhihao Wen, Yuan Fang, Pengcheng Wei, Fayao Liu, Zhenghua Chen, Min Wu
To capture the nuances of the temporal and spatial relationships and heterogeneous characteristics in an interconnected graph of sensors, we introduce a novel model named Temporal and Heterogeneous Graph Neural Networks (THGNN).
no code implementations • CVPR 2024 • Chaoyue Song, Jiacheng Wei, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu
In this paper, we address the challenge of reconstructing general articulated 3D objects from a single video.
no code implementations • CVPR 2024 • Cheng Chen, Xiaofeng Yang, Fan Yang, Chengzeng Feng, Zhoujie Fu, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu
In this paper, we present a new framework Sculpt3D that equips the current pipeline with explicit injection of 3D priors from retrieved reference objects without re-training the 2D diffusion model.
1 code implementation • CVPR 2024 • Zhaochong An, Guolei Sun, Yun Liu, Fayao Liu, Zongwei Wu, Dan Wang, Luc van Gool, Serge Belongie
The former arises from non-uniform point sampling, allowing models to distinguish the density disparities between foreground and background for easier segmentation.
Few-shot 3D Point Cloud Semantic Segmentation
Segmentation
+1
1 code implementation • 29 Feb 2024 • Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin
Lastly, to further improve the training process, FSS proposes a mesh thickness loss signal for pixel-aligned implicit models.
1 code implementation • CVPR 2024 • Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin
To this end we propose R-Cyclic Diffuser a framework that adapts Zero-1-to-3's novel approach to clothed human data by fusing it with a pixel-aligned implicit model.
no code implementations • 8 Dec 2023 • Xiaofeng Yang, YiWen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin
We propose a unified framework aimed at enhancing the diffusion priors for 3D generation tasks.
no code implementations • 30 Nov 2023 • Xiaosheng He, Fan Yang, Fayao Liu, Guosheng Lin
Many works propose to fine-tune a pre-trained GAN model.
no code implementations • 3 Nov 2023 • Shichao Dong, Fayao Liu, Guosheng Lin
Recently, large-scale pre-trained models such as Segment-Anything Model (SAM) and Contrastive Language-Image Pre-training (CLIP) have demonstrated remarkable success and revolutionized the field of computer vision.
no code implementations • 13 Sep 2023 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
In this work, we tackle the challenging problem of long-tailed image recognition.
1 code implementation • ICCV 2023 • Jieming Lou, Weide Liu, Zhuo Chen, Fayao Liu, Jun Cheng
Although existing stereo matching models have achieved continuous improvement, they often face issues related to trustworthiness due to the absence of uncertainty estimation.
1 code implementation • 17 Apr 2023 • Chaoyue Song, Jiacheng Wei, Tianyi Chen, YiWen Chen, Chuan Sheng Foo, Fayao Liu, Guosheng Lin
To solve this problem, we propose neural dual quaternion blend skinning (NeuDBS) to achieve 3D point deformation, which can perform rigid transformation without skin-collapsing artifacts.
no code implementations • 18 Jan 2023 • Xiaofeng Yang, Fayao Liu, Guosheng Lin
Current vision language pretraining models are dominated by methods using region visual features extracted from object detectors.
1 code implementation • 18 Nov 2022 • Chaoyue Song, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
With $G$ as the basic component, we propose a cross consistency learning scheme and a dual reconstruction objective to learn the pose transfer without supervision.
no code implementations • 23 Aug 2022 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
1 code implementation • ICCV 2023 • Shichao Dong, Ruibo Li, Jiacheng Wei, Fayao Liu, Guosheng Lin
Instance segmentation on 3D point clouds has been attracting increasing attention due to its wide applications, especially in scene understanding areas.
Ranked #23 on
3D Instance Segmentation
on ScanNet(v2)
no code implementations • 2 Jun 2022 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training.
Ranked #9 on
Long-tail Learning
on CIFAR-10-LT (ρ=10)
no code implementations • 6 Jan 2022 • Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin
We use the labeled image data to train a teacher model and use the trained model to generate pseudo captions on unlabeled image data.
1 code implementation • CVPR 2022 • Hanyu Shi, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
We propose a novel temporal-spatial framework for effective weakly supervised learning to generate high-quality pseudo labels from these limited annotated data.
1 code implementation • 11 Dec 2021 • Wanyue Zhang, Xun Xu, Fayao Liu, Chuan-Sheng Foo
Data augmentation is an important technique to reduce overfitting and improve learning performance, but existing works on data augmentation for 3D point cloud data are based on heuristics.
Ranked #1 on
3D Point Cloud Data Augmentation
on ModelNet40
1 code implementation • 9 Nov 2021 • Chaitanya K. Joshi, Fayao Liu, Xu Xun, Jie Lin, Chuan-Sheng Foo
Past work on distillation for GNNs proposed the Local Structure Preserving loss (LSP), which matches local structural relationships defined over edges across the student and teacher's node embeddings.
1 code implementation • NeurIPS 2021 • Chaoyue Song, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
It aims to transfer the pose of a source mesh to a target mesh and keep the identity (e. g., body shape) of the target mesh.
1 code implementation • 11 Aug 2021 • Weide Liu, Zhonghua Wu, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
To this end, we first propose a prior extractor to learn the query information from the unlabeled images with our proposed global-local contrastive learning.
no code implementations • 4 Aug 2021 • Fayao Liu, Guosheng Lin, Chuan-Sheng Foo, Chaitanya K. Joshi, Jie Lin
In this work we propose PointDisc, a point discriminative learning method to leverage self-supervisions for data-efficient 3D point cloud classification and segmentation.
no code implementations • 23 Jul 2021 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung
While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels.
no code implementations • CVPR 2021 • Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen
Scene flow in 3D point clouds plays an important role in understanding dynamic environments.
no code implementations • 21 Sep 2020 • Ruibing Jin, Guosheng Lin, Changyun Wen, Jianliang Wang, Fayao Liu
Optical flow, which expresses pixel displacement, is widely used in many computer vision tasks to provide pixel-level motion information.
no code implementations • CVPR 2020 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu
In this paper, we propose a cross-reference network (CRNet) for few-shot segmentation.
no code implementations • CVPR 2019 • Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh
It is challenging to detect curve texts due to their irregular shapes and varying sizes.
1 code implementation • CVPR 2019 • Chi Zhang, Guosheng Lin, Fayao Liu, Rui Yao, Chunhua Shen
Recent progress in semantic segmentation is driven by deep Convolutional Neural Networks and large-scale labeled image datasets.
Ranked #93 on
Few-Shot Semantic Segmentation
on PASCAL-5i (5-Shot)
no code implementations • 30 Sep 2018 • Zichuan Liu, Guosheng Lin, Wang Ling Goh, Fayao Liu, Chunhua Shen, Xiaokang Yang
In this work, we propose a novel hybrid method for scene text detection namely Correlation Propagation Network (CPN).
no code implementations • 26 Mar 2017 • Fayao Liu, Guosheng Lin, Ruizhi Qiao, Chunhua Shen
In this fashion, we easily achieve nonlinear learning of potential functions on both unary and pairwise terms in CRFs.
no code implementations • 22 Feb 2016 • Guosheng Lin, Fayao Liu, Chunhua Shen, Jianxin Wu, Heng Tao Shen
Our column generation based method can be further generalized from the triplet loss to a general structured learning based framework that allows one to directly optimize multivariate performance measures.
no code implementations • 28 Jan 2016 • Fayao Liu, Guosheng Lin, Chunhua Shen
We exemplify the usefulness of the proposed model on multi-class semantic labelling (discrete) and the robust depth estimation (continuous) problems.
no code implementations • 28 Mar 2015 • Fayao Liu, Guosheng Lin, Chunhua Shen
The deep CNN is trained on the ImageNet dataset and transferred to image segmentations here for constructing potentials of superpixels.
1 code implementation • 26 Feb 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin, Ian Reid
Therefore, here we present a deep convolutional neural field model for estimating depths from single monocular images, aiming to jointly explore the capacity of deep CNN and continuous CRF.
no code implementations • CVPR 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin
Therefore, we in this paper present a deep convolutional neural field model for estimating depths from a single image, aiming to jointly explore the capacity of deep CNN and continuous CRF.
no code implementations • 13 Apr 2014 • Fayao Liu, Chunhua Shen
In this work, we propose to learn deep convolutional image features using unsupervised and supervised learning.
no code implementations • 4 Jan 2014 • Chunhua Shen, Fayao Liu
This finding not only enables us to design new ensemble learning methods directly from kernel methods, but also makes it possible to take advantage of those highly-optimized fast linear SVM solvers for ensemble learning.
no code implementations • 7 Oct 2013 • Fayao Liu, Chunhua Shen, Ian Reid, Anton Van Den Hengel
Feature encoding with respect to an over-complete dictionary learned by unsupervised methods, followed by spatial pyramid pooling, and linear classification, has exhibited powerful strength in various vision applications.
no code implementations • 3 Oct 2013 • Fayao Liu, Luping Zhou, Chunhua Shen, Jianping Yin
In this work, we propose a novel multiple kernel learning framework to combine multi-modal features for AD classification, which is scalable and easy to implement.