no code implementations • ECCV 2020 • Yukun Su, Guosheng Lin, Jinhui Zhu, Qingyao Wu
This paper introduces a new method for recognizing violent behavior by learning contextual relationships between related people from human skeleton points.
Ranked #5 on Activity Recognition on RWF-2000
no code implementations • ECCV 2020 • Xiaofeng Yang, Guosheng Lin, Fengmao Lv, Fayao Liu
Compositional visual question answering requires reasoning over both semantic and geometry object relations.
no code implementations • ECCV 2020 • Tianyi Zhang, Guosheng Lin, Weide Liu, Jianfei Cai, Alex Kot
Finally, by training the segmentation model with the masks generated by our Splitting vs Merging strategy, we achieve the state-of-the-art weakly-supervised segmentation results on the Pascal VOC 2012 benchmark.
no code implementations • 9 Sep 2024 • Chengzeng Feng, Jiacheng Wei, Cheng Chen, Yang Li, Pan Ji, Fayao Liu, Hongdong Li, Guosheng Lin
We propose Prim2Room, a novel framework for controllable room mesh generation leveraging 2D layout conditions and 3D primitive retrieval to facilitate precise 3D layout specification.
no code implementations • 25 Aug 2024 • Shichao Dong, Ze Yang, Guosheng Lin
Beyond texture augmentation, we propose a method to automatically alter the shape of objects within 2D images.
no code implementations • 14 Aug 2024 • Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang, Guosheng Lin, Qingyao Wu
Diffusion models usher a new era of video editing, flexibly manipulating the video contents with text prompts.
1 code implementation • 3 Aug 2024 • Xingyi Li, Yizheng Wu, Jun Cen, Juewen Peng, Kewei Wang, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin
To this end, a 3D creator interface has been developed to provide users with fine-grained control over the creation process.
1 code implementation • 13 Jul 2024 • Qianxiong Xu, Guosheng Lin, Chen Change Loy, Cheng Long, Ziyue Li, Rui Zhao
Recent advancements in few-shot segmentation (FSS) have exploited pixel-by-pixel matching between query and support features, typically based on cross attention, which selectively activate query foreground (FG) features that correspond to the same-class support FG features.
1 code implementation • 24 Jun 2024 • Yizheng Wu, Zhiyu Pan, Kewei Wang, Xingyi Li, Jiahao Cui, Liwen Xiao, Guosheng Lin, Zhiguo Cao
To leverage unlabeled data, previous semi-supervised 3D instance segmentation approaches have explored self-training frameworks, which rely on high-quality pseudo labels for consistency regularization.
no code implementations • 14 Jun 2024 • Yuzhong Huang, Zhong Li, Zhang Chen, Zhiyuan Ren, Guosheng Lin, Fred Morstatter, Yi Xu
This process is achieved through the distillation of pretrained large-scale text-to-image diffusion models.
1 code implementation • 14 Jun 2024 • YiWen Chen, Tong He, Di Huang, Weicai Ye, Sijin Chen, Jiaxiang Tang, Xin Chen, Zhongang Cai, Lei Yang, Gang Yu, Guosheng Lin, Chi Zhang
Recently, 3D assets created via reconstruction and generation have matched the quality of manually crafted assets, highlighting their potential for replacement.
4 code implementations • 5 Jun 2024 • Xiaofeng Yang, Cheng Chen, Xulei Yang, Fayao Liu, Guosheng Lin
Besides the generative capabilities of diffusion priors, motivated by the unique time-symmetry properties of rectified flow models, a variant of our method can additionally perform image inversion.
no code implementations • 27 May 2024 • Zhoujie Fu, Jiacheng Wei, Wenhao Shen, Chaoyue Song, Xiaofeng Yang, Fayao Liu, Xulei Yang, Guosheng Lin
In this work, we introduce a novel approach for creating controllable dynamics in 3D-generated Gaussians using casually captured reference videos.
no code implementations • 24 May 2024 • Bingchen Yang, Haiyong Jiang, Hao Pan, Peter Wonka, Jun Xiao, Guosheng Lin
At each step, we provide two forms of geometric guidance.
no code implementations • CVPR 2024 • Chaoyue Song, Jiacheng Wei, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu
In this paper, we address the challenge of reconstructing general articulated 3D objects from a single video.
no code implementations • 9 Apr 2024 • Fan Yang, Jianfeng Zhang, Yichun Shi, Bowen Chen, Chenxu Zhang, Huichao Zhang, Xiaofeng Yang, Jiashi Feng, Guosheng Lin
Benefiting from the rapid development of 2D diffusion models, 3D content creation has made significant progress recently.
1 code implementation • CVPR 2024 • Kewei Wang, Yizheng Wu, Jun Cen, Zhiyu Pan, Xingyi Li, Zhe Wang, Zhiguo Cao, Guosheng Lin
To this end, we explore the feasibility of self-supervised motion prediction with only unlabeled LiDAR point clouds.
no code implementations • CVPR 2024 • Cheng Chen, Xiaofeng Yang, Fan Yang, Chengzeng Feng, Zhoujie Fu, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu
In this paper, we present a new framework Sculpt3D that equips the current pipeline with explicit injection of 3D priors from retrieved reference objects without re-training the 2D diffusion model.
no code implementations • CVPR 2024 • Xingyi Li, Zhiguo Cao, Yizheng Wu, Kewei Wang, Ke Xian, Zhe Wang, Guosheng Lin
To address this limitation, we present S-DyRF, a reference-based spatio-temporal stylization method for dynamic neural radiance fields.
1 code implementation • 29 Feb 2024 • Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin
Lastly, to further improve the training process, FSS proposes a mesh thickness loss signal for pixel-aligned implicit models.
no code implementations • 24 Jan 2024 • Yunfan Zhang, Hong Huang, Zhiwei Xiong, Zhiqi Shen, Guosheng Lin, Hao Wang, Nicholas Vun
The core strength of our pipeline lies in its ability to generate 3D scenes that are not only visually impressive but also exhibit features like photorealism, multi-view consistency, and diversity.
1 code implementation • CVPR 2024 • Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin
To this end we propose R-Cyclic Diffuser a framework that adapts Zero-1-to-3's novel approach to clothed human data by fusing it with a pixel-aligned implicit model.
1 code implementation • 13 Dec 2023 • Kewei Wang, Yizheng Wu, Zhiyu Pan, Xingyi Li, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin
To improve the quality of pseudo labels, we propose a novel motion selection and re-generation module.
no code implementations • 8 Dec 2023 • Xiaofeng Yang, YiWen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin
We propose a unified framework aimed at enhancing the diffusion priors for 3D generation tasks.
no code implementations • CVPR 2024 • Fan Yang, Tianyi Chen, Xiaosheng He, Zhongang Cai, Lei Yang, Si Wu, Guosheng Lin
We propose AttriHuman-3D, an editable 3D human generation model, which address the aforementioned problems with attribute decomposition and indexing.
no code implementations • 30 Nov 2023 • Xiaosheng He, Fan Yang, Fayao Liu, Guosheng Lin
Many works propose to fine-tune a pre-trained GAN model.
no code implementations • 28 Nov 2023 • Xiaojing Zhong, Xinyi Huang, Zhonghua Wu, Guosheng Lin, Qingyao Wu
To address this problem, we propose a novel Spatial Alignment and Region-Adaptive normalization method (SARA) in this paper.
no code implementations • 28 Nov 2023 • Xiaojing Zhong, Yukun Su, Zhonghua Wu, Guosheng Lin, Qingyao Wu
3D virtual try-on enjoys many potential applications and hence has attracted wide attention.
1 code implementation • CVPR 2024 • YiWen Chen, Zilong Chen, Chi Zhang, Feng Wang, Xiaofeng Yang, Yikai Wang, Zhongang Cai, Lei Yang, Huaping Liu, Guosheng Lin
3D editing plays a crucial role in many areas such as gaming and virtual reality.
no code implementations • 10 Nov 2023 • Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap
Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging.
no code implementations • 3 Nov 2023 • Shichao Dong, Fayao Liu, Guosheng Lin
Recently, large-scale pre-trained models such as Segment-Anything Model (SAM) and Contrastive Language-Image Pre-training (CLIP) have demonstrated remarkable success and revolutionized the field of computer vision.
1 code implementation • 17 Oct 2023 • Ruibo Li, Chi Zhang, Zhe Wang, Chunhua Shen, Guosheng Lin
By rigidly aligning each region with its potential counterpart in the target point cloud, we obtain a region-specific rigid transformation to generate its pseudo flow labels.
no code implementations • 13 Sep 2023 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
In this work, we tackle the challenging problem of long-tailed image recognition.
no code implementations • 4 Sep 2023 • Xianghui Yang, Guosheng Lin, Zhenghao Chen, Luping Zhou
Recent neural networks based surface reconstruction can be roughly divided into two categories, one warping templates explicitly and the other representing 3D surfaces implicitly.
no code implementations • 26 Aug 2023 • Yukun Su, Guosheng Lin, Qingyao Wu
(ii) Global-SPIL: to better learn and refine the features of the unordered and unstructured skeleton points, Global-SPIL employs the self-attention layer that operates directly on the sampled points, which can help to make the output more permutation-invariant and well-suited for our task.
1 code implementation • 22 Aug 2023 • YiWen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin
Recent strides in Text-to-3D techniques have been propelled by distilling knowledge from powerful large text-to-image diffusion models (LDMs).
1 code implementation • 20 Aug 2023 • Yanda Li, Chi Zhang, Gang Yu, Zhibin Wang, Bin Fu, Guosheng Lin, Chunhua Shen, Ling Chen, Yunchao Wei
However, these datasets often exhibit domain bias, potentially constraining the generative capabilities of the models.
Ranked #78 on Visual Question Answering on MM-Vet
no code implementations • 20 Aug 2023 • Liao Shen, Xingyi Li, Huiqiang Sun, Juewen Peng, Ke Xian, Zhiguo Cao, Guosheng Lin
To animate the visual content, the feature point cloud is displaced based on the scene flow derived from motion estimation and the corresponding camera pose.
no code implementations • 18 Aug 2023 • Ruibing Jin, Guosheng Lin, Min Wu, Jie Lin, Zhengguo Li, XiaoLi Li, Zhenghua Chen
To address this issue, we propose an unlimited knowledge distillation (UKD) in this paper.
1 code implementation • ICCV 2023 • Qianxiong Xu, Wenting Zhao, Guosheng Lin, Cheng Long
Moreover, when calculating SCCA, we design a scaled-cosine mechanism to better utilize the support features for similarity calculation.
Ranked #8 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)
no code implementations • 3 Aug 2023 • Shichao Dong, Guosheng Lin
3D semantic scene understanding tasks have achieved great success with the emergence of deep learning, but often require a huge amount of manually annotated training data.
2 code implementations • ICCV 2023 • Yiran Wang, Min Shi, Jiaqi Li, Zihao Huang, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin
Video depth estimation aims to infer temporally consistent depth.
Ranked #14 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)
2 code implementations • 30 May 2023 • Chi Zhang, YiWen Chen, Yijun Fu, Zhenglin Zhou, Gang Yu, Billzb Wang, Bin Fu, Tao Chen, Guosheng Lin, Chunhua Shen
The recent advancements in image-text diffusion models have stimulated research interest in large-scale 3D generative models.
1 code implementation • 17 May 2023 • Youtan Yin, Zhoujie Fu, Fan Yang, Guosheng Lin
This paper proposes a novel object-removing pipeline, named OR-NeRF, that can remove objects from 3D scenes with user-given points or text prompts on a single view, achieving better performance in less time than previous works.
1 code implementation • 17 Apr 2023 • Chaoyue Song, Jiacheng Wei, Tianyi Chen, YiWen Chen, Chuan Sheng Foo, Fayao Liu, Guosheng Lin
To solve this problem, we propose neural dual quaternion blend skinning (NeuDBS) to achieve 3D point deformation, which can perform rigid transformation without skin-collapsing artifacts.
no code implementations • 28 Mar 2023 • Yunfan Zhang, Hao Wang, Guosheng Lin, Vun Chan Hua Nicholas, Zhiqi Shen, Chunyan Miao
This paper investigates an open research task of reconstructing and generating 3D point clouds.
1 code implementation • 24 Mar 2023 • Weide Liu, Zhonghua Wu, Yang Zhao, Yuming Fang, Chuan-Sheng Foo, Jun Cheng, Guosheng Lin
Current methods for few-shot segmentation (FSSeg) have mainly focused on improving the performance of novel classes while neglecting the performance of base classes.
1 code implementation • CVPR 2023 • Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap
We conduct extensive experiments to analyze each of our proposed components and show the efficacy of our framework in generating high-fidelity 3D textured and text-relevant shapes.
no code implementations • CVPR 2023 • Xingyi Li, Zhiguo Cao, Huiqiang Sun, Jianming Zhang, Ke Xian, Guosheng Lin
To animate the scene, we perform motion estimation and lift the 2D motion into the 3D scene flow.
1 code implementation • 9 Mar 2023 • Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai
Weakly-supervised point cloud segmentation with extremely limited labels is highly desirable to alleviate the expensive costs of collecting densely annotated 3D points.
2 code implementations • CVPR 2023 • Xianghui Yang, Guosheng Lin, Zhenghao Chen, Luping Zhou
Deep neural networks (DNNs) are widely applied for nowadays 3D surface reconstruction tasks and such methods can be further divided into two categories, which respectively warp templates explicitly by moving vertices or represent 3D surfaces implicitly as signed or unsigned distance functions.
no code implementations • 18 Jan 2023 • Xiaofeng Yang, Fayao Liu, Guosheng Lin
Current vision language pretraining models are dominated by methods using region visual features extracted from object detectors.
1 code implementation • ICCV 2023 • Ze Yang, Ruibo Li, Evan Ling, Chi Zhang, Yiming Wang, Dezhao Huang, Keng Teck Ma, Minhoe Hur, Guosheng Lin
To address this issue, we propose a new label-guided knowledge distillation (LGKD) loss, where the old model output is expanded and transplanted (with the guidance of the ground truth label) to form a semantically appropriate class correspondence with the new model output.
Ranked #1 on Continual Semantic Segmentation on ScanNet
no code implementations • CVPR 2023 • Ruibo Li, Hanyu Shi, Ziang Fu, Zhe Wang, Guosheng Lin
To this end, we propose a two-stage weakly supervised approach, where the segmentation model trained with the incomplete binary masks in Stage1 will facilitate the self-supervised learning of the motion prediction network in Stage2 by estimating possible moving foregrounds in advance.
no code implementations • 5 Dec 2022 • Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Ruiqi Wu, Shizhou Zhang, Peng Wang, Yanning Zhang
The key insight of this design is that the cross-attention mechanism in the transformer could be an ideal solution to align the discriminative texture clues from the original image with the canonical view image, which could compensate for the low-quality texture information of the canonical view image.
Domain Generalization Generalizable Person Re-identification +1
1 code implementation • 18 Nov 2022 • Chaoyue Song, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
With $G$ as the basic component, we propose a cross consistency learning scheme and a dual reconstruction objective to learn the pose transfer without supervision.
1 code implementation • 15 Nov 2022 • Kennard Yanting Chan, Guosheng Lin, Haiyu Zhao, Weisi Lin
We propose IntegratedPIFu, a new pixel aligned implicit model that builds on the foundation set by PIFuHD.
no code implementations • 16 Oct 2022 • Wenjie Luo, Qun Song, Zhenyu Yan, Rui Tan, Guosheng Lin
Indoor self-localization is a highly demanded system function for smartphones.
1 code implementation • 2 Oct 2022 • Hao Wang, Guosheng Lin, Ana García del Molino, Anran Wang, Jiashi Feng, Zhiqi Shen
In this paper we present a novel multi-attribute face manipulation method based on textual descriptions.
1 code implementation • 29 Sep 2022 • Xingyi Li, Chaoyi Hong, Yiran Wang, Zhiguo Cao, Ke Xian, Guosheng Lin
We study the problem of novel view synthesis of objects from a single image.
no code implementations • 23 Aug 2022 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
1 code implementation • ICCV 2023 • Shichao Dong, Ruibo Li, Jiacheng Wei, Fayao Liu, Guosheng Lin
Instance segmentation on 3D point clouds has been attracting increasing attention due to its wide applications, especially in scene understanding areas.
Ranked #22 on 3D Instance Segmentation on ScanNet(v2)
1 code implementation • 4 Aug 2022 • Xianghui Yang, Guosheng Lin, Luping Zhou
Single-view 3D object reconstruction is a fundamental and challenging computer vision task that aims at recovering 3D shapes from single-view RGB images.
no code implementations • 29 Jul 2022 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
When we do online paired data augmentation, we first generate augmented text through random token replacement, then pass the augmented text into the latent space alignment module to output the latent codes, which are finally fed to StyleGAN2 to generate the augmented images.
no code implementations • 29 Jul 2022 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
To this end, we discover the semantic meanings of StyleGAN latent space, such that we are able to produce face images of various expressions, poses, and lighting by controlling the latent codes.
no code implementations • 19 Jul 2022 • Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian
Weakly supervised point cloud segmentation, i. e. semantically segmenting a point cloud with only a few labeled points in the whole 3D scene, is highly desirable due to the heavy burden of collecting abundant dense annotations for the model training.
no code implementations • 19 Jul 2022 • Nan Song, Chi Zhang, Guosheng Lin
First, instead of learning the decision boundaries between seen classes, as is done in standard close-set classification, we reserve space for unseen classes, such that images located in these areas are recognized as the unseen classes.
no code implementations • 2 Jun 2022 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training.
Ranked #9 on Long-tail Learning on CIFAR-10-LT (ρ=10)
1 code implementation • 23 Mar 2022 • Ze Yang, Chi Zhang, Ruibo Li, Yi Xu, Guosheng Lin
Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed.
1 code implementation • 9 Mar 2022 • Yukun Su, Jingliang Deng, Ruizhou Sun, Guosheng Lin, Qingyao Wu
Besides, they fail to take full advantage of the cues among inter- and intra-feature within a group of images.
Ranked #1 on Video Salient Object Detection on FBMS-59
no code implementations • 6 Jan 2022 • Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin
We use the labeled image data to train a teacher model and use the trained model to generate pseudo captions on unlabeled image data.
no code implementations • CVPR 2022 • Ruibo Li, Chi Zhang, Guosheng Lin, Zhe Wang, Chunhua Shen
In this work, we focus on scene flow learning on point clouds in a self-supervised manner.
1 code implementation • CVPR 2022 • Hanyu Shi, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
We propose a novel temporal-spatial framework for effective weakly supervised learning to generate high-quality pseudo labels from these limited annotated data.
no code implementations • CVPR 2022 • Tao Liang, Guosheng Lin, Mingyang Wan, Tianrui Li, Guojun Ma, Fengmao Lv
Through the proposed MI2P unit, we can inject the language information into the vision backbone by attending the word-wise textual features to different visual channels, as well as inject the visual information into the language backbone by attending the channel-wise visual features to different textual words.
no code implementations • 19 Oct 2021 • Anthony Meng Huat Tiong, Junnan Li, Guosheng Lin, Boyang Li, Caiming Xiong, Steven C. H. Hoi
ICCL interpolates two images from a class-agnostic sampler and a class-aware sampler, and trains the model such that the representation of the interpolative image can be used to retrieve the centroids for both source classes.
Ranked #22 on Long-tail Learning on CIFAR-10-LT (ρ=10)
no code implementations • 4 Oct 2021 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the learned tree structures into the recipe generation and food cross-modal retrieval procedure.
1 code implementation • NeurIPS 2021 • Chaoyue Song, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin
It aims to transfer the pose of a source mesh to a target mesh and keep the identity (e. g., body shape) of the target mesh.
no code implementations • ICCV 2021 • Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen
Inspired by the recent success in Automated Machine Learning literature (AutoML), in this paper, we present Meta Navigator, a framework that attempts to solve the aforementioned limitation in few-shot learning by seeking a higher-level strategy and proffer to automate the selection from various few-shot learning designs.
no code implementations • 29 Aug 2021 • Chi Zhang, Guosheng Lin, Lvlong Lai, Henghui Ding, Qingyao Wu
First, we present a Class Activation Map Calibration (CAMC) module to improve the learning and prediction of network classifiers, by enforcing network prediction based on important image regions.
no code implementations • 19 Aug 2021 • Weide Liu, Chi Zhang, Henghui Ding, Tzu-Yi Hung, Guosheng Lin
In this work, we argue that every support pixel's information is desired to be transferred to all query pixels and propose a Correspondence Matching Network (CMNet) with an Optimal Transport Matching module to mine out the correspondence between the query and support images.
1 code implementation • 17 Aug 2021 • Weide Liu, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin
To improve the generality of the objective activation maps, we propose a region prototypical network RPNet to explore the cross-image object diversity of the training set.
no code implementations • 17 Aug 2021 • Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu
With the development of Generative Adversarial Network, image-based virtual try-on methods have made great progress.
1 code implementation • 14 Aug 2021 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
Video captioning targets interpreting the complex visual contents as text descriptions, which requires the model to fully understand video scenes including objects and their interactions.
1 code implementation • 11 Aug 2021 • Weide Liu, Zhonghua Wu, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
To this end, we first propose a prior extractor to learn the query information from the unlabeled images with our proposed global-local contrastive learning.
1 code implementation • ICCV 2021 • Zhonghua Wu, Xiangxi Shi, Guosheng Lin, Jianfei Cai
To explicitly learn meta-class representations in few-shot segmentation task, we propose a novel Meta-class Memory based few-shot segmentation method (MM-Net), where we introduce a set of learnable memory embeddings to memorize the meta-class information during the base class training and transfer to novel classes during the inference stage.
no code implementations • 5 Aug 2021 • Xin Sun, Henghui Ding, Chi Zhang, Guosheng Lin, Keck-Voon Ling
In this work, we aim to address the challenging task of open set recognition (OSR).
no code implementations • 4 Aug 2021 • Fayao Liu, Guosheng Lin, Chuan-Sheng Foo, Chaitanya K. Joshi, Jie Lin
In this work we propose PointDisc, a point discriminative learning method to leverage self-supervisions for data-efficient 3D point cloud classification and segmentation.
no code implementations • 3 Aug 2021 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
In this paper, we propose a novel unified framework of Cycle-consistent Inverse GAN (CI-GAN) for both text-to-image generation and text-guided image manipulation tasks.
no code implementations • 27 Jul 2021 • Xiangxi Shi, Zhonghua Wu, Guosheng Lin, Jianfei Cai, Shafiq Joty
Therefore, in this paper, we propose a memory-based Image Manipulation Network (MIM-Net), where a set of memories learned from images is introduced to synthesize the texture information with the guidance of the textual description.
no code implementations • 23 Jul 2021 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung
While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels.
no code implementations • CVPR 2021 • Fengmao Lv, Xiang Chen, Yanyong Huang, Lixin Duan, Guosheng Lin
In turn, it also collects the reinforced features from each modality and uses them to generate a reinforced common message.
no code implementations • CVPR 2021 • Ruibo Li, Guosheng Lin, Lihua Xie
Due to the scarcity of annotated scene flow data, self-supervised scene flow learning in point clouds has attracted increasing attention.
Self-Supervised Learning Self-supervised Scene Flow Estimation
no code implementations • CVPR 2021 • Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen
Scene flow in 3D point clouds plays an important role in understanding dynamic environments.
no code implementations • CVPR 2021 • Fan Yang, Guosheng Lin
Garment transfer shows great potential in realistic applications with the goal of transfering outfits across different people images.
1 code implementation • CVPR 2021 • Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu
First, we adopt a simple but effective decoupled learning strategy of representations and classifiers that only the classifiers are updated in each incremental session, which avoids knowledge forgetting in the representations.
Ranked #8 on Few-Shot Class-Incremental Learning on CIFAR-100
1 code implementation • ICCV 2021 • Yukun Su, Ruizhou Sun, Guosheng Lin, Qingyao Wu
Data augmentation is vital for deep learning neural networks.
no code implementations • 20 Jan 2021 • Ruibing Jin, Guosheng Lin, Changyun Wen
Online proposal sampling is an intuitive solution to these issues.
1 code implementation • 7 Jan 2021 • Sheng Yang, Weisi Lin, Guosheng Lin, Qiuping Jiang, Zichuan Liu
We present a simple yet effective progressive self-guided loss function to facilitate deep learning-based salient object detection (SOD) in images.
no code implementations • 5 Jan 2021 • Chi Zhang, Guankai Li, Guosheng Lin, Qingyao Wu, Rui Yao
Image co-segmentation is an active computer vision task that aims to segment the common objects from a set of images.
no code implementations • ICCV 2021 • Yukun Su, Guosheng Lin, Qingyao Wu
Recently, self-supervised learning (SSL) has been proved very effective and it can help boost the performance in learning representations from unlabeled data in the image domain.
no code implementations • ICCV 2021 • Tao Liang, Guosheng Lin, Lei Feng, Yan Zhang, Fengmao Lv
To this end, both the marginal distribution and the elements with high-confidence correlations are aligned over the common space of the query and key vectors which are computed from different modalities.
no code implementations • 28 Dec 2020 • Xiaoyu Chen, Chi Zhang, Guosheng Lin, Jing Han
Moreover, when we use our network to handle the long-tail problem in a fully supervised point cloud segmentation dataset, it can also effectively boost the performance of the few-shot classes.
1 code implementation • 11 Dec 2020 • Linshan Jiang, Rui Tan, Xin Lou, Guosheng Lin
This paper considers the design and implementation of a practical privacy-preserving collaborative learning scheme, in which a curious learning coordinator trains a better machine learning model based on the data samples contributed by a number of IoT objects, while the confidentiality of the raw forms of the training data is protected against the coordinator.
1 code implementation • European Conference on Computer Vision (ECCV) 2020 • Shichao Dong, Guosheng Lin, Tzu-Yi Hung
In this paper, we define a novel concept of “regional purity” as the percentage of neighboring points belonging to the same instance within a fixed-radius 3D space.
Ranked #15 on 3D Instance Segmentation on ScanNet(v2)
no code implementations • 21 Sep 2020 • Ruibing Jin, Guosheng Lin, Changyun Wen, Jianliang Wang, Fayao Liu
Optical flow, which expresses pixel displacement, is widely used in many computer vision tasks to provide pixel-level motion information.
1 code implementation • ECCV 2020 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task.
no code implementations • ECCV 2020 • Lichang Chen, Guosheng Lin, Shijie Wang, Qingyao Wu
Scene Graph, as a vital tool to bridge the gap between language domain and image domain, has been widely adopted in the cross-modality task like VQA.
no code implementations • 12 Aug 2020 • Xin Sun, Chi Zhang, Guosheng Lin, Keck-Voon Ling
A typical challenge that hinders their real-world applications is that unknown samples may be fed into the system during the testing phase, but traditional deep neural networks will wrongly recognize these unknown samples as one of the known classes.
no code implementations • 27 Jul 2020 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
Recipe generation from food images and ingredients is a challenging task, which requires the interpretation of the information from another modality.
1 code implementation • CVPR 2020 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Tzu-Yi Hung, Lihua Xie
To the best of our knowledge, this is the first method that uses cloud-level weak labels on raw 3D space to train a point cloud semantic segmentation network.
no code implementations • CVPR 2020 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu
In this paper, we propose a cross-reference network (CRNet) for few-shot segmentation.
no code implementations • CVPR 2020 • Zhonghua Wu, Qingyi Tao, Guosheng Lin, Jianfei Cai
To reduce the human labeling effort, we propose a novel webly supervised object detection (WebSOD) method for novel classes which only requires the web images without further annotations.
5 code implementations • 15 Mar 2020 • Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen
We employ the Earth Mover's Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance.
no code implementations • 19 Apr 2019 • Rui Yao, Guosheng Lin, Shixiong Xia, Jiaqi Zhao, Yong Zhou
Second, we provide a detailed discussion and overview of the technical characteristics of the different methods.
1 code implementation • 7 Apr 2019 • Sheng Yang, Guosheng Lin, Qiuping Jiang, Weisi Lin
In this work, we proposed an end-to-end dilated inception network (DINet) for visual saliency prediction.
no code implementations • CVPR 2019 • Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh
It is challenging to detect curve texts due to their irregular shapes and varying sizes.
1 code implementation • CVPR 2019 • Chi Zhang, Guosheng Lin, Fayao Liu, Rui Yao, Chunhua Shen
Recent progress in semantic segmentation is driven by deep Convolutional Neural Networks and large-scale labeled image datasets.
Ranked #86 on Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot)
no code implementations • 13 Feb 2019 • Linshan Jiang, Rui Tan, Xin Lou, Guosheng Lin
This paper considers the design and implementation of a practical privacy-preserving collaborative learning scheme, in which a curious learning coordinator trains a better machine learning model based on the data samples contributed by a number of IoT objects, while the confidentiality of the raw forms of the training data is protected against the coordinator.
no code implementations • 21 Nov 2018 • Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai
Instead, we present a novel virtual Try-On network, M2E-Try On Net, which transfers the clothes from a model image to a person image without the need of any clean product images.
no code implementations • 30 Sep 2018 • Zichuan Liu, Guosheng Lin, Wang Ling Goh, Fayao Liu, Chunhua Shen, Xiaokang Yang
In this work, we propose a novel hybrid method for scene text detection namely Correlation Propagation Network (CPN).
no code implementations • 14 Sep 2018 • Zhonghua Wu, Guosheng Lin, Jianfei Cai
We develop an iterative learning method to generate pseudo part segmentation masks from keypoint labels.
1 code implementation • CVPR 2018 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid
In this work, we focus on weak supervision, developing a method for training a high-quality pixel-level classifier for semantic segmentation, using only image-level class labels as the provided ground-truth.
no code implementations • CVPR 2018 • Huaxin Xiao, Jiashi Feng, Guosheng Lin, Yu Liu, Maojun Zhang
In this paper, we propose a novel MoNet model to deeply exploit motion cues for boosting video object segmentation performance from two aspects, i. e., frame representation learning and segmentation refinement.
no code implementations • CVPR 2018 • Zichuan Liu, Guosheng Lin, Sheng Yang, Jiashi Feng, Weisi Lin, Wang Ling Goh
MCN predicts instance-level bounding boxes by firstly converting an image into a Stochastic Flow Graph (SFG) and then performing Markov Clustering on this graph.
no code implementations • 7 Mar 2018 • Tianyi Zhang, Guosheng Lin, Jianfei Cai, Tong Shen, Chunhua Shen, Alex C. Kot
In our work, we focus on the weakly supervised semantic segmentation with image label annotations.
no code implementations • 25 May 2017 • Tong Shen, Guosheng Lin, Lingqiao Liu, Chunhua Shen, Ian Reid
Training a Fully Convolutional Network (FCN) for semantic segmentation requires a large number of masks with pixel level labelling, which involves a large amount of human labour and time for annotation.
no code implementations • 26 Mar 2017 • Fayao Liu, Guosheng Lin, Ruizhi Qiao, Chunhua Shen
In this fashion, we easily achieve nonlinear learning of potential functions on both unary and pairwise terms in CRFs.
no code implementations • 20 Feb 2017 • Rui Yao, Guosheng Lin, Qinfeng Shi, Damith Ranasinghe
We conduct extensive experiments and demonstrate that our proposed approach is able to outperform the state-of-the-arts in terms of classification and label misalignment measures on three challenging datasets: Opportunity, Hand Gesture, and our new dataset.
no code implementations • 25 Jan 2017 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid
Semantic image segmentation is a fundamental task in image understanding.
no code implementations • CVPR 2017 • Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel
In this work, we propose to model the relational information between people as a sequence prediction task.
13 code implementations • CVPR 2017 • Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid
Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation.
Ranked #13 on Semantic Segmentation on Trans10K
no code implementations • 29 Apr 2016 • Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun
Crowd counting is an important task in computer vision, which has many applications in video surveillance.
no code implementations • 10 Mar 2016 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, Ian Reid
We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions.
no code implementations • CVPR 2016 • Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian Reid
To solve the first stage, we design a large-scale high-order binary codes inference algorithm to reduce the high-order objective to a standard binary quadratic problem such that graph cuts can be used to efficiently infer the binary code which serve as the label of each training datum.
no code implementations • 22 Feb 2016 • Guosheng Lin, Fayao Liu, Chunhua Shen, Jianxin Wu, Heng Tao Shen
Our column generation based method can be further generalized from the triplet loss to a general structured learning based framework that allows one to directly optimize multivariate performance measures.
no code implementations • 28 Jan 2016 • Fayao Liu, Guosheng Lin, Chunhua Shen
We exemplify the usefulness of the proposed model on multi-class semantic labelling (discrete) and the robust depth estimation (continuous) problems.
no code implementations • NeurIPS 2015 • Guosheng Lin, Chunhua Shen, Ian Reid, Anton Van Den Hengel
The network output dimension for message estimation is the same as the number of classes, in contrast to the network output for general CNN potential functions in CRFs, which is exponential in the order of the potentials.
no code implementations • CVPR 2016 • Guosheng Lin, Chunhua Shen, Anton van dan Hengel, Ian Reid
Recent advances in semantic image segmentation have mostly been achieved by training deep convolutional neural networks (CNNs).
Ranked #54 on Semantic Segmentation on PASCAL Context
no code implementations • 28 Mar 2015 • Fayao Liu, Guosheng Lin, Chunhua Shen
The deep CNN is trained on the ImageNet dataset and transferred to image segmentations here for constructing potentials of superpixels.
1 code implementation • 26 Feb 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin, Ian Reid
Therefore, here we present a deep convolutional neural field model for estimating depths from single monocular images, aiming to jointly explore the capacity of deep CNN and continuous CRF.
no code implementations • CVPR 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin
Therefore, we in this paper present a deep convolutional neural field model for estimating depths from a single image, aiming to jointly explore the capacity of deep CNN and continuous CRF.
1 code implementation • 24 Aug 2014 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel
The proposed framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods.
no code implementations • 4 Jul 2014 • Guosheng Lin, Chunhua Shen, Jianxin Wu
Hashing has proven a valuable tool for large-scale information retrieval.
1 code implementation • CVPR 2014 • Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van Den Hengel, David Suter
Here we propose to use boosted decision trees for achieving non-linearity in hashing, which are fast to train and evaluate, hence more suitable for hashing with high dimensional data.
no code implementations • 23 Nov 2013 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, David Suter
Different from most existing multi-class boosting methods, which use the same set of weak learners for all the classes, we train class specified weak learners (i. e., each class has a different set of weak learners).
no code implementations • 7 Sep 2013 • Guosheng Lin, Chunhua Shen, David Suter, Anton Van Den Hengel
This framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods.
no code implementations • 14 Feb 2013 • Chunhua Shen, Guosheng Lin, Anton Van Den Hengel
Inspired by structured support vector machines (SSVM), here we propose a new boosting algorithm for structured output prediction, which we refer to as StructBoost.