Search Results for author: Guosheng Lin

Found 135 papers, 45 papers with code

Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation

no code implementations • ECCV 2020 • Tianyi Zhang, Guosheng Lin, Weide Liu, Jianfei Cai, Alex Kot

Finally, by training the segmentation model with the masks generated by our Splitting vs Merging strategy, we achieve the state-of-the-art weakly-supervised segmentation results on the Pascal VOC 2012 benchmark.

Segmentation Weakly supervised segmentation +2

Paper
Add Code

Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition

no code implementations • ECCV 2020 • Yukun Su, Guosheng Lin, Jinhui Zhu, Qingyao Wu

This paper introduces a new method for recognizing violent behavior by learning contextual relationships between related people from human skeleton points.

Ranked #5 on Activity Recognition on RWF-2000

Activity Recognition

Paper
Add Code

TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering

no code implementations • ECCV 2020 • Xiaofeng Yang, Guosheng Lin, Fengmao Lv, Fayao Liu

Compositional visual question answering requires reasoning over both semantic and geometry object relations.

Object Question Answering +3

Paper
Add Code

REACTO: Reconstructing Articulated Objects from a Single Video

no code implementations • 17 Apr 2024 • Chaoyue Song, Jiacheng Wei, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu

In this paper, we address the challenge of reconstructing general articulated 3D objects from a single video.

Paper
Add Code

Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion

no code implementations • 9 Apr 2024 • Fan Yang, Jianfeng Zhang, Yichun Shi, Bowen Chen, Chenxu Zhang, Huichao Zhang, Xiaofeng Yang, Jiashi Feng, Guosheng Lin

Benefiting from the rapid development of 2D diffusion models, 3D content creation has made significant progress recently.

3D Generation

Paper
Add Code

Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations

1 code implementation • 20 Mar 2024 • Kewei Wang, Yizheng Wu, Jun Cen, Zhiyu Pan, Xingyi Li, Zhe Wang, Zhiguo Cao, Guosheng Lin

To this end, we explore the feasibility of self-supervised motion prediction with only unlabeled LiDAR point clouds.

Autonomous Driving motion prediction

Paper
Code

Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior

no code implementations • 14 Mar 2024 • Cheng Chen, Xiaofeng Yang, Fan Yang, Chengzeng Feng, Zhoujie Fu, Chuan-Sheng Foo, Guosheng Lin, Fayao Liu

In this paper, we present a new framework Sculpt3D that equips the current pipeline with explicit injection of 3D priors from retrieved reference objects without re-training the 2D diffusion model.

3D Generation Text to 3D

Paper
Add Code

S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes

no code implementations • 10 Mar 2024 • Xingyi Li, Zhiguo Cao, Yizheng Wu, Kewei Wang, Ke Xian, Zhe Wang, Guosheng Lin

To address this limitation, we present S-DyRF, a reference-based spatio-temporal stylization method for dynamic neural radiance fields.

Style Transfer

Paper
Add Code

Fine Structure-Aware Sampling: A New Sampling Training Scheme for Pixel-Aligned Implicit Models in Single-View Human Reconstruction

1 code implementation • 29 Feb 2024 • Kennard Yanting Chan, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin

Lastly, to further improve the training process, FSS proposes a mesh thickness loss signal for pixel-aligned implicit models.

Paper
Code

Style-Consistent 3D Indoor Scene Synthesis with Decoupled Objects

no code implementations • 24 Jan 2024 • Yunfan Zhang, Hong Huang, Zhiwei Xiong, Zhiqi Shen, Guosheng Lin, Hao Wang, Nicholas Vun

The core strength of our pipeline lies in its ability to generate 3D scenes that are not only visually impressive but also exhibit features like photorealism, multi-view consistency, and diversity.

Indoor Scene Synthesis

Paper
Add Code

Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

1 code implementation • 13 Dec 2023 • Kewei Wang, Yizheng Wu, Zhiyu Pan, Xingyi Li, Ke Xian, Zhe Wang, Zhiguo Cao, Guosheng Lin

To improve the quality of pseudo labels, we propose a novel motion selection and re-generation module.

Autonomous Driving Data Augmentation +2

Paper
Code

Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting

no code implementations • 8 Dec 2023 • Xiaofeng Yang, YiWen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin

We propose a unified framework aimed at enhancing the diffusion priors for 3D generation tasks.

3D Generation Denoising +1

Paper
Add Code

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing

no code implementations • 3 Dec 2023 • Fan Yang, Tianyi Chen, Xiaosheng He, Zhongang Cai, Lei Yang, Si Wu, Guosheng Lin

We propose AttriHuman-3D, an editable 3D human generation model, which address the aforementioned problems with attribute decomposition and indexing.

Attribute Disentanglement

Paper
Add Code

Few-shot Image Generation via Style Adaptation and Content Preservation

no code implementations • 30 Nov 2023 • Xiaosheng He, Fan Yang, Fayao Liu, Guosheng Lin

Many works propose to fine-tune a pre-trained GAN model.

Image Generation Image Reconstruction +1

Paper
Add Code

SARA: Controllable Makeup Transfer with Spatial Alignment and Region-Adaptive Normalization

no code implementations • 28 Nov 2023 • Xiaojing Zhong, Xinyi Huang, Zhonghua Wu, Guosheng Lin, Qingyao Wu

To address this problem, we propose a novel Spatial Alignment and Region-Adaptive normalization method (SARA) in this paper.

Paper
Add Code

DI-Net : Decomposed Implicit Garment Transfer Network for Digital Clothed 3D Human

no code implementations • 28 Nov 2023 • Xiaojing Zhong, Yukun Su, Zhonghua Wu, Guosheng Lin, Qingyao Wu

3D virtual try-on enjoys many potential applications and hence has attracted wide attention.

Virtual Try-on

Paper
Add Code

GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting

1 code implementation • 24 Nov 2023 • YiWen Chen, Zilong Chen, Chi Zhang, Feng Wang, Xiaofeng Yang, Yikai Wang, Zhongang Cai, Lei Yang, Huaping Liu, Guosheng Lin

3D editing plays a crucial role in many areas such as gaming and virtual reality.

843

Paper
Code

Learning-Based Biharmonic Augmentation for Point Cloud Classification

no code implementations • 10 Nov 2023 • Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap

Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging.

Classification Data Augmentation +1

Paper
Add Code

Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation

no code implementations • 3 Nov 2023 • Shichao Dong, Fayao Liu, Guosheng Lin

Recently, large-scale pre-trained models such as Segment-Anything Model (SAM) and Contrastive Language-Image Pre-training (CLIP) have demonstrated remarkable success and revolutionized the field of computer vision.

3D Semantic Segmentation Point Cloud Segmentation +5

Paper
Add Code

Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior

no code implementations • 17 Oct 2023 • Ruibo Li, Chi Zhang, Zhe Wang, Chunhua Shen, Guosheng Lin

By rigidly aligning each region with its potential counterpart in the target point cloud, we obtain a region-specific rigid transformation to generate its pseudo flow labels.

Motion Estimation motion prediction +2

Paper
Add Code

LCReg: Long-Tailed Image Classification with Latent Categories based Recognition

no code implementations • 13 Sep 2023 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

In this work, we tackle the challenging problem of long-tailed image recognition.

Data Augmentation Image Classification

Paper
Add Code

Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks and Zero-Curl Regularization

no code implementations • 4 Sep 2023 • Xianghui Yang, Guosheng Lin, Zhenghao Chen, Luping Zhou

Recent neural networks based surface reconstruction can be roughly divided into two categories, one warping templates explicitly and the other representing 3D surfaces implicitly.

Surface Reconstruction

Paper
Add Code

Improving Video Violence Recognition with Human Interaction Learning on 3D Skeleton Point Clouds

no code implementations • 26 Aug 2023 • Yukun Su, Guosheng Lin, Qingyao Wu

(ii) Global-SPIL: to better learn and refine the features of the unordered and unstructured skeleton points, Global-SPIL employs the self-attention layer that operates directly on the sampled points, which can help to make the output more permutation-invariant and well-suited for our task.

Action Recognition Temporal Action Localization

Paper
Add Code

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

1 code implementation • 22 Aug 2023 • YiWen Chen, Chi Zhang, Xiaofeng Yang, Zhongang Cai, Gang Yu, Lei Yang, Guosheng Lin

Recent strides in Text-to-3D techniques have been propelled by distilling knowledge from powerful large text-to-image diffusion models (LDMs).

3D Generation Text to 3D

201

Paper
Code

Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image

no code implementations • 20 Aug 2023 • Liao Shen, Xingyi Li, Huiqiang Sun, Juewen Peng, Ke Xian, Zhiguo Cao, Guosheng Lin

To animate the visual content, the feature point cloud is displaced based on the scene flow derived from motion estimation and the corresponding camera pose.

Motion Estimation

Paper
Add Code

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data

1 code implementation • 20 Aug 2023 • Yanda Li, Chi Zhang, Gang Yu, Zhibin Wang, Bin Fu, Guosheng Lin, Chunhua Shen, Ling Chen, Yunchao Wei

However, these datasets often exhibit domain bias, potentially constraining the generative capabilities of the models.

Ranked #51 on Visual Question Answering on MM-Vet

Visual Question Answering

Paper
Code

Self-Calibrated Cross Attention Network for Few-Shot Segmentation

1 code implementation • ICCV 2023 • Qianxiong Xu, Wenting Zhao, Guosheng Lin, Cheng Long

Moreover, when calculating SCCA, we design a scaled-cosine mechanism to better utilize the support features for similarity calculation.

Ranked #7 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation

Paper
Code

Unlimited Knowledge Distillation for Action Recognition in the Dark

no code implementations • 18 Aug 2023 • Ruibing Jin, Guosheng Lin, Min Wu, Jie Lin, Zhengguo Li, XiaoLi Li, Zhenghua Chen

To address this issue, we propose an unlimited knowledge distillation (UKD) in this paper.

Action Recognition Knowledge Distillation

Paper
Add Code

Weakly Supervised 3D Instance Segmentation without Instance-level Annotations

no code implementations • 3 Aug 2023 • Shichao Dong, Guosheng Lin

3D semantic scene understanding tasks have achieved great success with the emergence of deep learning, but often require a huge amount of manually annotated training data.

3D Instance Segmentation Scene Understanding +1

Paper
Add Code

Neural Video Depth Stabilizer

3 code implementations • ICCV 2023 • Yiran Wang, Min Shi, Jiaqi Li, Zihao Huang, Zhiguo Cao, Jianming Zhang, Ke Xian, Guosheng Lin

Video depth estimation aims to infer temporally consistent depth.

Ranked #15 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Monocular Depth Estimation

571

Paper
Code

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

2 code implementations • 30 May 2023 • Chi Zhang, YiWen Chen, Yijun Fu, Zhenglin Zhou, Gang Yu, Billzb Wang, Bin Fu, Tao Chen, Guosheng Lin, Chunhua Shen

The recent advancements in image-text diffusion models have stimulated research interest in large-scale 3D generative models.

3D Generation Attribute +1

498

Paper
Code

OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields

1 code implementation • 17 May 2023 • Youtan Yin, Zhoujie Fu, Fan Yang, Guosheng Lin

This paper proposes a novel object-removing pipeline, named OR-NeRF, that can remove objects from 3D scenes with user-given points or text prompts on a single view, achieving better performance in less time than previous works.

3D scene Editing Novel View Synthesis +1

Paper
Code

MoDA: Modeling Deformable 3D Objects from Casual Videos

1 code implementation • 17 Apr 2023 • Chaoyue Song, Tianyi Chen, YiWen Chen, Jiacheng Wei, Chuan Sheng Foo, Fayao Liu, Guosheng Lin

To solve this problem, we propose neural dual quaternion blend skinning (NeuDBS) to achieve 3D point deformation, which can perform rigid transformation without skin-collapsing artifacts.

Paper
Code

StarNet: Style-Aware 3D Point Cloud Generation

no code implementations • 28 Mar 2023 • Yunfan Zhang, Hao Wang, Guosheng Lin, Vun Chan Hua Nicholas, Zhiqi Shen, Chunyan Miao

This paper investigates an open research task of reconstructing and generating 3D point clouds.

Generating 3D Point Clouds Point Cloud Generation +1

Paper
Add Code

Harmonizing Base and Novel Classes: A Class-Contrastive Approach for Generalized Few-Shot Segmentation

1 code implementation • 24 Mar 2023 • Weide Liu, Zhonghua Wu, Yang Zhao, Yuming Fang, Chuan-Sheng Foo, Jun Cheng, Guosheng Lin

Current methods for few-shot segmentation (FSSeg) have mainly focused on improving the performance of novel classes while neglecting the performance of base classes.

Generalized Few-Shot Semantic Segmentation Segmentation +1

Paper
Code

TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

1 code implementation • CVPR 2023 • Jiacheng Wei, Hao Wang, Jiashi Feng, Guosheng Lin, Kim-Hui Yap

We conduct extensive experiments to analyze each of our proposed components and show the efficacy of our framework in generating high-fidelity 3D textured and text-relevant shapes.

Paper
Code

3D Cinemagraphy from a Single Image

no code implementations • CVPR 2023 • Xingyi Li, Zhiguo Cao, Huiqiang Sun, Jianming Zhang, Ke Xian, Guosheng Lin

To animate the scene, we perform motion estimation and lift the 2D motion into the 3D scene flow.

Image Animation Motion Estimation

Paper
Add Code

Reliability-Adaptive Consistency Regularization for Weakly-Supervised Point Cloud Segmentation

1 code implementation • 9 Mar 2023 • Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai

Weakly-supervised point cloud segmentation with extremely limited labels is highly desirable to alleviate the expensive costs of collecting densely annotated 3D points.

Point Cloud Segmentation Segmentation +1

Paper
Code

Neural Vector Fields: Implicit Representation by Explicit Learning

2 code implementations • CVPR 2023 • Xianghui Yang, Guosheng Lin, Zhenghao Chen, Luping Zhou

Deep neural networks (DNNs) are widely applied for nowadays 3D surface reconstruction tasks and such methods can be further divided into two categories, which respectively warp templates explicitly by moving vertices or represent 3D surfaces implicitly as signed or unsigned distance functions.

Quantization Surface Reconstruction

Paper
Code

Effective End-to-End Vision Language Pretraining with Semantic Visual Loss

no code implementations • 18 Jan 2023 • Xiaofeng Yang, Fayao Liu, Guosheng Lin

Current vision language pretraining models are dominated by methods using region visual features extracted from object detectors.

Paper
Add Code

Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds

1 code implementation • ICCV 2023 • Ze Yang, Ruibo Li, Evan Ling, Chi Zhang, Yiming Wang, Dezhao Huang, Keng Teck Ma, Minhoe Hur, Guosheng Lin

To address this issue, we propose a new label-guided knowledge distillation (LGKD) loss, where the old model output is expanded and transplanted (with the guidance of the ground truth label) to form a semantically appropriate class correspondence with the new model output.

Ranked #1 on Continual Semantic Segmentation on ScanNet

Continual Semantic Segmentation Knowledge Distillation +1

Paper
Code

Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving

no code implementations • CVPR 2023 • Ruibo Li, Hanyu Shi, Ziang Fu, Zhe Wang, Guosheng Lin

To this end, we propose a two-stage weakly supervised approach, where the segmentation model trained with the incomplete binary masks in Stage1 will facilitate the self-supervised learning of the motion prediction network in Stage2 by estimating possible moving foregrounds in advance.

Autonomous Driving motion prediction +2

Paper
Add Code

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion

no code implementations • 5 Dec 2022 • Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Ruiqi Wu, Shizhou Zhang, Peng Wang, Yanning Zhang

The key insight of this design is that the cross-attention mechanism in the transformer could be an ideal solution to align the discriminative texture clues from the original image with the canonical view image, which could compensate for the low-quality texture information of the canonical view image.

Domain Generalization Generalizable Person Re-identification +1

Paper
Add Code

Unsupervised 3D Pose Transfer with Cross Consistency and Dual Reconstruction

1 code implementation • 18 Nov 2022 • Chaoyue Song, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin

With $G$ as the basic component, we propose a cross consistency learning scheme and a dual reconstruction objective to learn the pose transfer without supervision.

Pose Transfer

Paper
Code

IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-view Human Reconstruction

1 code implementation • 15 Nov 2022 • Kennard Yanting Chan, Guosheng Lin, Haiyu Zhao, Weisi Lin

We propose IntegratedPIFu, a new pixel aligned implicit model that builds on the foundation set by PIFuHD.

Human Parsing

Paper
Code

Indoor Smartphone SLAM with Learned Echoic Location Features

no code implementations • 16 Oct 2022 • Wenjie Luo, Qun Song, Zhenyu Yan, Rui Tan, Guosheng Lin

Indoor self-localization is a highly demanded system function for smartphones.

Contrastive Learning Simultaneous Localization and Mapping

Paper
Add Code

ManiCLIP: Multi-Attribute Face Manipulation from Text

1 code implementation • 2 Oct 2022 • Hao Wang, Guosheng Lin, Ana García del Molino, Anran Wang, Jiashi Feng, Zhiqi Shen

In this paper we present a novel multi-attribute face manipulation method based on textual descriptions.

Attribute Text-based Image Editing

Paper
Code

SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis

1 code implementation • 29 Sep 2022 • Xingyi Li, Chaoyi Hong, Yiran Wang, Zhiguo Cao, Ke Xian, Guosheng Lin

We study the problem of novel view synthesis of objects from a single image.

Novel View Synthesis

Paper
Code

CRCNet: Few-shot Segmentation with Cross-Reference and Region-Global Conditional Networks

no code implementations • 23 Aug 2022 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu

Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.

Segmentation

Paper
Add Code

Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point Supervision

1 code implementation • ICCV 2023 • Shichao Dong, Ruibo Li, Jiacheng Wei, Fayao Liu, Guosheng Lin

Instance segmentation on 3D point clouds has been attracting increasing attention due to its wide applications, especially in scene understanding areas.

Ranked #20 on 3D Instance Segmentation on ScanNet(v2)

3D Instance Segmentation Scene Understanding +3

Paper
Code

Single-view 3D Mesh Reconstruction for Seen and Unseen Categories

1 code implementation • 4 Aug 2022 • Xianghui Yang, Guosheng Lin, Luping Zhou

Single-view 3D object reconstruction is a fundamental and challenging computer vision task that aims at recovering 3D shapes from single-view RGB images.

3D Object Reconstruction

Paper
Code

Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval

no code implementations • 29 Jul 2022 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

When we do online paired data augmentation, we first generate augmented text through random token replacement, then pass the augmented text into the latent space alignment module to output the latent codes, which are finally fed to StyleGAN2 to generate the augmented images.

Cross-Modal Retrieval Data Augmentation +3

Paper
Add Code

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image

no code implementations • 29 Jul 2022 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

To this end, we discover the semantic meanings of StyleGAN latent space, such that we are able to produce face images of various expressions, poses, and lighting by controlling the latent codes.

Face Generation Face Model

Paper
Add Code

Few-shot Open-set Recognition Using Background as Unknowns

no code implementations • 19 Jul 2022 • Nan Song, Chi Zhang, Guosheng Lin

First, instead of learning the decision boundaries between seen classes, as is done in standard close-set classification, we reserve space for unseen classes, such that images located in these areas are recognized as the unseen classes.

Open Set Learning

Paper
Add Code

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation

no code implementations • 19 Jul 2022 • Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian

Weakly supervised point cloud segmentation, i. e. semantically segmenting a point cloud with only a few labeled points in the whole 3D scene, is highly desirable due to the heavy burden of collecting abundant dense annotations for the model training.

Point Cloud Segmentation Segmentation

Paper
Add Code

Long-tailed Recognition by Learning from Latent Categories

no code implementations • 2 Jun 2022 • Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training.

Ranked #9 on Long-tail Learning on CIFAR-10-LT (ρ=10)

Data Augmentation Long-tail Learning

Paper
Add Code

Efficient Few-Shot Object Detection via Knowledge Inheritance

1 code implementation • 23 Mar 2022 • Ze Yang, Chi Zhang, Ruibo Li, Yi Xu, Guosheng Lin

Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed.

Few-Shot Object Detection Object +2

Paper
Code

A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection

1 code implementation • 9 Mar 2022 • Yukun Su, Jingliang Deng, Ruizhou Sun, Guosheng Lin, Qingyao Wu

Besides, they fail to take full advantage of the cues among inter- and intra-feature within a group of images.

Ranked #1 on Video Salient Object Detection on FBMS-59

Co-Salient Object Detection object-detection +3

278

Paper
Code

Self-Training Vision Language BERTs with a Unified Conditional Model

no code implementations • 6 Jan 2022 • Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin

We use the labeled image data to train a teacher model and use the trained model to generate pseudo captions on unlabeled image data.

Paper
Add Code

RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior

no code implementations • CVPR 2022 • Ruibo Li, Chi Zhang, Guosheng Lin, Zhe Wang, Chunhua Shen

In this work, we focus on scene flow learning on point clouds in a self-supervised manner.

Motion Estimation Self-Supervised Learning

Paper
Add Code

Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Temporal Matching and Spatial Graph Propagation

1 code implementation • CVPR 2022 • Hanyu Shi, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin

We propose a novel temporal-spatial framework for effective weakly supervised learning to generate high-quality pseudo labels from these limited annotated data.

Point Cloud Segmentation Scene Understanding +2

Paper
Code

Expanding Large Pre-Trained Unimodal Models With Multimodal Information Injection for Image-Text Multimodal Classification

no code implementations • CVPR 2022 • Tao Liang, Guosheng Lin, Mingyang Wan, Tianrui Li, Guojun Ma, Fengmao Lv

Through the proposed MI2P unit, we can inject the language information into the vision backbone by attending the word-wise textual features to different visual channels, as well as inject the visual information into the language backbone by attending the channel-wise visual features to different textual words.

Paper
Add Code

Improving Tail-Class Representation with Centroid Contrastive Learning

no code implementations • 19 Oct 2021 • Anthony Meng Huat Tiong, Junnan Li, Guosheng Lin, Boyang Li, Caiming Xiong, Steven C. H. Hoi

ICCL interpolates two images from a class-agnostic sampler and a class-aware sampler, and trains the model such that the representation of the interpolative image can be used to retrieve the centroids for both source classes.

Ranked #22 on Long-tail Learning on CIFAR-10-LT (ρ=10)

Contrastive Learning Image Classification +2

Paper
Add Code

Learning Structural Representations for Recipe Generation and Food Retrieval

no code implementations • 4 Oct 2021 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

Our approach brings together several novel ideas in a systematic framework: (1) exploiting an unsupervised learning approach to obtain the sentence-level tree structure labels before training; (2) generating trees of target recipes from images with the supervision of tree structure labels learned from (1); and (3) integrating the learned tree structures into the recipe generation and food cross-modal retrieval procedure.

Cross-Modal Retrieval Image Captioning +3

Paper
Add Code

3D Pose Transfer with Correspondence Learning and Mesh Refinement

1 code implementation • NeurIPS 2021 • Chaoyue Song, Jiacheng Wei, Ruibo Li, Fayao Liu, Guosheng Lin

It aims to transfer the pose of a source mesh to a target mesh and keep the identity (e. g., body shape) of the target mesh.

3D Generation Pose Transfer

Paper
Code

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning

no code implementations • ICCV 2021 • Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen

Inspired by the recent success in Automated Machine Learning literature (AutoML), in this paper, we present Meta Navigator, a framework that attempts to solve the aforementioned limitation in few-shot learning by seeking a higher-level strategy and proffer to automate the selection from various few-shot learning designs.

AutoML Few-Shot Learning

Paper
Add Code

Calibrating Class Activation Maps for Long-Tailed Visual Recognition

no code implementations • 29 Aug 2021 • Chi Zhang, Guosheng Lin, Lvlong Lai, Henghui Ding, Qingyao Wu

First, we present a Class Activation Map Calibration (CAMC) module to improve the learning and prediction of network classifiers, by enforcing network prediction based on important image regions.

Representation Learning

Paper
Add Code

Few-shot Segmentation with Optimal Transport Matching and Message Flow

no code implementations • 19 Aug 2021 • Weide Liu, Chi Zhang, Henghui Ding, Tzu-Yi Hung, Guosheng Lin

In this work, we argue that every support pixel's information is desired to be transferred to all query pixels and propose a Correspondence Matching Network (CMNet) with an Optimal Transport Matching module to mine out the correspondence between the query and support images.

Few-Shot Semantic Segmentation Multi-Task Learning +2

Paper
Add Code

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

1 code implementation • 17 Aug 2021 • Weide Liu, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin

To improve the generality of the objective activation maps, we propose a region prototypical network RPNet to explore the cross-image object diversity of the training set.

Image Segmentation Object +2

Paper
Code

MV-TON: Memory-based Video Virtual Try-on network

no code implementations • 17 Aug 2021 • Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu

With the development of Generative Adversarial Network, image-based virtual try-on methods have made great progress.

Generative Adversarial Network Virtual Try-on

Paper
Add Code

Cross-Modal Graph with Meta Concepts for Video Captioning

1 code implementation • 14 Aug 2021 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

Video captioning targets interpreting the complex visual contents as text descriptions, which requires the model to fully understand video scenes including objects and their interactions.

object-detection Object Detection +1

Paper
Code

Few-Shot Segmentation with Global and Local Contrastive Learning

1 code implementation • 11 Aug 2021 • Weide Liu, Zhonghua Wu, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

To this end, we first propose a prior extractor to learn the query information from the unlabeled images with our proposed global-local contrastive learning.

Contrastive Learning Image Segmentation +2

Paper
Code

Learning Meta-class Memory for Few-Shot Semantic Segmentation

1 code implementation • ICCV 2021 • Zhonghua Wu, Xiangxi Shi, Guosheng Lin, Jianfei Cai

To explicitly learn meta-class representations in few-shot segmentation task, we propose a novel Meta-class Memory based few-shot segmentation method (MM-Net), where we introduce a set of learnable memory embeddings to memorize the meta-class information during the base class training and transfer to novel classes during the inference stage.

Few-Shot Semantic Segmentation Segmentation +1

Paper
Code

M2IOSR: Maximal Mutual Information Open Set Recognition

no code implementations • 5 Aug 2021 • Xin Sun, Henghui Ding, Chi Zhang, Guosheng Lin, Keck-Voon Ling

In this work, we aim to address the challenging task of open set recognition (OSR).

Open Set Learning

Paper
Add Code

Point Discriminative Learning for Data-efficient 3D Point Cloud Analysis

no code implementations • 4 Aug 2021 • Fayao Liu, Guosheng Lin, Chuan-Sheng Foo, Chaitanya K. Joshi, Jie Lin

In this work we propose PointDisc, a point discriminative learning method to leverage self-supervisions for data-efficient 3D point cloud classification and segmentation.

3D Object Classification 3D Part Segmentation +5

Paper
Add Code

Cycle-Consistent Inverse GAN for Text-to-Image Synthesis

no code implementations • 3 Aug 2021 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

In this paper, we propose a novel unified framework of Cycle-consistent Inverse GAN (CI-GAN) for both text-to-image generation and text-guided image manipulation tasks.

Image Manipulation Text-to-Image Generation

Paper
Add Code

Remember What You have drawn: Semantic Image Manipulation with Memory

no code implementations • 27 Jul 2021 • Xiangxi Shi, Zhonghua Wu, Guosheng Lin, Jianfei Cai, Shafiq Joty

Therefore, in this paper, we propose a memory-based Image Manipulation Network (MIM-Net), where a set of memories learned from images is introduced to synthesize the texture information with the guidance of the textual description.

Image Manipulation

Paper
Add Code

Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds

no code implementations • 23 Jul 2021 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung

While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels.

Point Cloud Segmentation Scene Understanding +3

Paper
Add Code

Progressive Modality Reinforcement for Human Multimodal Emotion Recognition From Unaligned Multimodal Sequences

no code implementations • CVPR 2021 • Fengmao Lv, Xiang Chen, Yanyong Huang, Lixin Duan, Guosheng Lin

In turn, it also collects the reinforced features from each modality and uses them to generate a reinforced common message.

Multimodal Emotion Recognition Time Series +1

Paper
Add Code

Self-Point-Flow: Self-Supervised Scene Flow Estimation from Point Clouds with Optimal Transport and Random Walk

no code implementations • CVPR 2021 • Ruibo Li, Guosheng Lin, Lihua Xie

Due to the scarcity of annotated scene flow data, self-supervised scene flow learning in point clouds has attracted increasing attention.

Self-Supervised Learning Self-supervised Scene Flow Estimation

Paper
Add Code

HCRF-Flow: Scene Flow from Point Clouds with Continuous High-order CRFs and Position-aware Flow Embedding

no code implementations • CVPR 2021 • Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen

Scene flow in 3D point clouds plays an important role in understanding dynamic environments.

Position

Paper
Add Code

CT-Net: Complementary Transfering Network for Garment Transfer with Arbitrary Geometric Changes

no code implementations • CVPR 2021 • Fan Yang, Guosheng Lin

Garment transfer shows great potential in realistic applications with the goal of transfering outfits across different people images.

Paper
Add Code

Few-Shot Incremental Learning with Continually Evolved Classifiers

1 code implementation • CVPR 2021 • Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu

First, we adopt a simple but effective decoupled learning strategy of representations and classifiers that only the classifiers are updated in each incremental session, which avoids knowledge forgetting in the representations.

Ranked #7 on Few-Shot Class-Incremental Learning on CIFAR-100

Few-Shot Class-Incremental Learning Incremental Learning

136

Paper
Code

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation

1 code implementation • ICCV 2021 • Yukun Su, Ruizhou Sun, Guosheng Lin, Qingyao Wu

Data augmentation is vital for deep learning neural networks.

Data Augmentation Object +2

Paper
Code

Online Active Proposal Set Generation for Weakly Supervised Object Detection

no code implementations • 20 Jan 2021 • Ruibing Jin, Guosheng Lin, Changyun Wen

Online proposal sampling is an intuitive solution to these issues.

object-detection Weakly Supervised Object Detection

Paper
Add Code

Progressive Self-Guided Loss for Salient Object Detection

1 code implementation • 7 Jan 2021 • Sheng Yang, Weisi Lin, Guosheng Lin, Qiuping Jiang, Zichuan Liu

We present a simple yet effective progressive self-guided loss function to facilitate deep learning-based salient object detection (SOD) in images.

Object object-detection +2

Paper
Code

CycleSegNet: Object Co-segmentation with Cycle Refinement and Region Correspondence

no code implementations • 5 Jan 2021 • Chi Zhang, Guankai Li, Guosheng Lin, Qingyao Wu, Rui Yao

Image co-segmentation is an active computer vision task that aims to segment the common objects from a set of images.

Segmentation

Paper
Add Code

Self-Supervised 3D Skeleton Action Representation Learning With Motion Consistency and Continuity

no code implementations • ICCV 2021 • Yukun Su, Guosheng Lin, Qingyao Wu

Recently, self-supervised learning (SSL) has been proved very effective and it can help boost the performance in learning representations from unlabeled data in the image domain.

Action Recognition Representation Learning +2

Paper
Add Code

Attention Is Not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion

no code implementations • ICCV 2021 • Tao Liang, Guosheng Lin, Lei Feng, Yan Zhang, Fengmao Lv

To this end, both the marginal distribution and the elements with high-confidence correlations are aligned over the common space of the query and key vectors which are computed from different modalities.

Time Series Time Series Analysis +1

Paper
Add Code

Compositional Prototype Network with Multi-view Comparision for Few-Shot Point Cloud Semantic Segmentation

no code implementations • 28 Dec 2020 • Xiaoyu Chen, Chi Zhang, Guosheng Lin, Jing Han

Moreover, when we use our network to handle the long-tail problem in a fully supervised point cloud segmentation dataset, it can also effectively boost the performance of the few-shot classes.

Few-Shot Learning Point Cloud Segmentation +2

Paper
Add Code

On Lightweight Privacy-Preserving Collaborative Learning for Internet of Things by Independent Random Projections

1 code implementation • 11 Dec 2020 • Linshan Jiang, Rui Tan, Xin Lou, Guosheng Lin

This paper considers the design and implementation of a practical privacy-preserving collaborative learning scheme, in which a curious learning coordinator trains a better machine learning model based on the data samples contributed by a number of IoT objects, while the confidentiality of the raw forms of the training data is protected against the coordinator.

BIG-bench Machine Learning Privacy Preserving

Paper
Code

Learning Regional Purity for Instance Segmentation on 3D Point Clouds

1 code implementation • European Conference on Computer Vision (ECCV) 2020 • Shichao Dong, Guosheng Lin, Tzu-Yi Hung

In this paper, we define a novel concept of “regional purity” as the percentage of neighboring points belonging to the same instance within a fixed-radius 3D space.

Ranked #13 on 3D Instance Segmentation on ScanNet(v2)

3D Instance Segmentation 3D Semantic Segmentation +4

Paper
Code

Feature Flow: In-network Feature Flow Estimation for Video Object Detection

no code implementations • 21 Sep 2020 • Ruibing Jin, Guosheng Lin, Changyun Wen, Jianliang Wang, Fayao Liu

Optical flow, which expresses pixel displacement, is widely used in many computer vision tasks to provide pixel-level motion information.

object-detection Optical Flow Estimation +1

Paper
Add Code

Structure-Aware Generation Network for Recipe Generation from Images

1 code implementation • ECCV 2020 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

We investigate an open research task of generating cooking instructions based on only food images and ingredients, which is similar to the image captioning task.

Image Captioning Recipe Generation +1

Paper
Code

Graph Edit Distance Reward: Learning to Edit Scene Graph

no code implementations • ECCV 2020 • Lichang Chen, Guosheng Lin, Shijie Wang, Qingyao Wu

Scene Graph, as a vital tool to bridge the gap between language domain and image domain, has been widely adopted in the cross-modality task like VQA.

Graph Matching Image Retrieval +2

Paper
Add Code

Open Set Recognition with Conditional Probabilistic Generative Models

no code implementations • 12 Aug 2020 • Xin Sun, Chi Zhang, Guosheng Lin, Keck-Voon Ling

A typical challenge that hinders their real-world applications is that unknown samples may be fed into the system during the testing phase, but traditional deep neural networks will wrongly recognize these unknown samples as one of the known classes.

Open Set Learning

Paper
Add Code

Decomposing Generation Networks with Structure Prediction for Recipe Generation

no code implementations • 27 Jul 2020 • Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

Recipe generation from food images and ingredients is a challenging task, which requires the interpretation of the information from another modality.

Image Captioning Recipe Generation +1

Paper
Add Code

Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds

1 code implementation • CVPR 2020 • Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Tzu-Yi Hung, Lihua Xie

To the best of our knowledge, this is the first method that uses cloud-level weak labels on raw 3D space to train a point cloud semantic segmentation network.

3D Semantic Segmentation Point Cloud Segmentation +2

Paper
Code

CRNet: Cross-Reference Networks for Few-Shot Segmentation

no code implementations • CVPR 2020 • Weide Liu, Chi Zhang, Guosheng Lin, Fayao Liu

In this paper, we propose a cross-reference network (CRNet) for few-shot segmentation.

Image Segmentation Segmentation +1

Paper
Add Code

Exploring Bottom-up and Top-down Cues with Attentive Learning for Webly Supervised Object Detection

no code implementations • CVPR 2020 • Zhonghua Wu, Qingyi Tao, Guosheng Lin, Jianfei Cai

To reduce the human labeling effort, we propose a novel webly supervised object detection (WebSOD) method for novel classes which only requires the web images without further annotations.

Object object-detection +2

Paper
Add Code

DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning

5 code implementations • 15 Mar 2020 • Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen

We employ the Earth Mover's Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance.

Classification Few-Shot Image Classification +4

561

Paper
Code

Video Object Segmentation and Tracking: A Survey

no code implementations • 19 Apr 2019 • Rui Yao, Guosheng Lin, Shixiong Xia, Jiaqi Zhao, Yong Zhou

Second, we provide a detailed discussion and overview of the technical characteristics of the different methods.

Autonomous Vehicles Object +7

Paper
Add Code

A Dilated Inception Network for Visual Saliency Prediction

1 code implementation • 7 Apr 2019 • Sheng Yang, Guosheng Lin, Qiuping Jiang, Weisi Lin

In this work, we proposed an end-to-end dilated inception network (DINet) for visual saliency prediction.

Ranked #6 on Video Saliency Detection on MSU Video Saliency Prediction

Saliency Prediction Video Saliency Detection

Paper
Code

Towards Robust Curve Text Detection with Conditional Spatial Expansion

no code implementations • CVPR 2019 • Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh

It is challenging to detect curve texts due to their irregular shapes and varying sizes.

object-detection Object Detection +1

Paper
Add Code

CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning

1 code implementation • CVPR 2019 • Chi Zhang, Guosheng Lin, Fayao Liu, Rui Yao, Chunhua Shen

Recent progress in semantic segmentation is driven by deep Convolutional Neural Networks and large-scale labeled image datasets.

Ranked #86 on Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot)

Few-Shot Semantic Segmentation Segmentation +1

185

Paper
Code

On Lightweight Privacy-Preserving Collaborative Learning for IoT Objects

no code implementations • 13 Feb 2019 • Linshan Jiang, Rui Tan, Xin Lou, Guosheng Lin

BIG-bench Machine Learning Privacy Preserving

Paper
Add Code

M2E-Try On Net: Fashion from Model to Everyone

no code implementations • 21 Nov 2018 • Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai

Instead, we present a novel virtual Try-On network, M2E-Try On Net, which transfers the clothes from a model image to a person image without the need of any clean product images.

Virtual Try-on

Paper
Add Code

Correlation Propagation Networks for Scene Text Detection

no code implementations • 30 Sep 2018 • Zichuan Liu, Guosheng Lin, Wang Ling Goh, Fayao Liu, Chunhua Shen, Xiaokang Yang

In this work, we propose a novel hybrid method for scene text detection namely Correlation Propagation Network (CPN).

Scene Text Detection Text Detection

Paper
Add Code

Keypoint Based Weakly Supervised Human Parsing

no code implementations • 14 Sep 2018 • Zhonghua Wu, Guosheng Lin, Jianfei Cai

We develop an iterative learning method to generate pseudo part segmentation masks from keypoint labels.

Human Parsing Segmentation +1

Paper
Add Code

Bootstrapping the Performance of Webly Supervised Semantic Segmentation

1 code implementation • CVPR 2018 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

In this work, we focus on weak supervision, developing a method for training a high-quality pixel-level classifier for semantic segmentation, using only image-level class labels as the provided ground-truth.

Segmentation Transfer Learning +2

Paper
Code

MoNet: Deep Motion Exploitation for Video Object Segmentation

no code implementations • CVPR 2018 • Huaxin Xiao, Jiashi Feng, Guosheng Lin, Yu Liu, Maojun Zhang

In this paper, we propose a novel MoNet model to deeply exploit motion cues for boosting video object segmentation performance from two aspects, i. e., frame representation learning and segmentation refinement.

Object Optical Flow Estimation +5

Paper
Add Code

Learning Markov Clustering Networks for Scene Text Detection

no code implementations • CVPR 2018 • Zichuan Liu, Guosheng Lin, Sheng Yang, Jiashi Feng, Weisi Lin, Wang Ling Goh

MCN predicts instance-level bounding boxes by firstly converting an image into a Stochastic Flow Graph (SFG) and then performing Markov Clustering on this graph.

Clustering Scene Text Detection +1

Paper
Add Code

Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation

no code implementations • 7 Mar 2018 • Tianyi Zhang, Guosheng Lin, Jianfei Cai, Tong Shen, Chunhua Shen, Alex C. Kot

In our work, we focus on the weakly supervised semantic segmentation with image label annotations.

Image Captioning Segmentation +2

Paper
Add Code

Weakly Supervised Semantic Segmentation Based on Web Image Co-segmentation

no code implementations • 25 May 2017 • Tong Shen, Guosheng Lin, Lingqiao Liu, Chunhua Shen, Ian Reid

Training a Fully Convolutional Network (FCN) for semantic segmentation requires a large number of masks with pixel level labelling, which involves a large amount of human labour and time for annotation.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Structured Learning of Tree Potentials in CRF for Image Segmentation

no code implementations • 26 Mar 2017 • Fayao Liu, Guosheng Lin, Ruizhi Qiao, Chunhua Shen

In this fashion, we easily achieve nonlinear learning of potential functions on both unary and pairwise terms in CRFs.

Image Segmentation Semantic Segmentation

Paper
Add Code

Efficient Dense Labeling of Human Activity Sequences from Wearables using Fully Convolutional Networks

no code implementations • 20 Feb 2017 • Rui Yao, Guosheng Lin, Qinfeng Shi, Damith Ranasinghe

We conduct extensive experiments and demonstrate that our proposed approach is able to outperform the state-of-the-arts in terms of classification and label misalignment measures on three challenging datasets: Opportunity, Hand Gesture, and our new dataset.

Paper
Add Code

Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation

no code implementations • 25 Jan 2017 • Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

Semantic image segmentation is a fundamental task in image understanding.

Image Segmentation Segmentation +1

Paper
Add Code

Sequential Person Recognition in Photo Albums with a Recurrent Network

no code implementations • CVPR 2017 • Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton Van Den Hengel

In this work, we propose to model the relational information between people as a sequence prediction task.

Person Recognition

Paper
Add Code

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

13 code implementations • CVPR 2017 • Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid

Recently, very deep convolutional neural networks (CNNs) have shown outstanding performance in object recognition and have also been the first choice for dense classification problems such as semantic segmentation.

Ranked #13 on Semantic Segmentation on Trans10K

3D Absolute Human Pose Estimation Semantic Segmentation +1

584

Paper
Code

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps

no code implementations • 29 Apr 2016 • Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun

Crowd counting is an important task in computer vision, which has many applications in video surveillance.

Attribute Crowd Counting

Paper
Add Code

Exploring Context with Deep Structured models for Semantic Segmentation

no code implementations • 10 Mar 2016 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, Ian Reid

We formulate deep structured models by combining CNNs and Conditional Random Fields (CRFs) for learning the patch-patch context between image regions.

Image Segmentation Segmentation +1

Paper
Add Code

Fast Training of Triplet-based Deep Binary Embedding Networks

no code implementations • CVPR 2016 • Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian Reid

To solve the first stage, we design a large-scale high-order binary codes inference algorithm to reduce the high-order objective to a standard binary quadratic problem such that graph cuts can be used to efficiently infer the binary code which serve as the label of each training datum.

Image Retrieval Multi-Label Classification +1

Paper
Add Code

Structured Learning of Binary Codes with Column Generation

no code implementations • 22 Feb 2016 • Guosheng Lin, Fayao Liu, Chunhua Shen, Jianxin Wu, Heng Tao Shen

Our column generation based method can be further generalized from the triplet loss to a general structured learning based framework that allows one to directly optimize multivariate performance measures.

Image Retrieval Information Retrieval +1

Paper
Add Code

Discriminative Training of Deep Fully-connected Continuous CRF with Task-specific Loss

no code implementations • 28 Jan 2016 • Fayao Liu, Guosheng Lin, Chunhua Shen

We exemplify the usefulness of the proposed model on multi-class semantic labelling (discrete) and the robust depth estimation (continuous) problems.

Depth Estimation Multi-class Classification

Paper
Add Code

Deeply Learning the Messages in Message Passing Inference

no code implementations • NeurIPS 2015 • Guosheng Lin, Chunhua Shen, Ian Reid, Anton Van Den Hengel

The network output dimension for message estimation is the same as the number of classes, in contrast to the network output for general CNN potential functions in CRFs, which is exponential in the order of the potentials.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Efficient piecewise training of deep structured models for semantic segmentation

no code implementations • CVPR 2016 • Guosheng Lin, Chunhua Shen, Anton van dan Hengel, Ian Reid

Recent advances in semantic image segmentation have mostly been achieved by training deep convolutional neural networks (CNNs).

Ranked #54 on Semantic Segmentation on PASCAL Context

Image Segmentation Segmentation +1

Paper
Add Code

CRF Learning with CNN Features for Image Segmentation

no code implementations • 28 Mar 2015 • Fayao Liu, Guosheng Lin, Chunhua Shen

The deep CNN is trained on the ImageNet dataset and transferred to image segmentations here for constructing potentials of superpixels.

Image Segmentation Segmentation +2

Paper
Add Code

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields

1 code implementation • 26 Feb 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin, Ian Reid

Therefore, here we present a deep convolutional neural field model for estimating depths from single monocular images, aiming to jointly explore the capacity of deep CNN and continuous CRF.

Depth Estimation

Paper
Code

Deep Convolutional Neural Fields for Depth Estimation from a Single Image

no code implementations • CVPR 2015 • Fayao Liu, Chunhua Shen, Guosheng Lin

Therefore, we in this paper present a deep convolutional neural field model for estimating depths from a single image, aiming to jointly explore the capacity of deep CNN and continuous CRF.

Depth Estimation

Paper
Add Code

Supervised Hashing Using Graph Cuts and Boosted Decision Trees

1 code implementation • 24 Aug 2014 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel

The proposed framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods.

Descriptive Image Retrieval +1

Paper
Code

Optimizing Ranking Measures for Compact Binary Code Learning

no code implementations • 4 Jul 2014 • Guosheng Lin, Chunhua Shen, Jianxin Wu

Hashing has proven a valuable tool for large-scale information retrieval.

Image Retrieval Information Retrieval +1

Paper
Add Code

Fast Supervised Hashing with Decision Trees for High-Dimensional Data

1 code implementation • CVPR 2014 • Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton Van Den Hengel, David Suter

Here we propose to use boosted decision trees for achieving non-linearity in hashing, which are fast to train and evaluate, hence more suitable for hashing with high dimensional data.

Retrieval Vocal Bursts Intensity Prediction

Paper
Code

Fast Training of Effective Multi-class Boosting Using Coordinate Descent Optimization

no code implementations • 23 Nov 2013 • Guosheng Lin, Chunhua Shen, Anton Van Den Hengel, David Suter

Different from most existing multi-class boosting methods, which use the same set of weak learners for all the classes, we train class specified weak learners (i. e., each class has a different set of weak learners).

Multi-class Classification

Paper
Add Code

A General Two-Step Approach to Learning-Based Hashing

no code implementations • 7 Sep 2013 • Guosheng Lin, Chunhua Shen, David Suter, Anton Van Den Hengel

This framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods.

Vocal Bursts Valence Prediction

Paper
Add Code

StructBoost: Boosting Methods for Predicting Structured Output Variables

no code implementations • 14 Feb 2013 • Chunhua Shen, Guosheng Lin, Anton Van Den Hengel

Inspired by structured support vector machines (SSVM), here we propose a new boosting algorithm for structured output prediction, which we refer to as StructBoost.

Image Segmentation Multi-class Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.