Search Results for author: Wenxiong Kang

Found 14 papers, 6 papers with code

SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention

no code implementations13 Mar 2024 Feng Xiao, Hongbin Xu, Qiuxia Wu, Wenxiong Kang

3D visual grounding aims to automatically locate the 3D region of the specified object given the corresponding textual description.

Graph Attention Relation +2

StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields

no code implementations13 Mar 2024 Hongbin Xu, Weitao Chen, Feng Xiao, Baigui Sun, Wenxiong Kang

In this paper, we introduce StyleDyRF, a method that represents the 4D feature space by deforming a canonical feature volume and learns a linear style transformation matrix on the feature volume in a data-driven fashion.

Style Transfer

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

no code implementations18 Dec 2023 Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Moreover, to facilitate disentangled representation learning, we introduce four well-designed constraints: an auxiliary style classifier, an auxiliary inverse classifier, a content contrastive loss, and a pair of latent cycle losses, which can effectively contribute to the construction of the identity-related style space and semantic-related content space.

Disentanglement

A Novel Transfer Learning Method Utilizing Acoustic and Vibration Signals for Rotating Machinery Fault Diagnosis

no code implementations20 Oct 2023 Zhongliang Chen, Zhuofei Huang, Wenxiong Kang

Fault diagnosis of rotating machinery plays a important role for the safety and stability of modern industrial systems.

Transfer Learning

CostFormer:Cost Transformer for Cost Aggregation in Multi-view Stereo

no code implementations17 May 2023 Weitao Chen, Hongbin Xu, Zhipeng Zhou, Yang Liu, Baigui Sun, Wenxiong Kang, Xuansong Xie

The Residual Depth-Aware Cost Transformer(RDACT) is proposed to aggregate long-range features on cost volume via self-attention mechanisms along the depth and spatial dimensions.

PointDC:Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering

1 code implementation18 Apr 2023 Zisheng Chen, Hongbin Xu, Weitao Chen, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang

Semantic segmentation of point clouds usually requires exhausting efforts of human annotations, hence it attracts wide attention to the challenging topic of learning from unlabeled or weaker forms of annotations.

Clustering Segmentation +1

PointDC: Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-Modal Distillation and Super-Voxel Clustering

1 code implementation ICCV 2023 Zisheng Chen, Hongbin Xu, Weitao Chen, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang

Semantic segmentation of point clouds usually requires exhausting efforts of human annotations, hence it attracts wide attention to a challenging topic of learning from unlabeled or weaker form of annotations.

Clustering Segmentation +1

Semi-supervised Deep Multi-view Stereo

no code implementations24 Jul 2022 Hongbin Xu, Weitao Chen, Yang Liu, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang

For further troublesome case that the basic assumption is conflicted in MVS data, we propose a novel style consistency loss to alleviate the negative effect caused by the distribution gap.

Digging into Uncertainty in Self-supervised Multi-view Stereo

1 code implementation ICCV 2021 Hongbin Xu, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao

Specially, the limitations can be categorized into two types: ambiguious supervision in foreground and invalid supervision in background.

Image Reconstruction Self-Supervised Learning

Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation

1 code implementation12 Apr 2021 Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, Qiuxia Wu

Recent studies have witnessed that self-supervised methods based on view synthesis obtain clear progress on multi-view stereo (MVS).

Data Augmentation

JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image

1 code implementation ECCV 2020 Linpu Fang, Xingyan Liu, Li Liu, Hang Xu, Wenxiong Kang

The key ideas are two-fold: a) explicitly modeling the dependencies among joints and the relations between the pixels and the joints for better local feature representation learning; b) unifying the dense pixel-wise offset predictions and direct joint regression for end-to-end training.

3D Hand Pose Estimation regression +1

Dynamic Group Convolution for Accelerating Convolutional Neural Networks

1 code implementation ECCV 2020 Zhuo Su, Linpu Fang, Wenxiong Kang, Dewen Hu, Matti Pietikäinen, Li Liu

In this paper, we propose dynamic group convolution (DGC) that adaptively selects which part of input channels to be connected within each group for individual samples on the fly.

Computational Efficiency Image Classification

Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN

no code implementations18 Feb 2020 Hang Xu, Linpu Fang, Xiaodan Liang, Wenxiong Kang, Zhenguo Li

Finally, an InterDomain Transfer Module is proposed to exploit diverse transfer dependencies across all domains and enhance the regional feature representation by attending and transferring semantic contexts globally.

Object object-detection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.