Search Results for author: Wenxiong Kang

Found 14 papers, 6 papers with code

SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention

no code implementations • 13 Mar 2024 • Feng Xiao, Hongbin Xu, Qiuxia Wu, Wenxiong Kang

3D visual grounding aims to automatically locate the 3D region of the specified object given the corresponding textual description.

Graph Attention Relation +2

Paper
Add Code

StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields

no code implementations • 13 Mar 2024 • Hongbin Xu, Weitao Chen, Feng Xiao, Baigui Sun, Wenxiong Kang

In this paper, we introduce StyleDyRF, a method that represents the 4D feature space by deforming a canonical feature volume and learns a linear style transformation matrix on the feature volume in a data-driven fashion.

Style Transfer

Paper
Add Code

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

no code implementations • 18 Dec 2023 • Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Moreover, to facilitate disentangled representation learning, we introduce four well-designed constraints: an auxiliary style classifier, an auxiliary inverse classifier, a content contrastive loss, and a pair of latent cycle losses, which can effectively contribute to the construction of the identity-related style space and semantic-related content space.

Disentanglement

Paper
Add Code

A Novel Transfer Learning Method Utilizing Acoustic and Vibration Signals for Rotating Machinery Fault Diagnosis

no code implementations • 20 Oct 2023 • Zhongliang Chen, Zhuofei Huang, Wenxiong Kang

Fault diagnosis of rotating machinery plays a important role for the safety and stability of modern industrial systems.

Transfer Learning

Paper
Add Code

CostFormer:Cost Transformer for Cost Aggregation in Multi-view Stereo

no code implementations • 17 May 2023 • Weitao Chen, Hongbin Xu, Zhipeng Zhou, Yang Liu, Baigui Sun, Wenxiong Kang, Xuansong Xie

The Residual Depth-Aware Cost Transformer(RDACT) is proposed to aggregate long-range features on cost volume via self-attention mechanisms along the depth and spatial dimensions.

Paper
Add Code

PointDC:Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering

1 code implementation • 18 Apr 2023 • Zisheng Chen, Hongbin Xu, Weitao Chen, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang

Semantic segmentation of point clouds usually requires exhausting efforts of human annotations, hence it attracts wide attention to the challenging topic of learning from unlabeled or weaker forms of annotations.

Clustering Segmentation +1

Paper
Code

PointDC: Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-Modal Distillation and Super-Voxel Clustering

1 code implementation • ICCV 2023 • Zisheng Chen, Hongbin Xu, Weitao Chen, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang

Semantic segmentation of point clouds usually requires exhausting efforts of human annotations, hence it attracts wide attention to a challenging topic of learning from unlabeled or weaker form of annotations.

Clustering Segmentation +1

Paper
Code

Semi-supervised Deep Multi-view Stereo

no code implementations • 24 Jul 2022 • Hongbin Xu, Weitao Chen, Yang Liu, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang

For further troublesome case that the basic assumption is conflicted in MVS data, we propose a novel style consistency loss to alleviate the negative effect caused by the distribution gap.

Paper
Add Code

Unconstrained Face Sketch Synthesis via Perception-Adaptive Network and A New Benchmark

no code implementations • 2 Dec 2021 • Lin Nie, Lingbo Liu, Zhengtao Wu, Wenxiong Kang

Face sketch generation has attracted much attention in the field of visual computing.

Face Sketch Synthesis Representation Learning

Paper
Add Code

Digging into Uncertainty in Self-supervised Multi-view Stereo

1 code implementation • ICCV 2021 • Hongbin Xu, Zhipeng Zhou, Yali Wang, Wenxiong Kang, Baigui Sun, Hao Li, Yu Qiao

Specially, the limitations can be categorized into two types: ambiguious supervision in foreground and invalid supervision in background.

Image Reconstruction Self-Supervised Learning

Paper
Code

Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation

1 code implementation • 12 Apr 2021 • Hongbin Xu, Zhipeng Zhou, Yu Qiao, Wenxiong Kang, Qiuxia Wu

Recent studies have witnessed that self-supervised methods based on view synthesis obtain clear progress on multi-view stereo (MVS).

Data Augmentation

150

Paper
Code

JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image

1 code implementation • ECCV 2020 • Linpu Fang, Xingyan Liu, Li Liu, Hang Xu, Wenxiong Kang

The key ideas are two-fold: a) explicitly modeling the dependencies among joints and the relations between the pixels and the joints for better local feature representation learning; b) unifying the dense pixel-wise offset predictions and direct joint regression for end-to-end training.

3D Hand Pose Estimation regression +1

Paper
Code

Dynamic Group Convolution for Accelerating Convolutional Neural Networks

1 code implementation • ECCV 2020 • Zhuo Su, Linpu Fang, Wenxiong Kang, Dewen Hu, Matti Pietikäinen, Li Liu

In this paper, we propose dynamic group convolution (DGC) that adaptively selects which part of input channels to be connected within each group for individual samples on the fly.

Computational Efficiency Image Classification

126

Paper
Code

Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN

no code implementations • 18 Feb 2020 • Hang Xu, Linpu Fang, Xiaodan Liang, Wenxiong Kang, Zhenguo Li

Finally, an InterDomain Transfer Module is proposed to exploit diverse transfer dependencies across all domains and enhance the regional feature representation by attending and transferring semantic contexts globally.

Object object-detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.