Search Results for author: Yuxi Wang

Found 28 papers, 12 papers with code

OOD-HOI: Text-Driven 3D Whole-Body Human-Object Interactions Generation Beyond Training Domains

no code implementations27 Nov 2024 Yixuan Zhang, Hui Yang, Chuanchen Luo, Junran Peng, Yuxi Wang, Zhaoxiang Zhang

Generating realistic 3D human-object interactions (HOIs) from text descriptions is a active research topic with potential applications in virtual and augmented reality, robotics, and animation.

Human-Object Interaction Detection

Enhancing Sound Source Localization via False Negative Elimination

1 code implementation29 Aug 2024 Zengjie Song, Jiangshe Zhang, Yuxi Wang, Junsong Fan, Zhaoxiang Zhang

To address this issue, we propose a novel audio-visual learning framework which is instantiated with two individual learning schemes: self-supervised predictive learning (SSPL) and semantic-aware contrastive learning (SACL).

audio-visual learning Contrastive Learning +3

CityX: Controllable Procedural Content Generation for Unbounded 3D Cities

no code implementations24 Jul 2024 Shougao Zhang, Mengqi Zhou, Yuxi Wang, Chuanchen Luo, Rongyu Wang, Yiwei Li, XuCheng Yin, Zhaoxiang Zhang, Junran Peng

To address these problems, we propose a novel multi-modal controllable procedural content generation method, named CityX, which enhances realistic, unbounded 3D city generation guided by multiple layout conditions, including OSM, semantic maps, and satellite images.

Scene Generation

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

no code implementations18 Jul 2024 Pengfei Wang, Yuxi Wang, Shuai Li, Zhaoxiang Zhang, Zhen Lei, Lei Zhang

The scarcity of large-scale 3D-text paired data poses a great challenge on open vocabulary 3D scene understanding, and hence it is popular to leverage internet-scale 2D data and transfer their open vocabulary capabilities to 3D models through knowledge distillation.

Knowledge Distillation Representation Learning +1

MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets

no code implementations22 Apr 2024 Zeyu Li, Ruitong Gan, Chuanchen Luo, Yuxi Wang, Jiaheng Liu, Ziwei Zhu Man Zhang, Qing Li, XuCheng Yin, Zhaoxiang Zhang, Junran Peng

Driven by powerful image diffusion models, recent research has achieved the automatic creation of 3D objects from textual or visual guidance.

SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition

1 code implementation31 Jan 2024 Xu Hu, Yuxi Wang, Lue Fan, Junsong Fan, Junran Peng, Zhen Lei, Qing Li, Zhaoxiang Zhang

3D Gaussian Splatting has emerged as an alternative 3D representation for novel view synthesis, benefiting from its high-quality rendering results and real-time rendering speed.

Novel View Synthesis Segmentation +1

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

no code implementations7 Jan 2024 Genghao Zhang, Yuxi Wang, Chuanchen Luo, Shibiao Xu, Zhaoxiang Zhang, Man Zhang, Junran Peng

Indoor scene generation has attracted significant attention recently as it is crucial for applications of gaming, virtual reality, and interior design.

Diversity Scene Generation

HardMo: A Large-Scale Hardcase Dataset for Motion Capture

no code implementations CVPR 2024 Jiaqi Liao, Chuanchen Luo, Yinuo Du, Yuxi Wang, XuCheng Yin, Man Zhang, Zhaoxiang Zhang, Junran Peng

Empirically we find that the prediction failure in dance and martial arts is mainly characterized by the misalignment of hand-wrist and foot-ankle.

Human Mesh Recovery

Bootstrap Masked Visual Modeling via Hard Patches Mining

1 code implementation21 Dec 2023 Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tiancai Wang, Xiangyu Zhang, Zhaoxiang Zhang

To empower the model as a teacher, we propose Hard Patches Mining (HPM), predicting patch-wise losses and subsequently determining where to mask.

CompenHR: Efficient Full Compensation for High-resolution Projector

1 code implementation22 Nov 2023 Yuxi Wang, Haibin Ling, Bingyao Huang

Full projector compensation is a practical task of projector-camera systems.

Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation

no code implementations ICCV 2023 Yuxi Wang, Jian Liang, Jun Xiao, Shuqi Mei, Yuran Yang, Zhaoxiang Zhang

One-shot domain adaptation methods attempt to overcome these challenges by transferring the pre-trained source model to the target domain using only one target data.

Domain Adaptation Semantic Segmentation +1

DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

1 code implementation NeurIPS 2023 Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tong Wang, Zhaoxiang Zhang

As it is empirically observed that Vision Transformers (ViTs) are quite insensitive to the order of input tokens, the need for an appropriate self-supervised pretext task that enhances the location awareness of ViTs is becoming evident.

Position Spatial Reasoning

TDG: Text-guided Domain Generalization

no code implementations19 Aug 2023 Geng Liu, Yuxi Wang

Domain generalization (DG) attempts to generalize a model trained on single or multiple source domains to the unseen target domain.

Domain Generalization

DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation

no code implementations2 Aug 2023 Jingfan Chen, Yuxi Wang, Pengfei Wang, Xiao Chen, Zhaoxiang Zhang, Zhen Lei, Qing Li

The Class Incremental Semantic Segmentation (CISS) extends the traditional segmentation task by incrementally learning newly added classes.

Class-Incremental Semantic Segmentation Segmentation

Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation

1 code implementation4 Jun 2023 Haochen Wang, Yuchao Wang, Yujun Shen, Junsong Fan, Yuxi Wang, Zhaoxiang Zhang

A common practice is to select the highly confident predictions as the pseudo-ground-truths for each pixel, but it leads to a problem that most pixels may be left unused due to their unreliability.

Semantic Segmentation

Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation

1 code implementation23 May 2023 Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Liwei Wu, Yuxi Wang, Zhaoxiang Zhang

To this end, we propose T2S-DA, which we interpret as a form of pulling Target to Source for Domain Adaptation, encouraging the model in learning similar cross-domain features.

Domain Generalization Semantic Segmentation

Hard Patches Mining for Masked Image Modeling

1 code implementation CVPR 2023 Haochen Wang, Kaiyou Song, Junsong Fan, Yuxi Wang, Jin Xie, Zhaoxiang Zhang

We observe that the reconstruction loss can naturally be the metric of the difficulty of the pre-training task.

A Survey of Deep Visual Cross-Domain Few-Shot Learning

no code implementations16 Mar 2023 Wenjian Wang, Lijuan Duan, Yuxi Wang, Junsong Fan, Zhi Gong, Zhaoxiang Zhang

Research into Cross-Domain Few-Shot (CDFS) has emerged to address this issue, forming a more challenging and realistic setting.

cross-domain few-shot learning Survey +1

TexPrax: A Messaging Application for Ethical, Real-time Data Collection and Annotation

1 code implementation16 Aug 2022 Lorenz Stangier, Ji-Ung Lee, Yuxi Wang, Marvin Müller, Nicholas Frick, Joachim Metternich, Iryna Gurevych

We evaluate TexPrax in a user-study with German factory employees who ask their colleagues for solutions on problems that arise during their daily work.

Chatbot Sentence

Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

no code implementations CVPR 2022 Wenjian Wang, Lijuan Duan, Yuxi Wang, Qing En, Junsong Fan, Zhaoxiang Zhang

To remedy this problem, we propose an interesting and challenging cross-domain few-shot semantic segmentation task, where the training and test tasks perform on different domains.

Contrastive Learning Cross-Domain Few-Shot +3

A Curriculum-style Self-training Approach for Source-Free Semantic Segmentation

1 code implementation22 Jun 2021 Yuxi Wang, Jian Liang, Zhaoxiang Zhang

Source-free domain adaptation has developed rapidly in recent years, where the well-trained source model is adapted to the target domain instead of the source data, offering the potential for privacy concerns and intellectual property protection.

Representation Learning Segmentation +2

Uncertainty-Aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation

no code implementations ICCV 2021 Yuxi Wang, Junran Peng, Zhaoxiang Zhang

Unsupervised domain adaptation for semantic segmentation aims to assign the pixel-level labels for unlabeled target domain by transferring knowledge from the labeled source domain.

Pseudo Label Self-Supervised Learning +2

High-speed real-time single-pixel microscopy based on Fourier sampling

no code implementations15 Jun 2016 Qiang Guo, Hongwei Chen, Yuxi Wang, Yong Guo, Peng Liu, Xiurui Zhu, Zheng Cheng, Zhenming Yu, Minghua Chen, Sigang Yang, Shizhong Xie

However, according to CS theory, image reconstruction is an iterative process that consumes enormous amounts of computational time and cannot be performed in real time.

Image Reconstruction Image Restoration +1

Cannot find the paper you are looking for? You can Submit a new open access paper.