Search Results for author: Xingyu Chen

Found 53 papers, 21 papers with code

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

no code implementations28 Feb 2024 Zeyang Liu, Lipeng Wan, Xinrui Yang, Zhuoran Chen, Xingyu Chen, Xuguang Lan

To address this limitation, we propose Imagine, Initialize, and Explore (IIE), a novel method that offers a promising solution for efficient multi-agent exploration in complex scenarios.

Action Generation SMAC+ +1

ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization

no code implementations17 Jan 2024 Weiyao Wang, Pierre Gleize, Hao Tang, Xingyu Chen, Kevin J Liang, Matt Feiszli

Neural Radiance Fields (NeRF) exhibit remarkable performance for Novel View Synthesis (NVS) given a set of 2D images.

Novel View Synthesis

Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data

no code implementations30 Nov 2023 Yu Deng, Duomin Wang, Xiaohang Ren, Xingyu Chen, Baoyuan Wang

The key is to first learn a part-wise 4D generative model from monocular images via adversarial learning, to synthesize multi-view images of diverse identities and full motions as training data; then leverage a transformer-based animatable triplane reconstructor to learn 4D head reconstruction using the synthetic data.

3D Reconstruction

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images

no code implementations27 Nov 2023 Xihe Yang, Xingyu Chen, Shaohui Wang, Daiheng Gao, Xiaoguang Han, Baoyuan Wang

As for human avatar reconstruction, contemporary techniques commonly necessitate the acquisition of costly data and struggle to achieve satisfactory results from a small number of casual images.

Head-Related Transfer Function Interpolation with a Spherical CNN

1 code implementation15 Sep 2023 Xingyu Chen, Fei Ma, Yile Zhang, Amy Bastine, Prasanga N. Samarasinghe

The proposed method realizes the convolution process by decomposing and reconstructing HRTF through the Spherical Harmonics (SHs).

A Sigmoid-based car-following model to improve acceleration stability in traffic oscillation and following failure in free flow

no code implementations3 Sep 2023 Xingyu Chen, Haijian Bai

This paper proposes an improved Intelligent driving model (Sigmoid-IDM) to address the problems of excessive acceleration in traffic oscillation and following failure in free flow.

Spatial Upsampling of Head-Related Transfer Functions Using a Physics-Informed Neural Network

1 code implementation27 Jul 2023 Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Xingyu Chen

Head-related transfer function (HRTF) capture the information that a person uses to localize sound sources in space, and thus is crucial for creating personalized virtual acoustic experiences.

valid

Sound Field Estimation around a Rigid Sphere with Physics-informed Neural Network

1 code implementation26 Jul 2023 Xingyu Chen, Fei Ma, Amy Bastine, Prasanga Samarasinghe, Huiyuan Sun

To overcome this challenge, this paper proposes a method for sound field estimation based on a physics-informed neural network.

ESMC: Entire Space Multi-Task Model for Post-Click Conversion Rate via Parameter Constraint

no code implementations18 Jul 2023 Zhenhao Jiang, Biao Zeng, Hao Feng, Jin Liu, Jicong Fan, Jie Zhang, Jia Jia, Ning Hu, Xingyu Chen, Xuguang Lan

We propose a novel Entire Space Multi-Task Model for Post-Click Conversion Rate via Parameter Constraint (ESMC) and two alternatives: Entire Space Multi-Task Model with Siamese Network (ESMS) and Entire Space Multi-Task Model in Global Domain (ESMG) to address the PSC issue.

Decision Making Recommendation Systems +1

Reinforced Disentanglement for Face Swapping without Skip Connection

no code implementations ICCV 2023 Xiaohang Ren, Xingyu Chen, Pengfei Yao, Heung-Yeung Shum, Baoyuan Wang

The SOTA face swap models still suffer the problem of either target identity (i. e., shape) being leaked or the target non-identity attributes (i. e., background, hair) failing to be fully preserved in the final results.

Disentanglement Face Swapping

MMRDN: Consistent Representation for Multi-View Manipulation Relationship Detection in Object-Stacked Scenes

no code implementations25 Apr 2023 Han Wang, Jiayuan Zhang, Lipeng Wan, Xingyu Chen, Xuguang Lan, Nanning Zheng

Manipulation relationship detection (MRD) aims to guide the robot to grasp objects in the right order, which is important to ensure the safety and reliability of grasping in object stacked scenes.

Position Relationship Detection

SimTS: Rethinking Contrastive Representation Learning for Time Series Forecasting

1 code implementation31 Mar 2023 Xiaochen Zheng, Xingyu Chen, Manuel Schürch, Amina Mollaysa, Ahmed Allam, Michael Krauthammer

Contrastive learning methods have shown an impressive ability to learn meaningful representations for image or time series classification.

Contrastive Learning Representation Learning +3

Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation

no code implementations ICCV 2023 Xingyu Chen, Yu Deng, Baoyuan Wang

Improving the photorealism via CNN-based 2D super-resolution can break the strict 3D consistency, while keeping the 3D consistency by learning high-resolution 3D representations for direct rendering often compromises image quality.

Image Generation Representation Learning +1

FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning

1 code implementation1 Dec 2022 Yulei Qin, Xingyu Chen, Chao Chen, Yunhang Shen, Bo Ren, Yun Gu, Jie Yang, Chunhua Shen

Most existing methods focus on learning noise-robust models from web images while neglecting the performance drop caused by the differences between web domain and real-world domain.

Contrastive Learning Representation Learning

Hand Avatar: Free-Pose Hand Animation and Rendering from Monocular Video

no code implementations CVPR 2023 Xingyu Chen, Baoyuan Wang, Heung-Yeung Shum

We present HandAvatar, a novel representation for hand animation and rendering, which can generate smoothly compositional geometry and self-occlusion-aware texture.

Disentanglement

Place Recognition under Occlusion and Changing Appearance via Disentangled Representations

no code implementations21 Nov 2022 Yue Chen, Xingyu Chen, Yicen Li

Place recognition is a critical and challenging task for mobile robots, aiming to retrieve an image captured at the same place as a query image from a database.

Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields

no code implementations CVPR 2023 Yue Chen, Xingyu Chen, Xuan Wang, Qi Zhang, Yu Guo, Ying Shan, Fei Wang

Neural Radiance Fields (NeRF) have achieved photorealistic novel views synthesis; however, the requirement of accurate camera poses limits its application.

Frequency-Aware Self-Supervised Monocular Depth Estimation

1 code implementation11 Oct 2022 Xingyu Chen, Thomas H. Li, Ruonan Zhang, Ge Li

We present two versatile methods to generally enhance self-supervised monocular depth estimation (MDE) models.

Depth Prediction Monocular Depth Estimation +1

Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios

no code implementations10 Oct 2022 Xingyu Chen, Jianru Xue, Jianwu Fang, Yuxin Pan, Nanning Zheng

In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU.

Object object-detection +1

Sparse Semantic Map-Based Monocular Localization in Traffic Scenes Using Learned 2D-3D Point-Line Correspondences

no code implementations10 Oct 2022 Xingyu Chen, Jianru Xue, Shanmin Pang

The proposed sparse semantic map-based localization approach is robust against occlusion and long-term appearance changes in the environments.

Autonomous Vehicles

ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement

1 code implementation25 Sep 2022 Dongli Tan, Jiang-Jiang Liu, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji

In this paper, we propose an efficient structure named Efficient Correspondence Transformer (ECO-TR) by finding correspondences in a coarse-to-fine manner, which significantly improves the efficiency of functional correspondence model.

Outlier Detection

UC-OWOD: Unknown-Classified Open World Object Detection

1 code implementation23 Jul 2022 Zhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu

In this work, we propose a novel OWOD problem called Unknown-Classified Open World Object Detection (UC-OWOD).

Object object-detection +1

META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI

no code implementations23 May 2022 Liangtai Sun, Xingyu Chen, Lu Chen, Tianle Dai, Zichen Zhu, Kai Yu

However, this API-based architecture greatly limits the information-searching capability of intelligent assistants and may even lead to task failure if TOD-specific APIs are not available or the task is too complicated to be executed by the provided APIs.

Scheduling

UV Volumes for Real-time Rendering of Editable Free-view Human Performance

1 code implementation CVPR 2023 Yue Chen, Xuan Wang, Xingyu Chen, Qi Zhang, Xiaoyu Li, Yu Guo, Jue Wang, Fei Wang

Neural volume rendering enables photo-realistic renderings of a human performer in free-view, a critical task in immersive VR/AR applications.

AutoTS: Automatic Time Series Forecasting Model Design Based on Two-Stage Pruning

no code implementations26 Mar 2022 Chunnan Wang, Xingyu Chen, Chengyue Wu, Hongzhi Wang

We allow the effective combination of design experience from different sources, so as to create an effective search space containing a variety of TSF models to support different TSF tasks.

Neural Architecture Search Time Series +1

MobRecon: Mobile-Friendly Hand Mesh Reconstruction from Monocular Image

1 code implementation CVPR 2022 Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo

In this work, we propose a framework for single-view hand mesh reconstruction, which can simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal coherence.

3D Hand Pose Estimation Position +2

Hallucinated Neural Radiance Fields in the Wild

no code implementations CVPR 2022 Xingyu Chen, Qi Zhang, Xiaoyu Li, Yue Chen, Ying Feng, Xuan Wang, Jue Wang

This paper studies the problem of hallucinated NeRF: i. e., recovering a realistic NeRF at a different time of day from a group of tourism images.

Hallucination Novel View Synthesis

Greedy-based Value Representation for Efficient Coordination in Multi-agent Reinforcement Learning

no code implementations29 Sep 2021 Lipeng Wan, Zeyang Liu, Xingyu Chen, Han Wang, Xuguang Lan

Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning (MARL) methods with linear or monotonic value decomposition can not ensure the optimal consistency (i. e. the correspondence between the individual greedy actions and the maximal true Q value), leading to instability and poor coordination.

Multi-agent Reinforcement Learning reinforcement-learning +1

HybrUR: A Hybrid Physical-Neural Solution for Unsupervised Underwater Image Restoration

no code implementations6 Jul 2021 Shuaizheng Yan, Xingyu Chen, Zhengxing Wu, Min Tan, Junzhi Yu

Experimental results show that the proposed method can be used to perform high-quality restoration of unconstrained underwater images without supervision.

Underwater Image Restoration

Adaptive Feature Alignment for Adversarial Training

no code implementations31 May 2021 Tao Wang, Ruixin Zhang, Xingyu Chen, Kai Zhao, Xiaolin Huang, Yuge Huang, Shaoxin Li, Jilin Li, Feiyue Huang

Based on this observation, we propose the adaptive feature alignment (AFA) to generate features of arbitrary attacking strengths.

Adversarial Defense

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

1 code implementation CVPR 2021 Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner

Tests on AFLW2000-3D and BIWI show that our method runs at real-time and outperforms state of the art (SotA) face pose estimators.

3D Face Alignment Face Alignment +3

A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning

2 code implementations ECCV 2020 Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng

Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.

Generalized Zero-Shot Learning

Reveal of Domain Effect: How Visual Restoration Contributes to Object Detection in Aquatic Scenes

no code implementations4 Mar 2020 Xingyu Chen, Yue Lu, Zhengxing Wu, Junzhi Yu, Li Wen

According to our analysis, five key discoveries are reported: 1) Domain quality has an ignorable effect on within-domain convolutional representation and detection accuracy; 2) low-quality domain leads to higher generalization ability in cross-domain detection; 3) low-quality domain can hardly be well learned in a domain-mixed learning process; 4) degrading recall efficiency, restoration cannot improve within-domain detection accuracy; 5) visual restoration is beneficial to detection in the wild by reducing the domain shift between training data and real-world scenes.

Object object-detection +2

Rethinking Temporal Object Detection from Robotic Perspectives

no code implementations22 Dec 2019 Xingyu Chen, Zhengxing Wu, Junzhi Yu, Li Wen

From a robotic perspective, the importance of recall continuity and localization stability is equal to that of accuracy, but the AP is insufficient to reflect detectors' performance across time.

Multi-Object Tracking Object +2

Proportionally Fair Clustering

no code implementations9 May 2019 Xingyu Chen, Brandon Fain, Liang Lyu, Kamesh Munagala

We extend the fair machine learning literature by considering the problem of proportional centroid clustering in a metric context.

Clustering Fairness

Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos

1 code implementation23 Jul 2018 Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, Li Wen

As for temporal detection in videos, temporal refinement networks (TRNet) and temporal dual refinement networks (TDRNet) are developed by propagating the refinement information across time.

Object object-detection +1

Temporally Identity-Aware SSD with Attentional LSTM

1 code implementation1 Mar 2018 Xingyu Chen, Junzhi Yu, Zhengxing Wu

Moreover, we develop a creative temporal analysis unit, namely, attentional ConvLSTM (AC-LSTM), in which a temporal attention mechanism is specially tailored for background suppression and scale suppression while a ConvLSTM integrates attention-aware features across time.

object-detection Object Detection

Towards Real-Time Advancement of Underwater Visual Quality with GAN

1 code implementation3 Dec 2017 Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, Xi Fang, Li Wen

More specifically, an underwater index is investigated to describe underwater properties, and a loss function based on the underwater index is designed to train the critic branch for underwater noise suppression.

Cannot find the paper you are looking for? You can Submit a new open access paper.