no code implementations • IWSLT (ACL) 2022 • Qinpei Zhu, Renshou Wu, Guangfeng Liu, Xinyu Zhu, Xingyu Chen, Yang Zhou, Qingliang Miao, Rui Wang, Kai Yu
This paper describes AISP-SJTU’s submissions for the IWSLT 2022 Simultaneous Translation task.
no code implementations • 28 Feb 2024 • Zeyang Liu, Lipeng Wan, Xinrui Yang, Zhuoran Chen, Xingyu Chen, Xuguang Lan
To address this limitation, we propose Imagine, Initialize, and Explore (IIE), a novel method that offers a promising solution for efficient multi-agent exploration in complex scenarios.
no code implementations • 17 Jan 2024 • Weiyao Wang, Pierre Gleize, Hao Tang, Xingyu Chen, Kevin J Liang, Matt Feiszli
Neural Radiance Fields (NeRF) exhibit remarkable performance for Novel View Synthesis (NVS) given a set of 2D images.
no code implementations • 30 Nov 2023 • Yu Deng, Duomin Wang, Xiaohang Ren, Xingyu Chen, Baoyuan Wang
The key is to first learn a part-wise 4D generative model from monocular images via adversarial learning, to synthesize multi-view images of diverse identities and full motions as training data; then leverage a transformer-based animatable triplane reconstructor to learn 4D head reconstruction using the synthetic data.
no code implementations • 27 Nov 2023 • Xihe Yang, Xingyu Chen, Shaohui Wang, Daiheng Gao, Xiaoguang Han, Baoyuan Wang
As for human avatar reconstruction, contemporary techniques commonly necessitate the acquisition of costly data and struggle to achieve satisfactory results from a small number of casual images.
no code implementations • 22 Nov 2023 • Xingyu Chen, Xinyu Zhang, Qiyue Xia, Xinmin Fang, Chris Xiaoxuan Lu, Zhengxiong Li
We propose DiffSBR, a differentiable framework for mmWave-based 3D reconstruction.
no code implementations • 13 Nov 2023 • Xingyu Chen, Xiaochen Zheng, Amina Mollaysa, Manuel Schürch, Ahmed Allam, Michael Krauthammer
Irregular multivariate time series data is prevalent in the clinical and healthcare domains.
1 code implementation • 23 Oct 2023 • Xingyu Chen, Lemao Liu, Guoping Huang, Zhirui Zhang, Mingming Yang, Shuming Shi, Rui Wang
Word-Level Auto-Completion (WLAC) plays a crucial role in Computer-Assisted Translation.
1 code implementation • 15 Sep 2023 • Xingyu Chen, Fei Ma, Yile Zhang, Amy Bastine, Prasanga N. Samarasinghe
The proposed method realizes the convolution process by decomposing and reconstructing HRTF through the Spherical Harmonics (SHs).
no code implementations • 3 Sep 2023 • Xingyu Chen, Haijian Bai
This paper proposes an improved Intelligent driving model (Sigmoid-IDM) to address the problems of excessive acceleration in traffic oscillation and following failure in free flow.
1 code implementation • 27 Jul 2023 • Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Xingyu Chen
Head-related transfer function (HRTF) capture the information that a person uses to localize sound sources in space, and thus is crucial for creating personalized virtual acoustic experiences.
1 code implementation • 26 Jul 2023 • Xingyu Chen, Fei Ma, Amy Bastine, Prasanga Samarasinghe, Huiyuan Sun
To overcome this challenge, this paper proposes a method for sound field estimation based on a physics-informed neural network.
no code implementations • 18 Jul 2023 • Zhenhao Jiang, Biao Zeng, Hao Feng, Jin Liu, Jicong Fan, Jie Zhang, Jia Jia, Ning Hu, Xingyu Chen, Xuguang Lan
We propose a novel Entire Space Multi-Task Model for Post-Click Conversion Rate via Parameter Constraint (ESMC) and two alternatives: Entire Space Multi-Task Model with Siamese Network (ESMS) and Entire Space Multi-Task Model in Global Domain (ESMG) to address the PSC issue.
no code implementations • ICCV 2023 • Xiaohang Ren, Xingyu Chen, Pengfei Yao, Heung-Yeung Shum, Baoyuan Wang
The SOTA face swap models still suffer the problem of either target identity (i. e., shape) being leaked or the target non-identity attributes (i. e., background, hair) failing to be fully preserved in the final results.
no code implementations • 25 Apr 2023 • Han Wang, Jiayuan Zhang, Lipeng Wan, Xingyu Chen, Xuguang Lan, Nanning Zheng
Manipulation relationship detection (MRD) aims to guide the robot to grasp objects in the right order, which is important to ensure the safety and reliability of grasping in object stacked scenes.
1 code implementation • 31 Mar 2023 • Xiaochen Zheng, Xingyu Chen, Manuel Schürch, Amina Mollaysa, Ahmed Allam, Michael Krauthammer
Contrastive learning methods have shown an impressive ability to learn meaningful representations for image or time series classification.
no code implementations • ICCV 2023 • Xingyu Chen, Yu Deng, Baoyuan Wang
Improving the photorealism via CNN-based 2D super-resolution can break the strict 3D consistency, while keeping the 3D consistency by learning high-resolution 3D representations for direct rendering often compromises image quality.
no code implementations • ICCV 2023 • Peri Akiva, Jing Huang, Kevin J Liang, Rama Kovvuri, Xingyu Chen, Matt Feiszli, Kristin Dana, Tal Hassner
Understanding the visual world from the perspective of humans (egocentric) has been a long-standing challenge in computer vision.
1 code implementation • 1 Dec 2022 • Yulei Qin, Xingyu Chen, Chao Chen, Yunhang Shen, Bo Ren, Yun Gu, Jie Yang, Chunhua Shen
Most existing methods focus on learning noise-robust models from web images while neglecting the performance drop caused by the differences between web domain and real-world domain.
no code implementations • CVPR 2023 • Xingyu Chen, Baoyuan Wang, Heung-Yeung Shum
We present HandAvatar, a novel representation for hand animation and rendering, which can generate smoothly compositional geometry and self-occlusion-aware texture.
no code implementations • 22 Nov 2022 • Lipeng Wan, Zeyang Liu, Xingyu Chen, Xuguang Lan, Nanning Zheng
To ensure optimal consistency, the optimal node is required to be the unique STN.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 21 Nov 2022 • Yue Chen, Xingyu Chen, Yicen Li
Place recognition is a critical and challenging task for mobile robots, aiming to retrieve an image captured at the same place as a query image from a database.
no code implementations • CVPR 2023 • Yue Chen, Xingyu Chen, Xuan Wang, Qi Zhang, Yu Guo, Ying Shan, Fei Wang
Neural Radiance Fields (NeRF) have achieved photorealistic novel views synthesis; however, the requirement of accurate camera poses limits its application.
1 code implementation • 11 Oct 2022 • Xingyu Chen, Thomas H. Li, Ruonan Zhang, Ge Li
We present two versatile methods to generally enhance self-supervised monocular depth estimation (MDE) models.
no code implementations • 10 Oct 2022 • Xingyu Chen, Jianru Xue, Jianwu Fang, Yuxin Pan, Nanning Zheng
In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU.
no code implementations • 10 Oct 2022 • Xingyu Chen, Jianru Xue, Shanmin Pang
The proposed sparse semantic map-based localization approach is robust against occlusion and long-term appearance changes in the environments.
1 code implementation • 2 Oct 2022 • Xingyu Chen, Ruonan Zhang, Ji Jiang, Yan Wang, Ge Li, Thomas H. Li
In this paper, we redesign the patch-based triplet loss in MDE to alleviate the ubiquitous edge-fattening issue.
Ranked #1 on Unsupervised Monocular Depth Estimation on Kitti Raw
1 code implementation • 25 Sep 2022 • Dongli Tan, Jiang-Jiang Liu, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji
In this paper, we propose an efficient structure named Efficient Correspondence Transformer (ECO-TR) by finding correspondences in a coarse-to-fine manner, which significantly improves the efficiency of functional correspondence model.
1 code implementation • 23 Jul 2022 • Zhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu
In this work, we propose a novel OWOD problem called Unknown-Classified Open World Object Detection (UC-OWOD).
no code implementations • 23 May 2022 • Liangtai Sun, Xingyu Chen, Lu Chen, Tianle Dai, Zichen Zhu, Kai Yu
However, this API-based architecture greatly limits the information-searching capability of intelligent assistants and may even lead to task failure if TOD-specific APIs are not available or the task is too complicated to be executed by the provided APIs.
1 code implementation • NAACL 2022 • Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu
Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests.
1 code implementation • CVPR 2023 • Yue Chen, Xuan Wang, Xingyu Chen, Qi Zhang, Xiaoyu Li, Yu Guo, Jue Wang, Fei Wang
Neural volume rendering enables photo-realistic renderings of a human performer in free-view, a critical task in immersive VR/AR applications.
no code implementations • 26 Mar 2022 • Chunnan Wang, Xingyu Chen, Chengyue Wu, Hongzhi Wang
We allow the effective combination of design experience from different sources, so as to create an effective search space containing a variety of TSF models to support different TSF tasks.
1 code implementation • CVPR 2022 • Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo
In this work, we propose a framework for single-view hand mesh reconstruction, which can simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal coherence.
Ranked #7 on 3D Hand Pose Estimation on DexYCB
no code implementations • CVPR 2022 • Xingyu Chen, Qi Zhang, Xiaoyu Li, Yue Chen, Ying Feng, Xuan Wang, Jue Wang
This paper studies the problem of hallucinated NeRF: i. e., recovering a realistic NeRF at a different time of day from a group of tourism images.
no code implementations • 29 Sep 2021 • Lipeng Wan, Zeyang Liu, Xingyu Chen, Han Wang, Xuguang Lan
Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning (MARL) methods with linear or monotonic value decomposition can not ensure the optimal consistency (i. e. the correspondence between the individual greedy actions and the maximal true Q value), leading to instability and poor coordination.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 29 Aug 2021 • Xun Tan, Xingyu Chen, Guowei Zhang, Jishiyu Ding, Xuguang Lan
Fusing the two kinds of data usually helps to improve the detection results.
no code implementations • 14 Jul 2021 • Jie Xu, Xingyu Chen, Xuguang Lan, Nanning Zheng
The experimental results show that our approach makes the interaction more efficient and safer.
no code implementations • 6 Jul 2021 • Shuaizheng Yan, Xingyu Chen, Zhengxing Wu, Min Tan, Junzhi Yu
Experimental results show that the proposed method can be used to perform high-quality restoration of unconstrained underwater images without supervision.
no code implementations • 1 Jul 2021 • Zhiyuan Guo, Yuexin Li, Guo Chen, Xingyu Chen, Akshat Gupta
Spoken dialogue systems such as Siri and Alexa provide great convenience to people's everyday life.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +6
no code implementations • 31 May 2021 • Tao Wang, Ruixin Zhang, Xingyu Chen, Kai Zhao, Xiaolin Huang, Yuge Huang, Shaoxin Li, Jilin Li, Feiyue Huang
Based on this observation, we propose the adaptive feature alignment (AFA) to generate features of arbitrary attacking strengths.
1 code implementation • CVPR 2021 • Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang
Thus, we propose a novel unsupervised FIQA method that incorporates Similarity Distribution Distance for Face Image Quality Assessment (SDD-FIQA).
1 code implementation • CVPR 2021 • Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng
In the root-relative mesh recovery task, we exploit semantic relations among joints to generate a 3D mesh from the extracted 2D cues.
1 code implementation • EMNLP 2021 • Xingyu Chen, Zihan Zhao, Lu Chen, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu
In this paper, we introduce the task of structural reading comprehension (SRC) on web.
1 code implementation • CVPR 2021 • Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner
Tests on AFLW2000-3D and BIWI show that our method runs at real-time and outperforms state of the art (SotA) face pose estimators.
Ranked #5 on Head Pose Estimation on BIWI
no code implementations • 13 Oct 2020 • Junming Ma, Chaofan Yu, Aihui Zhou, Bingzhe Wu, Xibin Wu, Xingyu Chen, Xiangqun Chen, Lei Wang, Donggang Cao
We present S3ML, a secure serving system for machine learning inference in this paper.
2 code implementations • ECCV 2020 • Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng
Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.
no code implementations • 4 Mar 2020 • Xingyu Chen, Yue Lu, Zhengxing Wu, Junzhi Yu, Li Wen
According to our analysis, five key discoveries are reported: 1) Domain quality has an ignorable effect on within-domain convolutional representation and detection accuracy; 2) low-quality domain leads to higher generalization ability in cross-domain detection; 3) low-quality domain can hardly be well learned in a domain-mixed learning process; 4) degrading recall efficiency, restoration cannot improve within-domain detection accuracy; 5) visual restoration is beneficial to detection in the wild by reducing the domain shift between training data and real-world scenes.
no code implementations • 22 Dec 2019 • Xingyu Chen, Zhengxing Wu, Junzhi Yu, Li Wen
From a robotic perspective, the importance of recall continuity and localization stability is equal to that of accuracy, but the AP is insufficient to reflect detectors' performance across time.
no code implementations • 9 May 2019 • Xingyu Chen, Brandon Fain, Liang Lyu, Kamesh Munagala
We extend the fair machine learning literature by considering the problem of proportional centroid clustering in a metric context.
1 code implementation • 23 Jul 2018 • Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, Li Wen
As for temporal detection in videos, temporal refinement networks (TRNet) and temporal dual refinement networks (TDRNet) are developed by propagating the refinement information across time.
1 code implementation • 1 Mar 2018 • Xingyu Chen, Junzhi Yu, Zhengxing Wu
Moreover, we develop a creative temporal analysis unit, namely, attentional ConvLSTM (AC-LSTM), in which a temporal attention mechanism is specially tailored for background suppression and scale suppression while a ConvLSTM integrates attention-aware features across time.
1 code implementation • 3 Dec 2017 • Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, Xi Fang, Li Wen
More specifically, an underwater index is investigated to describe underwater properties, and a loss function based on the underwater index is designed to train the critic branch for underwater noise suppression.