no code implementations • IWSLT (ACL) 2022 • Qinpei Zhu, Renshou Wu, Guangfeng Liu, Xinyu Zhu, Xingyu Chen, Yang Zhou, Qingliang Miao, Rui Wang, Kai Yu
This paper describes AISP-SJTU’s submissions for the IWSLT 2022 Simultaneous Translation task.
1 code implementation • 1 Dec 2022 • Yulei Qin, Xingyu Chen, Chao Chen, Yunhang Shen, Bo Ren, Yun Gu, Jie Yang, Chunhua Shen
Most existing methods focus on learning noise-robust models from web images while neglecting the performance drop caused by the differences between web domain and real-world domain.
no code implementations • 23 Nov 2022 • Xingyu Chen, Baoyuan Wang, Heung-Yeung Shum
We present HandAvatar, a novel representation for hand animation and rendering, which can generate smoothly compositional geometry and self-occlusion-aware texture.
no code implementations • 22 Nov 2022 • Lipeng Wan, Zeyang Liu, Xingyu Chen, Xuguang Lan, Nanning Zheng
To ensure optimal consistency, the optimal node is required to be the unique STN.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
1 code implementation • 21 Nov 2022 • Yue Chen, Xingyu Chen
Place recognition is a critical and challenging task for mobile robots, aiming to retrieve an image captured at the same place as a query image from a database.
no code implementations • 21 Nov 2022 • Yue Chen, Xingyu Chen, Xuan Wang, Qi Zhang, Yu Guo, Ying Shan, Fei Wang
Neural Radiance Fields (NeRF) have achieved photorealistic novel views synthesis; however, the requirement of accurate camera poses limits its application.
1 code implementation • 11 Oct 2022 • Xingyu Chen, Thomas H. Li, Ruonan Zhang, Ge Li
We present two versatile methods to generally enhance self-supervised monocular depth estimation (MDE) models.
no code implementations • 10 Oct 2022 • Xingyu Chen, Jianru Xue, Jianwu Fang, Yuxin Pan, Nanning Zheng
In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU.
no code implementations • 10 Oct 2022 • Xingyu Chen, Jianru Xue, Shanmin Pang
The proposed sparse semantic map-based localization approach is robust against occlusion and long-term appearance changes in the environments.
1 code implementation • 2 Oct 2022 • Xingyu Chen, Ruonan Zhang, Ji Jiang, Yan Wang, Ge Li, Thomas H. Li
In this paper, we redesign the patch-based triplet loss in MDE to alleviate the ubiquitous edge-fattening issue.
Ranked #1 on
Unsupervised Monocular Depth Estimation
on Kitti Raw
no code implementations • 25 Sep 2022 • Dongli Tan, Jiang-Jiang Liu, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji
In this paper, we propose an efficient structure named Efficient Correspondence Transformer (ECO-TR) by finding correspondences in a coarse-to-fine manner, which significantly improves the efficiency of functional correspondence model.
1 code implementation • 23 Jul 2022 • Zhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu
In this work, we propose a novel OWOD problem called Unknown-Classified Open World Object Detection (UC-OWOD).
no code implementations • 23 May 2022 • Liangtai Sun, Xingyu Chen, Lu Chen, Tianle Dai, Zichen Zhu, Kai Yu
However, this API-based architecture greatly limits the information-searching capability of intelligent assistants and may even lead to task failure if TOD-specific APIs are not available or the task is too complicated to be executed by the provided APIs.
1 code implementation • NAACL 2022 • Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu
Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests.
1 code implementation • 27 Mar 2022 • Yue Chen, Xuan Wang, Xingyu Chen, Qi Zhang, Xiaoyu Li, Yu Guo, Jue Wang, Fei Wang
Neural volume rendering enables photo-realistic renderings of a human performer in free-view, a critical task in immersive VR/AR applications.
no code implementations • 26 Mar 2022 • Chunnan Wang, Xingyu Chen, Chengyue Wu, Hongzhi Wang
We allow the effective combination of design experience from different sources, so as to create an effective search space containing a variety of TSF models to support different TSF tasks.
1 code implementation • CVPR 2022 • Xingyu Chen, Yufeng Liu, Yajiao Dong, Xiong Zhang, Chongyang Ma, Yanmin Xiong, Yuan Zhang, Xiaoyan Guo
In this work, we propose a framework for single-view hand mesh reconstruction, which can simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal coherence.
no code implementations • CVPR 2022 • Xingyu Chen, Qi Zhang, Xiaoyu Li, Yue Chen, Ying Feng, Xuan Wang, Jue Wang
This paper studies the problem of hallucinated NeRF: i. e., recovering a realistic NeRF at a different time of day from a group of tourism images.
no code implementations • 29 Sep 2021 • Lipeng Wan, Zeyang Liu, Xingyu Chen, Han Wang, Xuguang Lan
Due to the representation limitation of the joint Q value function, multi-agent reinforcement learning (MARL) methods with linear or monotonic value decomposition can not ensure the optimal consistency (i. e. the correspondence between the individual greedy actions and the maximal true Q value), leading to instability and poor coordination.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
no code implementations • 29 Aug 2021 • Xun Tan, Xingyu Chen, Guowei Zhang, Jishiyu Ding, Xuguang Lan
Fusing the two kinds of data usually helps to improve the detection results.
no code implementations • 14 Jul 2021 • Jie Xu, Xingyu Chen, Xuguang Lan, Nanning Zheng
The experimental results show that our approach makes the interaction more efficient and safer.
no code implementations • 6 Jul 2021 • Shuaizheng Yan, Xingyu Chen, Zhengxing Wu, Jian Wang, Yue Lu, Min Tan, Junzhi Yu
Our experimental results show that the proposed method is able to perform high-quality restoration for unconstrained underwater images without any supervision.
no code implementations • 1 Jul 2021 • Zhiyuan Guo, Yuexin Li, Guo Chen, Xingyu Chen, Akshat Gupta
Spoken dialogue systems such as Siri and Alexa provide great convenience to people's everyday life.
no code implementations • 31 May 2021 • Tao Wang, Ruixin Zhang, Xingyu Chen, Kai Zhao, Xiaolin Huang, Yuge Huang, Shaoxin Li, Jilin Li, Feiyue Huang
Based on this observation, we propose the adaptive feature alignment (AFA) to generate features of arbitrary attacking strengths.
1 code implementation • CVPR 2021 • Fu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang
Thus, we propose a novel unsupervised FIQA method that incorporates Similarity Distribution Distance for Face Image Quality Assessment (SDD-FIQA).
1 code implementation • CVPR 2021 • Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng
In the root-relative mesh recovery task, we exploit semantic relations among joints to generate a 3D mesh from the extracted 2D cues.
1 code implementation • EMNLP 2021 • Xingyu Chen, Zihan Zhao, Lu Chen, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu
In this paper, we introduce the task of structural reading comprehension (SRC) on web.
1 code implementation • CVPR 2021 • Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner
Tests on AFLW2000-3D and BIWI show that our method runs at real-time and outperforms state of the art (SotA) face pose estimators.
Ranked #4 on
Head Pose Estimation
on AFLW2000
no code implementations • 13 Oct 2020 • Junming Ma, Chaofan Yu, Aihui Zhou, Bingzhe Wu, Xibin Wu, Xingyu Chen, Xiangqun Chen, Lei Wang, Donggang Cao
We present S3ML, a secure serving system for machine learning inference in this paper.
2 code implementations • ECCV 2020 • Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng
Using a gating mechanism that discriminates the unseen samples from the seen samples can decompose the GZSL problem to a conventional Zero-Shot Learning (ZSL) problem and a supervised classification problem.
no code implementations • 4 Mar 2020 • Xingyu Chen, Yue Lu, Zhengxing Wu, Junzhi Yu, Li Wen
According to our analysis, five key discoveries are reported: 1) Domain quality has an ignorable effect on within-domain convolutional representation and detection accuracy; 2) low-quality domain leads to higher generalization ability in cross-domain detection; 3) low-quality domain can hardly be well learned in a domain-mixed learning process; 4) degrading recall efficiency, restoration cannot improve within-domain detection accuracy; 5) visual restoration is beneficial to detection in the wild by reducing the domain shift between training data and real-world scenes.
no code implementations • 22 Dec 2019 • Xingyu Chen, Zhengxing Wu, Junzhi Yu, Li Wen
From a robotic perspective, the importance of recall continuity and localization stability is equal to that of accuracy, but the AP is insufficient to reflect detectors' performance across time.
no code implementations • 9 May 2019 • Xingyu Chen, Brandon Fain, Liang Lyu, Kamesh Munagala
We extend the fair machine learning literature by considering the problem of proportional centroid clustering in a metric context.
1 code implementation • 23 Jul 2018 • Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, Li Wen
As for temporal detection in videos, temporal refinement networks (TRNet) and temporal dual refinement networks (TDRNet) are developed by propagating the refinement information across time.
1 code implementation • 1 Mar 2018 • Xingyu Chen, Junzhi Yu, Zhengxing Wu
Moreover, we develop a creative temporal analysis unit, namely, attentional ConvLSTM (AC-LSTM), in which a temporal attention mechanism is specially tailored for background suppression and scale suppression while a ConvLSTM integrates attention-aware features across time.
1 code implementation • 3 Dec 2017 • Xingyu Chen, Junzhi Yu, Shihan Kong, Zhengxing Wu, Xi Fang, Li Wen
More specifically, an underwater index is investigated to describe underwater properties, and a loss function based on the underwater index is designed to train the critic branch for underwater noise suppression.