no code implementations • 3 Sep 2024 • Zhiheng Peng, Kai Zhao, Xiaoran Chen, Li Ma, Siyu Xia, Changjie Fan, Weijian Shang, Wei Jing
In this work, we also develop a progressive training strategy and integrated it with an enhanced optimization process, enabling the network to obtain initial weights using only a small skin dataset and achieve self-supervision in skeleton reconstruction.
no code implementations • 25 Jun 2024 • Kaichen Chi, Wei Jing, Junjie Li, Qiang Li, Qi Wang
To fill this gap, we propose a weakly supervised shadow removal network with a spherical feature space, dubbed S2-ShadowNet, to explore the best of both worlds for visible and infrared modalities.
no code implementations • 30 May 2024 • Dixuan Lin, Yuxiang Zhang, Mengcheng Li, Yebin Liu, Wei Jing, Qi Yan, Qianying Wang, Hongwen Zhang
The results on in-the-wild videos and real-world scenarios demonstrate the superior performances of our approach for interactive hand reconstruction.
1 code implementation • CVPR 2024 • Ke Guo, Zhenwei Miao, Wei Jing, Weiwei Liu, Weizi Li, Dayang Hao, Jia Pan
Due to the covariate shift issue, existing imitation learning-based simulators often fail to generate stable long-term simulations.
no code implementations • 25 Mar 2024 • Yinke Dong, Haifeng Yuan, Hongkun Liu, Wei Jing, Fangzhen Li, Hongmin Liu, Bin Fan
In this work, a progressive interaction network is proposed to enable the agent's feature to progressively focus on relevant maps, in order to better learn agents' feature representation capturing the relevant map constraints.
1 code implementation • 2 Aug 2023 • Tengju Ye, Wei Jing, Chunyong Hu, Shikun Huang, Lingping Gao, Fangzhen Li, Jingke Wang, Ke Guo, Wencong Xiao, Weibo Mao, Hang Zheng, Kun Li, Junbo Chen, Kaicheng Yu
Building a multi-modality multi-task neural network toward accurate and robust performance is a de-facto standard in perception task of autonomous driving.
no code implementations • 13 Feb 2022 • En Yen Puang, Hao Zhang, Hongyuan Zhu, Wei Jing
In this paper we present SA-CNN, a hierarchical and lightweight self-attention based encoding and decoding architecture for representation learning of point cloud data.
no code implementations • 12 Feb 2022 • Tianying Wang, En Yen Puang, Marcus Lee, Yan Wu, Wei Jing
The proposed method learns keypoints from camera images as the state representation, through a self-supervised autoencoder architecture.
no code implementations • 20 Jan 2022 • Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou
Temporal sentence grounding in videos (TSGV), \aka natural language video localization (NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that semantically corresponds to a language query from an untrimmed video.
no code implementations • 8 Nov 2021 • Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou
In this paper, we propose two debiasing strategies, data debiasing and model debiasing, to "force" a TSGV model to capture cross-modal interactions.
2 code implementations • NeurIPS 2021 • Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Wei Jing, Cheston Tan, Bryan Kian Hsiang Low
The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories.
1 code implementation • 22 Sep 2021 • Yunkai Wang, Dongkun Zhang, Yuxiang Cui, Zexi Chen, Wei Jing, Junbo Chen, Rong Xiong, Yue Wang
In this paper, we propose a domain generalization method for vision-based driving trajectory generation for autonomous vehicles in urban environments, which can be seen as a solution to extend the Invariant Risk Minimization (IRM) method in complex problems.
no code implementations • Findings (ACL) 2021 • Hao Zhang, Aixin Sun, Wei Jing, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh
In this work, we propose a Parallel Attention Network with Sequence matching (SeqPAN) to address the challenges in this task: multi-modal representation learning, and target moment boundary prediction.
1 code implementation • 13 May 2021 • Hao Zhang, Aixin Sun, Wei Jing, Guoshun Nan, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh
We adopt the first approach and introduce two contrastive learning objectives to refine video encoder and text encoder to learn video and text representations separately but with better alignment for VCMR.
no code implementations • 26 Feb 2021 • Hao Zhang, Aixin Sun, Wei Jing, Liangli Zhen, Joey Tianyi Zhou, Rick Siow Mong Goh
Our study suggests that the span-based QA framework is an effective strategy to solve the NLVL problem.
no code implementations • 6 Oct 2020 • Sicheng Yu, Hao Zhang, Wei Jing, Jing Jiang
In addition to the effective reduction of human efforts of our approach compared, through extensive experiments on OpenbookQA, we show that the proposed approach outperforms the models that use the same backbone and more training data; and our parameter analysis also demonstrates the interpretability of our approach.
1 code implementation • 28 Jul 2020 • En Yen Puang, Keng Peng Tee, Wei Jing
We train the deep neural network only in the simulated environment; and the trained model could be directly used for real-world visual servoing tasks.
1 code implementation • ACL 2020 • Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou
Given an untrimmed video and a text query, natural language video localization (NLVL) is to locate a matching span from the video that semantically corresponds to the query.
no code implementations • 5 Apr 2020 • Wei Jing, Feng Tian, Jizhong Zhang, Kuo-Ming Chao, Zhenxin Hong, Xu Liu
The main cause of this problem is the loss of discriminative feature due to reduced resolution.
Facial Expression Recognition
Facial Expression Recognition (FER)
+4
no code implementations • 11 Dec 2019 • Tianying Wang, Wei Qi Toh, Hao Zhang, Xiuchao Sui, Shaohua Li, Yong liu, Wei Jing
The proposed RoboCoDraw system takes a real human face image as input, converts it to a stylized avatar, then draws it with a robotic arm.
Robotics Graphics
no code implementations • 11 Dec 2019 • Tianying Wang, Hao Zhang, Wei Qi Toh, Hongyuan Zhu, Cheston Tan, Yan Wu, Yong liu, Wei Jing
The proposed method is able to efficiently generalize the previously learned task by model fusion to solve the environment adaptation problem.
no code implementations • 24 Sep 2019 • Yi Cheng, Hongyuan Zhu, Ying Sun, Cihan Acar, Wei Jing, Yan Wu, Liyuan Li, Cheston Tan, Joo-Hwee Lim
To our best knowledge, this is the first work to explore effective intra- and inter-modality fusion in 6D pose estimation.
no code implementations • 2019年8月8日 2019 • Wei Jing, Di Deng2, Zhe Xiao3, Yong Liu1, Kenji Shimada2
In this paper, we propose a novel planning method to directly sample and plan the inspection path for a camera-equipped UAV to acquire visual and geometric information of the target structures as a video stream setting in complex 3D environment.