1 code implementation • 30 May 2024 • Chunhui Zhang, Li Liu, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang
Most existing trackers are tailored for open-air environments, leading to performance degradation when applied to UOT due to domain gaps.
4 code implementations • 23 May 2024 • Chunhui Zhang, Li Liu, Hao Wen, Xi Zhou, Yanfeng Wang
To leverage more modalities, some recent efforts have been made to learn a unified visual object tracking model for any modality.
no code implementations • 26 Apr 2024 • Zhenrong Zhang, Jianan Liu, Xi Zhou, Tao Huang, Qing-Long Han, Jingxin Liu, Hongbin Liu
Cooperative perception is essential to enhance the efficiency and safety of future transportation systems, requiring extensive data sharing among vehicles on the road, which raises significant privacy concerns.
no code implementations • 5 Oct 2023 • Tao Huang, Jianan Liu, Xi Zhou, Dinh C. Nguyen, Mostafa Rahimi Azghadi, Yuxuan Xia, Qing-Long Han, Sumei Sun
To address this gap, this paper provides a comprehensive overview of the evolution of CP technologies, spanning from early explorations to recent developments, including advancements in V2X communication technologies.
1 code implementation • 31 Aug 2023 • Xiao Shen, Shirui Pan, Kup-Sze Choi, Xi Zhou
Cross-network node classification (CNNC), which aims to classify nodes in a label-deficient target network by transferring the knowledge from a source network with abundant labels, draws increasing attention recently.
1 code implementation • ICCV 2023 • Zhiqiang Shen, Xiaoxiao Sheng, Hehe Fan, Longguang Wang, Yulan Guo, Qiong Liu, Hao Wen, Xi Zhou
In this paper, we propose a Masked Spatio-Temporal Structure Prediction (MaST-Pre) method to capture the structure of point cloud videos without human annotations.
no code implementations • 7 Jul 2023 • Chunhui Zhang, Xin Sun, Li Liu, Yiqian Yang, Qiong Liu, Xi Zhou, Yanfeng Wang
This approach achieves feature integration in a unified backbone, removing the need for carefully-designed fusion modules and resulting in a more effective and efficient VL tracking framework.
1 code implementation • CVPR 2023 • Zhiqiang Shen, Xiaoxiao Sheng, Longguang Wang, Yulan Guo, Qiong Liu, Xi Zhou
Self-supervised learning can extract representations of good quality from solely unlabeled data, which is appealing for point cloud videos due to their high labelling cost.
1 code implementation • 10 Oct 2022 • Chunhui Zhang, Yixiong Chen, Li Liu, Qiong Liu, Xi Zhou
This work proposes a hierarchical contrastive learning (HiCo) method to improve the transferability for the US video model pretraining.
no code implementations • 26 Jul 2022 • Dan Zhang, Xi Zhou, Zi-Hao Wang, Yan Peng, Shao-Rong Xie
This paper presents a novel data-driven methodology to provide a multi-step prediction of ship roll motions in high sea states.
1 code implementation • 25 May 2022 • Xin Sun, Xuan Wang, Jialin Gao, Qiong Liu, Xi Zhou
Moment retrieval in videos is a challenging task that aims to retrieve the most relevant video moment in an untrimmed video given a sentence description.
no code implementations • 31 Jan 2022 • Xi Zhou, Qinghao Ye, Xiaolin Yang, Jiakuan Chen, Haiqin Ma, Jun Xia, Javier Del Ser, Guang Yang
Finally, we verify the reliability of the model and achieved automatic measurement of VV and ICV.
1 code implementation • EMNLP 2021 • Jialin Gao, Xin Sun, Mengmeng Xu, Xi Zhou, Bernard Ghanem
Temporal language grounding in videos aims to localize the temporal span relevant to the given query sentence.
no code implementations • 14 Sep 2020 • Zhuosheng Zhang, Yiqing Zhang, Hai Zhao, Xi Zhou, Xiang Zhou
This paper presents a novel method to generate answers for non-extraction machine reading comprehension (MRC) tasks whose answers cannot be simply extracted as one span from the given passages.
1 code implementation • 14 Sep 2020 • Longxiang Liu, Zhuosheng Zhang, Hai Zhao, Xi Zhou, Xiang Zhou
A multi-turn dialogue is composed of multiple utterances from two or more different speaker roles.
no code implementations • 31 Aug 2020 • Guanshuo Wang, Yufeng Yuan, Jiwei Li, Shiming Ge, Xi Zhou
Current stripe-based feature learning approaches have delivered impressive accuracy, but do not make a proper trade-off between diversity, locality, and robustness, which easily suffers from part semantic inconsistency for the conflict between rigid partition and misalignment.
no code implementations • 9 Mar 2020 • Jialin Gao, Zhixiang Shi, Jiani Li, Guanshuo Wang, Yufeng Yuan, Shiming Ge, Xi Zhou
Accurate temporal action proposals play an important role in detecting actions from untrimmed videos.
no code implementations • 24 Dec 2019 • Jialin Gao, Tong He, Xi Zhou, Shiming Ge
A collection of approaches based on graph convolutional networks have proven success in skeleton-based action recognition by exploring neighborhood information and dense dependencies between intra-frame joints.
Ranked #38 on Skeleton Based Action Recognition on NTU RGB+D
no code implementations • 28 Oct 2019 • Weiwei Zhang, Changsheng chen, Xuechun Wu, Jialin Gao, Di Bao, Jiwei Li, Xi Zhou
In this paper, we propose an adaptive pruning method.
1 code implementation • 5 Sep 2019 • Zhuosheng Zhang, Yuwei Wu, Hai Zhao, Zuchao Li, Shuailiang Zhang, Xi Zhou, Xiang Zhou
The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks.
Ranked #6 on Natural Language Inference on SNLI
2 code implementations • 30 Aug 2019 • Shuailiang Zhang, Hai Zhao, Yuwei Wu, Zhuosheng Zhang, Xi Zhou, Xiang Zhou
Multi-choice reading comprehension is a challenging task to select an answer from a set of candidate options when given passage and question.
no code implementations • 9 Aug 2019 • Jialin Gao, Zhixiang Shi, Jiani Li, Yufeng Yuan, Jiwei Li, Xi Zhou
In this technical report, we describe our solution to temporal action proposal (task 1) in ActivityNet Challenge 2019.
no code implementations • 27 Jan 2019 • Shuailiang Zhang, Hai Zhao, Yuwei Wu, Zhuosheng Zhang, Xi Zhou, Xiang Zhou
Multi-choice reading comprehension is a challenging task that requires complex reasoning procedure.
Ranked #3 on Question Answering on RACE
1 code implementation • 16 Jan 2019 • Zuchao Li, Shexia He, Hai Zhao, Yiqing Zhang, Zhuosheng Zhang, Xi Zhou, Xiang Zhou
Semantic role labeling (SRL) aims to discover the predicateargument structure of a sentence.
Ranked #9 on Semantic Role Labeling on CoNLL 2005
no code implementations • 19 Nov 2018 • Yuan Li, Yuanjie Yu, Zefeng Li, Yangkun Lin, Meifang Xu, Jiwei Li, Xi Zhou
Recently, semantic segmentation and general object detection frameworks have been widely adopted by scene text detecting tasks.
no code implementations • 29 Oct 2018 • Xinpei Zhou, Jiwei Li, Xi Zhou
Automatic speech recognition (ASR) tasks are resolved by end-to-end deep learning models, which benefits us by less preparation of raw data, and easier transformation between languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 26 Oct 2018 • Xuerui Yang, Jiwei Li, Xi Zhou
Deep Feedforward Sequential Memory Network (DFSMN) has shown superior performance on speech recognition tasks.
Sound Audio and Speech Processing
no code implementations • COLING 2018 • Chenggang Mi, Yating Yang, Lei Wang, Xi Zhou, Tonghai Jiang
Neural machine translation models integrating results of loanword identification experiments achieve the best results on OOV translation(with 0. 5-0. 9 BLEU improvements)
16 code implementations • 4 Apr 2018 • Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou
Instead of learning on semantic regions, we uniformly partition the images into several stripes, and vary the number of parts in different local branches to obtain local feature representations with multiple granularities.
Ranked #3 on Person Re-Identification on SYSU-30k (using extra training data)
4 code implementations • ECCV 2018 • Yao Feng, Fan Wu, Xiaohu Shao, Yan-Feng Wang, Xi Zhou
We propose a straightforward method that simultaneously reconstructs the 3D facial structure and provides dense alignment.
Ranked #1 on 3D Face Reconstruction on Florence
no code implementations • RANLP 2017 • Chenggang Mi, Yating Yang, Rui Dong, Xi Zhou, Lei Wang, Xiao Li, Tonghai Jiang
To alleviate data sparsity in spoken Uyghur machine translation, we proposed a log-linear based morphological segmentation approach.
1 code implementation • CVPR 2017 • Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou
At the global stage, given an image with a rough face detection result, the full face region is firstly re-initialized by a supervised spatial transformer network to a canonical shape state and then trained to regress a coarse landmark estimation.
no code implementations • LREC 2016 • Yang Liu, Jiajun Zhang, Cheng-qing Zong, Yating Yang, Xi Zhou
Existing discourse research only focuses on the monolingual languages and the inconsistency between languages limits the power of the discourse theory in multilingual applications such as machine translation.