Search Results for author: Xi Zhou

Found 26 papers, 9 papers with code

You Need to Read Again: Multi-granularity Perception Network for Moment Retrieval in Videos

no code implementations25 May 2022 Xin Sun, Xuan Wang, Jialin Gao, Qiong Liu, Xi Zhou

Moment retrieval in videos is a challenging task that aims to retrieve the most relevant video moment in an untrimmed video given a sentence description.

Moment Retrieval Reading Comprehension

Relation-aware Video Reading Comprehension for Temporal Language Grounding

1 code implementation EMNLP 2021 Jialin Gao, Xin Sun, Mengmeng Xu, Xi Zhou, Bernard Ghanem

Temporal language grounding in videos aims to localize the temporal span relevant to the given query sentence.

Reading Comprehension

Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue

1 code implementation14 Sep 2020 Longxiang Liu, Zhuosheng Zhang, Hai Zhao, Xi Zhou, Xiang Zhou

A multi-turn dialogue is composed of multiple utterances from two or more different speaker roles.

Composing Answer from Multi-spans for Reading Comprehension

no code implementations14 Sep 2020 Zhuosheng Zhang, Yiqing Zhang, Hai Zhao, Xi Zhou, Xiang Zhou

This paper presents a novel method to generate answers for non-extraction machine reading comprehension (MRC) tasks whose answers cannot be simply extracted as one span from the given passages.

Machine Reading Comprehension

Task-specific Objectives of Pre-trained Language Models for Dialogue Adaptation

1 code implementation10 Sep 2020 Junlong Li, Zhuosheng Zhang, Hai Zhao, Xi Zhou, Xiang Zhou

In this work, we focus on Dialogue-related Natural Language Processing (DrNLP) tasks and design a Dialogue-Adaptive Pre-training Objective (DAPO) based on some important qualities for assessing dialogues which are usually ignored by general LM pre-training objectives.

Natural Language Processing

Receptive Multi-granularity Representation for Person Re-Identification

no code implementations31 Aug 2020 Guanshuo Wang, Yufeng Yuan, Jiwei Li, Shiming Ge, Xi Zhou

Current stripe-based feature learning approaches have delivered impressive accuracy, but do not make a proper trade-off between diversity, locality, and robustness, which easily suffers from part semantic inconsistency for the conflict between rigid partition and misalignment.

Person Re-Identification

Focusing and Diffusion: Bidirectional Attentive Graph Convolutional Networks for Skeleton-based Action Recognition

no code implementations24 Dec 2019 Jialin Gao, Tong He, Xi Zhou, Shiming Ge

A collection of approaches based on graph convolutional networks have proven success in skeleton-based action recognition by exploring neighborhood information and dense dependencies between intra-frame joints.

Action Recognition Skeleton Based Action Recognition

Semantics-aware BERT for Language Understanding

1 code implementation5 Sep 2019 Zhuosheng Zhang, Yuwei Wu, Hai Zhao, Zuchao Li, Shuailiang Zhang, Xi Zhou, Xiang Zhou

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks.

Language Modelling Machine Reading Comprehension +5

DCMN+: Dual Co-Matching Network for Multi-choice Reading Comprehension

2 code implementations30 Aug 2019 Shuailiang Zhang, Hai Zhao, Yuwei Wu, Zhuosheng Zhang, Xi Zhou, Xiang Zhou

Multi-choice reading comprehension is a challenging task to select an answer from a set of candidate options when given passage and question.

Reading Comprehension

Relation-Aware Pyramid Network (RapNet) for temporal action proposal

no code implementations9 Aug 2019 Jialin Gao, Zhixiang Shi, Jiani Li, Yufeng Yuan, Jiwei Li, Xi Zhou

In this technical report, we describe our solution to temporal action proposal (task 1) in ActivityNet Challenge 2019.

Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks

no code implementations19 Nov 2018 Yuan Li, Yuanjie Yu, Zefeng Li, Yangkun Lin, Meifang Xu, Jiwei Li, Xi Zhou

Recently, semantic segmentation and general object detection frameworks have been widely adopted by scene text detecting tasks.

object-detection Object Detection +1

Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition

no code implementations29 Oct 2018 Xinpei Zhou, Jiwei Li, Xi Zhou

Automatic speech recognition (ASR) tasks are resolved by end-to-end deep learning models, which benefits us by less preparation of raw data, and easier transformation between languages.

Automatic Speech Recognition

A novel pyramidal-FSMN architecture with lattice-free MMI for speech recognition

no code implementations26 Oct 2018 Xuerui Yang, Jiwei Li, Xi Zhou

Deep Feedforward Sequential Memory Network (DFSMN) has shown superior performance on speech recognition tasks.

Sound Audio and Speech Processing

Toward Better Loanword Identification in Uyghur Using Cross-lingual Word Embeddings

no code implementations COLING 2018 Chenggang Mi, Yating Yang, Lei Wang, Xi Zhou, Tonghai Jiang

Neural machine translation models integrating results of loanword identification experiments achieve the best results on OOV translation(with 0. 5-0. 9 BLEU improvements)

Cross-Lingual Word Embeddings Language Modelling +3

Learning Discriminative Features with Multiple Granularities for Person Re-Identification

14 code implementations4 Apr 2018 Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou

Instead of learning on semantic regions, we uniformly partition the images into several stripes, and vary the number of parts in different local branches to obtain local feature representations with multiple granularities.

Ranked #3 on Person Re-Identification on SYSU-30k (using extra training data)

Person Re-Identification Re-Ranking

Log-linear Models for Uyghur Segmentation in Spoken Language Translation

no code implementations RANLP 2017 Chenggang Mi, Yating Yang, Rui Dong, Xi Zhou, Lei Wang, Xiao Li, Tonghai Jiang

To alleviate data sparsity in spoken Uyghur machine translation, we proposed a log-linear based morphological segmentation approach.

Machine Translation Translation +1

A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection

1 code implementation CVPR 2017 Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou

At the global stage, given an image with a rough face detection result, the full face region is firstly re-initialized by a supervised spatial transformer network to a canonical shape state and then trained to regress a coarse landmark estimation.

Face Detection Facial Landmark Detection

A Bilingual Discourse Corpus and Its Applications

no code implementations LREC 2016 Yang Liu, Jiajun Zhang, Cheng-qing Zong, Yating Yang, Xi Zhou

Existing discourse research only focuses on the monolingual languages and the inconsistency between languages limits the power of the discourse theory in multilingual applications such as machine translation.

Machine Translation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.