Search Results for author: Yuxiang Zhang

Found 37 papers, 9 papers with code

The Impact of Silence on Speech Anti-Spoofing

no code implementations21 Sep 2023 Yuxiang Zhang, Zhuo Li, Jingze Lu, Hua Hua, Wenchao Wang, Pengyuan Zhang

First, the reasons for the impact are explored, including the proportion of silence duration and the content of silence.

Action Detection Activity Detection +1

Improving Short Utterance Anti-Spoofing with AASIST2

no code implementations15 Sep 2023 Yuxiang Zhang, Jingze Lu, Zengqiang Shang, Wenchao Wang, Pengyuan Zhang

The modified Res2Net blocks can extract multi-scale features and improve the detection performance for speech of different durations, thus improving the short utterance evaluation performance.

Graph Attention Speaker Verification

You talk what you read: Understanding News Comment Behavior by Dispositional and Situational Attribution

no code implementations4 Aug 2023 Yuhang Wang, Yuxiang Zhang, Dongyuan Lu, Jitao Sang

Many news comment mining studies are based on the assumption that comment is explicitly linked to the corresponding news.

News Summarization

Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning

no code implementations3 Jul 2023 Yuxiang Zhang, Hongwen Zhang, Liangxiao Hu, Hongwei Yi, Shengping Zhang, Yebin Liu

For more accurate and physically plausible predictions, a contact-aware neural motion descent module is proposed in our network so that it can be aware of foot-ground contact and motion misalignment with the proxy observations.

3D Human Pose Estimation

UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective

no code implementations17 May 2023 Ping Yang, Junyu Lu, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Jiaxing Zhang, Pingjian Zhang

We propose a new paradigm for universal information extraction (IE) that is compatible with any schema format and applicable to a list of IE tasks, such as named entity recognition, relation extraction, event extraction and sentiment analysis.

Event Extraction named-entity-recognition +3

NER-to-MRC: Named-Entity Recognition Completely Solving as Machine Reading Comprehension

no code implementations6 May 2023 Yuxiang Zhang, Junjie Wang, Xinyu Zhu, Tetsuya Sakai, Hayato Yamana

Named-entity recognition (NER) detects texts with predefined semantic labels and is an essential building block for natural language processing (NLP).

Machine Reading Comprehension named-entity-recognition +2

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

1 code implementation1 May 2023 Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen Zhang, Tao Yu, Yebin Liu

Results and experiments demonstrate the superiority of our method in terms of image quality, full portrait video generation, and real-time re-animation compared to existing facial reenactment methods.

Face Reenactment Translation +1

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

no code implementations CVPR 2023 Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu

In this way, the clothing deformations are disentangled such that the pose-dependent wrinkles can be better learned and applied to unseen poses.

APE: An Open and Shared Annotated Dataset for Learning Urban Pedestrian Path Networks

no code implementations4 Mar 2023 Yuxiang Zhang, Nicholas Bolten, Sachin Mehta, Anat Caspi

The process features the use of a multi-input segmentation network trained on our dataset to predict important classes in the pedestrian environment and then generate a connected pedestrian path network.

Autonomous Driving

OASIS: Automated Assessment of Urban Pedestrian Paths at Scale

no code implementations4 Mar 2023 Yuxiang Zhang, Suresh Devalapalli, Sachin Mehta, Anat Caspi

The inspection of the Public Right of Way (PROW) for accessibility barriers is necessary for monitoring and maintaining the built environment for communities' walkability, rollability, safety, active transportation, and sustainability.

How to Extend 3D GBSM to RIS Cascade Channel with Non-ideal Phase Modulation?

no code implementations15 Feb 2023 Huiwen Gong, Jianhua Zhang, Yuxiang Zhang, Zhengfu Zhou, Guangyi Liu

In the modeling process, we consider the non-ideal phase modulation of the RIS element, so as to accurately characterize the dependence of its phase modulation on the incoming wave angle.

Capacity Analysis of Holographic MIMO Channels with Practical Constraints

no code implementations29 Dec 2022 Yuan Zhang, Jianhua Zhang, Yuxiang Zhang, Yuan YAO, Guangyi Liu

However, the channel might not satisfy isotropic scattering because of generalized angle distributions, and the antenna gain is limited by the array aperture in reality.

SrTR: Self-reasoning Transformer with Visual-linguistic Knowledge for Scene Graph Generation

no code implementations19 Dec 2022 Yuxiang Zhang, Zhenbo Liu, Shuai Wang

The execution efficiency of the one-stage scene graph generation approaches are quite high, which infer the effective relation between entity pairs using sparse proposal sets and a few queries.

Graph Generation Relational Reasoning +1

Background-Mixed Augmentation for Weakly Supervised Change Detection

1 code implementation21 Nov 2022 Rui Huang, Ruofei Wang, Qing Guo, Jieda Wei, Yuxiang Zhang, Wei Fan, Yang Liu

Change detection (CD) is to decouple object changes (i. e., object missing or appearing) from background changes (i. e., environment variations) like light and season variations in two images captured in the same scene over a long time span, presenting critical applications in disaster management, urban development, etc.

Change Detection Data Augmentation +1

A Shared Cluster-based Stochastic Channel Model for Joint Communication and Sensing Systems

no code implementations12 Nov 2022 Yameng Liu, Jianhua Zhang, Yuxiang Zhang, Zhiqiang Yuan, Guangyi Liu

Then, a stochastic JCAS channel model is proposed to capture the sharing feature, where shared and non-shared clusters by the two channels are defined and superimposed.

Solving Math Word Problems via Cooperative Reasoning induced Language Models

1 code implementation28 Oct 2022 Xinyu Zhu, Junjie Wang, Lin Zhang, Yuxiang Zhang, Yongfeng Huang, Ruyi Gan, Jiaxing Zhang, Yujiu Yang

This inspires us to develop a cooperative reasoning-induced PLM for solving MWPs, called Cooperative Reasoning (CoRe), resulting in a human-like reasoning architecture with system 1 as the generator and system 2 as the verifier.

Arithmetic Reasoning

Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion

no code implementations13 Oct 2022 Yuxiang Zhang, Jingze Lu, Xingming Wang, Zhuo Li, Runqiu Xiao, Wenchao Wang, Ming Li, Pengyuan Zhang

The overfitting of the model to the training set leads to extreme values of the scores and low correlation of the score distributions, which makes score fusion difficult.

Data Augmentation DeepFake Detection +1

Language-aware Domain Generalization Network for Cross-Scene Hyperspectral Image Classification

no code implementations6 Sep 2022 Yuxiang Zhang, Mengmeng Zhang, Wei Li, Shuai Wang, Ran Tao

Text information including extensive prior knowledge about land cover classes has been ignored in hyperspectral image classification (HSI) tasks.

Contrastive Learning Domain Generalization +1

Towards No.1 in CLUE Semantic Matching Challenge: Pre-trained Language Model Erlangshen with Propensity-Corrected Loss

1 code implementation5 Aug 2022 Junjie Wang, Yuxiang Zhang, Ping Yang, Ruyi Gan

This report describes a pre-trained language model Erlangshen with propensity-corrected loss, the No. 1 in CLUE Semantic Matching Challenge.

Language Modelling Masked Language Modeling

IDET: Iterative Difference-Enhanced Transformers for High-Quality Change Detection

1 code implementation15 Jul 2022 Qing Guo, Ruofei Wang, Rui Huang, Shuifa Sun, Yuxiang Zhang

Change detection (CD) aims to detect change regions within an image pair captured at different times, playing a significant role in diverse real-world applications.

Change Detection Vocal Bursts Intensity Prediction

PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images

1 code implementation13 Jul 2022 Hongwen Zhang, Yating Tian, Yuxiang Zhang, Mengcheng Li, Liang An, Zhenan Sun, Yebin Liu

To address these issues, we propose a Pyramidal Mesh Alignment Feedback (PyMAF) loop in our regression network for well-aligned human mesh recovery and extend it as PyMAF-X for the recovery of expressive full-body models.

Ranked #20 on 3D Human Pose Estimation on 3DPW (using extra training data)

3D human pose and shape estimation Human Mesh Recovery +2

SASV Based on Pre-trained ASV System and Integrated Scoring Module

no code implementations1 Jul 2022 Yuxiang Zhang, Zhuo Li, Wenchao Wang, Pengyuan Zhang

Based on the assumption that there is a correlation between anti-spoofing and speaker verification, a Total-Divide-Total integrated Spoofing-Aware Speaker Verification (SASV) system based on pre-trained automatic speaker verification (ASV) system and integrated scoring module is proposed and submitted to the SASV 2022 Challenge.

Speaker Verification

Adversarial Training-Aided Time-Varying Channel Prediction for TDD/FDD Systems

no code implementations25 Apr 2022 Zhen Zhang, Yuxiang Zhang, Jianhua Zhang, Feifei Gao

In this paper, a time-varying channel prediction method based on conditional generative adversarial network (CPcGAN) is proposed for time division duplexing/frequency division duplexing (TDD/FDD) systems.

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras

no code implementations ICCV 2021 Yuxiang Zhang, Zhe Li, Liang An, Mengcheng Li, Tao Yu, Yebin Liu

Overall, we propose the first light-weight total capture system and achieves fast, robust and accurate multi-person total motion capture performance.

3D Multi-Person Pose Estimation

Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection

no code implementations21 Jan 2021 Yuxiang Zhang, Sachin Mehta, Anat Caspi

Semantic segmentation is a prerequisite for this task since it maps contiguous regions of the same class as single entities.

Autonomous Navigation Model Selection +2

Incorporating Linguistic Constraints into Keyphrase Generation

no code implementations ACL 2019 Jing Zhao, Yuxiang Zhang

Keyphrases, that concisely describe the high-level topics discussed in a document, are very useful for a wide range of natural language processing tasks.

Keyphrase Generation Multi-Task Learning

Training Bit Fully Convolutional Network for Fast Semantic Segmentation

no code implementations1 Dec 2016 He Wen, Shuchang Zhou, Zhe Liang, Yuxiang Zhang, Dieqiao Feng, Xinyu Zhou, Cong Yao

Fully convolutional neural networks give accurate, per-pixel prediction for input images and have applications like semantic segmentation.

Semantic Segmentation

An Open Source Testing Tool for Evaluating Handwriting Input Methods

no code implementations30 May 2015 Liquan Qiu, Lianwen Jin, Ruifen Dai, Yuxiang Zhang, Lei LI

This paper presents an open source tool for testing the recognition accuracy of Chinese handwriting input methods.

Handwriting Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.