Search Results for author: Ya Li

Found 30 papers, 10 papers with code

Dual-Path Distillation: A Unified Framework to Improve Black-Box Attacks

no code implementations • ICML 2020 • Yonggang Zhang, Ya Li, Tongliang Liu, Xinmei Tian

To obtain sufficient knowledge for crafting adversarial examples, previous methods query the target model with inputs that are perturbed with different searching directions.

Paper
Add Code

Cross Attention Augmented Transducer Networks for Simultaneous Translation

1 code implementation • EMNLP 2021 • Dan Liu, Mengge Du, Xiaoxi Li, Ya Li, Enhong Chen

This paper proposes a novel architecture, Cross Attention Augmented Transducer (CAAT), for simultaneous translation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Perceptual learning in contour detection transfer across changes in contour path and orientation

no code implementations • 18 Mar 2024 • Yue Ding, Hongqiao Shi, Shuang Song, Yonghui Wang, Ya Li

The integration of local elements into shape contours is critical for target detection and identification in cluttered scenes.

Contour Detection Specificity

Paper
Add Code

CRB Minimization for RIS-aided mmWave Integrated Sensing and Communications

no code implementations • 2 Jan 2024 • Wanting Lyu, Songjie Yang, Yue Xiu, Ya Li, Hongjun He, Chau Yuen, Zhongpei Zhang

In this paper, reconfigurable intelligent surface (RIS) is employed in a millimeter wave (mmWave) integrated sensing and communications (ISAC) system.

Paper
Add Code

Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation

1 code implementation • 2 Jan 2024 • Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li

Drawing inspiration from state-of-the-art Text-to-Image (T2I) diffusion models, we introduce Auffusion, a TTA system adapting T2I model frameworks to TTA task, by effectively leveraging their inherent generative strengths and precise cross-modal alignment.

Ranked #5 on Audio Generation on AudioCaps

Audio Generation Style Transfer

114

Paper
Code

Frame-level emotional state alignment method for speech emotion recognition

1 code implementation • 27 Dec 2023 • Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li

To address this problem, we propose a frame-level emotional state alignment method for SER.

Speech Emotion Recognition

Paper
Code

Hypergraph Enhanced Knowledge Tree Prompt Learning for Next-Basket Recommendation

no code implementations • 26 Dec 2023 • Zi-Feng Mai, Chang-Dong Wang, Zhongjie Zeng, Ya Li, Jiaquan Chen, Philip S. Yu

To settle the above challenges, we propose a novel method HEKP4NBR, which transforms the knowledge graph (KG) into prompts, namely Knowledge Tree Prompt (KTP), to help PLM encode the OOV item IDs in the user's basket sequence.

Next-basket recommendation

Paper
Add Code

CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis

no code implementations • 16 Dec 2023 • Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li

In this paper, we introduce a contrastive learning-based CSS framework, CONCSS.

Contrastive Learning Self-Supervised Learning +1

Paper
Add Code

Mutual Information-Based Integrated Sensing and Communications: A WMMSE Framework

1 code implementation • 19 Oct 2023 • Yizhou Peng, Songjie Yang, Wanting Lyu, Ya Li, Hongjun He, Zhongpei Zhang, Chadi Assi

In this letter, a weighted minimum mean square error (WMMSE) empowered integrated sensing and communication (ISAC) system is investigated.

Paper
Code

Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis

no code implementations • 5 Jun 2023 • Dengfeng Ke, Yayue Deng, Yukang Jia, Jinlong Xue, Qi Luo, Ya Li, Jianqing Sun, Jiaen Liang, Binghuai Lin

Regressive Text-to-Speech (TTS) system utilizes attention mechanism to generate alignment between text and acoustic feature sequence.

Sentence Speech Synthesis

Paper
Add Code

M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis

no code implementations • 3 May 2023 • Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, JianHua Tao, Jianqing Sun, Jiaen Liang

However, it is still a challenge to comprehensively model the conversation, and a majority of conversational TTS systems only focus on extracting global information and omit local prosody features, which contain important fine-grained information like keywords and emphasis.

Speech Synthesis Text-To-Speech Synthesis

Paper
Add Code

Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent Induction

no code implementations • 9 Mar 2023 • Caiyuan Chu, Ya Li, Yifan Liu, Jia-Chen Gu, Quan Liu, Yongxin Ge, Guoping Hu

The key to automatic intention induction is that, for any given set of new data, the sentence representation obtained by the model can be well distinguished from different labels.

Clustering Contrastive Learning +3

Paper
Add Code

DKT-STDRL: Spatial and Temporal Representation Learning Enhanced Deep Knowledge Tracing for Learning Performance Prediction

no code implementations • 15 Feb 2023 • Liting Lyu, Zhifeng Wang, Haihong Yun, Zexue Yang, Ya Li

Then, the spatial features are connected with the original students' exercise features as joint learning features.

Knowledge Tracing Representation Learning

Paper
Add Code

A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis

no code implementations • 7 Oct 2022 • Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang

Then we used keypoint decomposition to extract video synthesis controlling parameters from the backend output and the source image.

Paper
Add Code

Towards Lightweight Black-Box Attacks against Deep Neural Networks

1 code implementation • 29 Sep 2022 • Chenghao Sun, Yonggang Zhang, Wan Chaoqun, Qizhou Wang, Ya Li, Tongliang Liu, Bo Han, Xinmei Tian

As it is hard to mitigate the approximation error with few available samples, we propose Error TransFormer (ETF) for lightweight attacks.

Paper
Code

ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis

1 code implementation • 20 Mar 2022 • Jinlong Xue, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang

In recent years, neural network based methods for multi-speaker text-to-speech synthesis (TTS) have made significant progress.

Speaker Verification Speech Synthesis +1

Paper
Code

Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking

1 code implementation • CVPR 2020 • Hongjun Wang, Guangrun Wang, Ya Li, Dongyu Zhang, Liang Lin

To examine the robustness of ReID systems is rather important because the insecurity of ReID systems may cause severe losses, e. g., the criminals may use the adversarial perturbations to cheat the CCTV systems.

Adversarial Attack Person Re-Identification

Paper
Code

Expression Analysis Based on Face Regions in Read-world Conditions

no code implementations • 23 Oct 2019 • Zheng Lian, Ya Li, Jian-Hua Tao, Jian Huang, Ming-Yue Niu

To sum up, the contributions of this paper lie in two areas: 1) We visualize concerned areas of human faces in emotion recognition; 2) We analyze the contribution of different face areas to different emotions in real-world conditions through experimental analysis.

Facial Emotion Recognition Facial Expression Recognition +1

Paper
Add Code

Speech Emotion Recognition via Contrastive Loss under Siamese Networks

no code implementations • 23 Oct 2019 • Zheng Lian, Ya Li, Jian-Hua Tao, Jian Huang

It outperforms the baseline system that is optimized without the contrastive loss function with 1. 14% and 2. 55% in the weighted accuracy and the unweighted accuracy, respectively.

feature selection Speech Emotion Recognition

Paper
Add Code

On Better Exploring and Exploiting Task Relationships in Multi-Task Learning: Joint Model and Feature Learning

no code implementations • 3 Apr 2019 • Ya Li, Xinmei Tian, Tongliang Liu, DaCheng Tao

The objective of our proposed method is to transform the features from different tasks into a common feature space in which the tasks are closely related and the shared parameters can be better optimized.

Multi-Task Learning

Paper
Add Code

Learning Efficient Lexically-Constrained Neural Machine Translation with External Memory

no code implementations • 31 Jan 2019 • Ya Li, Xinyu Liu, Dan Liu, Xueqiang Zhang, Junhua Liu

Recent years has witnessed dramatic progress of neural machine translation (NMT), however, the method of manually guiding the translation procedure remains to be better explored.

Machine Translation NMT +2

Paper
Add Code

Improving speech emotion recognition via Transformer-based Predictive Coding through transfer learning

no code implementations • 11 Nov 2018 • Zheng Lian, Ya Li, Jian-Hua Tao, Jian Huang

I have submitted a new version to arXiv:1910. 13806.

Speech Emotion Recognition Transfer Learning

Paper
Add Code

Investigation of Multimodal Features, Classifiers and Fusion Methods for Emotion Recognition

1 code implementation • 13 Sep 2018 • Zheng Lian, Ya Li, Jian-Hua Tao, Jian Huang

We test our method in the EmotiW 2018 challenge and we gain promising results.

Emotion Classification Multimodal Emotion Recognition +1

Paper
Code

Deep Domain Generalization via Conditional Invariant Adversarial Networks

no code implementations • ECCV 2018 • Ya Li, Xinmei Tian, Mingming Gong, Yajing Liu, Tongliang Liu, Kun Zhang, DaCheng Tao

Under the assumption that the conditional distribution $P(Y|X)$ remains unchanged across domains, earlier approaches to domain generalization learned the invariant representation $T(X)$ by minimizing the discrepancy of the marginal distribution $P(T(X))$.

Ranked #67 on Domain Generalization on PACS

Domain Generalization Representation Learning

Paper
Add Code

Domain Generalization via Conditional Invariant Representation

1 code implementation • 23 Jul 2018 • Ya Li, Mingming Gong, Xinmei Tian, Tongliang Liu, DaCheng Tao

With the conditional invariant representation, the invariance of the joint distribution $\mathbb{P}(h(X), Y)$ can be guaranteed if the class prior $\mathbb{P}(Y)$ does not change across training and test domains.

Domain Generalization

1,331

Paper
Code

Cost-Effective Active Learning for Deep Image Classification

3 code implementations • 13 Jan 2017 • Keze Wang, Dongyu Zhang, Ya Li, Ruimao Zhang, Liang Lin

In this paper, we propose a novel active learning framework, which is capable of building a competitive classifier with optimal feature representation via a limited amount of labeled training instances in an incremental learning manner.

Active Learning Classification +5

Paper
Code

DARI: Distance metric And Representation Integration for Person Verification

no code implementations • 15 Apr 2016 • Guangrun Wang, Liang Lin, Shengyong Ding, Ya Li, Qing Wang

The past decade has witnessed the rapid development of feature representation learning and distance metric learning, whereas the two steps are often discussed separately.

Ranked #7 on Person Re-Identification on SYSU-30k (using extra training data)

Metric Learning Person Re-Identification +1

Paper
Add Code

Audio Visual Emotion Recognition with Temporal Alignment and Perception Attention

no code implementations • 28 Mar 2016 • Linlin Chao, Jian-Hua Tao, Minghao Yang, Ya Li, Zhengqi Wen

The other one is locating and re-weighting the perception attentions in the whole audio-visual stream for better recognition.

Classification Emotion Recognition +1

Paper
Add Code

Deep Boosting: Joint Feature Selection and Analysis Dictionary Learning in Hierarchy

no code implementations • 8 Aug 2015 • Zhanglin Peng, Ya Li, Zhaoquan Cai, Liang Lin

In each layer, we construct a dictionary of filters by combining the filters from the lower layer, and iteratively optimize the image representation with a joint discriminative-generative formulation, i. e. minimization of empirical classification error plus regularization of analysis image generation over training images.

Classification Dictionary Learning +4

Paper
Add Code

Defuzzify firstly or finally: Dose it matter in fuzzy DEMATEL under uncertain environment?

no code implementations • 20 Mar 2014 • Yunpeng Li, Ya Li, Jie Liu, Yong Deng

The results of defuzzification at the first step are not coincide with the results of defuzzification at the final step. It seems that the alternative is to defuzzification in the final step in fuzzy DEMATEL.

Decision Making

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.