Search Results for author: Yihan Wu

Found 25 papers, 6 papers with code

Entity Alignment with Unlabeled Dangling Cases

no code implementations • 16 Mar 2024 • Hang Yin, Dong Ding, Liyao Xiang, Yuheng He, Yihan Wu, Xinbing Wang, Chenghu Zhou

We investigate the entity alignment problem with unlabeled dangling cases, meaning that there are entities in the source or target graph having no counterparts in the other, and those entities remain unlabeled.

Entity Alignment Representation Learning

Paper
Add Code

Few-Shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt

no code implementations • 14 Mar 2024 • Chenxi Liu, Zhenyi Wang, Tianyi Xiong, Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang

Few-Shot Class-Incremental Learning (FSCIL) models aim to incrementally learn new classes with scarce samples while preserving knowledge of old ones.

Few-Shot Class-Incremental Learning Incremental Learning

Paper
Add Code

Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection

1 code implementation • 19 Feb 2024 • Ruibo Chen, Yihan Wu, Lichang Chen, Guodong Liu, Qi He, Tianyi Xiong, Chenxi Liu, Junfeng Guo, Heng Huang

In the first stage, we devise a scoring network to evaluate the difficulty of training instructions, which is co-trained with the VLM.

Instruction Following Language Modelling

Paper
Code

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

no code implementations • 31 Jan 2024 • Yihan Wu, Soumi Maiti, Yifan Peng, Wangyou Zhang, Chenda Li, Yuyue Wang, Xihua Wang, Shinji Watanabe, Ruihua Song

Existing speech language models typically utilize task-dependent prompt tokens to unify various speech tasks in a single model.

Language Modelling Speech Enhancement +4

Paper
Add Code

GPT-4 Vision on Medical Image Classification -- A Case Study on COVID-19 Dataset

no code implementations • 27 Oct 2023 • Ruibo Chen, Tianyi Xiong, Yihan Wu, Guodong Liu, Zhengmian Hu, Lichang Chen, Yanshuo Chen, Chenxi Liu, Heng Huang

This technical report delves into the application of GPT-4 Vision (GPT-4V) in the nuanced realm of COVID-19 image classification, leveraging the transformative potential of in-context learning to enhance diagnostic processes.

Image Classification In-Context Learning +1

Paper
Add Code

DiPmark: A Stealthy, Efficient and Resilient Watermark for Large Language Models

no code implementations • 11 Oct 2023 • Yihan Wu, Zhengmian Hu, Hongyang Zhang, Heng Huang

Watermarking techniques offer a promising way to secure data via embedding covert information into the data.

Language Modelling

Paper
Add Code

Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation

no code implementations • 4 Oct 2023 • Yihan Wu, Brandon Y. Feng, Heng Huang

In this paper, we introduce an innovative method of safeguarding user privacy against the generative capabilities of Neural Radiance Fields (NeRF) models.

3D Scene Reconstruction Privacy Preserving

Paper
Add Code

Markov Chain-Guided Graph Construction and Sampling Depth Optimization for EEG-Based Mental Disorder Detection

no code implementations • 18 Sep 2023 • Yihan Wu, Tao Chang, Peng Xu, Yangsong Zhang

Graph Neural Networks (GNNs) have received considerable attention since its introduction.

EEG graph construction

Paper
Add Code

Characterizing normal perinatal development of the human brain structural connectivity

no code implementations • 22 Aug 2023 • Yihan Wu, Lana Vasung, Camilo Calixto, Ali Gholipour, Davood Karimi

The new computational method and results are useful for assessing normal and abnormal development of the structural connectome early in life.

Paper
Add Code

Cooperation or Competition: Avoiding Player Domination for Multi-Target Robustness via Adaptive Budgets

no code implementations • CVPR 2023 • Yimu Wang, Dinghuai Zhang, Yihan Wu, Heng Huang, Hongyang Zhang

We identify a phenomenon named player domination in the bargaining game, namely that the existing max-based approaches, such as MAX and MSD, do not converge.

Paper
Add Code

ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios

no code implementations • 20 May 2023 • Yuyue Wang, Huan Xiao, Yihan Wu, Ruihua Song

Considering comedians have diverse personal speech styles, including personal prosody, rhythm, and fillers, it requires real-world datasets and strong speech style modeling capabilities, which brings challenges.

Paper
Add Code

ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

1 code implementation • 30 Dec 2022 • Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo Mandic

Denoising Diffusion Probabilistic Models (DDPMs) are emerging in text-to-speech (TTS) synthesis because of their strong capability of generating high-fidelity samples.

Denoising

Paper
Code

Adversarial Weight Perturbation Improves Generalization in Graph Neural Networks

1 code implementation • 9 Dec 2022 • Yihan Wu, Aleksandar Bojchevski, Heng Huang

In this paper, we extensively study this phenomenon for graph data.

Graph Learning Node Classification

Paper
Code

VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing

1 code implementation • 30 Nov 2022 • Yihan Wu, Junliang Guo, Xu Tan, Chen Zhang, Bohan Li, Ruihua Song, Lei He, Sheng Zhao, Arul Menezes, Jiang Bian

In this paper, we propose a machine translation system tailored for the task of video dubbing, which directly considers the speech duration of each token in translation, to match the length of source and target speech.

Machine Translation Sentence +4

1,286

Paper
Code

PromptTTS: Controllable Text-to-Speech with Text Descriptions

no code implementations • 22 Nov 2022 • Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan

Thus, we develop a text-to-speech (TTS) system (dubbed as PromptTTS) that takes a prompt with both style and content descriptions as input to synthesize the corresponding speech.

Speech Synthesis

Paper
Add Code

Towards Robust Dataset Learning

1 code implementation • 19 Nov 2022 • Yihan Wu, Xinda Li, Florian Kerschbaum, Heng Huang, Hongyang Zhang

In this paper, we study the problem of learning a robust dataset such that any classifier naturally trained on the dataset is adversarially robust.

1,158

Paper
Code

Semantic scene descriptions as an objective of human vision

no code implementations • 23 Sep 2022 • Adrien Doerig, Tim C Kietzmann, Emily Allen, Yihan Wu, Thomas Naselaris, Kendrick Kay, Ian Charest

Interpreting the meaning of a visual scene requires not only identification of its constituent objects, but also a rich semantic characterization of object interrelations.

Paper
Add Code

Schizophrenia detection based on EEG using Recurrent Auto-Encoder framework

no code implementations • 9 Jul 2022 • Yihan Wu, Min Xia, Xiuzhu Wang, Yangsong Zhang

This study demonstrated that the structure of RAE is able to capture the differential features between SZ patients and HC subjects.

EEG

Paper
Add Code

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

no code implementations • 25 Jun 2022 • Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

In this paper, we propose a novel framework for learning style representation from abundant plain text in a self-supervised manner.

Contrastive Learning Deep Clustering +2

Paper
Add Code

RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval

no code implementations • 17 Jun 2022 • Yihan Wu, Hongyang Zhang, Heng Huang

The challenge is to design a provably robust algorithm that takes into consideration the 1-NN search and the high-dimensional nature of the embedding space.

Image Retrieval Retrieval

Paper
Add Code

AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios

no code implementations • 1 Apr 2022 • Yihan Wu, Xu Tan, Bohan Li, Lei He, Sheng Zhao, Ruihua Song, Tao Qin, Tie-Yan Liu

We model the speaker characteristics systematically to improve the generalization on new speakers.

Speech Synthesis

Paper
Add Code

A Law of Robustness beyond Isoperimetry

no code implementations • 23 Feb 2022 • Yihan Wu, Heng Huang, Hongyang Zhang

We prove a Lipschitzness lower bound $\Omega(\sqrt{n/p})$ of the interpolating neural network with $p$ parameters on arbitrary data distributions.

Paper
Add Code

Simultaneously exploring multi-scale and asymmetric EEG features for emotion recognition

no code implementations • 13 Oct 2021 • Yihan Wu, Min Xia, Li Nie, Yangsong Zhang, Andong Fan

In recent years, emotion recognition based on electroencephalography (EEG) has received growing interests in the brain-computer interaction (BCI) field.

EEG Emotion Recognition

Paper
Add Code

Understanding Metric Learning on Unit Hypersphere and Generating Better Examples for Adversarial Training

no code implementations • 29 Sep 2021 • Yihan Wu, Heng Huang

In this paper, we boost the performance of deep metric learning (DML) models with adversarial examples generated by attacking two new objective functions: \textit{intra-class alignment} and \textit{hyperspherical uniformity}.

Metric Learning Representation Learning

Paper
Add Code

NeuroGen: activation optimized image synthesis for discovery neuroscience

2 code implementations • 15 May 2021 • Zijin Gu, Keith W. Jamison, Meenakshi Khosla, Emily J. Allen, Yihan Wu, Thomas Naselaris, Kendrick Kay, Mert R. Sabuncu, Amy Kuceyeski

NeuroGen combines an fMRI-trained neural encoding model of human vision with a deep generative network to synthesize images predicted to achieve a target pattern of macro-scale brain activation.

Image Generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.