Search Results for author: Yihan Wu

Found 25 papers, 6 papers with code

Entity Alignment with Unlabeled Dangling Cases

no code implementations16 Mar 2024 Hang Yin, Dong Ding, Liyao Xiang, Yuheng He, Yihan Wu, Xinbing Wang, Chenghu Zhou

We investigate the entity alignment problem with unlabeled dangling cases, meaning that there are entities in the source or target graph having no counterparts in the other, and those entities remain unlabeled.

Entity Alignment Representation Learning

Few-Shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt

no code implementations14 Mar 2024 Chenxi Liu, Zhenyi Wang, Tianyi Xiong, Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang

Few-Shot Class-Incremental Learning (FSCIL) models aim to incrementally learn new classes with scarce samples while preserving knowledge of old ones.

Few-Shot Class-Incremental Learning Incremental Learning

GPT-4 Vision on Medical Image Classification -- A Case Study on COVID-19 Dataset

no code implementations27 Oct 2023 Ruibo Chen, Tianyi Xiong, Yihan Wu, Guodong Liu, Zhengmian Hu, Lichang Chen, Yanshuo Chen, Chenxi Liu, Heng Huang

This technical report delves into the application of GPT-4 Vision (GPT-4V) in the nuanced realm of COVID-19 image classification, leveraging the transformative potential of in-context learning to enhance diagnostic processes.

Image Classification In-Context Learning +1

DiPmark: A Stealthy, Efficient and Resilient Watermark for Large Language Models

no code implementations11 Oct 2023 Yihan Wu, Zhengmian Hu, Hongyang Zhang, Heng Huang

Watermarking techniques offer a promising way to secure data via embedding covert information into the data.

Language Modelling

Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation

no code implementations4 Oct 2023 Yihan Wu, Brandon Y. Feng, Heng Huang

In this paper, we introduce an innovative method of safeguarding user privacy against the generative capabilities of Neural Radiance Fields (NeRF) models.

3D Scene Reconstruction Privacy Preserving

Characterizing normal perinatal development of the human brain structural connectivity

no code implementations22 Aug 2023 Yihan Wu, Lana Vasung, Camilo Calixto, Ali Gholipour, Davood Karimi

The new computational method and results are useful for assessing normal and abnormal development of the structural connectome early in life.

Cooperation or Competition: Avoiding Player Domination for Multi-Target Robustness via Adaptive Budgets

no code implementations CVPR 2023 Yimu Wang, Dinghuai Zhang, Yihan Wu, Heng Huang, Hongyang Zhang

We identify a phenomenon named player domination in the bargaining game, namely that the existing max-based approaches, such as MAX and MSD, do not converge.

ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios

no code implementations20 May 2023 Yuyue Wang, Huan Xiao, Yihan Wu, Ruihua Song

Considering comedians have diverse personal speech styles, including personal prosody, rhythm, and fillers, it requires real-world datasets and strong speech style modeling capabilities, which brings challenges.

ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

1 code implementation30 Dec 2022 Zehua Chen, Yihan Wu, Yichong Leng, Jiawei Chen, Haohe Liu, Xu Tan, Yang Cui, Ke Wang, Lei He, Sheng Zhao, Jiang Bian, Danilo Mandic

Denoising Diffusion Probabilistic Models (DDPMs) are emerging in text-to-speech (TTS) synthesis because of their strong capability of generating high-fidelity samples.

Denoising

VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing

1 code implementation30 Nov 2022 Yihan Wu, Junliang Guo, Xu Tan, Chen Zhang, Bohan Li, Ruihua Song, Lei He, Sheng Zhao, Arul Menezes, Jiang Bian

In this paper, we propose a machine translation system tailored for the task of video dubbing, which directly considers the speech duration of each token in translation, to match the length of source and target speech.

Machine Translation Sentence +4

PromptTTS: Controllable Text-to-Speech with Text Descriptions

no code implementations22 Nov 2022 Zhifang Guo, Yichong Leng, Yihan Wu, Sheng Zhao, Xu Tan

Thus, we develop a text-to-speech (TTS) system (dubbed as PromptTTS) that takes a prompt with both style and content descriptions as input to synthesize the corresponding speech.

Speech Synthesis

Towards Robust Dataset Learning

1 code implementation19 Nov 2022 Yihan Wu, Xinda Li, Florian Kerschbaum, Heng Huang, Hongyang Zhang

In this paper, we study the problem of learning a robust dataset such that any classifier naturally trained on the dataset is adversarially robust.

Semantic scene descriptions as an objective of human vision

no code implementations23 Sep 2022 Adrien Doerig, Tim C Kietzmann, Emily Allen, Yihan Wu, Thomas Naselaris, Kendrick Kay, Ian Charest

Interpreting the meaning of a visual scene requires not only identification of its constituent objects, but also a rich semantic characterization of object interrelations.

Schizophrenia detection based on EEG using Recurrent Auto-Encoder framework

no code implementations9 Jul 2022 Yihan Wu, Min Xia, Xiuzhu Wang, Yangsong Zhang

This study demonstrated that the structure of RAE is able to capture the differential features between SZ patients and HC subjects.

EEG

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

no code implementations25 Jun 2022 Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

In this paper, we propose a novel framework for learning style representation from abundant plain text in a self-supervised manner.

Contrastive Learning Deep Clustering +2

RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval

no code implementations17 Jun 2022 Yihan Wu, Hongyang Zhang, Heng Huang

The challenge is to design a provably robust algorithm that takes into consideration the 1-NN search and the high-dimensional nature of the embedding space.

Image Retrieval Retrieval

AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios

no code implementations1 Apr 2022 Yihan Wu, Xu Tan, Bohan Li, Lei He, Sheng Zhao, Ruihua Song, Tao Qin, Tie-Yan Liu

We model the speaker characteristics systematically to improve the generalization on new speakers.

Speech Synthesis

A Law of Robustness beyond Isoperimetry

no code implementations23 Feb 2022 Yihan Wu, Heng Huang, Hongyang Zhang

We prove a Lipschitzness lower bound $\Omega(\sqrt{n/p})$ of the interpolating neural network with $p$ parameters on arbitrary data distributions.

Simultaneously exploring multi-scale and asymmetric EEG features for emotion recognition

no code implementations13 Oct 2021 Yihan Wu, Min Xia, Li Nie, Yangsong Zhang, Andong Fan

In recent years, emotion recognition based on electroencephalography (EEG) has received growing interests in the brain-computer interaction (BCI) field.

EEG Emotion Recognition

Understanding Metric Learning on Unit Hypersphere and Generating Better Examples for Adversarial Training

no code implementations29 Sep 2021 Yihan Wu, Heng Huang

In this paper, we boost the performance of deep metric learning (DML) models with adversarial examples generated by attacking two new objective functions: \textit{intra-class alignment} and \textit{hyperspherical uniformity}.

Metric Learning Representation Learning

NeuroGen: activation optimized image synthesis for discovery neuroscience

2 code implementations15 May 2021 Zijin Gu, Keith W. Jamison, Meenakshi Khosla, Emily J. Allen, Yihan Wu, Thomas Naselaris, Kendrick Kay, Mert R. Sabuncu, Amy Kuceyeski

NeuroGen combines an fMRI-trained neural encoding model of human vision with a deep generative network to synthesize images predicted to achieve a target pattern of macro-scale brain activation.

Image Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.