1 code implementation • 29 Sep 2023 • Yunxiang Li, Bowen Jing, Zihan Li, Jing Wang, You Zhang
The recent developments of foundation models in computer vision, especially the Segment Anything Model (SAM), allow scalable and domain-agnostic image segmentation to serve as a general-purpose segmentation tool.
no code implementations • 14 Sep 2023 • Yongyi Zang, You Zhang, Mojtaba Heydari, Zhiyao Duan
These unique properties make singing voice deepfake detection a relevant but significantly different problem from synthetic speech detection.
1 code implementation • 27 Jul 2023 • Yutong Wen, You Zhang, Zhiyao Duan
We further show that these normalized HRTFs can be used to learn a more unified HRTF representation across databases than the prior art.
1 code implementation • 24 May 2023 • Yunxiang Li, Meixu Chen, Wenxuan Yang, Kai Wang, Jun Ma, Alan C. Bovik, You Zhang
Image translation has wide applications, such as style transfer and modality conversion, usually aiming to generate images having both high degrees of realism and faithfulness.
no code implementations • 5 Apr 2023 • Yunxiang Li, Hua-Chieh Shao, Xiao Liang, Liyuan Chen, RuiQi Li, Steve Jiang, Jing Wang, You Zhang
However, for medical image translation, the existing diffusion models are deficient in accurately retaining structural information since the structure details of source domain images are lost during the forward diffusion process and cannot be fully recovered through learned reverse diffusion, while the integrity of anatomical structures is extremely important in medical images.
1 code implementation • 24 Mar 2023 • Yunxiang Li, Zihan Li, Kai Zhang, Ruilong Dan, Steve Jiang, You Zhang
The primary aim of this research was to address the limitations observed in the medical knowledge of prevalent large language models (LLMs) such as ChatGPT, by creating a specialized language model with enhanced accuracy in medical advice.
1 code implementation • 4 Nov 2022 • Siwen Ding, You Zhang, Zhiyao Duan
Our previous research on one-class learning has improved the generalization ability to unseen attacks by compacting the bona fide speech in the embedding space.
1 code implementation • 27 Oct 2022 • You Zhang, Yuxiang Wang, Zhiyao Duan
In this work, we propose to use neural fields, a differentiable representation of functions through neural networks, to model HRTFs with arbitrary spatial sampling schemes.
1 code implementation • 22 Sep 2022 • Kai Wang, Yunxiang Li, Michael Dohopolski, Tao Peng, Weiguo Lu, You Zhang, Jing Wang
For Head and Neck Cancers (HNC) patient management, automatic gross tumor volume (GTV) segmentation and accurate pre-treatment cancer recurrence prediction are of great importance to assist physicians in designing personalized management plans, which have the potential to improve the treatment outcome and quality of life for HNC patients.
1 code implementation • 28 Jul 2022 • Yuxiang Wang, You Zhang, Zhiyao Duan, Mark Bocko
For the HRTF data, we use truncated spherical harmonic (SH) coefficients to represent the HRTF magnitudes and onsets.
1 code implementation • 29 Jun 2022 • Zihan Li, Yunxiang Li, Qingde Li, Puyang Wang, Dazhou Guo, Le Lu, Dakai Jin, You Zhang, Qingqi Hong
In our LViT model, medical text annotation is incorporated to compensate for the quality deficiency in image data.
Ranked #1 on
Medical Image Segmentation
on MoNuSeg
no code implementations • 21 Jun 2022 • Abudukelimu Wuerkaixi, You Zhang, Zhiyao Duan, ChangShui Zhang
This clarification of definition is motivated by our extensive experiments, through which we discover that existing ASD methods fail in modeling the audio-visual synchronization and often classify unsynchronized videos as active speaking.
no code implementations • 8 Mar 2022 • Yunxiang Li, Ruilong Dan, Shuai Wang, Yifan Cao, Xiangde Luo, Chenghao Tan, Gangyong Jia, Huiyu Zhou, You Zhang, Yaqi Wang, Li Wang
For instance, the model trained on a dataset with specific imaging parameters cannot be well applied to other datasets with different imaging parameters.
1 code implementation • 10 Feb 2022 • You Zhang, Ge Zhu, Zhiyao Duan
We further propose fusion strategies for direct inference and fine-tuning to predict the SASV score based on the framework.
2 code implementations • 26 Jul 2021 • Xinhui Chen, You Zhang, Ge Zhu, Zhiyao Duan
Different from previous ASVspoof challenges, the LA task this year presents codec and transmission channel variability, while the new task DF presents general audio compression.
no code implementations • 23 Apr 2021 • Jaehee Chun, Justin C. Park, Sven Olberg, You Zhang, Dan Nguyen, Jing Wang, Jin Sung Kim, Steve Jiang
Finally, in the sCT reconstruction task, the MAE is reduced from 68 to 22 HU by utilizing the IDOL framework.
3 code implementations • 3 Apr 2021 • You Zhang, Ge Zhu, Fei Jiang, Zhiyao Duan
Spoofing countermeasure (CM) systems are critical in speaker verification; they aim to discern spoofing attacks from bona fide speech trials.
3 code implementations • 27 Oct 2020 • You Zhang, Fei Jiang, Zhiyao Duan
Human voices can be used to authenticate the identity of the speaker, but the automatic speaker verification (ASV) systems are vulnerable to voice spoofing attacks, such as impersonation, replay, text-to-speech, and voice conversion.
1 code implementation • 8 Aug 2020 • Sefik Emre Eskimez, You Zhang, Zhiyao Duan
Visual emotion expression plays an important role in audiovisual speech communication.
no code implementations • 3 May 2020 • Liang Huang, You Zhang, Weijian Pan, Jinyin Chen, Li Ping Qian, Yuan Wu
Extensive numerical results show both the CNN-based classifier and LSTM-based classifier extract similar radio features relating to modulation reference points.
no code implementations • 6 Dec 2019 • Liang Huang, Weijian Pan, You Zhang, LiPing Qian, Nan Gao, Yuan Wu
Deep learning has recently been applied to automatically classify the modulation categories of received radio signals without manual experience.
no code implementations • 15 May 2019 • Xuaner Zhang, Kevin Matzen, Vivien Nguyen, Dillon Yao, You Zhang, Ren Ng
We present a system that synthetically renders refocusable video from a deep DOF video shot with a smartphone, and analyzes future video frames to deliver context-aware autofocus for the current frame.
no code implementations • 26 Jun 2018 • Fei Wen, You Zhang, Wei Wang
Whereafter, the normalized Laplacian spectra of $G_1^S\bowtie (G_2^V\cup G_3^E)$ and $G_1^S\diamondsuit(G_2^V\cup G_3^E)$ are respectively determined in terms of the corresponding normalized Laplacian spectra of the connected regular graphs $G_{1}$, $G_{2}$ and $G_{3}$, which extend the corresponding results of [A. Das, P. Panigrahi, Linear Multil.
Combinatorics
no code implementations • SEMEVAL 2018 • You Zhang, Jin Wang, Xue-jie Zhang
The useful BiLSTM (Bidirectional Long-Short Term Memory) model with attention mechanism was mainly applied for our system.
no code implementations • IJCNLP 2017 • Hang Yuan, You Zhang, Jin Wang, Xue-jie Zhang
A shared task is a typical question answering task that aims to test how accurately the participants can answer the questions in exams.
no code implementations • 4 Oct 2017 • Zhiguo Zhou, Zhi-Jie Zhou, Hongxia Hao, Shulong Li, Xi Chen, You Zhang, Michael Folkert, Jing Wang
First, the predictive performance of the model may be reduced when features extracted from an individual imaging modality are blindly combined into a single predictive model.
no code implementations • WS 2017 • You Zhang, Hang Yuan, Jin Wang, Xue-jie Zhang
In this paper, we present a system that uses a convolutional neural network with long short-term memory (CNN-LSTM) model to complete the task.