no code implementations • 11 Dec 2024 • Xin Zhao, Xiaojun Chen, Haoyu Gao
Due to the remarkable generative potential of diffusion-based models, numerous researches have investigated jailbreak attacks targeting these frameworks.
no code implementations • 26 Nov 2024 • Guojian Zhan, Qiang Ge, Haoyu Gao, Yuming Yin, Bin Zhao, Shengbo Eben Li
Subsequent to the validation process, we conduct comprehensive simulations comparing our proposed model with both kinematic models and existing dynamic models discretized through the forward Euler method.
no code implementations • 22 May 2024 • Senmao Tian, Haoyu Gao, Gangyi Hong, Shuyun Wang, JingJie Wang, Xin Yu, Shunli Zhang
Minor variations in silhouette sequences can be diminished in the network's intermediate layers due to the accumulation of quantization errors.
no code implementations • 22 Sep 2023 • Haoyu Gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Yongbin Li
Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts.
1 code implementation • NeurIPS 2023 • Shuzheng Si, Wentao Ma, Haoyu Gao, Yuchuan Wu, Ting-En Lin, Yinpei Dai, Hangyu Li, Rui Yan, Fei Huang, Yongbin Li
SpokenWOZ further incorporates common spoken characteristics such as word-by-word processing and reasoning in spoken language.
1 code implementation • 19 May 2023 • Tianshu Yu, Haoyu Gao, Ting-En Lin, Min Yang, Yuchuan Wu, Wentao Ma, Chao Wang, Fei Huang, Yongbin Li
In this paper, we propose Speech-text dialog Pre-training for spoken dialog understanding with ExpliCiT cRoss-Modal Alignment (SPECTRA), which is the first-ever speech-text dialog pre-training model.
Ranked #2 on
Multimodal Sentiment Analysis
on CMU-MOSI
(Acc-2 metric, using extra
training data)
cross-modal alignment
Emotion Recognition in Conversation
+2
1 code implementation • 4 May 2023 • Haoyu Gao, Rui Wang, Ting-En Lin, Yuchuan Wu, Min Yang, Fei Huang, Yongbin Li
Dialogue Topic Segmentation (DTS) plays an essential role in a variety of dialogue modeling tasks.