Search Results for author: Yihong Tang

Found 16 papers, 7 papers with code

Vision-to-Music Generation: A Survey

2 code implementations27 Mar 2025 Zhaokai Wang, Chenxi Bao, Le Zhuo, Jingrui Han, Yang Yue, Yihong Tang, Victor Shea-Jay Huang, Yue Liao

Vision-to-music Generation, including video-to-music and image-to-music tasks, is a significant branch of multimodal artificial intelligence demonstrating vast application prospects in fields such as film scoring, short video creation, and dance music synthesis.

multimodal generation Music Generation +1

INTENT: Trajectory Prediction Framework with Intention-Guided Contrastive Clustering

no code implementations6 Mar 2025 Yihong Tang, Wei Ma

To this end, we present INTENT, an efficient intention-guided trajectory prediction model that relies solely on information contained in the road agent's trajectory.

Autonomous Driving Clustering +2

The Rise of Darkness: Safety-Utility Trade-Offs in Role-Playing Dialogue Agents

no code implementations28 Feb 2025 Yihong Tang, Kehai Chen, Xuefeng Bai, ZhengYu Niu, Bo wang, Jie Liu, Min Zhang

Large Language Models (LLMs) have made remarkable advances in role-playing dialogue agents, demonstrating their utility in character simulations.

Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning

no code implementations21 Oct 2024 Yihong Tang, Ao Qu, Zhaokai Wang, Dingyi Zhuang, Zhaofeng Wu, Wei Ma, Shenhao Wang, Yunhan Zheng, Zhan Zhao, Jinhua Zhao

Our central hypothesis is that mastering these basic spatial capabilities can significantly enhance a model's performance on composite spatial tasks requiring advanced spatial understanding and combinatorial problem-solving, with generalized improvements in visual-spatial tasks.

Spatial Reasoning Synthetic Data Generation

RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems

no code implementations25 Sep 2024 Yihong Tang, Bo wang, Xu Wang, Dongming Zhao, Jing Liu, Jijun Zhang, Ruifang He, Yuexian Hou

Role-playing systems powered by large language models (LLMs) have become increasingly influential in emotional communication applications.

Hallucination

ERABAL: Enhancing Role-Playing Agents through Boundary-Aware Learning

no code implementations23 Sep 2024 Yihong Tang, Jiao Ou, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai

Role-playing is an emerging application in the field of Human-Computer Interaction (HCI), primarily implemented through the alignment training of a large language model (LLM) with assigned characters.

Language Modeling Language Modelling +1

Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement

no code implementations16 Feb 2024 Yihong Tang, Jiao Ou, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai

Experiments on models improved by RoleAD indicate that our adversarial dataset ameliorates this deficiency, with the improvements demonstrating a degree of generalizability in ordinary scenarios.

Dialogue Generation

DialogBench: Evaluating LLMs as Human-like Dialogue Systems

1 code implementation3 Nov 2023 Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di Zhang, Kun Gai

In this paper, we propose DialogBench, a dialogue evaluation benchmark that contains 12 dialogue tasks to probe the capabilities of LLMs as human-like dialogue systems should have.

Dialogue Evaluation

Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense Persona

1 code implementation19 May 2023 Yihong Tang, Bo wang, Miao Fang, Dongming Zhao, Kun Huang, Ruifang He, Yuexian Hou

We design a Contrastive Latent Variable-based model (CLV) that clusters the dense persona descriptions into sparse categories, which are combined with the history query to generate personalized responses.

Dialogue Generation

Activity-aware Human Mobility Prediction with Hierarchical Graph Attention Recurrent Network

1 code implementation14 Oct 2022 Yihong Tang, Junlin He, Zhan Zhao

Human mobility prediction is a fundamental task essential for various applications in urban planning, location-based services and intelligent transportation systems.

Decoder Graph Attention

Few-Sample Traffic Prediction with Graph Networks using Locale as Relational Inductive Biases

1 code implementation8 Mar 2022 Mingxi Li, Yihong Tang, Wei Ma

Currently, most of the state-of-the-art prediction models are based on graph neural networks (GNNs), and the required training samples are proportional to the size of the traffic network.

Management Open-Ended Question Answering +2

Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles

no code implementations4 Nov 2021 Ao Qu, Yihong Tang, Wei Ma

In view of this, this paper first time formulates a novel task in which a group of vehicles can cooperatively send falsified information to "cheat" DRL-based ATCS in order to save their total travel time.

Deep Reinforcement Learning reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.