Search Results for author: Ziyan Wang

Found 40 papers, 12 papers with code

LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation

no code implementations15 Jul 2025 Ziyan Wang, Yingpeng Du, Zhu Sun, Jieyi Bi, Haoyan Chua, Tianjun Wei, Jie Zhang

To alleviate the limited insights derived from individual users' behaviors, at the user-crowd level, we propose aggregating user cliques into synthesized users with rich behaviors for more comprehensive LLM-driven multi-interest analysis.

Contrastive Learning

Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models

no code implementations24 Jun 2025 Zhicheng Zhang, Ziyan Wang, Yali Du, Fei Fang

Developing effective instruction-following policies in reinforcement learning remains challenging due to the reliance on extensive human-labeled instruction datasets and the difficulty of learning from sparse rewards.

Instruction Following reinforcement-learning +1

M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality

no code implementations3 Mar 2025 Ziyan Wang, Zhicheng Zhang, Fei Fang, Yali Du

We introduce Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality ($\text{M}^3\text{HF}$), a novel framework that integrates multi-phase human feedback of mixed quality into the MARL training process.

Multi-agent Reinforcement Learning reinforcement-learning +1

Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

1 code implementation21 Feb 2025 Qi Le, Enmao Diao, Ziyan Wang, Xinran Wang, Jie Ding, Li Yang, Ali Anwar

In the probing stage, PP selects a small yet crucial set of hidden states, based on residual importance, to run a few model layers ahead.

Active Large Language Model-based Knowledge Distillation for Session-based Recommendation

no code implementations15 Dec 2024 Yingpeng Du, Zhu Sun, Ziyan Wang, Haoyan Chua, Jie Zhang, Yew-Soon Ong

Knowledge distillation (KD)-based methods can alleviate these issues by transferring the knowledge to a small student, which trains a student based on the predictions of a cumbersome teacher.

Active Learning Knowledge Distillation +4

Boolean Product Graph Neural Networks

no code implementations21 Sep 2024 Ziyan Wang, Bin Liu, Ling Xiang

To mitigate fluctuations in latent graph structure learning, this paper proposes a novel Boolean product-based graph residual connection in GNNs to link the latent graph and the original graph.

Graph structure learning

Probability Passing for Graph Neural Networks: Graph Structure and Representations Joint Learning

1 code implementation15 Jul 2024 Ziyan Wang, YaXuan He, Bin Liu

To solve this problem, Latent Graph Inference (LGI) is proposed to infer a task-specific latent structure by computing similarity or edge probability of node features and then apply a GNN to produce predictions.

Graph Neural Network

ZeroDDI: A Zero-Shot Drug-Drug Interaction Event Prediction Method with Semantic Enhanced Learning and Dual-Modal Uniform Alignment

1 code implementation1 Jul 2024 Ziyan Wang, Zhankun Xiong, Feng Huang, Xuan Liu, Wen Zhang

Drug-drug interactions (DDIs) can result in various pharmacological changes, which can be categorized into different classes known as DDI events (DDIEs).

Representation Learning

CEST-KAN: Kolmogorov-Arnold Networks for CEST MRI Data Analysis

1 code implementation23 Jun 2024 Jiawen Wang, Pei Cai, Ziyan Wang, Huabin Zhang, Jianpan Huang

Results: The water and CEST maps generated by both MLP and KAN were visually comparable to the MPLF results.

Kolmogorov-Arnold Networks

Towards Domain Adaptive Neural Contextual Bandits

no code implementations13 Jun 2024 Ziyan Wang, Xiaoming Huo, Hao Wang

Our approach learns a bandit model for the target domain by collecting feedback from the source domain.

Decision Making Domain Adaptation +1

Safe Multi-agent Reinforcement Learning with Natural Language Constraints

no code implementations30 May 2024 Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, Yali Du

These embeddings are then integrated into the multi-agent policy learning process, enabling agents to learn policies that minimize constraint violations while optimizing rewards.

Autonomous Vehicles Multi-agent Reinforcement Learning +2

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

no code implementations30 May 2024 Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang

Communication is a fundamental aspect of human society, facilitating the exchange of information and beliefs among people.

Reinforcement Learning (RL)

Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation

no code implementations25 Mar 2024 Ziyan Wang, Yingpeng Du, Zhu Sun, Haoyan Chua, Kaidong Feng, Wenya Wang, Jie Zhang

However, the former methods struggle with optimal prompts to elicit the correct reasoning of LLMs due to the lack of task-specific feedback, leading to unsatisfactory recommendations.

Language Modeling Language Modelling +2

ANIM: Accurate Neural Implicit Model for Human Reconstruction from a single RGB-D image

no code implementations CVPR 2024 Marco Pesavento, Yuanlu Xu, Nikolaos Sarafianos, Robert Maier, Ziyan Wang, Chun-Han Yao, Marco Volino, Edmond Boyer, Adrian Hilton, Tony Tung

In this paper, we explore the benefits of incorporating depth observations in the reconstruction process by introducing ANIM, a novel method that reconstructs arbitrary 3D human shapes from single-view RGB-D images with an unprecedented level of accuracy.

Large Language Model with Graph Convolution for Recommendation

no code implementations14 Feb 2024 Yingpeng Du, Ziyan Wang, Zhu Sun, Haoyan Chua, Hongzhi Liu, Zhonghai Wu, Yining Ma, Jie Zhang, Youchen Sun

To adapt text-based LLMs with structured graphs, We use the LLM as an aggregator in graph processing, allowing it to understand graph-based information step by step.

Hallucination Language Modeling +2

Natural Language Reinforcement Learning

no code implementations11 Feb 2024 Xidong Feng, Ziyu Wan, Mengyue Yang, Ziyan Wang, Girish A. Koushik, Yali Du, Ying Wen, Jun Wang

Reinforcement Learning (RL) has shown remarkable abilities in learning policies for decision-making tasks.

Decision Making reinforcement-learning +2

Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

no code implementations15 Jan 2024 Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du

Through the use of pre-trained LMs and the elimination of the need for a ground-truth cost, our method enhances safe policy learning under a diverse set of human-derived free-form natural language constraints.

Form Reinforcement Learning (RL) +1

A Local Appearance Model for Volumetric Capture of Diverse Hairstyle

no code implementations14 Dec 2023 Ziyan Wang, Giljoo Nam, Aljaz Bozic, Chen Cao, Jason Saragih, Michael Zollhoefer, Jessica Hodgins

In this paper, we present a novel method for creating high-fidelity avatars with diverse hairstyles.

A Comprehensive Survey for Evaluation Methodologies of AI-Generated Music

no code implementations26 Aug 2023 Zeyu Xiong, Weitao Wang, Jing Yu, Yue Lin, Ziyan Wang

In recent years, AI-generated music has made significant progress, with several models performing well in multimodal and complex musical genres and scenes.

ChessGPT: Bridging Policy Learning and Language Modeling

1 code implementation NeurIPS 2023 Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang

Thus, we propose ChessGPT, a GPT model bridging policy learning and language modeling by integrating data from these two sources in Chess games.

Decision Making Language Modeling +1

Variational Imbalanced Regression: Fair Uncertainty Quantification via Probabilistic Smoothing

1 code implementation NeurIPS 2023 Ziyan Wang, Hao Wang

Existing regression models tend to fall short in both accuracy and uncertainty estimation when the label distribution is imbalanced.

Probabilistic Deep Learning regression +1

Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach

no code implementations NeurIPS 2023 Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy

While the majority of current approaches construct the reward redistribution in an uninterpretable manner, we propose to explicitly model the contributions of state and action from a causal perspective, resulting in an interpretable reward redistribution and preserving policy invariance.

reinforcement-learning Reinforcement Learning

Neural Strands: Learning Hair Geometry and Appearance from Multi-View Images

no code implementations28 Jul 2022 Radu Alexandru Rosu, Shunsuke Saito, Ziyan Wang, Chenglei Wu, Sven Behnke, Giljoo Nam

Furthermore, we introduce a novel neural rendering framework based on rasterization of the learned hair strands.

Neural Rendering

Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

1 code implementation14 Feb 2022 Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

Satisfying safety constraints almost surely (or with probability one) can be critical for the deployment of Reinforcement Learning (RL) in real-life applications.

reinforcement-learning Reinforcement Learning +2

HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture

no code implementations CVPR 2022 Ziyan Wang, Giljoo Nam, Tuur Stuyck, Stephen Lombardi, Michael Zollhoefer, Jessica Hodgins, Christoph Lassner

Capturing and rendering life-like hair is particularly challenging due to its fine geometric structure, the complex physical interaction and its non-trivial visual appearance. Yet, hair is a critical component for believable avatars.

Neural Rendering Optical Flow Estimation

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

no code implementations27 Oct 2021 David Mguni, Usman Islam, Yaqi Sun, Xiuling Zhang, Joel Jennings, Aivar Sootla, Changmin Yu, Ziyan Wang, Jun Wang, Yaodong Yang

In this paper, we introduce a new generation of RL solvers that learn to minimise safety violations while maximising the task reward to the extent that can be tolerated by the safe policy.

OpenAI Gym reinforcement-learning +3

Multi-Agent Constrained Policy Optimisation

4 code implementations6 Oct 2021 Shangding Gu, Jakub Grudzien Kuba, Munning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois Knoll, Yaodong Yang

To fill these gaps, in this work, we formulate the safe MARL problem as a constrained Markov game and solve it with policy optimisation methods.

MuJoCo Multi-agent Reinforcement Learning +3

Learning Compositional Radiance Fields of Dynamic Human Heads

1 code implementation CVPR 2021 Ziyan Wang, Timur Bagautdinov, Stephen Lombardi, Tomas Simon, Jason Saragih, Jessica Hodgins, Michael Zollhöfer

In addition, we show that the learned dynamic radiance field can be used to synthesize novel unseen expressions based on a global animation code.

NeRF Neural Rendering +1

Geometry-Aware Recurrent Neural Networks for Active Visual Recognition

no code implementations NeurIPS 2018 Ricson Cheng, Ziyan Wang, Katerina Fragkiadaki

We present recurrent geometry-aware neural networks that integrate visual information across multiple views of a scene into 3D latent feature tensors, while maintaining an one-to-one mapping between 3D physical locations in the world scene and latent feature locations.

3D Reconstruction Object +3

Semantic Photometric Bundle Adjustment on Natural Sequences

no code implementations30 Nov 2017 Rui Zhu, Chaoyang Wang, Chen-Hsuan Lin, Ziyan Wang, Simon Lucey

More recently, excellent results have been attained through the application of photometric bundle adjustment (PBA) methods -- which directly minimize the photometric error across frames.

Object Object Reconstruction

Object-Centric Photometric Bundle Adjustment with Deep Shape Prior

no code implementations4 Nov 2017 Rui Zhu, Chaoyang Wang, Chen-Hsuan Lin, Ziyan Wang, Simon Lucey

Reconstructing 3D shapes from a sequence of images has long been a problem of interest in computer vision.

Object

Virtual to Real Reinforcement Learning for Autonomous Driving

6 code implementations13 Apr 2017 Xinlei Pan, Yurong You, Ziyan Wang, Cewu Lu

To our knowledge, this is the first successful case of driving policy trained by reinforcement learning that can adapt to real world driving data.

Autonomous Driving Domain Adaptation +6

Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition

no code implementations6 Apr 2016 Ziyan Wang, Jiwen Lu, Ruogu Lin, Jianjiang Feng, Jie zhou

Specifically, we construct a pair of deep convolutional neural networks (CNNs) for the RGB and depth data, and concatenate them at the top layer of the network with a loss function which learns a new feature space where both correlated part and the individual part of the RGB-D information are well modelled.

Object Object Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.