Search Results for author: Zhiyuan Hu

Found 19 papers, 13 papers with code

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

1 code implementation31 Aug 2024 Zhiyuan Hu, Yuliang Liu, Jinman Zhao, Suyuchen Wang, Yan Wang, Wei Shen, Qing Gu, Anh Tuan Luu, See-Kiong Ng, Zhiwei Jiang, Bryan Hooi

Large language models (LLMs) face significant challenges in handling long-context tasks because of their limited effective context window size during pretraining, which restricts their ability to generalize over extended sequences.

8k

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases

1 code implementation7 Aug 2024 Xiangyan Liu, Bo Lan, Zhiyuan Hu, Yang Liu, Zhicheng Zhang, Fei Wang, Michael Shieh, Wenmeng Zhou

Similarity-based retrieval often has low recall in complex tasks, while manual tools and APIs are typically task-specific and require expert knowledge, reducing their generalizability across diverse code tasks and real-world applications.

HumanEval Retrieval

MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

1 code implementation4 Jul 2024 Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

To address these problems, we propose MAMA, a new approach to learning video-language representations by utilizing a contrastive objective with a subtractive angular margin to regularize cross-modal representations in their effort to reach perfect similarity.

Language Modelling Question Answering +3

READ: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

1 code implementation12 Dec 2023 Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Khoi Le, Zhiyuan Hu, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

Fully fine-tuning pretrained large-scale transformer models has become a popular paradigm for video-language modeling tasks, such as temporal language grounding and video-language summarization.

Language Modelling Transfer Learning

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration

1 code implementation14 Nov 2023 Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See Kiong Ng, Jiashi Feng

Large Language Models (LLMs) have marked a significant advancement in the field of natural language processing, demonstrating exceptional capabilities in reasoning, tool usage, and memory.

Benchmarking Language Modelling +1

Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

no code implementations16 Sep 2023 Zhiyuan Hu, Yue Feng, Yang Deng, Zekun Li, See-Kiong Ng, Anh Tuan Luu, Bryan Hooi

Recently, the development of large language models (LLMs) has been significantly enhanced the question answering and dialogue generation, and makes them become increasingly popular in current practical scenarios.

Dialogue Generation Language Modelling +3

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

1 code implementation22 Jun 2023 Miao Xiong, Zhiyuan Hu, Xinyang Lu, Yifei Li, Jie Fu, Junxian He, Bryan Hooi

To better break down the problem, we define a systematic framework with three components: prompting strategies for eliciting verbalized confidence, sampling methods for generating multiple responses, and aggregation techniques for computing consistency.

Arithmetic Reasoning Benchmarking +1

Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

1 code implementation16 Jun 2023 Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, Aldo Lipani

This approach uses LLM as annotation-free user simulator to assess dialogue responses, combining them with smaller fine-tuned end-to-end TOD models.

Language Modelling Large Language Model

PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation

1 code implementation14 Jun 2023 Zhiyuan Hu, Chumin Liu, Yue Feng, Anh Tuan Luu, Bryan Hooi

Controllable text generation is a challenging and meaningful field in natural language generation (NLG).

Denoising Sentence +1

POP: Prompt Of Prompts for Continual Learning

no code implementations14 Jun 2023 Zhiyuan Hu, Jiancheng Lyu, Dashan Gao, Nuno Vasconcelos

We show that a foundation model equipped with POP learning is able to outperform classic CL methods by a significant margin.

Continual Learning Open-Ended Question Answering

Modeling User Satisfaction Dynamics in Dialogue via Hawkes Process

1 code implementation21 May 2023 Fanghua Ye, Zhiyuan Hu, Emine Yilmaz

It assumes that the performance of a dialogue system can be measured by user satisfaction and uses an estimator to simulate users.

Dense Network Expansion for Class Incremental Learning

no code implementations CVPR 2023 Zhiyuan Hu, Yunsheng Li, Jiancheng Lyu, Dashan Gao, Nuno Vasconcelos

This is accomplished by the introduction of dense connections between the intermediate layers of the task expert networks, that enable the transfer of knowledge from old to new tasks via feature sharing and reusing.

class-incremental learning Class Incremental Learning +1

Safe Reinforcement Learning of Dynamic High-Dimensional Robotic Tasks: Navigation, Manipulation, Interaction

no code implementations27 Sep 2022 Puze Liu, Kuo Zhang, Davide Tateo, Snehal Jauhri, Zhiyuan Hu, Jan Peters, Georgia Chalvatzaki

Our proposed approach achieves state-of-the-art performance in simulated high-dimensional and dynamic tasks while avoiding collisions with the environment.

reinforcement-learning Reinforcement Learning +3

Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction

1 code implementation17 Nov 2019 Haozhe Wu, Zhiyuan Hu, Jia Jia, Yaohua Bu, Xiangnan He, Tat-Seng Chua

Next, we define user's attributes as two categories: spatial attributes (e. g., social role of user) and temporal attributes (e. g., post content of user).

Informativeness

Syntax Encoding with Application in Authorship Attribution

no code implementations EMNLP 2018 Richong Zhang, Zhiyuan Hu, Hongyu Guo, Yongyi Mao

We propose a novel strategy to encode the syntax parse tree of sentence into a learnable distributed representation.

Authorship Attribution Benchmarking +3

Cannot find the paper you are looking for? You can Submit a new open access paper.