Search Results for author: Wenjun Li

Found 14 papers, 5 papers with code

Understanding R1-Zero-Like Training: A Critical Perspective

1 code implementation26 Mar 2025 Zichen Liu, Changyu Chen, Wenjun Li, Penghui Qi, Tianyu Pang, Chao Du, Wee Sun Lee, Min Lin

DeepSeek-R1-Zero has shown that reinforcement learning (RL) at scale can directly enhance the reasoning capabilities of LLMs without supervised fine-tuning.

Reinforcement Learning (RL)

Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger

no code implementations18 Feb 2025 Wenjun Li, Dexun Li, Kuicai Dong, Cong Zhang, Hao Zhang, Weiwen Liu, Yasheng Wang, Ruiming Tang, Yong liu

Large language models (LLMs) have shown remarkable emergent capabilities, transforming the execution of functional tasks by leveraging external tools for complex problems that require specialized processing or real-time data.

Decision Making

Improving Environment Novelty Quantification for Effective Unsupervised Environment Design

no code implementations8 Feb 2025 Jayden Teoh, Wenjun Li, Pradeep Varakantham

Unsupervised Environment Design (UED) formalizes the problem of autocurricula through interactive training between a teacher agent and a student agent.

A Survey of Foundation Models for Music Understanding

no code implementations15 Sep 2024 Wenjun Li, Ying Cai, Ziyang Wu, Wenyi Zhang, Yifan Chen, Rundong Qi, Mengqi Dong, Peigen Chen, Xiao Dong, Fenghao Shi, Lei Guo, Junwei Han, Bao Ge, Tianming Liu, Lin Gan, Tuo Zhang

Music is essential in daily life, fulfilling emotional and entertainment needs, and connecting us personally, socially, and culturally.

Survey

Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval

no code implementations21 Jun 2024 Wenjun Li, Shudong Wang, Dong Zhao, Shenghui Xu, Zhaoming Pan, Zhimin Zhang

To address this, we propose a novel multi-granularity feature interaction module called MGFI, consisting of text-frame and word-frame, for video-text representations alignment.

Retrieval Sentence +3

Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning

no code implementations15 Jun 2024 Wenjun Li, Changyu Chen, Pradeep Varakantham

To address this challenge, we propose the Maximum Diversity Fine-Tuning (MDFT) strategy to improve the sample efficiency of fine-tuning in the planning domain.

Diversity valid

Controllable Talking Face Generation by Implicit Facial Keypoints Editing

1 code implementation5 Jun 2024 Dong Zhao, Jiaying Shi, Wenjun Li, Shudong Wang, Shenghui Xu, Zhaoming Pan

Audio-driven talking face generation has garnered significant interest within the domain of digital human research.

Talking Face Generation

ODD: A Benchmark Dataset for the Natural Language Processing based Opioid Related Aberrant Behavior Detection

1 code implementation5 Jul 2023 Sunjae Kwon, Xun Wang, Weisong Liu, Emily Druhl, Minhee L. Sung, Joel I. Reisman, Wenjun Li, Robert D. Kerns, William Becker, Hong Yu

Experimental results show that the prompt-tuning models outperformed the fine-tuning models in most categories and the gains were especially higher among uncommon categories (Suggested Aberrant Behavior, Confirmed Aberrant Behaviors, Diagnosed Opioid Dependence, and Medication Change).

ChatABL: Abductive Learning via Natural Language Interaction with ChatGPT

no code implementations21 Apr 2023 Tianyang Zhong, Yaonai Wei, Li Yang, Zihao Wu, Zhengliang Liu, Xiaozheng Wei, Wenjun Li, Junjie Yao, Chong Ma, Xiang Li, Dajiang Zhu, Xi Jiang, Junwei Han, Dinggang Shen, Tianming Liu, Tuo Zhang

The proposed method uses the strengths of LLMs' understanding and logical reasoning to correct the incomplete logical facts for optimizing the performance of perceptual module, by summarizing and reorganizing reasoning rules represented in natural language format.

Decipherment Logical Reasoning

Diversity Induced Environment Design via Self-Play

no code implementations4 Feb 2023 Dexun Li, Wenjun Li, Pradeep Varakantham

In this paper, we aim to introduce diversity in the Unsupervised Environment Design (UED) framework.

Diversity

Generalization through Diversity: Improving Unsupervised Environment Design

no code implementations19 Jan 2023 Wenjun Li, Pradeep Varakantham, Dexun Li

Agent decision making using Reinforcement Learning (RL) heavily relies on either a model or simulator of the environment (e. g., moving in an 8x8 maze with three rooms, playing Chess on an 8x8 board).

Decision Making Diversity +1

Facilitating human-wildlife cohabitation through conflict prediction

no code implementations22 Sep 2021 Susobhan Ghosh, Pradeep Varakantham, Aniket Bhatkhande, Tamanna Ahmad, Anish Andheria, Wenjun Li, Aparna Taneja, Divy Thakkar, Milind Tambe

With increasing world population and expanded use of forests as cohabited regions, interactions and conflicts with wildlife are increasing, leading to large-scale loss of lives (animal and human) and livelihoods (economic).

Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.