Search Results for author: Qiaosheng Zhang

Found 21 papers, 4 papers with code

Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute

1 code implementation1 Apr 2025 Jianhao Chen, Zishuo Xun, Bocheng Zhou, Han Qi, Hangfan Zhang, Qiaosheng Zhang, Yang Chen, Wei Hu, Yuzhong Qu, Wanli Ouyang, Shuyue Hu

This paper presents a simple, effective, and cost-efficient strategy to improve LLM performance by scaling test-time compute.

If Multi-Agent Debate is the Answer, What is the Question?

no code implementations12 Feb 2025 Hangfan Zhang, Zhiyao Cui, Xinrun Wang, Qiaosheng Zhang, Zhen Wang, Dinghao Wu, Shuyue Hu

Multi-agent debate (MAD) has emerged as a promising approach to enhance the factual accuracy and reasoning quality of large language models (LLMs) by engaging multiple agents in iterative discussions during inference.

Online Preference Alignment for Language Models via Count-based Exploration

1 code implementation22 Jan 2025 Chenjia Bai, Yang Zhang, Shuang Qiu, Qiaosheng Zhang, Kang Xu, Xuelong Li

Then, we reformulate our objective to direct preference optimization with an exploration term, where the UCB-term can be converted to a count-based exploration bonus.

Instruction Following

Community Detection for Contextual-LSBM: Theoretical Limitations of Misclassification Rate and Efficient Algorithms

no code implementations19 Jan 2025 Dian Jin, Yuqian Zhang, Qiaosheng Zhang

The integration of network information and node attribute information has recently gained significant attention in the community detection literature.

Attribute Community Detection +1

Graph Attention is Not Always Beneficial: A Theoretical Analysis of Graph Attention Mechanisms via Contextual Stochastic Block Models

no code implementations20 Dec 2024 Zhongtian Ma, Qiaosheng Zhang, Bocheng Zhou, Yexin Zhang, Shuyue Hu, Zhen Wang

Specifically, by appropriately defining \emph{structure noise} and \emph{feature noise} in graphs, we show that graph attention mechanisms can enhance classification performance when structure noise exceeds feature noise.

Graph Attention Node Classification

A Fast Convergence Theory for Offline Decision Making

no code implementations3 Jun 2024 Chenjie Mao, Qiaosheng Zhang

This paper proposes the first generic fast convergence result in general function approximation for offline decision making problems, which include offline reinforcement learning (RL) and off-policy evaluation (OPE) as special cases.

Decision Making Offline RL +1

Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning

no code implementations12 May 2024 Changhong Wang, Xudong Yu, Chenjia Bai, Qiaosheng Zhang, Zhen Wang

To address this problem, our work builds upon the investigation of successor representations for task generalization in online RL and extends the framework to incorporate offline-to-online learning.

Offline RL Reinforcement Learning (RL) +1

Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

no code implementations30 Apr 2024 Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

Finally, we extend Reg-MAIDS to multi-player general-sum MGs and prove that it can learn either the Nash equilibrium or coarse correlated equilibrium in a sample efficient manner.

Multi-agent Reinforcement Learning

Community Detection in the Multi-View Stochastic Block Model

no code implementations17 Jan 2024 Yexin Zhang, Zhongtian Ma, Qiaosheng Zhang, Zhen Wang, Xuelong Li

This paper considers the problem of community detection on multiple potentially correlated graphs from an information-theoretical perspective.

Community Detection Stochastic Block Model

Matrix Completion with Hypergraphs:Sharp Thresholds and Efficient Algorithms

no code implementations16 Jan 2024 Zhongtian Ma, Qiaosheng Zhang, Zhen Wang

Theoretical analyses show that our algorithm succeeds with high probability as long as the sample probability exceeds the aforementioned threshold, and this theoretical result is further validated by synthetic experiments.

Matrix Completion

Mutual Information Learned Regressor: an Information-theoretic Viewpoint of Training Regression Systems

no code implementations23 Nov 2022 Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao, Yusen He, Yaohua Wang

We first argue that the MSE minimization approach is equivalent to a conditional entropy learning problem, and then propose a mutual information learning formulation for solving regression problems by using a reparameterization technique.

regression

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

no code implementations3 Oct 2022 Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao

Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone feature extractors in downstream tasks.

Binary Classification Data Augmentation

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

no code implementations21 Sep 2022 Jirong Yi, Qiaosheng Zhang, Zhen Chen, Qiao Liu, Wei Shao

Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets.

Binary Classification

Exact Recovery in the General Hypergraph Stochastic Block Model

no code implementations11 May 2021 Qiaosheng Zhang, Vincent Y. F. Tan

This paper investigates fundamental limits of exact recovery in the general d-uniform hypergraph stochastic block model (d-HSBM), wherein n nodes are partitioned into k disjoint communities with relative sizes (p1,..., pk).

Clustering Stochastic Block Model

MC2G: An Efficient Algorithm for Matrix Completion with Social and Item Similarity Graphs

no code implementations8 Jun 2020 Qiaosheng Zhang, Geewon Suh, Changho Suh, Vincent Y. F. Tan

In this paper, we design and analyze MC2G (Matrix Completion with 2 Graphs), an algorithm that performs matrix completion in the presence of social and item similarity graphs.

Clustering Matrix Completion +1

Optimal Change-Point Detection with Training Sequences in the Large and Moderate Deviations Regimes

no code implementations13 Mar 2020 Haiyun He, Qiaosheng Zhang, Vincent Y. F. Tan

This paper investigates a novel offline change-point detection problem from an information-theoretic perspective.

Change Point Detection

Community Detection and Matrix Completion with Social and Item Similarity Graphs

no code implementations6 Dec 2019 Qiaosheng Zhang, Vincent Y. F. Tan, Changho Suh

We consider the problem of recovering a binary rating matrix as well as clusters of users and items based on a partially observed matrix together with side-information in the form of social and item similarity graphs.

Community Detection Matrix Completion +1

Cannot find the paper you are looking for? You can Submit a new open access paper.