Search Results for author: Zhihai Wang

Found 12 papers, 3 papers with code

HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking

no code implementations5 May 2025 Runquan Gui, Zhihai Wang, Jie Wang, Chi Ma, HuiLing Zhen, Mingxuan Yuan, Jianye Hao, Defu Lian, Enhong Chen, Feng Wu

However, these methods encounter challenges with complex planning tasks, primarily due to extended reasoning steps, diverse constraints, and the challenge of handling multiple distinct sub-tasks.

Logical Reasoning

Accelerating Large Language Model Reasoning via Speculative Search

no code implementations3 May 2025 Zhihai Wang, Jie Wang, Jilai Pan, Xilin Xia, HuiLing Zhen, Mingxuan Yuan, Jianye Hao, Feng Wu

Experiments on both the Qwen and Llama models demonstrate that SpecSearch significantly outperforms state-of-the-art approaches, achieving up to 2. 12$\times$ speedup with comparable reasoning quality.

Language Modeling Language Modelling +1

AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference

1 code implementation6 Feb 2025 Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Chen Chen, Lei Chen, Xianzhi Yu, Wulong Liu, Jianye Hao, Mingxuan Yuan, Bin Li

With the development of large language models (LLMs), efficient inference through Key-Value (KV) cache compression has attracted considerable attention, especially for long-context generation.

Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms

no code implementations3 Jul 2024 Zhihai Wang, Zijie Geng, Zhaojie Tu, Jie Wang, Yuxi Qian, Zhexuan Xu, Ziyan Liu, Siyuan Xu, Zhentao Tang, Shixiong Kai, Mingxuan Yuan, Jianye Hao, Bin Li, Yongdong Zhang, Feng Wu

We executed six state-of-the-art AI-based chip placement algorithms on these designs and plugged the results of each single-point algorithm into the physical implementation workflow to obtain the final PPA results.

Benchmarking

Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming

no code implementations19 Apr 2024 Jie Wang, Zhihai Wang, Xijun Li, Yufei Kuang, Zhihao Shi, Fangzhou Zhu, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

Moreover, we observe that (P3) what order of selected cuts to prefer significantly impacts the efficiency of MILP solvers as well.

Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming

no code implementations31 Jan 2024 Haotian Ling, Zhihai Wang, Jie Wang

A key problem for cuts is when to stop cuts generation, which is important for the efficiency of solving MILPs.

Decision Making

Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

no code implementations11 Jan 2024 Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao, Kun Mao

To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional optimization techniques.

Decision Making Management

Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning

no code implementations18 Oct 2023 Yufei Kuang, Xijun Li, Jie Wang, Fangzhou Zhu, Meng Lu, Zhihai Wang, Jia Zeng, Houqiang Li, Yongdong Zhang, Feng Wu

Specifically, we formulate the routine design task as a Markov decision process and propose an RL framework with adaptive action sequences to generate high-quality presolve routines efficiently.

reinforcement-learning Reinforcement Learning +1

A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design

1 code implementation22 Aug 2023 Zhihai Wang, Lei Chen, Jie Wang, Xing Li, Yinqi Bai, Xijun Li, Mingxuan Yuan, Jianye Hao, Yongdong Zhang, Feng Wu

In particular, we notice that the runtime of the Resub and Mfs2 operators often dominates the overall runtime of LS optimization processes.

Domain Generalization

Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model

no code implementations1 Feb 2023 Zhihai Wang, Xijun Li, Jie Wang, Yufei Kuang, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

Cut selection -- which aims to select a proper subset of the candidate cuts to improve the efficiency of solving MILPs -- heavily depends on (P1) which cuts should be preferred, and (P2) how many cuts should be selected.

Efficient Exploration in Resource-Restricted Reinforcement Learning

no code implementations14 Dec 2022 Zhihai Wang, Taoxing Pan, Qi Zhou, Jie Wang

In many real-world applications of reinforcement learning (RL), performing actions requires consuming certain types of resources that are non-replenishable in each episode.

Efficient Exploration reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.