no code implementations • 16 Nov 2023 • Qingyuan Li, Ran Meng, Yiduo Li, Bo Zhang, Liang Li, Yifan Lu, Xiangxiang Chu, Yerui Sun, Yuchen Xie
The large language model era urges faster and less costly inference.
no code implementations • 30 Aug 2023 • Qingyuan Li, Yifan Zhang, Liang Li, Peng Yao, Bo Zhang, Xiangxiang Chu, Yerui Sun, Li Du, Yuchen Xie
In this study, we propose a novel W4A8 post-training quantization method for the available open-sourced LLMs, which combines the advantages of both two recipes.
1 code implementation • The International Conference on Machine Learning (ICML) 2023 • Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, Tingwen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis
Distributed training of large deep neural networks requires frequent exchange of massive data between machines, thus communication efficiency is a major concern.
no code implementations • 17 May 2023 • Shigeng Sun, Yuchen Xie
In this paper, we propose a novel algorithm for adaptive step length selection in the classical SGD framework, which can be readily adapted to other stochastic algorithms.
1 code implementation • 11 Oct 2022 • Boris Chen, Amir Ziai, Rebecca Tucker, Yuchen Xie
A match cut is a transition between a pair of shots that uses similar framing, composition, or action to fluidly bring the viewer from one scene to the next.
no code implementations • 31 Dec 2020 • Yuchen Xie, Raghu Bollapragada, Richard Byrd, Jorge Nocedal
The motivation for this paper stems from the desire to develop an adaptive sampling method for solving constrained optimization problems in which the objective function is stochastic and the constraints are deterministic.
1 code implementation • 9 Oct 2020 • Hao-Jun Michael Shi, Yuchen Xie, Richard Byrd, Jorge Nocedal
This paper describes an extension of the BFGS and L-BFGS methods for the minimization of a nonlinear function subject to errors.
Optimization and Control
no code implementations • 1 Jan 2019 • Jianqing Fan, Zhaoran Wang, Yuchen Xie, Zhuoran Yang
Despite the great empirical success of deep reinforcement learning, its theoretical foundation is less well understood.
no code implementations • NeurIPS 2018 • Yi Chen, Zhuoran Yang, Yuchen Xie, Princeton Zhaoran Wang
In this paper, we study a semiparametric model where the pairwise measurements follow a natural exponential family distribution with an unknown base measure.