Search Results for author: Yuchen Xie

Found 9 papers, 3 papers with code

A Speed Odyssey for Deployable Quantization of LLMs

no code implementations • 16 Nov 2023 • Qingyuan Li, Ran Meng, Yiduo Li, Bo Zhang, Liang Li, Yifan Lu, Xiangxiang Chu, Yerui Sun, Yuchen Xie

The large language model era urges faster and less costly inference.

Language Modelling Large Language Model +2

Paper
Add Code

FPTQ: Fine-grained Post-Training Quantization for Large Language Models

no code implementations • 30 Aug 2023 • Qingyuan Li, Yifan Zhang, Liang Li, Peng Yao, Bo Zhang, Xiangxiang Chu, Yerui Sun, Li Du, Yuchen Xie

In this study, we propose a novel W4A8 post-training quantization method for the available open-sourced LLMs, which combines the advantages of both two recipes.

Quantization

Paper
Add Code

SLAMB: Accelerated Large Batch Training with Sparse Communication

1 code implementation • The International Conference on Machine Learning (ICML) 2023 • Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, Tingwen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis

Distributed training of large deep neural networks requires frequent exchange of massive data between machines, thus communication efficiency is a major concern.

Paper
Code

Stochastic Ratios Tracking Algorithm for Large Scale Machine Learning Problems

no code implementations • 17 May 2023 • Shigeng Sun, Yuchen Xie

In this paper, we propose a novel algorithm for adaptive step length selection in the classical SGD framework, which can be readily adapted to other stochastic algorithms.

Paper
Add Code

Match Cutting: Finding Cuts with Smooth Visual Transitions

1 code implementation • 11 Oct 2022 • Boris Chen, Amir Ziai, Rebecca Tucker, Yuchen Xie

A match cut is a transition between a pair of shots that uses similar framing, composition, or action to fluidly bring the viewer from one scene to the next.

Metric Learning

Paper
Code

Constrained and Composite Optimization via Adaptive Sampling Methods

no code implementations • 31 Dec 2020 • Yuchen Xie, Raghu Bollapragada, Richard Byrd, Jorge Nocedal

The motivation for this paper stems from the desire to develop an adaptive sampling method for solving constrained optimization problems in which the objective function is stochastic and the constraints are deterministic.

Paper
Add Code

A Noise-Tolerant Quasi-Newton Algorithm for Unconstrained Optimization

1 code implementation • 9 Oct 2020 • Hao-Jun Michael Shi, Yuchen Xie, Richard Byrd, Jorge Nocedal

This paper describes an extension of the BFGS and L-BFGS methods for the minimization of a nonlinear function subject to errors.

Optimization and Control

Paper
Code

A Theoretical Analysis of Deep Q-Learning

no code implementations • 1 Jan 2019 • Jianqing Fan, Zhaoran Wang, Yuchen Xie, Zhuoran Yang

Despite the great empirical success of deep reinforcement learning, its theoretical foundation is less well understood.

Q-Learning

Paper
Add Code

Contrastive Learning from Pairwise Measurements

no code implementations • NeurIPS 2018 • Yi Chen, Zhuoran Yang, Yuchen Xie, Princeton Zhaoran Wang

In this paper, we study a semiparametric model where the pairwise measurements follow a natural exponential family distribution with an unknown base measure.

Contrastive Learning Data Augmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.