Search Results for author: Yuchen Xie

Found 9 papers, 3 papers with code

FPTQ: Fine-grained Post-Training Quantization for Large Language Models

no code implementations30 Aug 2023 Qingyuan Li, Yifan Zhang, Liang Li, Peng Yao, Bo Zhang, Xiangxiang Chu, Yerui Sun, Li Du, Yuchen Xie

In this study, we propose a novel W4A8 post-training quantization method for the available open-sourced LLMs, which combines the advantages of both two recipes.

Quantization

SLAMB: Accelerated Large Batch Training with Sparse Communication

1 code implementation The International Conference on Machine Learning (ICML) 2023 Hang Xu, Wenxuan Zhang, Jiawei Fei, Yuzhe Wu, Tingwen Xie, Jun Huang, Yuchen Xie, Mohamed Elhoseiny, Panos Kalnis

Distributed training of large deep neural networks requires frequent exchange of massive data between machines, thus communication efficiency is a major concern.

Stochastic Ratios Tracking Algorithm for Large Scale Machine Learning Problems

no code implementations17 May 2023 Shigeng Sun, Yuchen Xie

In this paper, we propose a novel algorithm for adaptive step length selection in the classical SGD framework, which can be readily adapted to other stochastic algorithms.

Match Cutting: Finding Cuts with Smooth Visual Transitions

1 code implementation11 Oct 2022 Boris Chen, Amir Ziai, Rebecca Tucker, Yuchen Xie

A match cut is a transition between a pair of shots that uses similar framing, composition, or action to fluidly bring the viewer from one scene to the next.

Metric Learning

Constrained and Composite Optimization via Adaptive Sampling Methods

no code implementations31 Dec 2020 Yuchen Xie, Raghu Bollapragada, Richard Byrd, Jorge Nocedal

The motivation for this paper stems from the desire to develop an adaptive sampling method for solving constrained optimization problems in which the objective function is stochastic and the constraints are deterministic.

A Noise-Tolerant Quasi-Newton Algorithm for Unconstrained Optimization

1 code implementation9 Oct 2020 Hao-Jun Michael Shi, Yuchen Xie, Richard Byrd, Jorge Nocedal

This paper describes an extension of the BFGS and L-BFGS methods for the minimization of a nonlinear function subject to errors.

Optimization and Control

A Theoretical Analysis of Deep Q-Learning

no code implementations1 Jan 2019 Jianqing Fan, Zhaoran Wang, Yuchen Xie, Zhuoran Yang

Despite the great empirical success of deep reinforcement learning, its theoretical foundation is less well understood.

Q-Learning

Contrastive Learning from Pairwise Measurements

no code implementations NeurIPS 2018 Yi Chen, Zhuoran Yang, Yuchen Xie, Princeton Zhaoran Wang

In this paper, we study a semiparametric model where the pairwise measurements follow a natural exponential family distribution with an unknown base measure.

Contrastive Learning Data Augmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.