Search Results for author: Ching-pei Lee

Found 10 papers, 5 papers with code

Manifold Identification for Ultimately Communication-Efficient Distributed Optimization

1 code implementation • ICML 2020 • Yu-Sheng Li, Wei-Lin Chiang, Ching-pei Lee

The expensive inter-machine communication is the bottleneck of distributed optimization.

Distributed Optimization

Paper
Code

Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural Network

2 code implementations • 21 Mar 2024 • Zih-Syuan Huang, Ching-pei Lee

We propose a Regularized Adaptive Momentum Dual Averaging (RAMDA) algorithm for training structured neural networks.

Language Modelling

Paper
Code

Accelerating nuclear-norm regularized low-rank matrix optimization through Burer-Monteiro decomposition

no code implementations • 29 Apr 2022 • Ching-pei Lee, Ling Liang, Tianyun Tang, Kim-Chuan Toh

This work proposes a rapid algorithm, BM-Global, for nuclear-norm-regularized convex and low-rank matrix optimization problems.

Recommendation Systems

Paper
Add Code

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

2 code implementations • ICLR 2022 • Zih-Syuan Huang, Ching-pei Lee

This paper proposes an algorithm (RMDA) for training neural networks (NNs) with a regularization term for promoting desired structures.

Data Augmentation

Paper
Code

Momentum as Variance-Reduced Stochastic Gradient

no code implementations • 29 Sep 2021 • Zih-Syuan Huang, Ching-pei Lee

Stochastic gradient descent with momentum (SGD+M) is widely used to empirically improve the convergence behavior and the generalization performance of plain stochastic gradient descent (SGD) in the training of deep learning models, but our theoretical understanding for SGD+M is still very limited.

Data Augmentation

Paper
Add Code

Accelerating Inexact Successive Quadratic Approximation for Regularized Optimization Through Manifold Identification

no code implementations • 4 Dec 2020 • Ching-pei Lee

We show that for a wide class of degenerate solutions, ISQA+ possesses superlinear convergence not just only in iterations, but also in running time because the cost per iteration is bounded.

Optimization and Control

Paper
Add Code

A Distributed Quasi-Newton Algorithm for Primal and Dual Regularized Empirical Risk Minimization

1 code implementation • 12 Dec 2019 • Ching-pei Lee, Cong Han Lim, Stephen J. Wright

When applied to the distributed dual ERM problem, unlike state of the art that takes only the block-diagonal part of the Hessian, our approach is able to utilize global curvature information and is thus magnitudes faster.

Distributed Optimization

Paper
Code

A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

1 code implementation • 4 Mar 2018 • Ching-pei Lee, Cong Han Lim, Stephen J. Wright

Initial computational results on convex problems demonstrate that our method significantly improves on communication cost and running time over the current state-of-the-art methods.

Distributed Optimization

Paper
Code

On the Equivalence of CoCoA+ and DisDCA

no code implementations • 13 Jun 2015 • Ching-pei Lee

In this document, we show that the algorithm CoCoA+ (Ma et al., ICML, 2015) under the setting used in their experiments, which is also the best setting suggested by the authors that proposed this algorithm, is equivalent to the practical variant of DisDCA (Yang, NIPS, 2013).

Paper
Add Code

Distributed Training of Structured SVM

no code implementations • 8 Jun 2015 • Ching-pei Lee, Kai-Wei Chang, Shyam Upadhyay, Dan Roth

Training structured prediction models is time-consuming.

Structured Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.