Search Results for author: Zih-Syuan Huang

Found 3 papers, 2 papers with code

Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural Network

2 code implementations21 Mar 2024 Zih-Syuan Huang, Ching-pei Lee

We propose a Regularized Adaptive Momentum Dual Averaging (RAMDA) algorithm for training structured neural networks.

Language Modelling

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

2 code implementations ICLR 2022 Zih-Syuan Huang, Ching-pei Lee

This paper proposes an algorithm (RMDA) for training neural networks (NNs) with a regularization term for promoting desired structures.

Data Augmentation

Momentum as Variance-Reduced Stochastic Gradient

no code implementations29 Sep 2021 Zih-Syuan Huang, Ching-pei Lee

Stochastic gradient descent with momentum (SGD+M) is widely used to empirically improve the convergence behavior and the generalization performance of plain stochastic gradient descent (SGD) in the training of deep learning models, but our theoretical understanding for SGD+M is still very limited.

Data Augmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.