Search Results for author: Huang Bojun

Found 5 papers, 0 papers with code

Utility-Probability Duality of Neural Networks

no code implementations • 24 May 2023 • Huang Bojun, Fei Yuan

In this perspective, training of the neural network corresponds to a utility learning process.

Paper
Add Code

Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)

no code implementations • 22 Jul 2022 • Huang Bojun

The paper then turns to demonstrate a symmetry breaking phenomenon regarding the optimality of the Lagrangian saddle points, which justifies a largely overlooked direction in developing the Lagrangian method.

Imitation Learning Machine Translation +1

Paper
Add Code

Lagrangian Method for Episodic Learning

no code implementations • 29 Sep 2021 • Huang Bojun

Our paper brings new perspectives to this general approach in the following aspects: 1) Inspired by the usually-used linear $V$-form Lagrangian, we proposed a nonlinear $Q$-form Lagrangian function and proved that it enjoys strong duality property in spite of its nonlinearity.

Imitation Learning Machine Translation

Paper
Add Code

Simpson's Bias in NLP Training

no code implementations • 13 Mar 2021 • Fei Yuan, Longtu Zhang, Huang Bojun, Yaobo Liang

In most machine learning tasks, we evaluate a model $M$ on a given data population $S$ by measuring a population-level metric $F(S;M)$.

Multi-class Classification Sentence +1

Paper
Add Code

Steady State Analysis of Episodic Reinforcement Learning

no code implementations • NeurIPS 2020 • Huang Bojun

This paper proves that the episodic learning environment of every finite-horizon decision task has a unique steady state under any behavior policy, and that the marginal distribution of the agent's input indeed converges to the steady-state distribution in essentially all episodic learning processes.

Continual Learning reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.