no code implementations • 3 Sep 2022 • Hideitsu Hino, Shotaro Akaho, Noboru Murata
The Expectation--Maximization (EM) algorithm is a simple meta-algorithm that has been used for many years as a methodology for statistical inference when there are missing measurements in the observed data or when the data is composed of observables and unobservables.
no code implementations • 17 Feb 2022 • Kazuya Takabatake, Shotaro Akaho
The FSLL model is a "highest-order" Boltzmann machine; nevertheless, we can compute the dual parameters of the model distribution, which plays important roles in exponential families, in $O(|X|\log|X|)$ time.
no code implementations • ICLR 2022 • Ryo Karakida, Shotaro Akaho
Even for the same target, the trained model shows some transfer and forgetting depending on the sample size of each task.
no code implementations • 15 Jul 2021 • Hideaki Ishibashi, Shotaro Akaho
This paper proposes an extension of principal component analysis for Gaussian process (GP) posteriors, denoted by GP-PCA.
no code implementations • 2 Jul 2021 • Kazuya Takabatake, Shotaro Akaho
Like Bayesian networks, the structure of a dependency network is represented by a directed graph, and each node has a conditional probability table.
no code implementations • 14 Oct 2019 • Ryo Karakida, Shotaro Akaho, Shun-ichi Amari
The Fisher information matrix (FIM) plays an essential role in statistics and machine learning as a Riemannian metric tensor or a component of the Hessian matrix of loss functions.
no code implementations • 27 Sep 2019 • Shotaro Akaho, Hideitsu Hino, Noboru Murata
In this paper, we examine a geometrical projection algorithm for statistical inference.
no code implementations • NeurIPS 2019 • Ryo Karakida, Shotaro Akaho, Shun-ichi Amari
Thus, we can conclude that batch normalization in the last layer significantly contributes to decreasing the sharpness induced by the FIM.
no code implementations • 4 Jun 2018 • Ryo Karakida, Shotaro Akaho, Shun-ichi Amari
The Fisher information matrix (FIM) is a fundamental quantity to represent the characteristics of a stochastic model, including deep neural networks (DNNs).
no code implementations • 13 Sep 2006 • Shotaro Akaho
Canonical correlation analysis is a technique to extract common features from a pair of multivariate data.