no code implementations • 24 May 2023 • Rishi Sonthalia, Anna Seigal, Guido Montufar
We define the supermodular rank of a function on a lattice.
1 code implementation • 15 Nov 2022 • Michael Murray, Hui Jin, Benjamin Bowman, Guido Montufar
We provide expressions for the coefficients of this power series which depend on both the Hermite coefficients of the activation function as well as the depth of the network.
no code implementations • 6 Jun 2022 • Benjamin Bowman, Guido Montufar
This bias depends on the model architecture and input distribution alone and thus does not depend on the target function which does not need to be in the RKHS of the kernel.
no code implementations • ICLR 2022 • Benjamin Bowman, Guido Montufar
We study the dynamics of a neural network in function space when optimizing the mean squared error via gradient flow.
no code implementations • ICLR Workshop Neural_Compression 2021 • Pradeep Kr. Banerjee, Guido Montufar
We point out that a number of well-known PAC-Bayesian-style and information-theoretic generalization bounds for randomized learning algorithms can be derived under a common framework starting from a fundamental information exponential inequality.
no code implementations • ICLR 2019 • Alex Tong Lin, Wuchen Li, Stanley Osher, Guido Montufar
We introduce a new method for training generative adversarial networks by applying the Wasserstein-2 metric proximal on the generators.
1 code implementation • 13 Feb 2021 • Xuebin Zheng, Bingxin Zhou, Junbin Gao, Yu Guang Wang, Pietro Lio, Ming Li, Guido Montufar
The graph neural networks with the proposed framelet convolution and pooling achieve state-of-the-art performance in many node and graph prediction tasks.
no code implementations • 21 Dec 2020 • Quynh Nguyen, Marco Mondelli, Guido Montufar
In this paper, we provide tight bounds on the smallest eigenvalue of NTK matrices for deep ReLU nets, both in the limiting case of infinite widths and for finite widths.
1 code implementation • ICLR 2020 • Michael Arbel, Arthur Gretton, Wuchen Li, Guido Montufar
Many machine learning problems can be expressed as the optimization of some cost functional over a parametric family of probability distributions.
1 code implementation • ICML 2020 • Yu Guang Wang, Ming Li, Zheng Ma, Guido Montufar, Xiaosheng Zhuang, Yanan Fan
Deep Graph Neural Networks (GNNs) are useful models for graph classification and graph-based regression tasks.
no code implementations • 25 Sep 2019 • Yu Guang Wang, Ming Li, Zheng Ma, Guido Montufar, Xiaosheng Zhuang, Yanan Fan
The input of each pooling layer is transformed by the compressive Haar basis of the corresponding clustering.
no code implementations • 25 Sep 2019 • Yonatan Dukler, Quanquan Gu, Guido Montufar
We present a proof of convergence for ReLU networks trained with weight normalization.
no code implementations • 15 Sep 2019 • Alex Tong Lin, Yonatan Dukler, Wuchen Li, Guido Montufar
We propose regularization strategies for learning discriminative models that are robust to in-class variations of the input data.
no code implementations • 6 Feb 2019 • Alex Tong Lin, Mark J. Debord, Katia Estabridis, Gary Hewer, Guido Montufar, Stanley Osher
In order to obtain multi-agents that act in a decentralized manner, we introduce a novel algorithm under the popular framework of centralized training, but decentralized execution.
1 code implementation • 19 Jun 2018 • Guido Montufar
The restricted Boltzmann machine is a network of stochastic units with undirected interactions between pairs of visible and hidden units.
no code implementations • 15 Sep 2017 • Anna Seigal, Guido Montufar
We compare two statistical models of three binary random variables.
no code implementations • 6 Apr 2017 • Guido Montufar, Johannes Rauh
We investigate the geometry of optimal memoryless time independent decision making in relation to the amount of information that the acting agent has about the state of the system.
no code implementations • 31 May 2016 • Guido Montufar, Keyan Ghazi-Zahedi, Nihat Ay
Reinforcement learning for embodied agents is a challenging problem.
2 code implementations • 1 Dec 2015 • Keyan Ghazi-Zahedi, Daniel F. B. Haeufle, Guido Montufar, Syn Schmitt, Nihat Ay
An important aspect of morphological computation is that it cannot be assigned to an embodied system per se, but that it is, as we show, behavior- and state-dependent.
no code implementations • 10 Nov 2015 • Guido Montufar, Jason Morton
The limit is described by the tropical morphism; a piecewise linear map with pieces corresponding to slicings of the visible matrix by the normal fan of the hidden matrix.
no code implementations • 14 Aug 2015 • Guido Montufar, Johannes Rauh
We investigate the representation of hierarchical models in terms of marginals of other hierarchical models with smaller interactions.
no code implementations • 24 Mar 2015 • Guido Montufar
We establish upper bounds for the minimal number of hidden units for which a binary stochastic feedforward network with sigmoid activation probabilities and a single hidden layer is a universal approximator of Markov kernels.
no code implementations • 24 Mar 2015 • Guido Montufar, Keyan Ghazi-Zahedi, Nihat Ay
For partially observable Markov decision processes (POMDPs), optimal memoryless policies are generally stochastic.
no code implementations • 14 Nov 2014 • Guido Montufar
We show that deep narrow Boltzmann machines are universal approximators of probability distributions on the activities of their visible units, provided they have sufficiently many hidden layers, each containing the same number of units as the visible layer.
no code implementations • NeurIPS 2011 • Guido Montufar, Johannes Rauh, Nihat Ay
We present explicit classes of probability distributions that can be learned by Restricted Boltzmann Machines (RBMs) depending on the number of units that they contain, and which are representative for the expressive power of the model.
no code implementations • 14 Feb 2014 • Guido Montufar, Nihat Ay, Keyan Ghazi-Zahedi
Conditional restricted Boltzmann machines are undirected stochastic neural networks with a layer of input and output units connected bipartitely to a layer of hidden units.
no code implementations • 20 Dec 2013 • Razvan Pascanu, Guido Montufar, Yoshua Bengio
For a $k$ layer model with $n$ hidden units on each layer it is $\Omega(\left\lfloor {n}/{n_0}\right\rfloor^{k-1}n^{n_0})$.
no code implementations • 15 Jan 2013 • Guido Montufar, Jason Morton
We describe discrete restricted Boltzmann machines: probabilistic graphical models with bipartite interactions between visible and hidden discrete variables.