Search Results for author: Nikolay Malkin

Found 37 papers, 25 papers with code

Machine learning and information theory concepts towards an AI Mathematician

no code implementations • 7 Mar 2024 • Yoshua Bengio, Nikolay Malkin

The current state-of-the-art in artificial intelligence is impressive, especially in terms of mastery of language, but not so much in terms of mathematical reasoning.

Mathematical Reasoning

Paper
Add Code

Discrete Probabilistic Inference as Control in Multi-path Environments

1 code implementation • 15 Feb 2024 • Tristan Deleu, Padideh Nouri, Nikolay Malkin, Doina Precup, Yoshua Bengio

We consider the problem of sampling from a discrete and structured distribution as a sequential decision problem, where the objective is to find a stochastic policy such that objects are sampled at the end of this sequential process proportionally to some predefined reward.

Paper
Code

V-STaR: Training Verifiers for Self-Taught Reasoners

no code implementations • 9 Feb 2024 • Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron Courville, Alessandro Sordoni, Rishabh Agarwal

Common self-improvement approaches for large language models (LLMs), such as STaR (Zelikman et al., 2022), iteratively fine-tune LLMs on self-generated solutions to improve their problem-solving ability.

Code Generation Math

Paper
Add Code

Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

1 code implementation • 9 Feb 2024 • Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science.

Denoising Efficient Exploration

Paper
Code

On diffusion models for amortized inference: Benchmarking and improving stochastic control and sampling

1 code implementation • 7 Feb 2024 • Marcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca Scimeca, Jarrid Rector-Brooks, Alexandre Adam, Yoshua Bengio, Nikolay Malkin

We study the problem of training diffusion models to sample from a distribution with a given unnormalized density or energy function.

Benchmarking

Paper
Code

PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation

no code implementations • 6 Feb 2024 • Pablo Lemos, Sammy Sharief, Nikolay Malkin, Laurence Perreault-Levasseur, Yashar Hezaveh

The proposed approach enables the estimation of the probability that two sets of samples are drawn from the same distribution, providing a statistically rigorous method for assessing the performance of a single generative model or the comparison of multiple competing models trained on the same dataset.

Dimensionality Reduction

Paper
Add Code

Improving Gradient-guided Nested Sampling for Posterior Inference

1 code implementation • 6 Dec 2023 • Pablo Lemos, Nikolay Malkin, Will Handley, Yoshua Bengio, Yashar Hezaveh, Laurence Perreault-Levasseur

We present a performant, general-purpose gradient-guided nested sampling algorithm, ${\tt GGNS}$, combining the state of the art in differentiable programming, Hamiltonian slice sampling, clustering, mode separation, dynamic nested sampling, and parallelization.

Clustering

Paper
Code

PhyloGFN: Phylogenetic inference with generative flow networks

1 code implementation • 12 Oct 2023 • Mingyang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio

Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities.

Variational Inference

Paper
Code

Amortizing intractable inference in large language models

1 code implementation • 6 Oct 2023 • Edward J. Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin

Autoregressive large language models (LLMs) compress knowledge from their training data through next-token conditional distributions.

Bayesian Inference

Paper
Code

Expected flow networks in stochastic environments and two-player zero-sum games

1 code implementation • 4 Oct 2023 • Marco Jiralerspong, Bilun Sun, Danilo Vucetic, Tianyu Zhang, Yoshua Bengio, Gauthier Gidel, Nikolay Malkin

Generative flow networks (GFlowNets) are sequential sampling models trained to match a given distribution.

Protein Design

Paper
Code

Delta-AI: Local objectives for amortized inference in sparse graphical models

1 code implementation • 3 Oct 2023 • Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Thomas Jiralerspong, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio

We present a new algorithm for amortized inference in sparse probabilistic graphical models (PGMs), which we call $\Delta$-amortized inference ($\Delta$-AI).

Paper
Code

Discrete, compositional, and symbolic representations through attractor dynamics

no code implementations • 3 Oct 2023 • Andrew Nam, Eric Elmoznino, Nikolay Malkin, Chen Sun, Yoshua Bengio, Guillaume Lajoie

Compositionality is an important feature of discrete symbolic systems, such as language and programs, as it enables them to have infinite capacity despite a finite symbol set.

Quantization

Paper
Add Code

Simulation-free Schrödinger bridges via score and flow matching

1 code implementation • 7 Jul 2023 • Alexander Tong, Nikolay Malkin, Kilian Fatras, Lazar Atanackovic, Yanlei Zhang, Guillaume Huguet, Guy Wolf, Yoshua Bengio

We present simulation-free score and flow matching ([SF]$^2$M), a simulation-free objective for inferring stochastic dynamics given unpaired samples drawn from arbitrary source and target distributions.

694

Paper
Code

Thompson sampling for improved exploration in GFlowNets

no code implementations • 30 Jun 2023 • Jarrid Rector-Brooks, Kanika Madan, Moksh Jain, Maksym Korablyov, Cheng-Hao Liu, Sarath Chandar, Nikolay Malkin, Yoshua Bengio

Generative flow networks (GFlowNets) are amortized variational inference algorithms that treat sampling from a distribution over compositional objects as a sequential decision-making problem with a learnable action policy.

Active Learning Decision Making +3

Paper
Add Code

BatchGFN: Generative Flow Networks for Batch Active Learning

1 code implementation • 26 Jun 2023 • Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward.

Active Learning

Paper
Code

Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets

1 code implementation • 26 May 2023 • Dinghuai Zhang, Hanjun Dai, Nikolay Malkin, Aaron Courville, Yoshua Bengio, Ling Pan

In this paper, we design Markov decision processes (MDPs) for different combinatorial problems and propose to train conditional GFlowNets to sample from the solution space.

Combinatorial Optimization

Paper
Code

GFlowNet-EM for learning compositional latent variable models

1 code implementation • 13 Feb 2023 • Edward J. Hu, Nikolay Malkin, Moksh Jain, Katie Everett, Alexandros Graikos, Yoshua Bengio

Latent variable models (LVMs) with discrete compositional latents are an important but challenging setting due to a combinatorially large number of possible configurations of the latents.

Variational Inference

Paper
Code

Better Training of GFlowNets with Local Credit and Incomplete Trajectories

2 code implementations • 3 Feb 2023 • Ling Pan, Nikolay Malkin, Dinghuai Zhang, Yoshua Bengio

Generative Flow Networks or GFlowNets are related to Monte-Carlo Markov chain methods (as they sample from a distribution specified by an energy function), reinforcement learning (as they learn a policy to sample composed objects through a sequence of steps), generative models (as they learn to represent and sample from a distribution) and amortized variational methods (as they can be used to learn to approximate and sample from an otherwise intractable posterior, given a prior and a likelihood).

Paper
Code

Improving and generalizing flow-based generative models with minibatch optimal transport

2 code implementations • 1 Feb 2023 • Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Guy Wolf, Yoshua Bengio

CFM features a stable regression objective like that used to train the stochastic flow in diffusion models but enjoys the efficient inference of deterministic flow models.

694

Paper
Code

A theory of continuous generative flow networks

1 code implementation • 30 Jan 2023 • Salem Lahlou, Tristan Deleu, Pablo Lemos, Dinghuai Zhang, Alexandra Volokhova, Alex Hernández-García, Léna Néhale Ezzine, Yoshua Bengio, Nikolay Malkin

Generative flow networks (GFlowNets) are amortized variational inference algorithms that are trained to sample from unnormalized target distributions over compositional objects.

Variational Inference

Paper
Code

Posterior samples of source galaxies in strong gravitational lenses with score-based priors

no code implementations • 7 Nov 2022 • Alexandre Adam, Adam Coogan, Nikolay Malkin, Ronan Legin, Laurence Perreault-Levasseur, Yashar Hezaveh, Yoshua Bengio

Inferring accurate posteriors for high-dimensional representations of the brightness of gravitationally-lensed sources is a major challenge, in part due to the difficulties of accurately quantifying the priors.

Paper
Add Code

GFlowOut: Dropout with Generative Flow Networks

no code implementations • 24 Oct 2022 • Dianbo Liu, Moksh Jain, Bonaventure Dossou, Qianli Shen, Salem Lahlou, Anirudh Goyal, Nikolay Malkin, Chris Emezue, Dinghuai Zhang, Nadhir Hassen, Xu Ji, Kenji Kawaguchi, Yoshua Bengio

These methods face two important challenges: (a) the posterior distribution over masks can be highly multi-modal which can be difficult to approximate with standard variational inference and (b) it is not trivial to fully utilize sample-dependent information and correlation among dropout masks to improve posterior estimation.

Bayesian Inference Variational Inference

Paper
Add Code

ThinkSum: Probabilistic reasoning over sets using large language models

no code implementations • 4 Oct 2022 • Batu Ozturkler, Nikolay Malkin, Zhen Wang, Nebojsa Jojic

Our results suggest that because the probabilistic inference in ThinkSum is performed outside of calls to the LLM, ThinkSum is less sensitive to prompt design, yields more interpretable predictions, and can be flexibly combined with latent variable models to extract structured knowledge from LLMs.

In-Context Learning Retrieval

Paper
Add Code

GFlowNets and variational inference

1 code implementation • 2 Oct 2022 • Nikolay Malkin, Salem Lahlou, Tristan Deleu, Xu Ji, Edward Hu, Katie Everett, Dinghuai Zhang, Yoshua Bengio

This paper builds bridges between two families of probabilistic algorithms: (hierarchical) variational inference (VI), which is typically used to model distributions over continuous spaces, and generative flow networks (GFlowNets), which have been used for distributions over discrete structures such as graphs.

Reinforcement Learning (RL) Variational Inference

Paper
Code

Learning GFlowNets from partial episodes for improved convergence and stability

3 code implementations • 26 Sep 2022 • Kanika Madan, Jarrid Rector-Brooks, Maksym Korablyov, Emmanuel Bengio, Moksh Jain, Andrei Nica, Tom Bosc, Yoshua Bengio, Nikolay Malkin

Generative flow networks (GFlowNets) are a family of algorithms for training a sequential sampler of discrete objects under an unnormalized target density and have been successfully used for various probabilistic modeling tasks.

562

Paper
Code

Unifying Generative Models with GFlowNets and Beyond

no code implementations • 6 Sep 2022 • Dinghuai Zhang, Ricky T. Q. Chen, Nikolay Malkin, Yoshua Bengio

Our framework provides a means for unifying training and inference algorithms, and provides a route to shine a unifying light over many generative models.

Decision Making

Paper
Add Code

Diffusion models as plug-and-play priors

1 code implementation • 17 Jun 2022 • Alexandros Graikos, Nikolay Malkin, Nebojsa Jojic, Dimitris Samaras

We consider the problem of inferring high-dimensional data $\mathbf{x}$ in a model that consists of a prior $p(\mathbf{x})$ and an auxiliary differentiable constraint $c(\mathbf{x},\mathbf{y})$ on $x$ given some additional information $\mathbf{y}$.

Combinatorial Optimization Denoising +2

175

Paper
Code

Resolving label uncertainty with implicit posterior models

1 code implementation • 28 Feb 2022 • Esther Rolf, Nikolay Malkin, Alexandros Graikos, Ana Jojic, Caleb Robinson, Nebojsa Jojic

We propose a method for jointly inferring labels across a collection of data samples, where each sample consists of an observation and a prior belief about the label.

Common Sense Reasoning Segmentation +2

Paper
Code

Generative Flow Networks for Discrete Probabilistic Modeling

2 code implementations • 3 Feb 2022 • Dinghuai Zhang, Nikolay Malkin, Zhen Liu, Alexandra Volokhova, Aaron Courville, Yoshua Bengio

We present energy-based generative flow networks (EB-GFN), a novel probabilistic modeling algorithm for high-dimensional discrete data.

Paper
Code

Trajectory balance: Improved credit assignment in GFlowNets

3 code implementations • 31 Jan 2022 • Nikolay Malkin, Moksh Jain, Emmanuel Bengio, Chen Sun, Yoshua Bengio

Generative flow networks (GFlowNets) are a method for learning a stochastic policy for generating compositional objects, such as graphs or strings, from a given unnormalized density by sequences of actions, where many possible action sequences may lead to the same object.

562

Paper
Code

Coherence boosting: When your pretrained language model is not paying enough attention

1 code implementation • ACL 2022 • Nikolay Malkin, Zhen Wang, Nebojsa Jojic

Long-range semantic coherence remains a challenge in automatic language generation and understanding.

Language Modelling Text Generation

Paper
Code

Resolving label uncertainty with implicit generative models

no code implementations • 29 Sep 2021 • Esther Rolf, Nikolay Malkin, Alexandros Graikos, Ana Jojic, Caleb Robinson, Nebojsa Jojic

In prediction problems, coarse and imprecise sources of input can provide rich information about labels, but are not readily used by discriminative learners.

Common Sense Reasoning Segmentation +2

Paper
Add Code

Studying word order through iterative shuffling

1 code implementation • EMNLP 2021 • Nikolay Malkin, Sameera Lanka, Pranav Goel, Nebojsa Jojic

As neural language models approach human performance on NLP benchmark tasks, their advances are widely seen as evidence of an increasingly complex understanding of syntax.

Language Modelling Sentence

Paper
Code

GPT Perdetry Test: Generating new meanings for new words

no code implementations • NAACL 2021 • Nikolay Malkin, Sameera Lanka, Pranav Goel, Sudha Rao, Nebojsa Jojic

Human innovation in language, such as inventing new words, is a challenge for pretrained language models.

Paper
Add Code

High-resolution land cover change from low-resolution labels: Simple baselines for the 2021 IEEE GRSS Data Fusion Contest

1 code implementation • 4 Jan 2021 • Nikolay Malkin, Caleb Robinson, Nebojsa Jojic

We present simple algorithms for land cover change detection in the 2021 IEEE GRSS Data Fusion Contest.

Change Detection

Paper
Code

Mining self-similarity: Label super-resolution with epitomic representations

1 code implementation • ECCV 2020 • Nikolay Malkin, Anthony Ortiz, Caleb Robinson, Nebojsa Jojic

We show that simple patch-based models, such as epitomes, can have superior performance to the current state of the art in semantic segmentation and label super-resolution, which uses deep convolutional neural networks.

Semantic Segmentation Super-Resolution

Paper
Code

Shuffle relations for Hodge and motivic correlators

no code implementations • 14 Mar 2020 • Nikolay Malkin

Motivic correlators have several advantages: they obey dihedral symmetry relations at all points, not only at roots of unity; they are defined for any curve, and the double shuffle relations admit a generalization to elliptic curves; and they describe elements of the motivic Lie coalgebra rather than its universal enveloping algebra.

Algebraic Geometry

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.