Search Results for author: Yiwei Bai

Found 10 papers, 2 papers with code

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

no code implementations13 Sep 2017 Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Wei-Nan Zhang, Ying Wen, Yong Yu

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Deep Reasoning Networks: Thinking Fast and Slow

no code implementations3 Jun 2019 Di Chen, Yiwei Bai, Wenting Zhao, Sebastian Ament, John M. Gregoire, Carla P. Gomes

At a high level, DRNets encode a structured latent space of the input data, which is constrained to adhere to prior knowledge by a reasoning module.

Deep Reasoning Networks: Thinking Fast and Slow, for Pattern De-mixing

no code implementations25 Sep 2019 Di Chen, Yiwei Bai, Wenting Zhao, Sebastian Ament, John M. Gregoire, Carla P. Gomes

We introduce Deep Reasoning Networks (DRNets), an end-to-end framework that combines deep learning with reasoning for solving pattern de-mixing problems, typically in an unsupervised or weakly-supervised setting.

Zero Training Overhead Portfolios for Learning to Solve Combinatorial Problems

no code implementations5 Feb 2021 Yiwei Bai, Wenting Zhao, Carla P. Gomes

There has been an increasing interest in harnessing deep learning to tackle combinatorial optimization (CO) problems in recent years.

BIG-bench Machine Learning Combinatorial Optimization +2

Fairness of Exposure in Stochastic Bandits

no code implementations3 Mar 2021 Lequn Wang, Yiwei Bai, Wen Sun, Thorsten Joachims

Contextual bandit algorithms have become widely used for recommendation in online systems (e. g. marketplaces, music streaming, news), where they now wield substantial influence on which items get exposed to the users.

Fairness Multi-Armed Bandits

Automating Crystal-Structure Phase Mapping: Combining Deep Learning with Constraint Reasoning

no code implementations21 Aug 2021 Di Chen, Yiwei Bai, Sebastian Ament, Wenting Zhao, Dan Guevarra, Lan Zhou, Bart Selman, R. Bruce van Dover, John M. Gregoire, Carla P. Gomes

DRNets compensate for the limited data by exploiting and magnifying the rich prior knowledge about the thermodynamic rules governing the mixtures of crystals with constraint reasoning seamlessly integrated into neural network optimization.

Unsupervised Learning for Solving the Travelling Salesman Problem

1 code implementation NeurIPS 2023 Yimeng Min, Yiwei Bai, Carla P. Gomes

Our loss function consists of two parts: one pushes the model to find the shortest path and the other serves as a surrogate for the constraint that the route should form a Hamiltonian Cycle.

Deep Reasoning Networks for Unsupervised Pattern De-mixing with Constraint Reasoning

no code implementations ICML 2020 Di Chen, Yiwei Bai, Wenting Zhao, Sebastian Ament, John Gregoire, Carla Gomes

We introduce Deep Reasoning Networks (DRNets), an end-to-end framework that combines deep learning with constraint reasoning for solving pattern de-mixing problems, typically in an unsupervised or very-weakly-supervised setting.

Cannot find the paper you are looking for? You can Submit a new open access paper.