Playing the Game of 2048

25 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

ofirpress/attention_with_linear_biases ICLR 2022

Since the introduction of the transformer model by Vaswani et al. (2017), a fundamental question has yet to be answered: how does a model achieve extrapolation at inference time for sequences that are longer than it saw during training?

Mastering 2048 with Delayed Temporal Coherence Learning, Multi-Stage Weight Promotion, Redundant Encoding and Carousel Shaping

aszczepanski/2048 18 Apr 2016

With the aim to develop a strong 2048 playing program, we employ temporal difference learning with systematic n-tuple networks.

GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding

facebookresearch/fairscale ICLR 2021

Neural network scaling has been critical for improving the model quality in many real-world machine learning applications with vast amounts of training data and compute.

GSPMD: General and Scalable Parallelization for ML Computation Graphs

apple/axlearn 10 May 2021

We present GSPMD, an automatic, compiler-based parallelization system for common machine learning computations.

Long Short-Term Transformer for Online Action Detection

amazon-research/long-short-term-transformer NeurIPS 2021

We present Long Short-term TRansformer (LSTR), a temporal modeling algorithm for online action detection, which employs a long- and short-term memory mechanism to model prolonged sequence data.

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization

stefanleong/s2crnet 13 Sep 2021

To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time.

Planning in Stochastic Environments with a Learned Model

opendilab/LightZero ICLR 2022

However, previous instantiations of this approach were limited to the use of deterministic models.

ImageNet Training in Minutes

fuentesdt/livermask 14 Sep 2017

If we can make full use of the supercomputer for DNN training, we should be able to finish the 90-epoch ResNet-50 training in one minute.

Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes

deepsense-ai/Distributed-BA3C 9 Jan 2018

We present a study in Distributed Deep Reinforcement Learning (DDRL) focused on scalability of a state-of-the-art Deep Reinforcement Learning algorithm known as Batch Asynchronous Advantage ActorCritic (BA3C).

Improving Electron Micrograph Signal-to-Noise with an Atrous Convolutional Encoder-Decoder

Jeffrey-Ede/Electron-Micrograph-Denoiser 30 Jul 2018

Our neural network was trained end-to-end to remove Poisson noise applied to low-dose ($\ll$ 300 counts ppx) micrographs created from a new dataset of 17267 2048$\times$2048 high-dose ($>$ 2500 counts ppx) micrographs and then fine-tuned for ordinary doses (200-2500 counts ppx).