Search Results for author: Xiaomeng Yang

Found 10 papers, 3 papers with code

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

1 code implementation • 12 Apr 2024 • Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang, Jonathan May, Luke Zettlemoyer, Omer Levy, Chunting Zhou

The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and state space models exist, they empirically underperform Transformers in pretraining efficiency and downstream task accuracy.

290

Paper
Code

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

no code implementations • 19 Dec 2023 • Xiaomeng Yang, Zhi Qiao, Yu Zhou, Weiping Wang

Nowadays, scene text recognition has attracted more and more attention due to its diverse applications.

Conditional Text Generation Language Modelling +1

Paper
Add Code

End-to-end Story Plot Generator

no code implementations • 13 Oct 2023 • Hanlin Zhu, Andrew Cohen, Danqing Wang, Kevin Yang, Xiaomeng Yang, Jiantao Jiao, Yuandong Tian

Story plots, while short, carry most of the essential information of a full story that may contain tens of thousands of words.

Blocking

Paper
Add Code

Learning Personalized Story Evaluation

no code implementations • 5 Oct 2023 • Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei LI, Yuandong Tian

We further develop a personalized story evaluation model PERSE to infer reviewer preferences and provide a personalized evaluation.

Retrieval Text Generation

Paper
Add Code

TorchRL: A data-driven decision-making library for PyTorch

2 code implementations • 1 Jun 2023 • Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni de Fabritiis, Vincent Moens

PyTorch has ascended as a premier machine learning framework, yet it lacks a native and comprehensive library for decision and control tasks suitable for large development teams dealing with complex real-world data and environments.

Computational Efficiency Decision Making +1

1,850

Paper
Code

Masked and Permuted Implicit Context Learning for Scene Text Recognition

no code implementations • 25 May 2023 • Xiaomeng Yang, Zhi Qiao, Jin Wei, Dongbao Yang, Yu Zhou

We utilize the training procedure of PLM, and to integrate MLM, we incorporate word length information into the decoding process and replace the undetermined characters with mask tokens.

Language Modelling Masked Language Modeling +1

Paper
Add Code

Learning Compiler Pass Orders using Coreset and Normalized Value Prediction

no code implementations • 9 Jan 2023 • Youwei Liang, Kevin Stone, Ali Shameli, Chris Cummins, Mostafa Elhoushi, Jiadong Guo, Benoit Steiner, Xiaomeng Yang, Pengtao Xie, Hugh Leather, Yuandong Tian

Finding the optimal pass sequence of compilation can lead to a significant reduction in program size and/or improvement in program efficiency.

Compiler Optimization Graph Learning +1

Paper
Add Code

Sample-efficient Surrogate Model for Frequency Response of Linear PDEs using Self-Attentive Complex Polynomials

no code implementations • 6 Jan 2023 • Andrew Cohen, Weiping Dou, Jiang Zhu, Slawomir Koziel, Peter Renner, Jan-Ove Mattsson, Xiaomeng Yang, Beidi Chen, Kevin Stone, Yuandong Tian

Linear Partial Differential Equations (PDEs) govern the spatial-temporal dynamics of physical systems that are essential to building modern technology.

Paper
Add Code

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world

1 code implementation • 20 Jun 2022 • Eugene Vinitsky, Nathan Lichtlé, Xiaomeng Yang, Brandon Amos, Jakob Foerster

We introduce Nocturne, a new 2D driving simulator for investigating multi-agent coordination under partial observability.

Imitation Learning

239

Paper
Code

Hybrid Composition with IdleBlock: More Efficient Networks for Image Recognition

no code implementations • 19 Nov 2019 • Bing Xu, Andrew Tulloch, Yunpeng Chen, Xiaomeng Yang, Lin Qiao

We propose a new building block, IdleBlock, which naturally prunes connections within the block.

Neural Architecture Search

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.