Search Results for author: Florian Mai

Found 11 papers, 8 papers with code

Learning to Plan for Language Modeling from Unlabeled Data

no code implementations31 Mar 2024 Nathan Cornille, Marie-Francine Moens, Florian Mai

By training to predict the next token in an unlabeled corpus, large language models learn to perform many tasks without any labeled data.

Language Modelling Self-Supervised Learning

HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition

2 code implementations29 May 2023 Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet, Petr Motlicek

In particular, multi-head HyperConformer achieves comparable or higher recognition performance while being more efficient than Conformer in terms of inference speed, memory, parameter count, and available training data.

speech-recognition Speech Recognition

BQ-NCO: Bisimulation Quotienting for Efficient Neural Combinatorial Optimization

2 code implementations NeurIPS 2023 Darko Drakulic, Sofia Michel, Florian Mai, Arnaud Sors, Jean-Marc Andreoli

In this paper, we present a novel formulation of Combinatorial Optimization Problems (COPs) as Markov Decision Processes (MDPs) that effectively leverages common symmetries of COPs to improve out-of-distribution robustness.

Combinatorial Optimization Out-of-Distribution Generalization

HyperMixer: An MLP-based Low Cost Alternative to Transformers

3 code implementations7 Mar 2022 Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, Francois Marelli, Francois Fleuret, James Henderson

We find that existing architectures such as MLPMixer, which achieves token mixing through a static MLP applied to each feature independently, are too detached from the inductive biases required for natural language understanding.

Natural Language Understanding

Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation

no code implementations13 Oct 2021 Florian Mai, James Henderson

We address this issue by extending their method to Bag-of-Vectors Autoencoders (BoV-AEs), which encode the text into a variable-size bag of vectors that grows with the size of the text, as in attention-based models.

Conditional Text Generation Sentence Summarization

Optimizer Benchmarking Needs to Account for Hyperparameter Tuning

no code implementations ICML 2020 Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi, François Fleuret

The performance of optimizers, particularly in deep learning, depends considerably on their chosen hyperparameter configuration.

Benchmarking

Multi-Modal Adversarial Autoencoders for Recommendations of Citations and Subject Labels

1 code implementation22 Jul 2019 Lukas Galke, Florian Mai, Iacopo Vagliano, Ansgar Scherp

We present multi-modal adversarial autoencoders for recommendation and evaluate them on two different tasks: citation recommendation and subject label recommendation.

Citation Recommendation

CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model

1 code implementation ICLR 2019 Florian Mai, Lukas Galke, Ansgar Scherp

In order to address this shortcoming, we propose a learning algorithm for the Continuous Matrix Space Model, which we call Continual Multiplication of Words (CMOW).

Word Embeddings

Using Titles vs. Full-text as Source for Automated Semantic Document Annotation

1 code implementation15 May 2017 Lukas Galke, Florian Mai, Alan Schelten, Dennis Brunsch, Ansgar Scherp

For the first time, we offer a systematic comparison of classification approaches to investigate how far semantic annotations can be conducted using just the metadata of the documents such as titles published as labels on the Linked Open Data cloud.

Document Classification General Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.