Search Results for author: Yi-An Lai

Found 12 papers, 2 papers with code

DeAL: Decoding-time Alignment for Large Language Models

no code implementations5 Feb 2024 James Y. Huang, Sailik Sengupta, Daniele Bonadiman, Yi-An Lai, Arshit Gupta, Nikolaos Pappas, Saab Mansour, Katrin Kirchhoff, Dan Roth

Current work focuses on alignment at model training time, through techniques such as Reinforcement Learning with Human Feedback (RLHF).

Improving Prediction Backward-Compatiblility in NLP Model Upgrade with Gated Fusion

no code implementations4 Feb 2023 Yi-An Lai, Elman Mansimov, Yuqing Xie, Yi Zhang

When upgrading neural models to a newer version, new errors that were not encountered in the legacy version can be introduced, known as regression errors.

regression

Backward Compatibility During Data Updates by Weight Interpolation

no code implementations25 Jan 2023 Raphael Schumann, Elman Mansimov, Yi-An Lai, Nikolaos Pappas, Xibin Gao, Yi Zhang

This method interpolates between the weights of the old and new model and we show in extensive experiments that it reduces negative flips without sacrificing the improved accuracy of the new model.

regression

Efficient Domain Adaptation of Language Models via Adaptive Tokenization

no code implementations EMNLP (sustainlp) 2021 Vin Sachidananda, Jason S. Kessler, Yi-An Lai

While adaptive tokenization incurs a 6% increase in model parameters in our experimentation, due to the introduction of 10k new domain-specific tokens, our approach, using 64 vCPUs, is 72x faster than further pretraining the language model on domain-specific corpora on 8 TPUs.

Domain Adaptation Language Modelling

Context Analysis for Pre-trained Masked Language Models

no code implementations Findings of the Association for Computational Linguistics 2020 Yi-An Lai, Garima Lalwani, Yi Zhang

Pre-trained language models that learn contextualized word representations from a large un-annotated corpus have become a standard component for many state-of-the-art NLP systems.

Attribute-aware Collaborative Filtering: Survey and Classification

no code implementations20 Oct 2018 Wen-Hao Chen, Chin-Chi Hsu, Yi-An Lai, Vincent Liu, Mi-Yen Yeh, Shou-De Lin

Attribute-aware CF models aims at rating prediction given not only the historical rating from users to items, but also the information associated with users (e. g. age), items (e. g. price), or even ratings (e. g. rating time).

Attribute Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.