Search Results for author: Rui Pan

Found 17 papers, 10 papers with code

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

1 code implementation • 26 Mar 2024 • Rui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang

Attempting to complement this deficiency, we investigate the layerwise properties of LoRA on fine-tuning tasks and observe an unexpected but consistent skewness of weight norms across different layers.

GSM8K Language Modelling +1

8,060

Paper
Code

A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

no code implementations • 17 Mar 2024 • Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, Jing Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang

A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades.

Distributed Computing

Paper
Add Code

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

no code implementations • 13 Mar 2024 • Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang

To mitigate this issue, we propose Bootstrapped Preference Optimization (BPO), which conducts preference learning with datasets containing negative responses bootstrapped from the model itself.

Ranked #32 on Visual Question Answering on MM-Vet

Language Modelling Large Language Model +1

Paper
Add Code

The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs

1 code implementation • 6 Feb 2024 • Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang

In this paper, we identify a typical class of inputs that baffles MLLMs, which consist of images that are highly relevant but inconsistent with answers, causing MLLMs to suffer from hallucination.

Hallucination

Paper
Code

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance

1 code implementation • 5 Jan 2024 • Renjie Pi, Tianyang Han, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang

The deployment of multimodal large language models (MLLMs) has brought forth a unique vulnerability: susceptibility to malicious attacks through visual inputs.

Paper
Code

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets

no code implementations • 3 Jan 2024 • Ernest Perkowski, Rui Pan, Tuan Dung Nguyen, Yuan-Sen Ting, Sandor Kruk, Tong Zhang, Charlie O'Neill, Maja Jablonska, Zechang Sun, Michael J. Smith, Huiling Liu, Kevin Schawinski, Kartheik Iyer, Ioana Ciucă for UniverseTBD

We explore the potential of enhancing LLM performance in astronomy-focused question-answering through targeted, continual pre-training.

Astronomy Benchmarking +1

Paper
Add Code

Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise

no code implementations • 22 Dec 2023 • Rui Pan, Yuxing Liu, Xiaoyu Wang, Tong Zhang

This means SGD with heavy-ball momentum is useful in the large-batch settings such as distributed machine learning or federated learning, where a smaller number of iterations can significantly reduce the number of communication rounds, leading to acceleration in practice.

Federated Learning

Paper
Add Code

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

no code implementations • 8 Dec 2023 • Yinwei Dai, Rui Pan, Anand Iyer, Kai Li, Ravi Netravali

Machine learning (ML) inference platforms are tasked with balancing two competing goals: ensuring high throughput given many requests, and delivering low-latency responses to support interactive applications.

Paper
Add Code

Plum: Prompt Learning using Metaheuristic

1 code implementation • 14 Nov 2023 • Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, Kashun Shum, Renjie Pi, Jipeng Zhang, Tong Zhang

Since the emergence of large language models, prompt learning has become a popular method for optimizing and customizing these models.

Image Generation

134

Paper
Code

Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?

1 code implementation • 31 Oct 2023 • Yichi Zhang, Jiayi Pan, Yuchen Zhou, Rui Pan, Joyce Chai

Vision-Language Models (VLMs) are trained on vast amounts of data captured by humans emulating our understanding of the world.

Paper
Code

Mitigating the Alignment Tax of RLHF

no code implementations • 12 Sep 2023 • Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan YAO, Tong Zhang

Building on the analysis and the observation that averaging different layers of the transformer leads to significantly different reward-tax trade-offs, we propose Adaptive Model Averaging (AMA) to adaptively find various combination ratios of model layers.

Common Sense Reasoning Continual Learning

Paper
Add Code

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

1 code implementation • 21 Jun 2023 • Shizhe Diao, Rui Pan, Hanze Dong, Ka Shun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang

As the number of available foundation models and specialized tasks keeps growing, the job of training scientific language models becomes highly nontrivial.

8,060

Paper
Code

DetGPT: Detect What You Need via Reasoning

1 code implementation • 23 May 2023 • Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang

Overall, our proposed paradigm and DetGPT demonstrate the potential for more sophisticated and intuitive interactions between humans and machines.

Autonomous Driving Object +2

725

Paper
Code

Effective Bilevel Optimization via Minimax Reformulation

no code implementations • 22 May 2023 • Xiaoyu Wang, Rui Pan, Renjie Pi, Tong Zhang

To address this issue, we propose a reformulation of bilevel optimization as a minimax problem, effectively decoupling the outer-inner dependency.

Bilevel Optimization Meta-Learning

Paper
Add Code

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

3 code implementations • 13 Apr 2023 • Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang

Utilizing a reward model and a sufficient number of samples, our approach selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples.

Ethics

8,060

Paper
Code

ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT

1 code implementation • 30 Nov 2022 • Rui Pan, Shizhe Diao, Jianlin Chen, Tong Zhang

In this paper, we present ExtremeBERT, a toolkit for accelerating and customizing BERT pretraining.

Molecular System Prediction Sentence Classification

281

Paper
Code

Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums

1 code implementation • ICLR 2022 • Rui Pan, Haishan Ye, Tong Zhang

In this paper, we propose Eigencurve, the first family of learning rate schedules that can achieve minimax optimal convergence rates (up to a constant) for SGD on quadratic objectives when the eigenvalue distribution of the underlying Hessian matrix is skewed.

Image Classification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.