Search Results for author: Jipeng Zhang

Found 26 papers, 20 papers with code

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

1 code implementation • 26 Mar 2024 • Rui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang

Attempting to complement this deficiency, we investigate layerwise properties of LoRA on fine-tuning tasks and observe an uncommon skewness of weight norms across different layers.

GSM8K Language Modelling +1

7,973

Paper
Code

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

no code implementations • 13 Mar 2024 • Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang

To mitigate this issue, we propose Bootstrapped Preference Optimization (BPO), which conducts preference learning with datasets containing negative responses bootstrapped from the model itself.

Ranked #29 on Visual Question Answering on MM-Vet

Language Modelling Large Language Model +1

Paper
Add Code

The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs

1 code implementation • 6 Feb 2024 • Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang

In this paper, we identify a typical class of inputs that baffles MLLMs, which consist of images that are highly relevant but inconsistent with answers, causing MLLMs to suffer from hallucination.

Hallucination

Paper
Code

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

no code implementations • 6 Feb 2024 • Zhenwen Liang, Kehan Guo, Gang Liu, Taicheng Guo, Yujun Zhou, Tianyu Yang, Jiajun Jiao, Renjie Pi, Jipeng Zhang, Xiangliang Zhang

The paper introduces SceMQA, a novel benchmark for scientific multimodal question answering at the college entrance level.

Multiple-choice Question Answering

Paper
Add Code

PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs

no code implementations • 31 Jan 2024 • Ying Su, Jipeng Zhang, Yangqiu Song, Tong Zhang

To facilitate the evaluation of pruned subgraphs, we also propose a graph attention network (GAT) based module to reason with the subgraph data.

Graph Attention Knowledge Graphs +1

Paper
Add Code

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance

1 code implementation • 5 Jan 2024 • Renjie Pi, Tianyang Han, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang

The deployment of multimodal large language models (MLLMs) has brought forth a unique vulnerability: susceptibility to malicious attacks through visual inputs.

Paper
Code

Exploring Boundary of GPT-4V on Marine Analysis: A Preliminary Case Study

no code implementations • 4 Jan 2024 • Ziqiang Zheng, YiWei Chen, Jipeng Zhang, Tuan-Anh Vu, Huimin Zeng, Yue Him Wong Tim, Sai-Kit Yeung

In this study, we carry out the preliminary and comprehensive case study of utilizing GPT-4V for marine analysis.

Paper
Add Code

G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

1 code implementation • 18 Dec 2023 • Jiahui Gao, Renjie Pi, Jipeng Zhang, Jiacheng Ye, Wanjun Zhong, YuFei Wang, Lanqing Hong, Jianhua Han, Hang Xu, Zhenguo Li, Lingpeng Kong

We first analyze the limitations of current Multimodal Large Language Models (MLLMs) in this area: they struggle to accurately comprehending basic geometric elements and their relationships.

Language Modelling Large Language Model

Paper
Code

Plum: Prompt Learning using Metaheuristic

1 code implementation • 14 Nov 2023 • Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, Kashun Shum, Renjie Pi, Jipeng Zhang, Tong Zhang

Since the emergence of large language models, prompt learning has become a popular method for optimizing and customizing these models.

Image Generation

129

Paper
Code

PerceptionGPT: Effectively Fusing Visual Perception into LLM

no code implementations • 11 Nov 2023 • Renjie Pi, Lewei Yao, Jiahui Gao, Jipeng Zhang, Tong Zhang

In this paper, we present a novel end-to-end framework named PerceptionGPT, which efficiently and effectively equips the VLLMs with visual perception abilities by leveraging the representation power of LLMs' token embedding.

Paper
Add Code

MarineGPT: Unlocking Secrets of Ocean to the Public

1 code implementation • 20 Oct 2023 • Ziqiang Zheng, Jipeng Zhang, Tuan-Anh Vu, Shizhe Diao, Yue Him Wong Tim, Sai-Kit Yeung

Large language models (LLMs), such as ChatGPT/GPT-4, have proven to be powerful tools in promoting the user experience as an AI assistant.

Language Modelling

Paper
Code

Non-Autoregressive Sentence Ordering

1 code implementation • 19 Oct 2023 • Yi Bin, Wenhao Shi, Bin Ji, Jipeng Zhang, Yujuan Ding, Yang Yang

Existing sentence ordering approaches generally employ encoder-decoder frameworks with the pointer net to recover the coherence by recurrently predicting each sentence step-by-step.

Sentence Sentence Ordering

Paper
Code

Mitigating the Alignment Tax of RLHF

no code implementations • 12 Sep 2023 • Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan YAO, Tong Zhang

Building on the analysis and the observation that averaging different layers of the transformer leads to significantly different reward-tax trade-offs, we propose Adaptive Model Averaging (AMA) to adaptively find various combination ratios of model layers.

Common Sense Reasoning Continual Learning

Paper
Add Code

tdCoxSNN: Time-Dependent Cox Survival Neural Network for Continuous-time Dynamic Prediction

1 code implementation • 12 Jul 2023 • Lang Zeng, Jipeng Zhang, Wei Chen, Ying Ding

In pursuit of constructing a dynamic prediction model for a progressive eye disorder, age-related macular degeneration (AMD), we propose a time-dependent Cox survival neural network (tdCoxSNN) to predict its progression using longitudinal fundus images.

Paper
Code

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

1 code implementation • 21 Jun 2023 • Shizhe Diao, Rui Pan, Hanze Dong, Ka Shun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang

As the number of available models and specialized tasks keeps growing, the job of general finetuning becomes highly nontrivial.

7,973

Paper
Code

DetGPT: Detect What You Need via Reasoning

1 code implementation • 23 May 2023 • Renjie Pi, Jiahui Gao, Shizhe Diao, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang

Overall, our proposed paradigm and DetGPT demonstrate the potential for more sophisticated and intuitive interactions between humans and machines.

Autonomous Driving Object +2

716

Paper
Code

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

1 code implementation • 13 Apr 2023 • Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang

Utilizing a reward model and a sufficient number of samples, our approach selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples.

Ethics

7,973

Paper
Code

Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks

1 code implementation • 12 Jan 2023 • Xinsong Zhang, Yan Zeng, Jipeng Zhang, Hang Li

X-FM has one language encoder, one vision encoder, and one fusion encoder, as well as a new training method.

Ranked #3 on Visual Grounding on RefCOCO+ test B

Cross-Modal Retrieval Open-Ended Question Answering +3

Paper
Code

Generalizing Math Word Problem Solvers via Solution Diversification

1 code implementation • 1 Dec 2022 • Zhenwen Liang, Jipeng Zhang, Lei Wang, Yan Wang, Jie Shao, Xiangliang Zhang

In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator.

Math

Paper
Code

Analogical Math Word Problems Solving with Enhanced Problem-Solution Association

1 code implementation • 1 Dec 2022 • Zhenwen Liang, Jipeng Zhang, Xiangliang Zhang

In this paper, we propose to build a novel MWP solver by leveraging analogical MWPs, which advance the solver's generalization ability across different kinds of MWPs.

Math Question Answering

Paper
Code

X$^2$-VLM: All-In-One Pre-trained Model For Vision-Language Tasks

2 code implementations • 22 Nov 2022 • Yan Zeng, Xinsong Zhang, Hang Li, Jiawei Wang, Jipeng Zhang, Wangchunshu Zhou

Vision language pre-training aims to learn alignments between vision and language from a large amount of data.

Ranked #1 on Cross-Modal Retrieval on Flickr30k (using extra training data)

Cross-Modal Retrieval Image Captioning +7

430

Paper
Code

Execution-based Evaluation for Data Science Code Generation Models

1 code implementation • 17 Nov 2022 • JunJie Huang, Chenglong Wang, Jipeng Zhang, Cong Yan, Haotian Cui, Jeevana Priya Inala, Colin Clement, Nan Duan, Jianfeng Gao

Code generation models can benefit data scientists' productivity by automatically generating code from context and text descriptions.

Code Generation Model Selection

Paper
Code

MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving

1 code implementation • Findings (NAACL) 2022 • Zhenwen Liang, Jipeng Zhang, Lei Wang, Wei Qin, Yunshi Lan, Jie Shao, Xiangliang Zhang

Math word problem (MWP) solving faces a dilemma in number representation learning.

Ranked #5 on Math Word Problem Solving on MathQA

Common Sense Reasoning Language Modelling +4

Paper
Code

Graph-to-Tree Learning for Solving Math Word Problems

1 code implementation • ACL 2020 • Jipeng Zhang, Lei Wang, Roy Ka-Wei Lee, Yi Bin, Yan Wang, Jie Shao, Ee-Peng Lim

While the recent tree-based neural models have demonstrated promising results in generating solution expression for the math word problem (MWP), most of these models do not capture the relationships and order information among the quantities well.

Ranked #10 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving

Paper
Code

Template-based math word problem solvers with recursive neural networks

1 code implementation • AAAI 2019 • Lei Wang, Dongxiang Zhang, Jipeng Zhang, Xing Xu, Lianli Gao, Bing Tian Dai, Heng Tao Shen

Then, we design a recursive neural network to encode the quantity with Bi-LSTM and self attention, and infer the unknown operator nodes in a bottom-up manner.

Math

155

Paper
Code

Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions

1 code implementation • ACL 2019 • Jierui Li, Lei Wang, Jipeng Zhang, Yan Wang, Bing Tian Dai, Dongxiang Zhang

Several deep learning models have been proposed for solving math word problems (MWPs) automatically.

Ranked #13 on Math Word Problem Solving on Math23K

Math Math Word Problem Solving +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.