Search Results for author: Yihe Deng

Found 9 papers, 5 papers with code

Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance

no code implementations • 13 Feb 2024 • Linxi Zhao, Yihe Deng, Weitong Zhang, Quanquan Gu

The advancement of Large Vision-Language Models (LVLMs) has increasingly highlighted the critical issue of their tendency to hallucinate non-existing objects in the images.

Hallucination

Paper
Add Code

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

2 code implementations • 2 Jan 2024 • Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu

In this paper, we delve into the prospect of growing a strong LLM out of a weak one without the need for acquiring additional human-annotated data.

1,234

Paper
Code

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

no code implementations • 23 Nov 2023 • Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu

Additionally, when our analysis is specialized to linear regression in the strongly convex setting, it yields a tighter bound for bias error than the best-known result.

regression

Paper
Add Code

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

3 code implementations • 7 Nov 2023 • Yihe Deng, Weitong Zhang, Zixiang Chen, Quanquan Gu

While it is widely acknowledged that the quality of a prompt, such as a question, significantly impacts the quality of the response provided by LLMs, a systematic method for crafting questions that LLMs can better comprehend is still underdeveloped.

1,641

Paper
Code

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

no code implementations • 2 Oct 2023 • Zixiang Chen, Yihe Deng, Yuanzhi Li, Quanquan Gu

Multi-modal learning has become increasingly popular due to its ability to leverage information from different data sources (e. g., text and images) to improve the model performance.

Image Generation Representation Learning +1

Paper
Add Code

Towards Understanding Mixture of Experts in Deep Learning

2 code implementations • 4 Aug 2022 • Zixiang Chen, Yihe Deng, Yue Wu, Quanquan Gu, Yuanzhi Li

To our knowledge, this is the first result towards formally understanding the mechanism of the MoE layer for deep learning.

Paper
Code

Adversarial Training with Fast Gradient Projection Method against Synonym Substitution based Text Attacks

1 code implementation • 9 Aug 2020 • Xiaosen Wang, Yichen Yang, Yihe Deng, Kun He

Adversarial training is the most empirically successful approach in improving the robustness of deep neural networks for image classification. For text classification, however, existing synonym substitution based adversarial attacks are effective but not efficient to be incorporated into practical text adversarial training.

Adversarial Attack Image Classification +2

Paper
Code

Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency

1 code implementation • ACL 2019 • Shuhuai Ren, Yihe Deng, Kun He, Wanxiang Che

Experiments on three popular datasets using convolutional as well as LSTM models show that PWWS reduces the classification accuracy to the most extent, and keeps a very low word substitution rate.

Adversarial Attack General Classification +5

Paper
Code

Micro blogs Oriented Word Segmentation System

no code implementations • WS 2012 • Yijia Liu, Meishan Zhang, Wanxiang Che, Ting Liu, Yihe Deng

Chinese Word Segmentation Dependency Parsing +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.