Found 53 papers, 37 papers with code

What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum Probability

no code implementations Findings (EMNLP) 2021 Yaochen Liu, Yazhou Zhang, Qiuchi Li, Benyou Wang, Dawei Song

The QPM framework involves a complex-valued multi-modal representation encoder, a quantum-like fusion subnetwork and a quantum measurement mechanism.

ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model

1 code implementation18 Feb 2024 Guiming Hardy Chen, Shunian Chen, Ruifei Zhang, Junying Chen, Xiangbo Wu, Zhiyi Zhang, Zhihong Chen, Jianquan Li, Xiang Wan, Benyou Wang

Recent advancements in Large Vision-Language Models (LVLMs) have enabled processing of multimodal inputs in language models but require significant computational resources for deployment, especially in edge devices.

Language Modelling Visual Question Answering

Humans or LLMs as the Judge? A Study on Judgement Biases

no code implementations16 Feb 2024 Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang

Adopting human and large language models (LLM) as judges (\textit{a. k. a} human- and LLM-as-a-judge) for evaluating the performance of existing LLMs has recently gained attention.

Pushing The Limit of LLM Capacity for Text Classification

no code implementations12 Feb 2024 Yazhou Zhang, Mengyao Wang, Chenyu Ren, Qiuchi Li, Prayag Tiwari, Benyou Wang, Jing Qin

The value of text classification's future research has encountered challenges and uncertainties, due to the extraordinary efficacy demonstrated by large language models (LLMs) across numerous downstream NLP tasks.

Language Modelling text-classification +1

Silkie: Preference Distillation for Large Visual Language Models

no code implementations17 Dec 2023 Lei LI, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong

This paper explores preference distillation for large vision language models (LVLMs), improving their ability to generate helpful and faithful responses anchoring the visual context.


MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V

1 code implementation23 Nov 2023 Wentao Ge, Shunian Chen, Guiming Chen, Junying Chen, Zhihong Chen, Shuo Yan, Chenghao Zhu, Ziyue Lin, Wenya Xie, Xidong Wang, Anningzhe Gao, Zhiyi Zhang, Jianquan Li, Xiang Wan, Benyou Wang

In the pursuit of Artificial General Intelligence (AGI), the integration of vision in language models has marked a significant milestone.

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

1 code implementation16 Nov 2023 Junying Chen, Xidong Wang, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang

We validate the new protocol in the domains where proprietary LLMs like ChatGPT perform relatively poorly, such as Traditional Chinese Medicine.

Domain Adaptation Language Modelling

Outcome-supervised Verifiers for Planning in Mathematical Reasoning

1 code implementation16 Nov 2023 Fei Yu, Anningzhe Gao, Benyou Wang

These findings offer a novel perspective on the role of outcome supervision in training verifiers for multi-step reasoning tasks and provide theoretical justification for its advantage in value estimation for planning.

GSM8K Mathematical Reasoning

On Elastic Language Models

no code implementations13 Nov 2023 Chen Zhang, Benyou Wang, Dawei Song

To this end, we propose an elastic language model (ElasticLM) that elastically adjusts the tradeoff according to the request stream.

Information Retrieval Knowledge Distillation +4

Quantifying Self-diagnostic Atomic Knowledge in Chinese Medical Foundation Model: A Computational Analysis

1 code implementation18 Oct 2023 Yaxin Fan, Feng Jiang, Benyou Wang, Peifeng Li, Haizhou Li

Recent studies primarily focused on the quality of FMs evaluated by GPT-4 or their ability to pass medical exams, no studies have quantified the extent of self-diagnostic atomic knowledge stored in FMs' memory, which is the basis of foundation models to provide factual and reliable suggestions.

Instruction Following

DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations

1 code implementation17 Oct 2023 Yazhou Zhang, Mengyao Wang, Youxi Wu, Prayag Tiwari, Qiuchi Li, Benyou Wang, Jing Qin

Large language models (LLMs) and their variants have shown extraordinary efficacy across numerous downstream natural language processing (NLP) tasks, which has presented a new vision for the development of NLP.

Benchmarking Emotion Recognition

AceGPT, Localizing Large Language Models in Arabic

1 code implementation21 Sep 2023 Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Juncai He, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu

This paper is devoted to the development of a localized Large Language Model (LLM) specifically for Arabic, a language imbued with unique cultural characteristics inadequately addressed by current mainstream models.

Instruction Following Language Modelling +2

PlatoLM: Teaching LLMs via a Socratic Questioning User Simulator

no code implementations21 Aug 2023 Chuyi Kong, Yaxin Fan, Xiang Wan, Feng Jiang, Benyou Wang

The unparalleled performance of closed-sourced ChatGPT has sparked efforts towards its democratization, with notable strides made by leveraging real user and ChatGPT conversations, as evidenced by Vicuna.

Language Modelling Large Language Model

CMB: A Comprehensive Medical Benchmark in Chinese

1 code implementation17 Aug 2023 Xidong Wang, Guiming Hardy Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li

We hope this benchmark could facilitate the widespread adoption and enhancement of medical LLMs within China.

A Survey of Quantum-Cognitively Inspired Sentiment Analysis Models

no code implementations6 Jun 2023 Yaochen Liu, Qiuchi Li, Benyou Wang, Yazhou Zhang, Dawei Song

Quantum theory, originally proposed as a physical theory to describe the motions of microscopic particles, has been applied to various non-physics domains involving human cognition and decision-making that are inherently uncertain and exhibit certain non-classical, quantum-like characteristics.

Decision Making Sentiment Analysis

On the Difference of BERT-style and CLIP-style Text Encoders

1 code implementation6 Jun 2023 Zhihong Chen, Guiming Hardy Chen, Shizhe Diao, Xiang Wan, Benyou Wang

Masked language modeling (MLM) has been one of the most popular pretraining recipes in natural language processing, e. g., BERT, one of the representative models.

Language Modelling Masked Language Modeling +1

Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias

1 code implementation NeurIPS 2023 Zhongwei Wan, Che Liu, Mi Zhang, Jie Fu, Benyou Wang, Sibo Cheng, Lei Ma, César Quilodrán-Casas, Rossella Arcucci

Med-UniC reaches superior performance across 5 medical image tasks and 10 datasets encompassing over 30 diseases, offering a versatile framework for unifying multi-modal medical data within diverse linguistic communities.


Injecting Knowledge into Biomedical Pre-trained Models via Polymorphism and Synonymous Substitution

1 code implementation24 May 2023 Hongbo Zhang, Xiang Wan, Benyou Wang

This gives us a hint that relational knowledge might not be redundant to the stored knowledge of PLMs, but rather be complementary.

HuatuoGPT, towards Taming Language Model to Be a Doctor

1 code implementation24 May 2023 Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li

Experimental results demonstrate that HuatuoGPT achieves state-of-the-art results in performing medical consultation among open-source LLMs in GPT-4 evaluation, human evaluation, and medical benchmark datasets.

Language Modelling Large Language Model

Lifting the Curse of Capacity Gap in Distilling Language Models

1 code implementation20 May 2023 Chen Zhang, Yang Yang, Jiahao Liu, Jingang Wang, Yunsen Xian, Benyou Wang, Dawei Song

However, when the capacity gap between the teacher and the student is large, a curse of capacity gap appears, invoking a deficiency in distilling LMs.

Knowledge Distillation

Word Grounded Graph Convolutional Network

1 code implementation10 May 2023 Zhibin Lu, Qianqian Xie, Benyou Wang, Jian-Yun Nie

An inductive Word-grounded Graph Convolutional Network (WGCN) is proposed to learn word and document representations based on WGraph in a supervised manner.

text-classification Text Classification

Huatuo-26M, a Large-scale Chinese Medical QA Dataset

1 code implementation2 May 2023 Jianquan Li, Xidong Wang, Xiangbo Wu, Zhiyi Zhang, Xiaolong Xu, Jie Fu, Prayag Tiwari, Xiang Wan, Benyou Wang

Moreover, we also experimentally show the benefit of the proposed dataset in many aspects: (i) trained models for other QA datasets in a zero-shot fashion; and (ii) as external knowledge for retrieval-augmented generation (RAG); and (iii) improving existing pre-trained language models by using the QA pairs as a pre-training corpus in continued training manner.

Language Modelling Question Answering +1

Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary

1 code implementation20 Apr 2023 Xiaokang Liu, Jianquan Li, Jingjing Mu, Min Yang, Ruifeng Xu, Benyou Wang

In this paper, we introduce novel K-center contrastive learning and adjustable decision boundary learning (CLAB) to improve the effectiveness of open intent classification.

Contrastive Learning intent-classification +1

A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models

no code implementations18 Apr 2023 Qianqian Xie, Zheheng Luo, Benyou Wang, Sophia Ananiadou

In this paper, we present a systematic review of recent advancements in BTS, leveraging cutting-edge NLP techniques from PLMs to LLMs, to help understand the latest progress, challenges, and future directions.

Information Retrieval Language Modelling +3

Natural Language Reasoning, A Survey

1 code implementation26 Mar 2023 Fei Yu, Hongbo Zhang, Prayag Tiwari, Benyou Wang

This survey paper proposes a clearer view of natural language reasoning in the field of Natural Language Processing (NLP), both conceptually and practically.

Logical Reasoning Mathematical Reasoning +4

Modular Retrieval for Generalization and Interpretation

1 code implementation23 Mar 2023 Juhao Liang, Chen Zhang, Zhengyang Tang, Jie Fu, Dawei Song, Benyou Wang

Built upon the paradigm, we propose a retrieval model with modular prompt tuning named REMOP.

Language Modelling Retrieval

Adapting Pre-trained Language Models for Quantum Natural Language Processing

no code implementations24 Feb 2023 Qiuchi Li, Benyou Wang, Yudong Zhu, Christina Lioma, Qun Liu

The emerging classical-quantum transfer learning paradigm has brought a decent performance to quantum computational models in many tasks, such as computer vision, by enabling a combination of quantum models and classical pre-trained neural networks.

Sentence Sentence Classification +1

Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts

1 code implementation ICCV 2023 Zhihong Chen, Shizhe Diao, Benyou Wang, Guanbin Li, Xiang Wan

Medical vision-and-language pre-training (Med-VLP) has shown promising improvements on many downstream medical tasks owing to its applicability to extracting generic representations from medical images and texts.

Image Retrieval Image-text Classification +7

Document-level Relation Extraction with Relation Correlations

1 code implementation20 Dec 2022 Ridong Han, Tao Peng, Benyou Wang, Lu Liu, Xiang Wan

Document-level relation extraction faces two overlooked challenges: long-tail problem and multi-label problem.

Document-level Relation Extraction Relation

Spatio-Temporal Contrastive Learning Enhanced GNNs for Session-based Recommendation

1 code implementation23 Sep 2022 Zhongwei Wan, Xin Liu, Benyou Wang, Jiezhong Qiu, Boyu Li, Ting Guo, Guangyong Chen, Yang Wang

The idea is to supplement the GNN-based main supervised recommendation task with the temporal representation via an auxiliary cross-view contrastive learning mechanism.

Collaborative Filtering Contrastive Learning +1

DPTDR: Deep Prompt Tuning for Dense Passage Retrieval

1 code implementation COLING 2022 Zhengyang Tang, Benyou Wang, Ting Yao

We believe this work facilitates the industry, as it saves enormous efforts and costs of deployment and increases the utility of computing resources.

Language Modelling Natural Questions +2

Doge Tickets: Uncovering Domain-general Language Models by Playing Lottery Tickets

1 code implementation20 Jul 2022 Yi Yang, Chen Zhang, Benyou Wang, Dawei Song

To uncover the domain-general LM, we propose to identify domain-general parameters by playing lottery tickets (dubbed doge tickets).

Domain Generalization

Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk

1 code implementation2 Jul 2022 Benyou Wang, Xiangbo Wu, Xiaokang Liu, Jianquan Li, Prayag Tiwari, Qianqian Xie

However, the humor aspect of natural language is relatively under-investigated, especially in the age of pre-trained language models.

Benchmarking Machine Translation +1

Exploring Extreme Parameter Compression for Pre-trained Language Models

1 code implementation ICLR 2022 Yuxin Ren, Benyou Wang, Lifeng Shang, Xin Jiang, Qun Liu

A tiny version achieves $96. 7\%$ performance of BERT-base with $ {1}/{48} $ encoder parameters (i. e., less than 2M parameters excluding the embedding layer) and $2. 7 \times$ faster on inference.

Knowledge Distillation Tensor Decomposition

Pre-trained Language Models in Biomedical Domain: A Systematic Survey

1 code implementation11 Oct 2021 Benyou Wang, Qianqian Xie, Jiahuan Pei, Zhihong Chen, Prayag Tiwari, Zhao Li, Jie Fu

In this paper, we summarize the recent progress of pre-trained language models in the biomedical domain and their applications in biomedical downstream tasks.

On Position Embeddings in BERT

no code implementations ICLR 2021 Benyou Wang, Lifeng Shang, Christina Lioma, Xin Jiang, Hao Yang, Qun Liu, Jakob Grue Simonsen

Various Position Embeddings (PEs) have been proposed in Transformer based architectures~(e. g. BERT) to model word order.

General Classification Position +1

Meta-Learning for Neural Relation Classification with Distant Supervision

no code implementations26 Oct 2020 Zhenzhen Li, Jian-Yun Nie, Benyou Wang, Pan Du, Yuhan Zhang, Lixin Zou, Dongsheng Li

Distant supervision provides a means to create a large number of weakly labeled data at low cost for relation classification.

Classification General Classification +3

A Multi-task Learning Framework for Opinion Triplet Extraction

3 code implementations Findings of the Association for Computational Linguistics 2020 Chen Zhang, Qiuchi Li, Dawei Song, Benyou Wang

The state-of-the-art Aspect-based Sentiment Analysis (ABSA) approaches are mainly based on either detecting aspect terms and their corresponding sentiment polarities, or co-extracting aspect and opinion terms.

Aspect-Based Sentiment Analysis Aspect Sentiment Triplet Extraction +2

Encoding word order in complex embeddings

1 code implementation ICLR 2020 Benyou Wang, Donghao Zhao, Christina Lioma, Qiuchi Li, Peng Zhang, Jakob Grue Simonsen

The benefit of continuous functions over variable positions is that word representations shift smoothly with increasing positions.

Language Modelling Machine Translation +5

Leveraging Entanglement Entropy for Deep Understanding of Attention Matrix in Text Matching

no code implementations25 Sep 2019 Peng Zhang, Xiaoliu Mao, Xindian Ma, Benyou Wang, Jing Zhang, Jun Wang, Dawei Song

We prove that by a mapping (via the trace operator) on the high-dimensional matching matrix, a low-dimensional attention matrix can be derived.

Inductive Bias Question Answering +2

Semantic Hilbert Space for Text Representation Learning

1 code implementation26 Feb 2019 Benyou Wang, Qiuchi Li, Massimo Melucci, Dawei Song

To address this issue, we propose a new framework that models different levels of semantic units (e. g. sememe, word, sentence, and semantic abstraction) on a single \textit{Semantic Hilbert Space}, which naturally admits a non-linear semantic composition by means of a complex-valued vector word representation.

Benchmarking General Classification +5

A Quantum Many-body Wave Function Inspired Language Modeling Approach

1 code implementation28 Aug 2018 Peng Zhang, Zhan Su, Lipeng Zhang, Benyou Wang, Dawei Song

The recently proposed quantum language model (QLM) aimed at a principled approach to modeling term dependency by applying the quantum probability theory.

Language Modelling Question Answering +2

Quantum-inspired Complex Word Embedding

no code implementations WS 2018 Qiuchi Li, Sagar Uprety, Benyou Wang, Dawei Song

A challenging task for word embeddings is to capture the emergent meaning or polarity of a combination of individual words.

Sentence Sentence Classification +1

TextZoo, a New Benchmark for Reconsidering Text Classification

no code implementations10 Feb 2018 Benyou Wang, Li Wang, Qikang Wei, Lichun Liu

Text representation is a fundamental concern in Natural Language Processing, especially in text classification.

General Classification text-classification +1

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

3 code implementations30 May 2017 Jun Wang, Lantao Yu, Wei-Nan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, Dell Zhang

This paper provides a unified account of two schools of thinking in information retrieval modelling: the generative retrieval focusing on predicting relevant documents given a query, and the discriminative retrieval focusing on predicting relevancy given a query-document pair.

Document Ranking Information Retrieval +2

