Search Results for author: Ziqiang Cao

Found 38 papers, 8 papers with code

ProtTeX: Structure-In-Context Reasoning and Editing of Proteins with Large Language Models

no code implementations11 Mar 2025 Zicheng Ma, Chuanliu Fan, Zhicong Wang, Zhenyu Chen, Xiaohan Lin, Yanheng Li, Shihao Feng, Jun Zhang, Ziqiang Cao, Yi Qin Gao

To address these challenges, we introduce a novel framework, ProtTeX, which tokenizes the protein sequences, structures, and textual information into a unified discrete space.

multimodal generation Protein Design +1

ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model

no code implementations27 Feb 2025 Chuanliu Fan, Ziqiang Cao, Zicheng Ma, Nan Yu, Yimin Peng, Jun Zhang, Yiqin Gao, Guohong Fu

Goal-oriented de novo molecule design, namely generating molecules with specific property or substructure constraints, is a crucial yet challenging task in drug discovery.

Bayesian Optimization Drug Discovery +5

Prot2Chat: Protein LLM with Early Fusion of Sequence and Structure

1 code implementation7 Feb 2025 Zhicong Wang, Zicheng Ma, Ziqiang Cao, Changlong Zhou, Jun Zhang, Yiqin Gao

Proteins play a pivotal role in living organisms, yet understanding their functions presents significant challenges, including the limited flexibility of classification-based methods, the inability to effectively leverage spatial structural information, and the lack of systematic evaluation metrics for protein Q&A systems.

Answer Generation Decoder +4

Interleaved-Modal Chain-of-Thought

no code implementations29 Nov 2024 Jun Gao, Yongqi Li, Ziqiang Cao, Wenjie Li

Chain-of-Thought (CoT) prompting elicits large language models (LLMs) to produce a series of intermediate reasoning steps before arriving at the final answer.

PIP-MM: Pre-Integrating Prompt Information into Visual Encoding via Existing MLLM Structures

no code implementations30 Oct 2024 Tianxiang Wu, Minxin Nie, Ziqiang Cao

However, because the process of image encoding isprompt-agnostic, the extracted visual features only provide a coarsedescription of the image, impossible to focus on the requirements of theprompt.

AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-Context Learning

no code implementations11 Jun 2024 Jun Gao, Qian Qiao, Ziqiang Cao, Zili Wang, Wenjie Li

In-context learning (ICL) facilitates Large Language Models (LLMs) exhibiting emergent ability on downstream tasks without updating billions of parameters.

In-Context Learning

Guiding ChatGPT to Generate Salient Domain Summaries

no code implementations3 Jun 2024 Jun Gao, Ziqiang Cao, Shaoyao Huang, Luozheng Qin, Chunhui Ai

Then, we require ChatGPT to generate $k$ candidate summaries for the inference document at a time under the guidance of the retrieved demonstration.

In-Context Learning

Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization

no code implementations1 Jun 2024 Shichao Sun, Ruifeng Yuan, Ziqiang Cao, Wenjie Li, PengFei Liu

Two strategies are designed to perform this iterative process: Prompt Chaining and Stepwise Prompt.

Text Summarization

SelfCP: Compressing Over-Limit Prompt via the Frozen Large Language Model Itself

no code implementations27 May 2024 Jun Gao, Ziqiang Cao, Wenjie Li

This paper proposes a Self-Compressor (SelfCP), which employs the target LLM itself to compress over-limit prompts into dense vectors while keeping the allowed prompts unmodified.

Decoder In-Context Learning +3

Unifying Demonstration Selection and Compression for In-Context Learning

no code implementations27 May 2024 Jun Gao, Ziqiang Cao, Wenjie Li

In this paper, we propose an ICL framework UniICL, which Unifies demonstration selection and compression, and final response generation via a single frozen LLM.

In-Context Learning Response Generation +2

CoUDA: Coherence Evaluation via Unified Data Augmentation

1 code implementation31 Mar 2024 Dawei Zhu, Wenhao Wu, YiFan Song, Fangwei Zhu, Ziqiang Cao, Sujian Li

Due to the scarcity of annotated data, data augmentation is commonly used for training coherence evaluation models.

Coherence Evaluation Data Augmentation

Personalized Large Language Model Assistant with Evolving Conditional Memory

no code implementations22 Dec 2023 Ruifeng Yuan, Shichao Sun, Yongqi Li, Zili Wang, Ziqiang Cao, Wenjie Li

With the rapid development of large language models, AI assistants like ChatGPT have become increasingly integrated into people's works and lives but are limited in personalized services.

Language Modeling Language Modelling +2

KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model

1 code implementation20 Nov 2023 Lei Geng, Xu Yan, Ziqiang Cao, Juntao Li, Wenjie Li, Sujian Li, Xinjie Zhou, Yang Yang, Jun Zhang

We achieve a biomedical multilingual corpus by incorporating three granularity knowledge alignments (entity, fact, and passage levels) into monolingual corpora.

Language Modeling Relation +2

RSpell: Retrieval-augmented Framework for Domain Adaptive Chinese Spelling Check

1 code implementation16 Aug 2023 Siqi Song, Qi Lv, Lei Geng, Ziqiang Cao, Guohong Fu

In this paper, we propose a retrieval-augmented spelling check framework called RSpell, which searches corresponding domain terms and incorporates them into CSC models.

Retrieval

RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search

1 code implementation23 May 2023 Yang Bai, Min Cao, Daming Gao, Ziqiang Cao, Chen Chen, Zhenfeng Fan, Liqiang Nie, Min Zhang

RA offsets the overfitting risk by introducing a novel positive relation detection task (i. e., learning to distinguish strong and weak positive pairs).

Person Search Relation +2

Text-based Person Search without Parallel Image-Text Data

no code implementations22 May 2023 Yang Bai, Jingyao Wang, Min Cao, Chen Chen, Ziqiang Cao, Liqiang Nie, Min Zhang

Text-based person search (TBPS) aims to retrieve the images of the target person from a large image gallery based on a given natural language description.

Image Captioning Language Modeling +5

Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference!

1 code implementation8 May 2023 Zecheng Tang, Pinzheng Wang, Keyan Zhou, Juntao Li, Ziqiang Cao, Min Zhang

Diffusion models have been successfully adapted to text generation tasks by mapping the discrete text into the continuous space.

Text Generation

Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening

no code implementations14 Mar 2023 Min Cao, Yang Bai, Jingyao Wang, Ziqiang Cao, Liqiang Nie, Min Zhang

The proposed framework equipped with only two embedding layers achieves $O(1)$ querying time complexity, while improving the retrieval efficiency and keeping its performance, when applied prior to the common image-text retrieval methods.

Image-text Retrieval Multi-Label Classification +3

Few-shot Query-Focused Summarization with Prefix-Merging

no code implementations29 Nov 2022 Ruifeng Yuan, Zili Wang, Ziqiang Cao, Wenjie Li

Drawn inspiration from prefix-tuning, we are allowed to integrate the task knowledge from text summarization and question answering into a properly designed prefix and apply the merged prefix to query-focused summarization.

Few-Shot Learning Query-focused Summarization +2

FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness

no code implementations1 Nov 2022 Wenhao Wu, Wei Li, Jiachen Liu, Xinyan Xiao, Ziqiang Cao, Sujian Li, Hua Wu

We first measure a model's factual robustness by its success rate to defend against adversarial attacks when generating factual information.

Abstractive Text Summarization

Visual Subtitle Feature Enhanced Video Outline Generation

no code implementations24 Aug 2022 Qi Lv, Ziqiang Cao, Wenrui Xie, Derui Wang, Jingwen Wang, Zhiwei Hu, Tangkun Zhang, Ba Yuan, Yuanhang Li, Min Cao, Wenjie Li, Sujian Li, Guohong Fu

Furthermore, based on the similarity between video outlines and textual outlines, we use a large number of articles with chapter headings to pretrain our model.

Headline Generation Navigate +4

Revising Image-Text Retrieval via Multi-Modal Entailment

no code implementations22 Aug 2022 Xu Yan, Chunhui Ai, Ziqiang Cao, Min Cao, Sujian Li, Wenjie Li, Guohong Fu

While the builders of existing image-text retrieval datasets strive to ensure that the caption matches the linked image, they cannot prevent a caption from fitting other images.

Image-text Retrieval Natural Language Inference +2

KE-QI: A Knowledge Enhanced Article Quality Identification Dataset

no code implementations15 Jun 2022 Chunhui Ai, Derui Wang, Xu Yan, Yang Xu, Wenrui Xie, Ziqiang Cao

With so many articles of varying qualities being produced every moment, it is a very urgent task to screen outstanding articles and commit them to social media.

Text Classification

BASS: Boosting Abstractive Summarization with Unified Semantic Graph

no code implementations ACL 2021 Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Ziqiang Cao, Sujian Li, Hua Wu, Haifeng Wang

Abstractive summarization for long-document or multi-document remains challenging for the Seq2Seq architecture, as Seq2Seq is not good at analyzing long-distance relations in text.

Abstractive Text Summarization Decoder +3

Incorporating Relevant Knowledge in Context Modeling and Response Generation

no code implementations9 Nov 2018 Yan-ran Li, Wenjie Li, Ziqiang Cao, Chengyao Chen

To sustain engaging conversation, it is critical for chatbots to make good use of relevant knowledge.

Attribute Chatbot +2

Faithful to the Original: Fact Aware Neural Abstractive Summarization

no code implementations13 Nov 2017 Ziqiang Cao, Furu Wei, Wenjie Li, Sujian Li

While previous abstractive summarization approaches usually focus on the improvement of informativeness, we argue that faithfulness is also a vital prerequisite for a practical abstractive summarization system.

Abstractive Text Summarization Extractive Summarization +3

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

13 code implementations IJCNLP 2017 Yan-ran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, Shuzi Niu

We develop a high-quality multi-turn dialog dataset, DailyDialog, which is intriguing in several aspects.

Improving Multi-Document Summarization via Text Classification

no code implementations28 Nov 2016 Ziqiang Cao, Wenjie Li, Sujian Li, Furu Wei

Developed so far, multi-document summarization has reached its bottleneck due to the lack of sufficient training data and diverse categories of documents.

Document Summarization General Classification +3

Joint Copying and Restricted Generation for Paraphrase

no code implementations28 Nov 2016 Ziqiang Cao, Chuwei Luo, Wenjie Li, Sujian Li

In this paper, we develop a novel Seq2Seq model to fuse a copying decoder and a restricted generative decoder.

Abstractive Text Summarization Decoder +3

Multi-Document Summarization via Discriminative Summary Reranking

no code implementations8 Jul 2015 Xiaojun Wan, Ziqiang Cao, Furu Wei, Sujian Li, Ming Zhou

However, according to our quantitative analysis, none of the existing summarization models can always produce high-quality summaries for different document sets, and even a summarization model with good overall performance may produce low-quality summaries for some document sets.

Document Summarization Multi-Document Summarization +2

Cannot find the paper you are looking for? You can Submit a new open access paper.