Search Results for author: Zihan Xu

Found 19 papers, 10 papers with code

FlowAgent: Achieving Compliance and Flexibility for Workflow Agents

1 code implementation20 Feb 2025 Yuchen Shi, Siqi Cai, Zihan Xu, Yuei Qin, Gang Li, Hang Shao, Jiawei Chen, Deqing Yang, Ke Li, Xing Sun

Experiments on three datasets demonstrate that FlowAgent not only adheres to workflows but also effectively manages OOW queries, highlighting its dual strengths in compliance and flexibility.

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

1 code implementation27 Jan 2025 Heting Gao, Hang Shao, Xiong Wang, Chaofan Qiu, Yunhang Shen, Siqi Cai, Yuchen Shi, Zihan Xu, Zuwei Long, Yike Zhang, Shaoqi Dong, Chaoyou Fu, Ke Li, Long Ma, Xing Sun

The film Her features Samantha, a sophisticated AI audio agent who is capable of understanding both linguistic and paralinguistic information in human speech and delivering real-time responses that are natural, informative and sensitive to emotional subtleties.

Question Answering

A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models

no code implementations17 Dec 2024 Gongbo Zhang, Zihan Xu, Qiao Jin, Fangyi Chen, Yilu Fang, Yi Liu, Justin F. Rousseau, Ziyang Xu, Zhiyong Lu, Chunhua Weng, Yifan Peng

While holding great promise for improving and facilitating healthcare, large language models (LLMs) struggle to produce up-to-date responses on evolving topics due to outdated knowledge or hallucination.

Hallucination RAG +1

Deciphering genomic codes using advanced NLP techniques: a scoping review

no code implementations25 Nov 2024 Shuyan Cheng, Yishu Wei, Yiliang Zhou, Zihan Xu, Drew N Wright, Jinze Liu, Yifan Peng

The goal of this review is to assess data and model accessibility in the most recent literature, gaining a better understanding of the existing capabilities and constraints of these tools in processing genomic sequencing data.

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

1 code implementation28 Aug 2024 Yuncheng Yang, Yulei Qin, Tong Wu, Zihan Xu, Gang Li, Pengcheng Guo, Hang Shao, Yuchen Shi, Ke Li, Xing Sun, Jie Yang, Yun Gu

For the latter, we highlight the diversity of constituting experts and that of the fine-tuning instructions throughout the model and data selection process.

Diversity

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

1 code implementation4 Aug 2024 Yulei Qin, Yuncheng Yang, Pengcheng Guo, Gang Li, Hang Shao, Yuchen Shi, Zihan Xu, Yun Gu, Ke Li, Xing Sun

To bridge this gap, we present a comprehensive review on existing literature of data assessment and selection especially for instruction tuning of LLMs.

Sinkhorn Distance Minimization for Knowledge Distillation

1 code implementation27 Feb 2024 Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou, Houqiang Li

We propose the Sinkhorn Knowledge Distillation (SinKD) that exploits the Sinkhorn distance to ensure a nuanced and precise assessment of the disparity between teacher and student distributions.

Decoder Knowledge Distillation

Devil in the Number: Towards Robust Multi-modality Data Filter

no code implementations24 Sep 2023 Yichen Xu, Zihan Xu, Wenhao Chai, Zhonghan Zhao, Enxin Song, Gaoang Wang

In order to appropriately filter multi-modality data sets on a web-scale, it becomes crucial to employ suitable filtering methods to boost performance and reduce training costs.

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

no code implementations30 Mar 2023 Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu Enwei Zhang, Wei Liu, Jie Yang, Ke Li, Xing Sun

During the preceding biennium, vision-language pre-training has achieved noteworthy success on several downstream tasks.

cross-modal alignment Zero-Shot Learning

ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser

no code implementations NAACL 2021 Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, Kai Yu

Given a database schema, Text-to-SQL aims to translate a natural language question into the corresponding SQL query.

Decoder Text-To-SQL

SiMaN: Sign-to-Magnitude Network Binarization

2 code implementations16 Feb 2021 Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Fei Chao, Chia-Wen Lin, Ling Shao

In this paper, we show that our weight binarization provides an analytical solution by encoding high-magnitude weights into +1s, and 0s otherwise.

Binarization

Answer-driven Deep Question Generation based on Reinforcement Learning

no code implementations COLING 2020 Liuyin Wang, Zihan Xu, Zibo Lin, Haitao Zheng, Ying Shen

First, we propose an answer-aware initialization module with a gated connection layer which introduces both document and answer information to the decoder, thus helping to guide the choice of answer-focused question words.

Decoder Question Generation +4

Rotated Binary Neural Network

2 code implementations NeurIPS 2020 Mingbao Lin, Rongrong Ji, Zihan Xu, Baochang Zhang, Yan Wang, Yongjian Wu, Feiyue Huang, Chia-Wen Lin

In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version.

Binarization Quantization

CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking

no code implementations22 Sep 2020 Zhi Chen, Lu Chen, Zihan Xu, Yanbin Zhao, Su Zhu, Kai Yu

In dialogue systems, a dialogue state tracker aims to accurately find a compact representation of the current dialogue status, based on the entire dialogue history.

Dialogue State Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.