Search Results for author: Xinting Huang

Found 21 papers, 9 papers with code

Paper
Add Code

FuseChat: Knowledge Fusion of Chat Models

1 code implementation • 25 Feb 2024 • Fanqi Wan, ZiYi Yang, Longguang Zhong, Xiaojun Quan, Xinting Huang, Wei Bi

Recently, \textsc{FuseLLM} introduced the concept of knowledge fusion to transfer the collective knowledge of multiple structurally varied LLMs into a target LLM through lightweight continual training.

326

Paper
Code

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

no code implementations • 21 Feb 2024 • Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong

Multimodal reasoning stands as a pivotal capability for large vision-language models (LVLMs).

Geometry Problem Solving Molecular Property Prediction +2

Paper
Add Code

Knowledge Verification to Nip Hallucination in the Bud

1 code implementation • 19 Jan 2024 • Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi

While large language models (LLMs) have demonstrated exceptional performance across various tasks following human alignment, they may still generate responses that sound plausible but contradict factual knowledge, a phenomenon known as \emph{hallucination}.

Hallucination World Knowledge

Paper
Code

Knowledge Fusion of Large Language Models

1 code implementation • 19 Jan 2024 • Fanqi Wan, Xinting Huang, Deng Cai, Xiaojun Quan, Wei Bi, Shuming Shi

In this paper, we introduce the notion of knowledge fusion for LLMs, aimed at combining the capabilities of existing LLMs and transferring them into a single LLM.

Code Generation Common Sense Reasoning +6

326

Paper
Code

Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

1 code implementation • 16 Jan 2024 • Shuming Shi, Enbo Zhao, Deng Cai, Leyang Cui, Xinting Huang, Huayang Li

We present Inferflow, an efficient and highly configurable inference engine for large language models (LLMs).

Quantization

210

Paper
Code

Longer Fixations, More Computation: Gaze-Guided Recurrent Neural Networks

no code implementations • 31 Oct 2023 • Xinting Huang, Jiajing Wan, Ioannis Kritikos, Nora Hollenstein

Humans read texts at a varying pace, while machine learning models treat each token in the same way in terms of a computational process.

Language Modelling Sentiment Analysis

Paper
Add Code

SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving

no code implementations • 19 Oct 2023 • Xueliang Zhao, Xinting Huang, Wei Bi, Lingpeng Kong

Large Language Models (LLMs) have driven substantial progress in artificial intelligence in recent years, exhibiting impressive capabilities across a wide range of tasks, including mathematical problem-solving.

GSM8K Math

Paper
Add Code

Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration

1 code implementation • 13 Oct 2023 • Fanqi Wan, Xinting Huang, Tao Yang, Xiaojun Quan, Wei Bi, Shuming Shi

Instruction-tuning can be substantially optimized through enhanced diversity, resulting in models capable of handling a broader spectrum of tasks.

Paper
Code

TeGit: Generating High-Quality Instruction-Tuning Data with Text-Grounded Task Design

no code implementations • 11 Sep 2023 • Yongrui Chen, Haiyun Jiang, Xinting Huang, Shuming Shi, Guilin Qi

High-quality instruction-tuning data is critical to improving LLM capabilities.

Hallucination

Paper
Add Code

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

1 code implementation • 3 Sep 2023 • Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge.

Hallucination World Knowledge

807

Paper
Code

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

1 code implementation • 15 Jun 2023 • Chenyang Lyu, Minghao Wu, Longyue Wang, Xinting Huang, Bingshuai Liu, Zefeng Du, Shuming Shi, Zhaopeng Tu

Although instruction-tuned large language models (LLMs) have exhibited remarkable capabilities across various NLP tasks, their effectiveness on other data modalities beyond text has not been fully studied.

Language Modelling

1,393

Paper
Code

Pre-training Multi-party Dialogue Models with Latent Discourse Inference

1 code implementation • 24 May 2023 • Yiyang Li, Xinting Huang, Wei Bi, Hai Zhao

Multi-party dialogues are more difficult for models to understand than one-to-one two-party dialogues, since they involve multiple interlocutors, resulting in interweaving reply-to relations and information flows.

Paper
Code

Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance

no code implementations • 22 May 2023 • Yue Zhang, Leyang Cui, Deng Cai, Xinting Huang, Tao Fang, Wei Bi

Proprietary Large Language Models (LLMs), such as ChatGPT, have garnered significant attention due to their exceptional capabilities in handling a diverse range of tasks.

Instruction Following

Paper
Add Code

Effidit: Your AI Writing Assistant

no code implementations • 3 Aug 2022 • Shuming Shi, Enbo Zhao, Duyu Tang, Yan Wang, Piji Li, Wei Bi, Haiyun Jiang, Guoping Huang, Leyang Cui, Xinting Huang, Cong Zhou, Yong Dai, Dongyang Ma

In Effidit, we significantly expand the capacities of a writing assistant by providing functions in five categories: text completion, error checking, text polishing, keywords to sentences (K2S), and cloud input methods (cloud IME).

Keywords to Sentences Retrieval +3

Paper
Add Code

Robust Task-Oriented Dialogue Generation with Contrastive Pre-training and Adversarial Filtering

no code implementations • 20 May 2022 • Shiquan Yang, Xinting Huang, Jey Han Lau, Sarah Erfani

Data artifacts incentivize machine learning models to learn non-transferable generalizations by taking advantage of shortcuts in the data, and there is growing evidence that data artifacts play a role for the strong results that deep learning models achieve in recent natural language processing benchmarks.

Contrastive Learning Dialogue Generation

Paper
Add Code

Generalizable and Explainable Dialogue Generation via Explicit Action Learning

no code implementations • Findings of the Association for Computational Linguistics 2020 • Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang

To alleviate the need of action annotations, latent action learning is introduced to map each utterance to a latent representation.

Dialogue Generation Response Generation

Paper
Add Code

KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation

1 code implementation • SEMEVAL 2020 • Jiajing Wan, Xinting Huang

This paper presents our strategies in SemEval 2020 Task 4: Commonsense Validation and Explanation.

Paper
Code

Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation

no code implementations • ACL 2020 • Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang

This approach requires complete state-action annotations of human-to-human dialogues (i. e., expert demonstrations), which is labor intensive.

Task-Oriented Dialogue Systems

Paper
Add Code

MALA: Cross-Domain Dialogue Generation with Action Learning

no code implementations • 18 Dec 2019 • Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang

These two components, however, have a discrepancy in their objectives, i. e., task completion and language quality.

Dialogue Generation Response Generation +2

Paper
Add Code

CARL: Aggregated Search with Context-Aware Module Embedding Learning

no code implementations • 3 Aug 2019 • Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang, Hai-Tao Zheng

To model and utilize the context information for aggregated search, we propose a model with context attention and representation learning (CARL).

Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.