Search Results for author: Xiusi Chen

Found 18 papers, 9 papers with code

IterAlign: Iterative Constitutional Alignment of Large Language Models

no code implementations27 Mar 2024 Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang

Such a constitution discovery pipeline can be run iteratively and automatically to discover new constitutions that specifically target the alignment gaps in the current LLM.

Bridging Language and Items for Retrieval and Recommendation

1 code implementation6 Mar 2024 Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, Julian McAuley

This paper introduces BLaIR, a series of pretrained sentence embedding models specialized for recommendation scenarios.

Retrieval Sentence +2

TinyLLM: Learning a Small Student from Multiple Large Language Models

no code implementations7 Feb 2024 Yijun Tian, Yikun Han, Xiusi Chen, Wei Wang, Nitesh V. Chawla

To solve the problems and facilitate the learning of compact language models, we propose TinyLLM, a novel knowledge distillation paradigm to learn a small student LLM from multiple large teacher LLMs.

Knowledge Distillation

MEMORYLLM: Towards Self-Updatable Large Language Models

no code implementations7 Feb 2024 Yu Wang, Xiusi Chen, Jingbo Shang, Julian McAuley

Existing Large Language Models (LLMs) usually remain static after deployment, which might make it hard to inject new knowledge into the model.

Model Editing

TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance

no code implementations24 Jan 2024 Haorui Wang, Rongzhi Zhang, Yinghao Li, Lingkai Kong, Yuchen Zhuang, Xiusi Chen, Chao Zhang

The teacher LLM generates problem-solving instructions and corrective principles based on the student LLM's errors.

Language Modelling

"Why Should I Review This Paper?" Unifying Semantic, Topic, and Citation Factors for Paper-Reviewer Matching

no code implementations23 Oct 2023 Yu Zhang, Yanzhen Shen, Xiusi Chen, Bowen Jin, Jiawei Han

As many academic conferences are overwhelmed by a rapidly increasing number of paper submissions, automatically finding appropriate reviewers for each submission becomes a more urgent need than ever.

Information Retrieval Language Modelling +1

Language Models As Semantic Indexers

no code implementations11 Oct 2023 Bowen Jin, Hansi Zeng, Guoyin Wang, Xiusi Chen, Tianxin Wei, Ruirui Li, Zhengyang Wang, Zheng Li, Yang Li, Hanqing Lu, Suhang Wang, Jiawei Han, Xianfeng Tang

Semantic identifier (ID) is an important concept in information retrieval that aims to preserve the semantics of objects such as documents and items inside their IDs.

Contrastive Learning Information Retrieval +2

MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering

no code implementations8 Oct 2023 Xiusi Chen, Jyun-Yu Jiang, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Wei Wang

Few-shot question answering (QA) aims at achieving satisfactory results on machine question answering when only a few training samples are available.

Data Augmentation Question Answering +3

Weakly Supervised Multi-Label Classification of Full-Text Scientific Papers

1 code implementation24 Jun 2023 Yu Zhang, Bowen Jin, Xiusi Chen, Yanzhen Shen, Yunyi Zhang, Yu Meng, Jiawei Han

Instead of relying on human-annotated training samples to build a classifier, weakly supervised scientific paper classification aims to classify papers only using category descriptions (e. g., category names, category-indicative keywords).

Multi-Label Classification

Professional Basketball Player Behavior Synthesis via Planning with Diffusion

no code implementations7 Jun 2023 Xiusi Chen, Wei-Yao Wang, Ziniu Hu, Curtis Chou, Lam Hoang, Kun Jin, Mingyan Liu, P. Jeffrey Brantingham, Wei Wang

To accomplish reward-guided trajectory generation, conditional sampling is introduced to condition the diffusion model on the value function and conduct classifier-guided sampling.

Decision Making

Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation

1 code implementation7 Jun 2023 Xiusi Chen, Yu Zhang, Jinliang Deng, Jyun-Yu Jiang, Wei Wang

Few-shot question answering (QA) aims at precisely discovering answers to a set of questions from context passages while only a few training samples are available.

Data Augmentation Question Answering

MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information

1 code implementation7 Nov 2021 Yu Zhang, Shweta Garg, Yu Meng, Xiusi Chen, Jiawei Han

We study the problem of weakly supervised text classification, which aims to classify text documents into a set of pre-defined categories with category surface names only and without any annotated training document provided.

text-classification Text Classification

A Multi-view Multi-task Learning Framework for Multi-variate Time Series Forecasting

1 code implementation2 Sep 2021 Jinliang Deng, Xiusi Chen, Renhe Jiang, Xuan Song, Ivor W. Tsang

Therefore, there are two fundamental views which can be used to analyze MTS data, namely the spatial view and the temporal view.

Attribute Multi-Task Learning +2

Hierarchical Metadata-Aware Document Categorization under Weak Supervision

1 code implementation26 Oct 2020 Yu Zhang, Xiusi Chen, Yu Meng, Jiawei Han

Our experiments demonstrate a consistent improvement of HiMeCat over competitive baselines and validate the contribution of our representation learning and data augmentation modules.

Data Augmentation Document Classification +1

TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering

2 code implementations22 Dec 2018 Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian Sadler, Michelle Vanni, Jiawei Han

Our method, TaxoGen, uses term embeddings and hierarchical clustering to construct a topic taxonomy in a recursive fashion.

Databases

Cannot find the paper you are looking for? You can Submit a new open access paper.