1 code implementation • 2 Nov 2024 • Jialiang Xu, Shenglan Li, Zhaozhuo Xu, Denghui Zhang
Prior study shows that LLMs sometimes generate content that violates copyright.
2 code implementations • 16 Jul 2024 • Shicheng Liu, Sina J. Semnani, Harold Triedman, Jialiang Xu, Isaac Dan Zhao, Monica S. Lam
SPINACH achieves a new state of the art on the QALD-7, QALD-9 Plus and QALD-10 datasets by 31. 0%, 27. 0%, and 10. 0% in $F_1$, respectively, and coming within 1. 6% of the fine-tuned LLaMA SOTA model on WikiWebQuestions.
no code implementations • 1 Jun 2024 • Heidi C. Zhang, Sina J. Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica S. Lam
We introduce SPAGHETTI: Semantic Parsing Augmented Generation for Hybrid English information from Text Tables and Infoboxes, a hybrid question-answering (QA) pipeline that utilizes information from heterogeneous knowledge sources, including knowledge base, text, tables, and infoboxes.
1 code implementation • 29 May 2024 • Jialiang Xu, Michael Moor, Jure Leskovec
Concretely, our experiments suggest that RIR augmentation helps by providing further visual and textual cues without necessarily containing the direct answer to a query.
no code implementations • 27 Nov 2023 • Chi Han, Jialiang Xu, Manling Li, Hanning Zhang, Tarek Abdelzaher, Heng Ji
Social media play a significant role in shaping public opinion and influencing ideological communities through information propagation.
1 code implementation • 16 Nov 2023 • Shicheng Liu, Jialiang Xu, Wesley Tjangnaka, Sina J. Semnani, Chen Jie Yu, Monica S. Lam
This paper presents the first conversational agent that supports the full generality of hybrid data access for large knowledge corpora, through a language we developed called SUQL (Structured and Unstructured Query Language).
1 code implementation • 22 May 2023 • Chi Han, Jialiang Xu, Manling Li, Yi Fung, Chenkai Sun, Nan Jiang, Tarek Abdelzaher, Heng Ji
In this work, we theoretically and empirically revisit output word embeddings and find that their linear transformations are equivalent to steering language model generation styles.
1 code implementation • 6 Dec 2022 • Hongwei Han, Jialiang Xu, Mengyu Zhou, Yijia Shao, Shi Han, Dongmei Zhang
But current approaches to rich-number tasks with transformer-based language models abandon or lose some of the numeracy information - e. g., breaking numbers into sub-word tokens - which leads to many number-related errors.
no code implementations • 14 Nov 2022 • Jialiang Xu, Mengyu Zhou, Xinyi He, Shi Han, Dongmei Zhang
Numerical Question Answering is the task of answering questions that require numerical capabilities.
no code implementations • 2 Sep 2022 • Xinyi He, Mengyu Zhou, Mingjie Zhou, Jialiang Xu, Xiao Lv, Tianle Li, Yijia Shao, Shi Han, Zejian yuan, Dongmei Zhang
Tabular data analysis is performed every day across various domains.