no code implementations • NAACL (ACL) 2022 • Muhao Chen, Lifu Huang, Manling Li, Ben Zhou, Heng Ji, Dan Roth
This tutorial targets researchers and practitioners who are interested in AI and ML technologies for structural information extraction (IE) from unstructured textual sources.
1 code implementation • 18 Jul 2024 • Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng
This paper investigates the capability of LLMs in storytelling, focusing on narrative development and plot progression.
1 code implementation • 7 Jul 2024 • Yubo Zhang, Shudi Hou, Mingyu Derek Ma, Wei Wang, Muhao Chen, Jieyu Zhao
We introduce CLIMB (shorthand for A Benchmark of Clinical Bias in Large Language Models), a pioneering comprehensive benchmark to evaluate both intrinsic (within LLMs) and extrinsic (on downstream tasks) bias in LLMs for clinical decision tasks.
no code implementations • 4 Jul 2024 • Terry Tong, Jiashu Xu, Qin Liu, Muhao Chen
The security of multi-turn conversational large language models (LLMs) is understudied despite it being one of the most popular LLM utilization.
no code implementations • 1 Jul 2024 • Nan Xu, Fei Wang, Sheng Zhang, Hoifung Poon, Muhao Chen
Motivated by in-context learning (ICL) capabilities of Large Language models (LLMs), multimodal LLMs with additional visual modality are also exhibited with similar ICL abilities when multiple image-text pairs are provided as demonstrations.
no code implementations • 17 Jun 2024 • Bangzheng Li, Ben Zhou, Xingyu Fu, Fei Wang, Dan Roth, Muhao Chen
One popular approach is using perplexity as a way to measure models' familiarity with the prompt.
no code implementations • 17 Jun 2024 • Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen
Direct preference optimization (DPO) has shown to be an effective method for large language model (LLM) alignment.
no code implementations • 15 Jun 2024 • Muhao Chen, Jing Qin
In this work, we develop a deep neural network (DNN) approach to predict the geometric configurations and physical properties-such as nodal coordinates, member forces, and natural frequencies-of any tensegrity structures in equilibrium states.
no code implementations • 13 Jun 2024 • Fei Wang, Xingyu Fu, James Y. Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen
We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs.
no code implementations • 6 Jun 2024 • Hadi Askari, Anshuman Chhabra, Muhao Chen, Prasant Mohapatra
To bridge this gap, we propose relevance paraphrasing, a simple strategy that can be used to measure the robustness of LLMs as summarizers.
no code implementations • 16 May 2024 • Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen
To mitigate this issue, this study explores a novel contradictory dialogue processing task that aims to detect and modify contradictory statements in a conversation.
1 code implementation • 17 Apr 2024 • James Y. Huang, Wenxuan Zhou, Fei Wang, Fred Morstatter, Sheng Zhang, Hoifung Poon, Muhao Chen
Despite the strong capabilities of Large Language Models (LLMs) to acquire knowledge from their training corpora, the memorization of sensitive information in the corpora such as copyrighted, harmful, and private content has led to ethical and legal concerns.
no code implementations • 2 Apr 2024 • Victoria Graf, Qin Liu, Muhao Chen
In this paper, we propose Nested Product of Experts(NPoE) defense framework, which involves a mixture of experts (MoE) as a trigger-only ensemble within the PoE defense framework to simultaneously defend against multiple trigger types.
1 code implementation • 30 Mar 2024 • Tenghao Huang, Dongwon Jung, Muhao Chen
Recent advancements in integrating external tools with Large Language Models (LLMs) have opened new frontiers, with applications in mathematical reasoning, code generators, and smart assistants.
no code implementations • 24 Mar 2024 • Qin Liu, Fei Wang, Nan Xu, Tianyi Yan, Tao Meng, Muhao Chen
In this paper, we propose monotonic paraphrasing (MonoPara), an end-to-end decoding strategy that paraphrases given prompts or instructions into their lower perplexity counterparts based on an ensemble of a paraphrase LM for prompt (or instruction) rewriting, and a target LM (i. e. the prompt or instruction executor) that constrains the generation for lower perplexity.
1 code implementation • 14 Mar 2024 • Yu Wang, Xiaogeng Liu, Yu Li, Muhao Chen, Chaowei Xiao
However, with the integration of additional modalities, MLLMs are exposed to new vulnerabilities, rendering them prone to structured-based jailbreak attacks, where semantic content (e. g., "harmful text") has been injected into the images to mislead MLLMs.
no code implementations • 12 Mar 2024 • Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, ZiYi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Akshay Chaudhari, Serena Yeung-Levy, Curtis P. Langlotz, Sheng Wang, Hoifung Poon
Frontier general-domain models such as GPT-4V still have significant performance gaps in multimodal biomedical applications.
1 code implementation • 6 Mar 2024 • Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin
In recent years, few-shot and zero-shot learning, which learn to predict labels with limited annotated instances, have garnered significant attention.
1 code implementation • 22 Feb 2024 • Jiongxiao Wang, Jiazhao Li, Yiquan Li, Xiangyu Qi, Junjie Hu, Yixuan Li, Patrick McDaniel, Muhao Chen, Bo Li, Chaowei Xiao
In particular, service providers will construct prefixed safety examples with a secret prompt, acting as a "backdoor trigger".
1 code implementation • 17 Feb 2024 • Tianyi Lorena Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen
Instruction tuning has been used as a promising approach to improve the performance of large language models (LLMs) on unseen tasks.
no code implementations • 13 Feb 2024 • Yixiang Yao, Fei Wang, Srivatsan Ravi, Muhao Chen
Language Models as a Service (LMaaS) offers convenient access for developers and researchers to perform inference using pre-trained language models.
1 code implementation • 21 Jan 2024 • Jiashu Xu, Fei Wang, Mingyu Derek Ma, Pang Wei Koh, Chaowei Xiao, Muhao Chen
The exorbitant cost of training Large language models (LLMs) from scratch makes it essential to fingerprint the models to protect intellectual property via ownership authentication and to ensure downstream users and developers comply with their license terms (e. g. restricting commercial use).
2 code implementations • 19 Jan 2024 • Yiwei Wang, Muhao Chen, Nanyun Peng, Kai-Wei Chang
How to edit the knowledge in multi-step reasoning has become the major challenge in the knowledge editing (KE) of large language models (LLMs).
1 code implementation • 27 Dec 2023 • Tianyang Liu, Fei Wang, Muhao Chen
Large Language Models (LLMs) have shown to be capable of various tasks, yet their capability in interpreting and reasoning over tabular data remains an underexplored area.
Ranked #1 on Semantic Parsing on WikiTableQuestions
no code implementations • 16 Nov 2023 • Jiongxiao Wang, Junlin Wu, Muhao Chen, Yevgeniy Vorobeychik, Chaowei Xiao
Reinforcement Learning with Human Feedback (RLHF) is a methodology designed to align Large Language Models (LLMs) with human preferences, playing an important role in LLMs alignment.
no code implementations • 16 Nov 2023 • Wenjie Mo, Jiashu Xu, Qin Liu, Jiongxiao Wang, Jun Yan, Chaowei Xiao, Muhao Chen
Existing studies in backdoor defense have predominantly focused on the training phase, overlooking the critical aspect of testing time defense.
no code implementations • 16 Nov 2023 • Nan Xu, Fei Wang, Ben Zhou, Bang Zheng Li, Chaowei Xiao, Muhao Chen
While large language models (LLMs) have demonstrated increasing power, they have also given rise to a wide range of harmful behaviors.
1 code implementation • 16 Nov 2023 • Bangzheng Li, Ben Zhou, Fei Wang, Xingyu Fu, Dan Roth, Muhao Chen
During the construction of the evidence, we purposefully replace semantic clues (entities) that may lead to the correct answer with distractor clues (evidence) that will not directly lead to the correct answer but require a chain-like reasoning process.
1 code implementation • 15 Nov 2023 • Lingbo Mo, Boshi Wang, Muhao Chen, Huan Sun
The rapid progress in open-source Large Language Models (LLMs) is significantly driving AI development forward.
1 code implementation • 28 Oct 2023 • Nan Xu, Fei Wang, Mingtao Dong, Muhao Chen
Many discriminative natural language understanding (NLU) tasks have large label spaces.
1 code implementation • 23 Oct 2023 • Zekun Li, Wenxuan Zhou, Yao-Yi Chiang, Muhao Chen
This paper introduces GeoLM, a geospatially grounded language model that enhances the understanding of geo-entities in natural language.
1 code implementation • 23 Oct 2023 • Tenghao Huang, Ehsan Qasemi, Bangzheng Li, He Wang, Faeze Brahman, Muhao Chen, Snigdha Chaturvedi
Storytelling's captivating potential makes it a fascinating research area, with implications for entertainment, education, therapy, and cognitive studies.
1 code implementation • 20 Oct 2023 • Yiwei Wang, Yujun Cai, Muhao Chen, Yuxuan Liang, Bryan Hooi
We have two main findings: i) ChatGPT's decision is sensitive to the order of labels in the prompt; ii) ChatGPT has a clearly higher chance to select the labels at earlier positions as the answer.
1 code implementation • 4 Oct 2023 • Peifang Wang, Olga Golovneva, Armen Aghajanyan, Xiang Ren, Muhao Chen, Asli Celikyilmaz, Maryam Fazel-Zarandi
By fine-tuning the System-2 module (LLaMA-2 70B) on only a small amount of data on multi-step reasoning, the accuracy of our method is further improved and surpasses the best fully-supervised end-to-end approach by 5. 7% and a pipeline approach with FlanPaLM (540B) by 7. 5% on a challenging dataset with human-authored questions.
2 code implementations • 3 Oct 2023 • Xiaogeng Liu, Nan Xu, Muhao Chen, Chaowei Xiao
In light of these challenges, we intend to answer this question: Can we develop an approach that can automatically generate stealthy jailbreak prompts?
no code implementations • 19 Sep 2023 • Fei Wang, Kuan-Hao Huang, Kai-Wei Chang, Muhao Chen
In this paper, we propose a simple yet effective method, SALT, to improve the zero-shot cross-lingual transfer of the multilingual pretrained language models without the help of such external data.
1 code implementation • 21 Aug 2023 • Tai Nguyen, Yifeng Di, Joohan Lee, Muhao Chen, Tianyi Zhang
Recognizing software entities such as library names from free-form text is essential to enable many software engineering (SE) technologies, such as traceability link recovery, automated documentation, and API recommendation.
1 code implementation • 7 Aug 2023 • Wenxuan Zhou, Sheng Zhang, Yu Gu, Muhao Chen, Hoifung Poon
Instruction tuning has proven effective for distilling LLMs into more cost-efficient models such as Alpaca and Vicuna.
Ranked #1 on Named Entity Recognition (NER) on NCBI Disease
1 code implementation • 7 Jun 2023 • Shudi Hou, Yu Xia, Muhao Chen, Sujian Li
Traditional text classification typically categorizes texts into pre-defined coarse-grained classes, from which the produced models cannot handle the real-world scenario where finer categories emerge periodically for accurate services.
1 code implementation • 28 May 2023 • Fei Wang, James Y. Huang, Tianyi Yan, Wenxuan Zhou, Muhao Chen
However, previous ensemble-based debiasing methods typically apply debiasing on top-level logits without directly addressing biased attention patterns.
1 code implementation • 24 May 2023 • Tanay Dixit, Fei Wang, Muhao Chen
However, most of the prior works on training factuality-aware models have ignored the negative effect it has on summary quality.
no code implementations • 24 May 2023 • Jiongxiao Wang, Zichen Liu, Keun Hee Park, Zhuojun Jiang, Zhaoheng Zheng, Zhuofeng Wu, Muhao Chen, Chaowei Xiao
We propose a novel attack method named advICL, which aims to manipulate only the demonstration without changing the input to mislead the models.
1 code implementation • 24 May 2023 • Fei Wang, Wenjie Mo, Yiwei Wang, Wenxuan Zhou, Muhao Chen
Building upon this SCM, we propose causal intervention techniques to mitigate entity bias for both white-box and black-box settings.
no code implementations • 24 May 2023 • Jiashu Xu, Mingyu Derek Ma, Fei Wang, Chaowei Xiao, Muhao Chen
We investigate security concerns of the emergent instruction tuning paradigm, that models are trained on crowdsourced datasets with task instructions to achieve superior performance.
1 code implementation • 24 May 2023 • Qin Liu, Fei Wang, Chaowei Xiao, Muhao Chen
Language models are often at risk of diverse backdoor attacks, especially data poisoning.
no code implementations • 24 May 2023 • Tianqing Fang, Zhaowei Wang, Wenxuan Zhou, Hongming Zhang, Yangqiu Song, Muhao Chen
However, knowledge conflicts arise when there is a mismatch between the actual temporal relations of events in the context and the prior knowledge or biases learned by the model.
1 code implementation • 24 May 2023 • James Y. Huang, Wenlin Yao, Kaiqiang Song, Hongming Zhang, Muhao Chen, Dong Yu
It is unclear whether the compositional semantics of sentences can be directly reflected as compositional operations in the embedding space.
1 code implementation • 22 May 2023 • Yiwei Wang, Bryan Hooi, Fei Wang, Yujun Cai, Yuxuan Liang, Wenxuan Zhou, Jing Tang, Manjuan Duan, Muhao Chen
In principle, textual context determines the ground-truth relation and the RE models should be able to correctly identify the relations reflected by the textual context.
no code implementations • 22 May 2023 • Ehsan Qasemi, Amani R. Maina-Kilaas, Devadutta Dash, Khalid Alsaggaf, Muhao Chen
However, it is unclear if SOTA visual language models (VLMs) can extract such preconditions and infer the affordance of objects with them.
1 code implementation • 18 May 2023 • Xinze Li, Yixin Cao, Muhao Chen, Aixin Sun
Goal-oriented Script Generation is a new task of generating a list of steps that can fulfill the given goal.
1 code implementation • 20 Mar 2023 • Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen
However, their reliance on parametric knowledge may cause them to overlook contextual cues, leading to incorrect predictions in context-sensitive NLP tasks (e. g., knowledge acquisition tasks).
1 code implementation • 21 Dec 2022 • Keming Lu, I-Hung Hsu, Wenxuan Zhou, Mingyu Derek Ma, Muhao Chen
Relation Extraction (RE) has been extended to cross-document scenarios because many relations are not simply described in a single document.
1 code implementation • 21 Dec 2022 • Jiashu Xu, Mingyu Derek Ma, Muhao Chen
Two key obstacles in biomedical relation extraction (RE) are the scarcity of annotations and the prevalence of instances without explicitly pre-defined labels due to low annotation coverage.
no code implementations • 21 Dec 2022 • Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen, Hoifung Poon
In this paper, we aim at bridging the gap and propose to pretrain and finetune the RE model using consistent objectives of contrastive learning.
no code implementations • 20 Dec 2022 • Tianqing Fang, Wenxuan Zhou, Fangyu Liu, Hongming Zhang, Yangqiu Song, Muhao Chen
However, data augmentation may introduce noisy data that impairs training.
1 code implementation • 3 Nov 2022 • Peifeng Wang, Aaron Chan, Filip Ilievski, Muhao Chen, Xiang Ren
Neural language models (LMs) have achieved impressive results on various language-based reasoning tasks by utilizing latent knowledge encoded in their own pretrained parameters.
1 code implementation • 22 Oct 2022 • Fei Wang, Kaiqiang Song, Hongming Zhang, Lifeng Jin, Sangwoo Cho, Wenlin Yao, Xiaoyang Wang, Muhao Chen, Dong Yu
Recent literature adds extractive summaries as guidance for abstractive summarization models to provide hints of salient content and achieves better performance.
Ranked #7 on Abstractive Text Summarization on CNN / Daily Mail
no code implementations • 21 Oct 2022 • Zekun Li, Jina Kim, Yao-Yi Chiang, Muhao Chen
Characterizing geo-entities is integral to various application domains, such as geo-intelligence and map comprehension, while a key challenge is to capture the spatial-varying context of an entity.
no code implementations • 10 Oct 2022 • Haoyu Wang, Hongming Zhang, Yuqian Deng, Jacob R. Gardner, Dan Roth, Muhao Chen
In this paper, we seek to improve the faithfulness of TempRel extraction models from two perspectives.
Ranked #3 on Temporal Relation Classification on MATRES
1 code implementation • 10 Oct 2022 • Xiaocong Yang, James Y. Huang, Wenxuan Zhou, Muhao Chen
Parameter-efficient tuning aims at updating only a small subset of parameters when adapting a pretrained model to downstream tasks.
no code implementations • 8 Oct 2022 • Haoyu Wang, Hongming Zhang, Yueguan Wang, Yuqian Deng, Muhao Chen, Dan Roth
In this paper, we address this gap by examining the extent to which current models comprehend the essentiality of step events in relation to a goal event.
1 code implementation • 15 Sep 2022 • Shikhar Singh, Ehsan Qasemi, Muhao Chen
While such tasks measure the requisite knowledge to ground and reason over a given visual instance, they do not, however, measure the ability of VLMs to retain and generalize such knowledge.
1 code implementation • 12 Sep 2022 • Jihoon Sohn, Mingyu Derek Ma, Muhao Chen
The chronological hierarchies between knowledge graphs at different timestamps are represented by embedding the knowledge graphs as vectors in a common hyperbolic space.
no code implementations • 26 Jun 2022 • Shuo Ma, Kai Lu, Muhao Chen, Robert E. Skelton
This paper presents an analytical and experimental design and deployment control analysis of a hyperbolic paraboloid cable net based on clustering actuation strategies.
1 code implementation • 16 Jun 2022 • Ehsan Qasemi, Piyush Khanna, Qiang Ning, Muhao Chen
Reasoning with preconditions such as "glass can be used for drinking water unless the glass is shattered" remains an open problem for language models.
no code implementations • 8 Jun 2022 • Shuo Ma, Yiqian Chen, Muhao Chen, Robert E. Skelton
This paper presents the equilibrium and stiffness study of clustered tensegrity structures (CTS) considering pulley sizes.
1 code implementation • 25 May 2022 • Nan Xu, Fei Wang, Bangzheng Li, Mingtao Dong, Muhao Chen
Due to shortcuts from surface patterns to annotated entity labels and biased training, existing entity typing models are subject to the problem of spurious correlations.
1 code implementation • 19 May 2022 • Keming Lu, I-Hung Hsu, Wenxuan Zhou, Mingyu Derek Ma, Muhao Chen
Considering that summarization tasks aim at acquiring concise expressions of synoptical information from the longer context, these tasks naturally align with the objective of RE, i. e., extracting a kind of synoptical information that describes the relation of entity mentions.
Ranked #7 on Relation Extraction on TACRED
1 code implementation • NAACL 2022 • Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi
In this paper, we propose the CORE (Counterfactual Analysis based Relation Extraction) debiasing method that guides the RE models to focus on the main effects of textual context without losing the entity information.
no code implementations • Findings (NAACL) 2022 • Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Bryan Hooi
GRAPHCACHE aggregates the features from sentences in the whole dataset to learn global representations of properties, and use them to augment the local features within individual sentences.
1 code implementation • NAACL 2022 • Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen
This prunes the full self-attention structure into an order-invariant graph attention that captures the connected graph structure of cells belonging to the same row or column, and it differentiates between relevant cells and irrelevant cells from the structural perspective.
Ranked #2 on Data-to-Text Generation on ToTTo
no code implementations • Findings (NAACL) 2022 • Juncheng Liu, Zequn Sun, Bryan Hooi, Yiwei Wang, Dayiheng Liu, Baosong Yang, Xiaokui Xiao, Muhao Chen
We study dangling-aware entity alignment in knowledge graphs (KGs), which is an underexplored but important problem.
1 code implementation • NAACL 2022 • James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen
Semantic typing aims at classifying tokens or spans of interest in a textual context into semantic categories such as relations, entity types, and event types.
Ranked #3 on Relation Extraction on TACRED
1 code implementation • NAACL 2022 • Wenxuan Zhou, Qiang Ning, Heba Elfardy, Kevin Small, Muhao Chen
Current question answering (QA) systems primarily consider the single-answer scenario, where each question is assumed to be paired with one correct answer.
1 code implementation • 12 Feb 2022 • Bangzheng Li, Wenpeng Yin, Muhao Chen
The task of ultra-fine entity typing (UFET) seeks to predict diverse and free-form words or phrases that describe the appropriate types of entities mentioned in sentences.
Ranked #1 on Entity Typing on FIGER
no code implementations • 16 Dec 2021 • Wenxuan Zhou, Fangyu Liu, huan zhang, Muhao Chen
Deep neural networks are often overparameterized and may not easily achieve model generalization.
1 code implementation • ICLR 2022 • Peifeng Wang, Jonathan Zamora, Junfeng Liu, Filip Ilievski, Muhao Chen, Xiang Ren
In this paper, we propose an Imagine-and-Verbalize (I&V) method, which learns to imagine a relational scene knowledge graph (SKG) with relations between the input concepts, and leverage the SKG as a constraint when generating a plausible scene description.
no code implementations • 1 Dec 2021 • Yiwei Wang, Yujun Cai, Yuxuan Liang, Wei Wang, Henghui Ding, Muhao Chen, Jing Tang, Bryan Hooi
Representing a label distribution as a one-hot vector is a common practice in training node classification models.
no code implementations • 17 Oct 2021 • Shuo Ma, Muhao Chen, Robert E. Skelton
This paper presents the formulations of nonlinear and linearized statics, dynamics, and control for any clustered tensegrity system (CTS).
1 code implementation • ACL 2022 • Wenxuan Zhou, Fangyu Liu, Ivan Vulić, Nigel Collier, Muhao Chen
To achieve this, it is crucial to represent multilingual knowledge in a shared/unified space.
no code implementations • Findings (EMNLP) 2021 • Mingyu Derek Ma, Muhao Chen, Te-Lin Wu, Nanyun Peng
Taxonomies are valuable resources for many applications, but the limited coverage due to the expensive manual curation process hinders their general applicability.
no code implementations • EMNLP 2021 • Xiyang Zhang, Muhao Chen, Jonathan May
Storytelling, whether via fables, news reports, documentaries, or memoirs, can be thought of as the communication of interesting and related events that, taken together, form a concrete process.
no code implementations • EMNLP 2021 • Haoyu Wang, Hongming Zhang, Muhao Chen, Dan Roth
The task of subevent detection aims to resolve this granularity issue, recognizing the membership of multi-granular events in event complexes.
1 code implementation • Findings (EMNLP) 2021 • Fei Wang, Kexuan Sun, Jay Pujara, Pedro Szekely, Muhao Chen
From one perspective, our system conducts masked salient token prediction to enhance the model for alignment and reasoning between the table and the statement.
Ranked #9 on Table-based Fact Verification on TabFact
no code implementations • ACL 2021 • Muhao Chen, Hongming Zhang, Qiang Ning, Manling Li, Heng Ji, Kathleen McKeown, Dan Roth
This tutorial targets researchers and practitioners who are interested in AI technologies that help machines understand natural language text, particularly real-world events described in the text.
1 code implementation • Findings (ACL) 2021 • Peifeng Wang, Filip Ilievski, Muhao Chen, Xiang Ren
Inspired by evidence that pretrained language models (LMs) encode commonsense knowledge, recent work has applied LMs to automatically populate commonsense knowledge graphs (CKGs).
no code implementations • 15 Jun 2021 • Shuo Ma, Muhao Chen, Xingfei Yuan, Robert E. Skelton
Results show that the proposed CTS cable dome always has one prestress mode and is globally stable in its deployment trajectory.
1 code implementation • ACL 2021 • Zequn Sun, Muhao Chen, Wei Hu
Since KGs possess different sets of entities, there could be entities that cannot find alignment across them, leading to the problem of dangling entities.
Ranked #1 on Entity Alignment on DBP2.0 zh-en
1 code implementation • 4 May 2021 • Fei Wang, Kexuan Sun, Muhao Chen, Jay Pujara, Pedro Szekely
The task of natural language table retrieval (NLTR) seeks to retrieve semantically relevant tables based on natural language queries.
1 code implementation • 18 Apr 2021 • Ehsan Qasemi, Filip Ilievski, Muhao Chen, Pedro Szekely
To address this gap, we propose a novel challenge of reasoning with circumstantial preconditions.
1 code implementation • EMNLP 2021 • Wenxuan Zhou, Fangyu Liu, Muhao Chen
Pretrained Transformers achieve remarkable performance when training and test data are from the same distribution.
1 code implementation • EMNLP 2021 • Wenxuan Zhou, Muhao Chen
Recent information extraction approaches have relied on training deep neural models.
Ranked #1 on Named Entity Recognition (NER) on CoNLL++
1 code implementation • NAACL 2021 • Xuelu Chen, Michael Boratko, Muhao Chen, Shib Sankar Dasgupta, Xiang Lorraine Li, Andrew McCallum
Knowledge bases often consist of facts which are harvested from a variety of sources, many of which are noisy and some of which conflict, resulting in a level of uncertainty for each triple.
1 code implementation • 15 Mar 2021 • Junheng Hao, Muhao Chen, Wenchao Yu, Yizhou Sun, Wei Wang
The cross-view association model is learned to bridge the embeddings of ontological concepts and their corresponding instance-view entities.
1 code implementation • 7 Mar 2021 • Junheng Hao, Chelsea Ju, Muhao Chen, Yizhou Sun, Carlo Zaniolo, Wei Wang
Leveraging a wide-range of biological knowledge, such as gene ontology and protein-protein interaction (PPI) networks from other closely related species presents a vital approach to infer the molecular impact of a new species.
1 code implementation • 6 Mar 2021 • Changping Meng, Muhao Chen, Jie Mao, Jennifer Neville
Analyzing the readability of articles has been an important sociolinguistic task.
1 code implementation • 2 Feb 2021 • Wenxuan Zhou, Muhao Chen
Sentence-level relation extraction (RE) aims at identifying the relationship between two entities in a sentence.
Ranked #2 on Relation Extraction on Re-TACRED
1 code implementation • 15 Dec 2020 • Cunchao Zhu, Muhao Chen, Changjun Fan, Guangquan Cheng, Yan Zhan
Since such temporal knowledge graphs often suffer from incompleteness, it is important to develop time-aware representation learning models that help to infer the missing temporal facts.
no code implementations • CONLL 2020 • Muhao Chen, Hongming Zhang, Haoyu Wang, Dan Roth
This paper studies a new cognitively motivated semantic typing task, multi-axis event process typing, that, given anevent process, attempts to infer free-form typelabels describing (i) the type of action made bythe process and (ii) the type of object the pro-cess seeks to affect.
no code implementations • EMNLP 2020 • Hongming Zhang, Muhao Chen, Haoyu Wang, Yangqiu Song, Dan Roth
Computational and cognitive studies of event understanding suggest that identifying, comprehending, and predicting events depend on having structured representations of a sequence of events and on conceptualizing (abstracting) its components into (soft) event categories.
no code implementations • 13 Oct 2020 • Muhao Chen, Hongming Zhang, Haoyu Wang, Dan Roth
This paper studies a new cognitively motivated semantic typing task, multi-axis event process typing, that, given an event process, attempts to infer free-form type labels describing (i) the type of action made by the process and (ii) the type of object the process seeks to affect.
no code implementations • EMNLP 2020 • Haoyu Wang, Muhao Chen, Hongming Zhang, Dan Roth
Understanding natural language involves recognizing how multiple event mentions structurally and temporally interact with each other.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Xuelu Chen, Muhao Chen, Changjun Fan, Ankith Uppunda, Yizhou Sun, Carlo Zaniolo
Predicting missing facts in a knowledge graph (KG) is a crucial task in knowledge base construction and reasoning, and it has been the subject of much research in recent works using KG embeddings.
Ranked #2 on Knowledge Graph Completion on DPB-5L (French)
1 code implementation • EMNLP 2020 • Zequn Sun, Muhao Chen, Wei Hu, Chengming Wang, Jian Dai, Wei zhang
Capturing associations for knowledge graphs (KGs) through entity alignment, entity type inference and other related tasks benefits NLP applications with comprehensive knowledge representations.
Ranked #28 on Entity Alignment on DBP15k zh-en
2 code implementations • 28 Sep 2020 • Fangyu Liu, Muhao Chen, Dan Roth, Nigel Collier
This work studies the use of visual semantic representations to align entities in heterogeneous knowledge graphs (KGs).
Ranked #3 on Multi-modal Entity Alignment on UMVM-oea-d-w-v1 (using extra training data)
1 code implementation • EACL 2021 • Muhao Chen, Weijia Shi, Ben Zhou, Dan Roth
Much research effort has been put to multilingual knowledge graph (KG) embedding methods to address the entity alignment task, which seeks to match entities in different languagespecific KGs that refer to the same real-world object.
Ranked #19 on Entity Alignment on DBP15k zh-en
1 code implementation • 10 Mar 2020 • Zequn Sun, Qingheng Zhang, Wei Hu, Chengming Wang, Muhao Chen, Farahnaz Akrami, Chengkai Li
Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings.
1 code implementation • 20 Nov 2019 • Zequn Sun, Chengming Wang, Wei Hu, Muhao Chen, Jian Dai, Wei zhang, Yuzhong Qu
As the direct neighbors of counterpart entities are usually dissimilar due to the schema heterogeneity, AliNet introduces distant neighbors to expand the overlap between their neighborhood structures.
Ranked #29 on Entity Alignment on DBP15k zh-en
no code implementations • IJCNLP 2019 • Weijia Shi, Muhao Chen, Pei Zhou, Kai-Wei Chang
Contextualized word embedding models, such as ELMo, generate meaningful representations of words and their context.
1 code implementation • IJCNLP 2019 • Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao Huang, Muhao Chen, Ryan Cotterell, Kai-Wei Chang
Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora.
2 code implementations • 30 Aug 2019 • Haochen Chen, Syed Fahad Sultan, Yingtao Tian, Muhao Chen, Steven Skiena
Two key features of FastRP are: 1) it explicitly constructs a node similarity matrix that captures transitive relationships in a graph and normalizes matrix entries based on node degrees; 2) it utilizes very sparse random projection, which is a scalable optimization-free method for dimension reduction.
no code implementations • WS 2019 • Weijia Shi, Muhao Chen, Yingtao Tian, Kai-Wei Chang
Bilingual word embeddings, which representlexicons of different languages in a shared em-bedding space, are essential for supporting se-mantic and knowledge transfers in a variety ofcross-lingual NLP tasks.
1 code implementation • 6 Jun 2019 • Qingheng Zhang, Zequn Sun, Wei Hu, Muhao Chen, Lingbing Guo, Yuzhong Qu
Furthermore, we design some cross-KG inference methods to enhance the alignment between two KGs.
1 code implementation • 24 May 2019 • Changjun Fan, Li Zeng, Yuhui Ding, Muhao Chen, Yizhou Sun, Zhong Liu
By training on small-scale networks, the learned model is capable of assigning relative BC scores to nodes for any unseen networks, and thus identifying the highly-ranked nodes.
no code implementations • 4 Dec 2018 • Pei Zhou, Muhao Chen, Kai-Wei Chang, Carlo Zaniolo
Quantifying differences in terminologies from various academic domains has been a longstanding problem yet to be solved.
1 code implementation • 26 Nov 2018 • Xuelu Chen, Muhao Chen, Weijia Shi, Yizhou Sun, Carlo Zaniolo
However, there are many KGs that model uncertain knowledge, which typically model the inherent uncertainty of relations facts with a confidence score, and embedding such uncertain knowledge represents an unresolved challenge.
1 code implementation • 13 Sep 2018 • Haochen Chen, Xiaofei Sun, Yingtao Tian, Bryan Perozzi, Muhao Chen, Steven Skiena
Network embedding methods aim at learning low-dimensional latent representation of nodes in a network.
Social and Information Networks Physics and Society
no code implementations • 7 Sep 2018 • Muhao Chen, Yingtao Tian, Xuelu Chen, Zijun Xue, Carlo Zaniolo
Recent advances in translation-based graph embedding methods for populating instance-level knowledge graphs lead to promising new approaching for the ontology population problem.
1 code implementation • CONLL 2019 • Muhao Chen, Yingtao Tian, Haochen Chen, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo
Bilingual word embeddings have been widely used to capture the similarity of lexical semantics in different human languages.
1 code implementation • 31 Jul 2018 • Muhao Chen, Changping Meng, Gang Huang, Carlo Zaniolo
Nowadays, editors tend to separate different subtopics of a long Wiki-pedia article into multiple sub-articles.
no code implementations • 18 Jun 2018 • Muhao Chen, Yingtao Tian, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo
Since many multilingual KGs also provide literal descriptions of entities, in this paper, we introduce an embedding-based approach which leverages a weakly aligned multilingual KG for semi-supervised cross-lingual learning using entity descriptions.
no code implementations • 24 May 2017 • Tao Zhou, Muhao Chen, Jie Yu, Demetri Terzopoulos
Following the recent progress in image classification and captioning using deep learning, we develop a novel natural language person retrieval system based on an attention mechanism.
2 code implementations • 12 Nov 2016 • Muhao Chen, Yingtao Tian, Mohan Yang, Carlo Zaniolo
Many recent works have demonstrated the benefits of knowledge graph embeddings in completing monolingual knowledge graphs.
Ranked #38 on Entity Alignment on DBP15k zh-en