no code implementations • NAACL (ACL) 2022 • Muhao Chen, Lifu Huang, Manling Li, Ben Zhou, Heng Ji, Dan Roth
This tutorial targets researchers and practitioners who are interested in AI and ML technologies for structural information extraction (IE) from unstructured textual sources.
1 code implementation • 28 May 2023 • Fei Wang, James Y. Huang, Tianyi Yan, Wenxuan Zhou, Muhao Chen
However, previous ensemble-based debiasing methods typically apply debiasing on top-level logits without directly addressing biased attention patterns.
no code implementations • 24 May 2023 • Jiashu Xu, Mingyu Derek Ma, Fei Wang, Chaowei Xiao, Muhao Chen
Instruction-tuned models are trained on crowdsourcing datasets with task instructions to achieve superior performance.
no code implementations • 24 May 2023 • Qin Liu, Fei Wang, Chaowei Xiao, Muhao Chen
Language models are often at risk of diverse backdoor attacks, especially data poisoning.
no code implementations • 24 May 2023 • Jiongxiao Wang, Zichen Liu, Keun Hee Park, Muhao Chen, Chaowei Xiao
With the emergence of more powerful large language models (LLMs), such as ChatGPT and GPT-4, in-context learning (ICL) has gained significant prominence in leveraging these models for specific tasks by utilizing data-label pairs as precondition prompts.
1 code implementation • 24 May 2023 • Tanay Dixit, Fei Wang, Muhao Chen
However, most of the prior works on training factuality-aware models have ignored the negative effect it has on summary quality.
no code implementations • 24 May 2023 • Tianqing Fang, Zhaowei Wang, Wenxuan Zhou, Hongming Zhang, Yangqiu Song, Muhao Chen
Event temporal reasoning aims at identifying the temporal relations between two or more events.
no code implementations • 24 May 2023 • James Y. Huang, Wenlin Yao, Kaiqiang Song, Hongming Zhang, Muhao Chen, Dong Yu
It is unclear whether the compositional semantics of sentences can be directly reflected as compositional operations in the embedding space.
no code implementations • 24 May 2023 • Fei Wang, Wenjie Mo, Yiwei Wang, Wenxuan Zhou, Muhao Chen
Meanwhile, our in-context intervention effectively reduces the knowledge conflicts between parametric knowledge and contextual knowledge in GPT-3. 5 and improves the F1 score by 9. 14 points on a challenging test set derived from Re-TACRED.
1 code implementation • 22 May 2023 • Yiwei Wang, Bryan Hooi, Fei Wang, Yujun Cai, Yuxuan Liang, Wenxuan Zhou, Jing Tang, Manjuan Duan, Muhao Chen
In principle, textual context determines the ground-truth relation and the RE models should be able to correctly identify the relations reflected by the textual context.
1 code implementation • 18 May 2023 • Xinze Li, Yixin Cao, Muhao Chen, Aixin Sun
Goal-oriented Script Generation is a new task of generating a list of steps that can fulfill the given goal.
1 code implementation • 20 Mar 2023 • Wenxuan Zhou, Sheng Zhang, Hoifung Poon, Muhao Chen
Large language models (LLMs) encode parametric knowledge about world facts and have shown remarkable performance in knowledge-driven NLP tasks.
no code implementations • 21 Dec 2022 • Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen, Hoifung Poon
In this paper, we aim at bridging the gap and propose to pretrain and finetune the RE model using consistent objectives of contrastive learning.
no code implementations • 21 Dec 2022 • Keming Lu, I-Hung Hsu, Wenxuan Zhou, Mingyu Derek Ma, Muhao Chen
Additionally, we show that Mr. CoD facilitates evidence retrieval and boosts end-to-end RE performance with effective multi-hop reasoning in both closed and open settings of RE.
1 code implementation • 21 Dec 2022 • Jiashu Xu, Mingyu Derek Ma, Muhao Chen
Two key obstacles in biomedical relation extraction (RE) are the scarcity of annotations and the prevalence of instances without explicitly pre-defined labels due to low annotation coverage.
no code implementations • 20 Dec 2022 • Tianqing Fang, Wenxuan Zhou, Fangyu Liu, Hongming Zhang, Yangqiu Song, Muhao Chen
However, data augmentation may introduce noisy data that impairs training.
1 code implementation • 3 Nov 2022 • Peifeng Wang, Aaron Chan, Filip Ilievski, Muhao Chen, Xiang Ren
Neural language models (LMs) have achieved impressive results on various language-based reasoning tasks by utilizing latent knowledge encoded in their own pretrained parameters.
1 code implementation • 22 Oct 2022 • Fei Wang, Kaiqiang Song, Hongming Zhang, Lifeng Jin, Sangwoo Cho, Wenlin Yao, Xiaoyang Wang, Muhao Chen, Dong Yu
Recent literature adds extractive summaries as guidance for abstractive summarization models to provide hints of salient content and achieves better performance.
Ranked #6 on
Abstractive Text Summarization
on CNN / Daily Mail
no code implementations • 21 Oct 2022 • Zekun Li, Jina Kim, Yao-Yi Chiang, Muhao Chen
Characterizing geo-entities is integral to various application domains, such as geo-intelligence and map comprehension, while a key challenge is to capture the spatial-varying context of an entity.
1 code implementation • 10 Oct 2022 • Xiaocong Yang, James Y. Huang, Wenxuan Zhou, Muhao Chen
Parameter-efficient tuning aims at updating only a small subset of parameters when adapting a pretrained model to downstream tasks.
no code implementations • 10 Oct 2022 • Haoyu Wang, Hongming Zhang, Yuqian Deng, Jacob R. Gardner, Dan Roth, Muhao Chen
In this paper, we seek to improve the faithfulness of TempRel extraction models from two perspectives.
Ranked #3 on
Temporal Relation Classification
on MATRES
no code implementations • 8 Oct 2022 • Hongming Zhang, Yueguan Wang, Yuqian Deng, Haoyu Wang, Muhao Chen, Dan Roth
In this paper, we seek to fill this gap by studying how well current models can understand the essentiality of different step events towards a goal event.
1 code implementation • 15 Sep 2022 • Shikhar Singh, Ehsan Qasemi, Muhao Chen
While such tasks measure the requisite knowledge to ground and reason over a given visual instance, they do not, however, measure the ability of VLMs to retain and generalize such knowledge.
1 code implementation • 12 Sep 2022 • Jihoon Sohn, Mingyu Derek Ma, Muhao Chen
The chronological hierarchies between knowledge graphs at different timestamps are represented by embedding the knowledge graphs as vectors in a common hyperbolic space.
no code implementations • 26 Jun 2022 • Shuo Ma, Kai Lu, Muhao Chen, Robert E. Skelton
This paper presents an analytical and experimental design and deployment control analysis of a hyperbolic paraboloid cable net based on clustering actuation strategies.
no code implementations • 16 Jun 2022 • Ehsan Qasemi, Piyush Khanna, Qiang Ning, Muhao Chen
Reasoning with preconditions such as "glass can be used for drinking water unless the glass is shattered" remains an open problem for language models.
no code implementations • 8 Jun 2022 • Shuo Ma, Yiqian Chen, Muhao Chen, Robert E. Skelton
This paper presents the equilibrium and stiffness study of clustered tensegrity structures (CTS) considering pulley sizes.
1 code implementation • 25 May 2022 • Nan Xu, Fei Wang, Bangzheng Li, Mingtao Dong, Muhao Chen
Due to shortcuts from surface patterns to annotated entity labels and biased training, existing entity typing models are subject to the problem of spurious correlations.
1 code implementation • 19 May 2022 • Keming Lu, I-Hung Hsu, Wenxuan Zhou, Mingyu Derek Ma, Muhao Chen
Considering that summarization tasks aim at acquiring concise expressions of synoptical information from the longer context, these tasks naturally align with the objective of RE, i. e., extracting a kind of synoptical information that describes the relation of entity mentions.
Ranked #6 on
Relation Extraction
on TACRED
no code implementations • Findings (NAACL) 2022 • Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Bryan Hooi
GRAPHCACHE aggregates the features from sentences in the whole dataset to learn global representations of properties, and use them to augment the local features within individual sentences.
1 code implementation • NAACL 2022 • Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi
In this paper, we propose the CORE (Counterfactual Analysis based Relation Extraction) debiasing method that guides the RE models to focus on the main effects of textual context without losing the entity information.
1 code implementation • NAACL 2022 • Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen
This prunes the full self-attention structure into an order-invariant graph attention that captures the connected graph structure of cells belonging to the same row or column, and it differentiates between relevant cells and irrelevant cells from the structural perspective.
Ranked #2 on
Data-to-Text Generation
on ToTTo
no code implementations • Findings (NAACL) 2022 • Juncheng Liu, Zequn Sun, Bryan Hooi, Yiwei Wang, Dayiheng Liu, Baosong Yang, Xiaokui Xiao, Muhao Chen
We study dangling-aware entity alignment in knowledge graphs (KGs), which is an underexplored but important problem.
1 code implementation • NAACL 2022 • James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen
Semantic typing aims at classifying tokens or spans of interest in a textual context into semantic categories such as relations, entity types, and event types.
Ranked #2 on
Relation Extraction
on TACRED
1 code implementation • NAACL 2022 • Wenxuan Zhou, Qiang Ning, Heba Elfardy, Kevin Small, Muhao Chen
Current question answering (QA) systems primarily consider the single-answer scenario, where each question is assumed to be paired with one correct answer.
1 code implementation • 12 Feb 2022 • Bangzheng Li, Wenpeng Yin, Muhao Chen
The task of ultra-fine entity typing (UFET) seeks to predict diverse and free-form words or phrases that describe the appropriate types of entities mentioned in sentences.
Ranked #1 on
Entity Typing
on FIGER
no code implementations • 16 Dec 2021 • Wenxuan Zhou, Fangyu Liu, huan zhang, Muhao Chen
Deep neural networks are often overparameterized and may not easily achieve model generalization.
1 code implementation • ICLR 2022 • Peifeng Wang, Jonathan Zamora, Junfeng Liu, Filip Ilievski, Muhao Chen, Xiang Ren
In this paper, we propose an Imagine-and-Verbalize (I&V) method, which learns to imagine a relational scene knowledge graph (SKG) with relations between the input concepts, and leverage the SKG as a constraint when generating a plausible scene description.
no code implementations • 1 Dec 2021 • Yiwei Wang, Yujun Cai, Yuxuan Liang, Wei Wang, Henghui Ding, Muhao Chen, Jing Tang, Bryan Hooi
Representing a label distribution as a one-hot vector is a common practice in training node classification models.
no code implementations • 17 Oct 2021 • Shuo Ma, Muhao Chen, Robert E. Skelton
This paper presents the formulations of nonlinear and linearized statics, dynamics, and control for any clustered tensegrity system (CTS).
1 code implementation • ACL 2022 • Wenxuan Zhou, Fangyu Liu, Ivan Vulić, Nigel Collier, Muhao Chen
To achieve this, it is crucial to represent multilingual knowledge in a shared/unified space.
no code implementations • Findings (EMNLP) 2021 • Mingyu Derek Ma, Muhao Chen, Te-Lin Wu, Nanyun Peng
Taxonomies are valuable resources for many applications, but the limited coverage due to the expensive manual curation process hinders their general applicability.
no code implementations • EMNLP 2021 • Xiyang Zhang, Muhao Chen, Jonathan May
Storytelling, whether via fables, news reports, documentaries, or memoirs, can be thought of as the communication of interesting and related events that, taken together, form a concrete process.
no code implementations • EMNLP 2021 • Haoyu Wang, Hongming Zhang, Muhao Chen, Dan Roth
The task of subevent detection aims to resolve this granularity issue, recognizing the membership of multi-granular events in event complexes.
1 code implementation • Findings (EMNLP) 2021 • Fei Wang, Kexuan Sun, Jay Pujara, Pedro Szekely, Muhao Chen
From one perspective, our system conducts masked salient token prediction to enhance the model for alignment and reasoning between the table and the statement.
Ranked #6 on
Table-based Fact Verification
on TabFact
no code implementations • ACL 2021 • Muhao Chen, Hongming Zhang, Qiang Ning, Manling Li, Heng Ji, Kathleen McKeown, Dan Roth
This tutorial targets researchers and practitioners who are interested in AI technologies that help machines understand natural language text, particularly real-world events described in the text.
1 code implementation • Findings (ACL) 2021 • Peifeng Wang, Filip Ilievski, Muhao Chen, Xiang Ren
Inspired by evidence that pretrained language models (LMs) encode commonsense knowledge, recent work has applied LMs to automatically populate commonsense knowledge graphs (CKGs).
no code implementations • 15 Jun 2021 • Shuo Ma, Muhao Chen, Xingfei Yuan, Robert E. Skelton
Results show that the proposed CTS cable dome always has one prestress mode and is globally stable in its deployment trajectory.
1 code implementation • ACL 2021 • Zequn Sun, Muhao Chen, Wei Hu
Since KGs possess different sets of entities, there could be entities that cannot find alignment across them, leading to the problem of dangling entities.
Ranked #1 on
Entity Alignment
on DBP2.0 zh-en
1 code implementation • 4 May 2021 • Fei Wang, Kexuan Sun, Muhao Chen, Jay Pujara, Pedro Szekely
The task of natural language table retrieval (NLTR) seeks to retrieve semantically relevant tables based on natural language queries.
1 code implementation • EMNLP 2021 • Wenxuan Zhou, Fangyu Liu, Muhao Chen
Pretrained Transformers achieve remarkable performance when training and test data are from the same distribution.
no code implementations • 18 Apr 2021 • Ehsan Qasemi, Filip Ilievski, Muhao Chen, Pedro Szekely
To address this gap, we propose a novel challenge of reasoning with circumstantial preconditions.
1 code implementation • EMNLP 2021 • Wenxuan Zhou, Muhao Chen
Recent information extraction approaches have relied on training deep neural models.
Ranked #1 on
Named Entity Recognition (NER)
on CoNLL++
1 code implementation • NAACL 2021 • Xuelu Chen, Michael Boratko, Muhao Chen, Shib Sankar Dasgupta, Xiang Lorraine Li, Andrew McCallum
Knowledge bases often consist of facts which are harvested from a variety of sources, many of which are noisy and some of which conflict, resulting in a level of uncertainty for each triple.
1 code implementation • 15 Mar 2021 • Junheng Hao, Muhao Chen, Wenchao Yu, Yizhou Sun, Wei Wang
The cross-view association model is learned to bridge the embeddings of ontological concepts and their corresponding instance-view entities.
1 code implementation • 7 Mar 2021 • Junheng Hao, Chelsea Ju, Muhao Chen, Yizhou Sun, Carlo Zaniolo, Wei Wang
Leveraging a wide-range of biological knowledge, such as gene ontology and protein-protein interaction (PPI) networks from other closely related species presents a vital approach to infer the molecular impact of a new species.
1 code implementation • 6 Mar 2021 • Changping Meng, Muhao Chen, Jie Mao, Jennifer Neville
Analyzing the readability of articles has been an important sociolinguistic task.
1 code implementation • 2 Feb 2021 • Wenxuan Zhou, Muhao Chen
Sentence-level relation extraction (RE) aims at identifying the relationship between two entities in a sentence.
Ranked #2 on
Relation Extraction
on Re-TACRED
1 code implementation • 15 Dec 2020 • Cunchao Zhu, Muhao Chen, Changjun Fan, Guangquan Cheng, Yan Zhan
Since such temporal knowledge graphs often suffer from incompleteness, it is important to develop time-aware representation learning models that help to infer the missing temporal facts.
no code implementations • CONLL 2020 • Muhao Chen, Hongming Zhang, Haoyu Wang, Dan Roth
This paper studies a new cognitively motivated semantic typing task, multi-axis event process typing, that, given anevent process, attempts to infer free-form typelabels describing (i) the type of action made bythe process and (ii) the type of object the pro-cess seeks to affect.
no code implementations • EMNLP 2020 • Hongming Zhang, Muhao Chen, Haoyu Wang, Yangqiu Song, Dan Roth
Computational and cognitive studies of event understanding suggest that identifying, comprehending, and predicting events depend on having structured representations of a sequence of events and on conceptualizing (abstracting) its components into (soft) event categories.
no code implementations • EMNLP 2020 • Haoyu Wang, Muhao Chen, Hongming Zhang, Dan Roth
Understanding natural language involves recognizing how multiple event mentions structurally and temporally interact with each other.
no code implementations • 13 Oct 2020 • Muhao Chen, Hongming Zhang, Haoyu Wang, Dan Roth
This paper studies a new cognitively motivated semantic typing task, multi-axis event process typing, that, given an event process, attempts to infer free-form type labels describing (i) the type of action made by the process and (ii) the type of object the process seeks to affect.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Xuelu Chen, Muhao Chen, Changjun Fan, Ankith Uppunda, Yizhou Sun, Carlo Zaniolo
Predicting missing facts in a knowledge graph (KG) is a crucial task in knowledge base construction and reasoning, and it has been the subject of much research in recent works using KG embeddings.
Ranked #2 on
Knowledge Graph Completion
on DPB-5L (French)
1 code implementation • EMNLP 2020 • Zequn Sun, Muhao Chen, Wei Hu, Chengming Wang, Jian Dai, Wei zhang
Capturing associations for knowledge graphs (KGs) through entity alignment, entity type inference and other related tasks benefits NLP applications with comprehensive knowledge representations.
Ranked #26 on
Entity Alignment
on DBP15k zh-en
2 code implementations • 28 Sep 2020 • Fangyu Liu, Muhao Chen, Dan Roth, Nigel Collier
This work studies the use of visual semantic representations to align entities in heterogeneous knowledge graphs (KGs).
Ranked #11 on
Entity Alignment
on dbp15k ja-en
(using extra training data)
1 code implementation • EACL 2021 • Muhao Chen, Weijia Shi, Ben Zhou, Dan Roth
Much research effort has been put to multilingual knowledge graph (KG) embedding methods to address the entity alignment task, which seeks to match entities in different languagespecific KGs that refer to the same real-world object.
Ranked #17 on
Entity Alignment
on DBP15k zh-en
1 code implementation • 10 Mar 2020 • Zequn Sun, Qingheng Zhang, Wei Hu, Chengming Wang, Muhao Chen, Farahnaz Akrami, Chengkai Li
Recent advancement in KG embedding impels the advent of embedding-based entity alignment, which encodes entities in a continuous embedding space and measures entity similarities based on the learned embeddings.
1 code implementation • 20 Nov 2019 • Zequn Sun, Chengming Wang, Wei Hu, Muhao Chen, Jian Dai, Wei zhang, Yuzhong Qu
As the direct neighbors of counterpart entities are usually dissimilar due to the schema heterogeneity, AliNet introduces distant neighbors to expand the overlap between their neighborhood structures.
Ranked #27 on
Entity Alignment
on DBP15k zh-en
no code implementations • IJCNLP 2019 • Weijia Shi, Muhao Chen, Pei Zhou, Kai-Wei Chang
Contextualized word embedding models, such as ELMo, generate meaningful representations of words and their context.
1 code implementation • IJCNLP 2019 • Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao Huang, Muhao Chen, Ryan Cotterell, Kai-Wei Chang
Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora.
1 code implementation • 30 Aug 2019 • Haochen Chen, Syed Fahad Sultan, Yingtao Tian, Muhao Chen, Steven Skiena
Two key features of FastRP are: 1) it explicitly constructs a node similarity matrix that captures transitive relationships in a graph and normalizes matrix entries based on node degrees; 2) it utilizes very sparse random projection, which is a scalable optimization-free method for dimension reduction.
no code implementations • WS 2019 • Weijia Shi, Muhao Chen, Yingtao Tian, Kai-Wei Chang
Bilingual word embeddings, which representlexicons of different languages in a shared em-bedding space, are essential for supporting se-mantic and knowledge transfers in a variety ofcross-lingual NLP tasks.
1 code implementation • 6 Jun 2019 • Qingheng Zhang, Zequn Sun, Wei Hu, Muhao Chen, Lingbing Guo, Yuzhong Qu
Furthermore, we design some cross-KG inference methods to enhance the alignment between two KGs.
1 code implementation • 24 May 2019 • Changjun Fan, Li Zeng, Yuhui Ding, Muhao Chen, Yizhou Sun, Zhong Liu
By training on small-scale networks, the learned model is capable of assigning relative BC scores to nodes for any unseen networks, and thus identifying the highly-ranked nodes.
no code implementations • 4 Dec 2018 • Pei Zhou, Muhao Chen, Kai-Wei Chang, Carlo Zaniolo
Quantifying differences in terminologies from various academic domains has been a longstanding problem yet to be solved.
1 code implementation • 26 Nov 2018 • Xuelu Chen, Muhao Chen, Weijia Shi, Yizhou Sun, Carlo Zaniolo
However, there are many KGs that model uncertain knowledge, which typically model the inherent uncertainty of relations facts with a confidence score, and embedding such uncertain knowledge represents an unresolved challenge.
1 code implementation • 13 Sep 2018 • Haochen Chen, Xiaofei Sun, Yingtao Tian, Bryan Perozzi, Muhao Chen, Steven Skiena
Network embedding methods aim at learning low-dimensional latent representation of nodes in a network.
Social and Information Networks Physics and Society
no code implementations • 7 Sep 2018 • Muhao Chen, Yingtao Tian, Xuelu Chen, Zijun Xue, Carlo Zaniolo
Recent advances in translation-based graph embedding methods for populating instance-level knowledge graphs lead to promising new approaching for the ontology population problem.
1 code implementation • CONLL 2019 • Muhao Chen, Yingtao Tian, Haochen Chen, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo
Bilingual word embeddings have been widely used to capture the similarity of lexical semantics in different human languages.
1 code implementation • 31 Jul 2018 • Muhao Chen, Changping Meng, Gang Huang, Carlo Zaniolo
Nowadays, editors tend to separate different subtopics of a long Wiki-pedia article into multiple sub-articles.
no code implementations • 18 Jun 2018 • Muhao Chen, Yingtao Tian, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo
Since many multilingual KGs also provide literal descriptions of entities, in this paper, we introduce an embedding-based approach which leverages a weakly aligned multilingual KG for semi-supervised cross-lingual learning using entity descriptions.
no code implementations • 24 May 2017 • Tao Zhou, Muhao Chen, Jie Yu, Demetri Terzopoulos
Following the recent progress in image classification and captioning using deep learning, we develop a novel natural language person retrieval system based on an attention mechanism.
2 code implementations • 12 Nov 2016 • Muhao Chen, Yingtao Tian, Mohan Yang, Carlo Zaniolo
Many recent works have demonstrated the benefits of knowledge graph embeddings in completing monolingual knowledge graphs.
Ranked #36 on
Entity Alignment
on DBP15k zh-en