Search Results for author: Xiang Ren

Found 186 papers, 114 papers with code

Hierarchical Graph Representation Learning with Differentiable Pooling

14 code implementations • NeurIPS 2018 • Rex Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L. Hamilton, Jure Leskovec

Recently, graph neural networks (GNNs) have revolutionized the field of graph representation learning through effectively learned node embeddings, and achieved state-of-the-art results in tasks such as node classification and link prediction.

Ranked #1 on Graph Classification on REDDIT-MULTI-12K

General Classification Graph Classification +3

12,992

Paper
Code

Recurrent Event Network: Autoregressive Structure Inference over Temporal Knowledge Graphs

2 code implementations • 11 Apr 2019 • Woojeong Jin, Meng Qu, Xisen Jin, Xiang Ren

The task becomes more challenging on temporal knowledge graphs, where each fact is associated with a timestamp.

Knowledge Graphs Link Prediction +1

12,992

Paper
Code

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

3 code implementations • 9 Jun 2022 • Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew Dai, Andrew La, Andrew Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartłomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, César Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodola, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, Francois Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocoń, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, Jose Hernandez-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras-Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Şenel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, Maria Jose Ramírez Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Orduna Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael Ivanitskiy, Michael Starritt, Michael Strube, Michał Swędrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T, Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha S. Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Miłkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima, Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, ZiRui Wang, Ziyi Wu

BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models.

Common Sense Reasoning Math +1

2,650

Paper
Code

Automated Phrase Mining from Massive Text Corpora

4 code implementations • 15 Feb 2017 • Jingbo Shang, Jialu Liu, Meng Jiang, Xiang Ren, Clare R. Voss, Jiawei Han

As one of the fundamental tasks in text analysis, phrase mining aims at extracting quality phrases from a text corpus.

General Knowledge POS +1

1,161

Paper
Code

Empower Sequence Labeling with Task-Aware Neural Language Model

3 code implementations • 13 Sep 2017 • Liyuan Liu, Jingbo Shang, Frank F. Xu, Xiang Ren, Huan Gui, Jian Peng, Jiawei Han

In this study, we develop a novel neural framework to extract abundant knowledge hidden in raw texts to empower the sequence labeling task.

Ranked #13 on Part-Of-Speech Tagging on Penn Treebank

Language Modelling named-entity-recognition +5

845

Paper
Code

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

3 code implementations • 5 Jun 2023 • Dongfu Jiang, Xiang Ren, Bill Yuchen Lin

We present LLM-Blender, an ensembling framework designed to attain consistently superior performance by leveraging the diverse strengths of multiple open-source large language models (LLMs).

786

Paper
Code

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

3 code implementations • ICML 2018 • Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, Jure Leskovec

Modeling and generating graphs is fundamental for studying networks in biology, engineering, and social sciences.

Graph Generation

672

Paper
Code

Learning Named Entity Tagger using Domain-Specific Dictionary

1 code implementation • EMNLP 2018 • Jingbo Shang, Liyuan Liu, Xiang Ren, Xiaotao Gu, Teng Ren, Jiawei Han

Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features.

named-entity-recognition Named Entity Recognition +1

483

Paper
Code

Indirect Supervision for Relation Extraction using Question-Answer Pairs

2 code implementations • 30 Oct 2017 • Zeqiu Wu, Xiang Ren, Frank F. Xu, Ji Li, Jiawei Han

However, due to the incompleteness of knowledge bases and the context-agnostic labeling, the training data collected via distant supervision (DS) can be very noisy.

Question Answering Relation +1

418

Paper
Code

CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases

2 code implementations • 27 Oct 2016 • Xiang Ren, Zeqiu Wu, Wenqi He, Meng Qu, Clare R. Voss, Heng Ji, Tarek F. Abdelzaher, Jiawei Han

We propose a novel domain-independent framework, called CoType, that runs a data-driven text segmentation algorithm to extract entity mentions, and jointly embeds entity mentions, relation mentions, text features and type labels into two low-dimensional spaces (for entity and relation mentions respectively), where, in each space, objects whose types are close will also have similar representations.

Ranked #11 on Relation Extraction on NYT11-HRL

Joint Entity and Relation Extraction Relation +1

418

Paper
Code

KagNet: Knowledge-Aware Graph Networks for Commonsense Reasoning

2 code implementations • IJCNLP 2019 • Bill Yuchen Lin, Xinyue Chen, Jamin Chen, Xiang Ren

Commonsense reasoning aims to empower machines with the human ability to make presumptions about ordinary situations in our daily life.

Ranked #29 on Common Sense Reasoning on CommonsenseQA (using extra training data)

Common Sense Reasoning Knowledge Base Question Answering +2

267

Paper
Code

Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering

2 code implementations • EMNLP 2020 • Yanlin Feng, Xinyue Chen, Bill Yuchen Lin, Peifeng Wang, Jun Yan, Xiang Ren

Existing work on augmenting question answering (QA) models with external knowledge (e. g., knowledge graphs) either struggle to model multi-hop relations efficiently, or lack transparency into the model's prediction rationale.

Knowledge Graphs Question Answering +2

244

Paper
Code

CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP

3 code implementations • EMNLP 2021 • Qinyuan Ye, Bill Yuchen Lin, Xiang Ren

Humans can learn a new language task efficiently with only few examples, by leveraging their knowledge obtained when learning prior tasks.

Few-Shot Learning

239

Paper
Code

Self-Discover: Large Language Models Self-Compose Reasoning Structures

2 code implementations • 6 Feb 2024 • Pei Zhou, Jay Pujara, Xiang Ren, Xinyun Chen, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou, Swaroop Mishra, Huaixiu Steven Zheng

We introduce SELF-DISCOVER, a general framework for LLMs to self-discover the task-intrinsic reasoning structures to tackle complex reasoning problems that are challenging for typical prompting methods.

Math

227

Paper
Code

Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models

2 code implementations • ICLR 2020 • Xisen Jin, Zhongyu Wei, Junyi Du, xiangyang xue, Xiang Ren

Human and metrics evaluation on both LSTM models and BERT Transformer models on multiple datasets show that our algorithms outperform prior hierarchical explanation algorithms.

Semantic Composition

203

Paper
Code

Discretized Integrated Gradients for Explaining Language Models

2 code implementations • EMNLP 2021 • Soumya Sanyal, Xiang Ren

As a prominent attribution-based explanation algorithm, Integrated Gradients (IG) is widely adopted due to its desirable explanation axioms and the ease of gradient computation.

Feature Importance Sentiment Analysis +1

203

Paper
Code

TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition

1 code implementation • ACL 2020 • Bill Yuchen Lin, Dong-Ho Lee, Ming Shen, Ryan Moreno, Xiao Huang, Prashant Shiralkar, Xiang Ren

In this paper, we introduce "entity triggers," an effective proxy of human explanations for facilitating label-efficient learning of NER models.

named-entity-recognition Named Entity Recognition +2

171

Paper
Code

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

1 code implementation • EMNLP 2018 • Liyuan Liu, Xiang Ren, Jingbo Shang, Jian Peng, Jiawei Han

Many efforts have been made to facilitate natural language processing tasks with pre-trained language models (LMs), and brought significant improvements to various applications.

Ranked #47 on Named Entity Recognition (NER) on CoNLL 2003 (English)

Language Modelling Named Entity Recognition (NER)

146

Paper
Code

Hierarchical Text Classification with Reinforced Label Assignment

1 code implementation • IJCNLP 2019 • Yuning Mao, Jingjing Tian, Jiawei Han, Xiang Ren

While existing hierarchical text classification (HTC) methods attempt to capture label hierarchies for model training, they either make local decisions regarding each label or completely ignore the hierarchy information during inference.

Ranked #1 on Text Classification on RCV1 (Macro F1 metric)

General Classification text-classification +1

139

Paper
Code

CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning

2 code implementations • Findings of the Association for Computational Linguistics 2020 • Bill Yuchen Lin, Wangchunshu Zhou, Ming Shen, Pei Zhou, Chandra Bhagavatula, Yejin Choi, Xiang Ren

In this paper, we present a constrained text generation task, CommonGen associated with a benchmark dataset, to explicitly test machines for the ability of generative commonsense reasoning.

Ranked #1 on Text Generation on CommonGen

Common Sense Reasoning Question Answering +3

137

Paper
Code

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning

2 code implementations • 30 Jan 2018 • Xuan Wang, Yu Zhang, Xiang Ren, Yuhao Zhang, Marinka Zitnik, Jingbo Shang, Curtis Langlotz, Jiawei Han

Motivation: State-of-the-art biomedical named entity recognition (BioNER) systems often require handcrafted features specific to each entity type, such as genes, chemicals and diseases.

Feature Engineering Multi-Task Learning +4

129

Paper
Code

Time-Series Event Prediction with Evolutionary State Graph

3 code implementations • 10 May 2019 • Wenjie Hu, Yang Yang, Ziqiang Cheng, Carl Yang, Xiang Ren

In this paper, we present evolutionary state graph, a dynamic graph structure designed to systematically represent the evolving relations (edges) among states (nodes) along time.

Time Series Time Series Classification +1

Paper
Code

Heterogeneous Supervision for Relation Extraction: A Representation Learning Approach

1 code implementation • EMNLP 2017 • Liyuan Liu, Xiang Ren, Qi Zhu, Shi Zhi, Huan Gui, Heng Ji, Jiawei Han

These annotations, referred as heterogeneous supervision, often conflict with each other, which brings a new challenge to the original relation extraction task: how to infer the true label from noisy labels for a given instance.

Relation Relation Extraction +1

Paper
Code

HMEAE: Hierarchical Modular Event Argument Extraction

1 code implementation • IJCNLP 2019 • Xiaozhi Wang, Ziqi Wang, Xu Han, Zhiyuan Liu, Juanzi Li, Peng Li, Maosong Sun, Jie zhou, Xiang Ren

Existing event extraction methods classify each argument role independently, ignoring the conceptual correlations between different argument roles.

Event Argument Extraction Event Extraction +1

Paper
Code

Characterizing and Forecasting User Engagement with In-app Action Graph: A Case Study of Snapchat

1 code implementation • 2 Jun 2019 • Yozen Liu, Xiaolin Shi, Lucas Pierce, Xiang Ren

Here we propose to formalize individual user's in-app action transition patterns as a temporally evolving action graph, and analyze its characteristics in terms of informing future user engagement.

Time Series Analysis

Paper
Code

Commonsense-Focused Dialogues for Response Generation: An Empirical Study

1 code implementation • SIGDIAL (ACL) 2021 • Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

Moreover, existing dialogue datasets do not explicitly focus on exhibiting commonsense as a facet.

Response Generation Text Generation

Paper
Code

Dataless Knowledge Fusion by Merging Weights of Language Models

1 code implementation • 19 Dec 2022 • Xisen Jin, Xiang Ren, Daniel Preotiuc-Pietro, Pengxiang Cheng

In this paper, we study the problem of merging individual models built on different training data sets to obtain a single model that performs well both across all data set domains and can generalize on out-of-domain data.

Multi-Task Learning

Paper
Code

End-to-End Reinforcement Learning for Automatic Taxonomy Induction

1 code implementation • ACL 2018 • Yuning Mao, Xiang Ren, Jiaming Shen, Xiaotao Gu, Jiawei Han

We present a novel end-to-end reinforcement learning approach to automatic taxonomy induction from a set of terms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Mining Entity Synonyms with Efficient Neural Set Generation

1 code implementation • 16 Nov 2018 • Jiaming Shen, Ruiliang Lyu, Xiang Ren, Michelle Vanni, Brian Sadler, Jiawei Han

Mining entity synonym sets (i. e., sets of terms referring to the same entity) is an important task for many entity-leveraging applications.

Paper
Code

Jointly Learning Explainable Rules for Recommendation with Knowledge Graph

1 code implementation • 9 Mar 2019 • Weizhi Ma, Min Zhang, Yue Cao, Woojeong, Jin, Chenyang Wang, Yiqun Liu, Shaoping Ma, Xiang Ren

The framework encourages two modules to complement each other in generating effective and explainable recommendation: 1) inductive rules, mined from item-centric knowledge graphs, summarize common multi-hop relational patterns for inferring different item associations and provide human-readable explanation for model prediction; 2) recommendation module can be augmented by induced rules and thus have better generalization ability dealing with the cold-start issue.

Explainable Recommendation Knowledge Graphs +1

Paper
Code

Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding

3 code implementations • 17 Feb 2016 • Xiang Ren, Wenqi He, Meng Qu, Clare R. Voss, Heng Ji, Jiawei Han

Current systems of fine-grained entity typing use distant supervision in conjunction with existing knowledge bases to assign categories (type labels) to entity mentions.

Entity Typing Semantic Similarity +2

Paper
Code

AFET: Automatic Fine-Grained Entity Typing by Hierarchical Partial-Label Embedding

1 code implementation • EMNLP 2016 • Xiang Ren, Wenqi He, Meng Qu, Lifu Huang, Heng Ji, Jiawei Han

Entity Typing Named Entity Recognition (NER) +2

Paper
Code

Collaborative Policy Learning for Open Knowledge Graph Reasoning

2 code implementations • IJCNLP 2019 • Cong Fu, Tong Chen, Meng Qu, Woojeong Jin, Xiang Ren

We propose a novel reinforcement learning framework to train two collaborative agents jointly, i. e., a multi-hop graph reasoner and a fact extractor.

Paper
Code

NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction

2 code implementations • 5 Sep 2019 • Wenxuan Zhou, Hongtao Lin, Bill Yuchen Lin, Ziqi Wang, Junyi Du, Leonardo Neves, Xiang Ren

The soft matching module learns to match rules with semantically similar sentences such that raw corpora can be automatically labeled and leveraged by the RE module (in a much better coverage) as augmented supervision, in addition to the exactly matched sentences.

Relation Relation Extraction +1

Paper
Code

Learning Dynamic Context Augmentation for Global Entity Linking

2 code implementations • IJCNLP 2019 • Xiyuan Yang, Xiaotao Gu, Sheng Lin, Siliang Tang, Yueting Zhuang, Fei Wu, Zhigang Chen, Guoping Hu, Xiang Ren

Despite of the recent success of collective entity linking (EL) methods, these "global" inference methods may yield sub-optimal results when the "all-mention coherence" assumption breaks, and often suffer from high computational cost at the inference stage, due to the complex search space.

Ranked #5 on Entity Disambiguation on AIDA-CoNLL

Entity Disambiguation Entity Linking +1

Paper
Code

ECONET: Effective Continual Pretraining of Language Models for Event Temporal Reasoning

2 code implementations • EMNLP 2021 • Rujun Han, Xiang Ren, Nanyun Peng

While pre-trained language models (PTLMs) have achieved noticeable success on many NLP tasks, they still struggle for tasks that require event temporal reasoning, which is essential for event-centric applications.

Ranked #1 on Question Answering on Torque

Continual Pretraining Language Modelling +4

Paper
Code

Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER

1 code implementation • ACL 2022 • Dong-Ho Lee, Akshen Kadakia, Kangmin Tan, Mahak Agarwal, Xinyu Feng, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren

We also find that good demonstration can save many labeled examples and consistency in demonstration contributes to better performance.

Domain Adaptation Few-Shot Text Classification +6

Paper
Code

Automatic Synonym Discovery with Knowledge Bases

1 code implementation • 25 Jun 2017 • Meng Qu, Xiang Ren, Jiawei Han

In this paper, we study the problem of automatic synonym discovery with knowledge bases, that is, identifying synonyms for knowledge base entities in a given domain-specific corpus.

Paper
Code

Looking Beyond Label Noise: Shifted Label Distribution Matters in Distantly Supervised Relation Extraction

1 code implementation • IJCNLP 2019 • Qinyuan Ye, Liyuan Liu, Maosen Zhang, Xiang Ren

In this paper, we study the problem what limits the performance of DS-trained neural models, conduct thorough analyses, and identify a factor that can influence the performance greatly, shifted label distribution.

Relation Relation Extraction

Paper
Code

A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models

1 code implementation • ACL 2022 • Woojeong Jin, Yu Cheng, Yelong Shen, Weizhu Chen, Xiang Ren

Large pre-trained vision-language (VL) models can learn a new task with a handful of examples and generalize to a new task without fine-tuning.

Ranked #4 on Image Captioning on Flickr30k Captions test (SPICE metric)

Image Captioning Language Modelling +2

Paper
Code

Learning Collaborative Agents with Rule Guidance for Knowledge Graph Reasoning

1 code implementation • EMNLP 2020 • Deren Lei, Gangrong Jiang, Xiaotao Gu, Kexuan Sun, Yuning Mao, Xiang Ren

Walk-based models have shown their advantages in knowledge graph (KG) reasoning by achieving decent performance while providing interpretable decisions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Integrating Local Context and Global Cohesiveness for Open Information Extraction

1 code implementation • 26 Apr 2018 • Qi Zhu, Xiang Ren, Jingbo Shang, Yu Zhang, Ahmed El-Kishky, Jiawei Han

However, current Open IE systems focus on modeling local context information in a sentence to extract relation tuples, while ignoring the fact that global statistics in a large corpus can be collectively leveraged to identify high-quality sentence-level extractions.

Open Information Extraction Relation +1

Paper
Code

Learning Dual Retrieval Module for Semi-supervised Relation Extraction

1 code implementation • 20 Feb 2019 • Hongtao Lin, Jun Yan, Meng Qu, Xiang Ren

In this paper, we leverage a key insight that retrieving sentences expressing a relation is a dual task of predicting relation label for a given sentence---two tasks are complementary to each other and can be optimized jointly for mutual enhancement.

MULTI-VIEW LEARNING Relation +3

Paper
Code

Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Peifeng Wang, Nanyun Peng, Filip Ilievski, Pedro Szekely, Xiang Ren

In this paper, we augment a general commonsense QA framework with a knowledgeable path generator.

Knowledge Graphs Language Modelling +1

Paper
Code

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

3 code implementations • ACL 2020 • Brendan Kennedy, Xisen Jin, Aida Mostafazadeh Davani, Morteza Dehghani, Xiang Ren

Hate speech classifiers trained on imbalanced datasets struggle to determine if group identifiers like "gay" or "black" are used in offensive or prejudiced ways.

Paper
Code

SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

1 code implementation • 17 Oct 2019 • Jiaming Shen, Zeqiu Wu, Dongming Lei, Jingbo Shang, Xiang Ren, Jiawei Han

In this study, we propose a novel framework, SetExpan, which tackles this problem, with two techniques: (1) a context feature selection method that selects clean context features for calculating entity-entity distributional similarity, and (2) a ranking-based unsupervised ensemble method for expanding entity set based on denoised context features.

feature selection Question Answering

Paper
Code

SCOTT: Self-Consistent Chain-of-Thought Distillation

1 code implementation • 3 May 2023 • Peifeng Wang, Zhengyang Wang, Zheng Li, Yifan Gao, Bing Yin, Xiang Ren

While CoT can yield dramatically improved performance, such gains are only observed for sufficiently large LMs.

counterfactual Counterfactual Reasoning +1

Paper
Code

Cascade-BGNN: Toward Efficient Self-supervised Representation Learning on Large-scale Bipartite Graphs

1 code implementation • 27 Jun 2019 • Chaoyang He, Tian Xie, Yu Rong, Wenbing Huang, Junzhou Huang, Xiang Ren, Cyrus Shahabi

Existing techniques either cannot be scaled to large-scale bipartite graphs that have limited labels or cannot exploit the unique structure of bipartite graphs, which have distinct node features in two domains.

Recommendation Systems Representation Learning

Paper
Code

Pre-training Text-to-Text Transformers for Concept-centric Common Sense

1 code implementation • 24 Oct 2020 • Wangchunshu Zhou, Dong-Ho Lee, Ravi Kiran Selvam, Seyeon Lee, Bill Yuchen Lin, Xiang Ren

Pre-trained language models (PTLM) have achieved impressive results in a range of natural language understanding (NLU) and generation (NLG) tasks.

Common Sense Reasoning Knowledge Graphs +3

Paper
Code

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation

1 code implementation • EMNLP 2021 • Mozhdeh Gheini, Xiang Ren, Jonathan May

We study the power of cross-attention in the Transformer architecture within the context of transfer learning for machine translation, and extend the findings of studies into cross-attention when training from scratch.

Machine Translation Transfer Learning +1

Paper
Code

Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning

1 code implementation • ACL 2021 • Bill Yuchen Lin, Seyeon Lee, Xiaoyang Qiao, Xiang Ren

In addition, we also create two new datasets, X-CSQA and X-CODAH, by translating their English versions to 15 other languages, so that we can evaluate popular ML-LMs for cross-lingual commonsense reasoning.

Common Sense Reasoning Sentence

Paper
Code

UNIREX: A Unified Learning Framework for Language Model Rationale Extraction

1 code implementation • BigScience (ACL) 2022 • Aaron Chan, Maziar Sanjabi, Lambert Mathias, Liang Tan, Shaoliang Nie, Xiaochang Peng, Xiang Ren, Hamed Firooz

An extractive rationale explains a language model's (LM's) prediction on a given task instance by highlighting the text inputs that most influenced the prediction.

Language Modelling text-classification +1

Paper
Code

Unsupervised Cross-Task Generalization via Retrieval Augmentation

1 code implementation • 17 Apr 2022 • Bill Yuchen Lin, Kangmin Tan, Chris Miller, Beiwen Tian, Xiang Ren

Humans can perform unseen tasks by recalling relevant skills acquired previously and then generalizing them to the target tasks, even if there is no supervision at all.

Retrieval

Paper
Code

Faith and Fate: Limits of Transformers on Compositionality

1 code implementation • NeurIPS 2023 • Nouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, Jena D. Hwang, Soumya Sanyal, Sean Welleck, Xiang Ren, Allyson Ettinger, Zaid Harchaoui, Yejin Choi

We formulate compositional tasks as computation graphs to systematically quantify the level of complexity, and break down reasoning steps into intermediate sub-procedures.

Paper
Code

An Attention-based Collaboration Framework for Multi-View Network Representation Learning

1 code implementation • 19 Sep 2017 • Meng Qu, Jian Tang, Jingbo Shang, Xiang Ren, Ming Zhang, Jiawei Han

Existing approaches usually study networks with a single type of proximity between nodes, which defines a single view of a network.

Representation Learning

Paper
Code

Visually Grounded Continual Learning of Compositional Phrases

2 code implementations • EMNLP 2020 • Xisen Jin, Junyi Du, Arka Sadhu, Ram Nevatia, Xiang Ren

To study this human-like language acquisition ability, we present VisCOLL, a visually grounded language learning task, which simulates the continual acquisition of compositional phrases from streaming visual scenes.

Continual Learning Grounded language learning +1

Paper
Code

Temporal Attribute Prediction via Joint Modeling of Multi-Relational Structure Evolution

1 code implementation • 9 Mar 2020 • Sankalp Garg, Navodita Sharma, Woojeong Jin, Xiang Ren

We show that if the information contained in the graph and the time series data are closely related, then this inter-dependence can be used to predict the time series with improved accuracy.

Attribute Knowledge Graphs +4

Paper
Code

Learning from Explanations with Neural Execution Tree

1 code implementation • ICLR 2020 • Ziqi Wang, Yujia Qin, Wenxuan Zhou, Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren

While deep neural networks have achieved impressive performance on a range of NLP tasks, these data-hungry models heavily rely on labeled data, which restricts their applications in scenarios where data annotation is expensive.

Data Augmentation Multi-hop Question Answering +6

Paper
Code

Constrained Abstractive Summarization: Preserving Factual Consistency with Constrained Generation

2 code implementations • 24 Oct 2020 • Yuning Mao, Xiang Ren, Heng Ji, Jiawei Han

Despite significant progress, state-of-the-art abstractive summarization methods are still prone to hallucinate content inconsistent with the source document.

Abstractive Text Summarization Keyphrase Extraction

Paper
Code

Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation

2 code implementations • EMNLP 2021 • Yuning Mao, Wenchang Ma, Deren Lei, Jiawei Han, Xiang Ren

In this paper, we present a systematic analysis that studies whether current seq2seq models, especially pre-trained language models, are good enough for preserving important input concepts and to what extent explicitly guiding generation with the concepts as lexical constraints is beneficial.

Conditional Text Generation Denoising

Paper
Code

NewsEdits: A News Article Revision Dataset and a Document-Level Reasoning Challenge

1 code implementation • 14 Jun 2022 • Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng

News article revision histories provide clues to narrative and factual evolution in news articles.

Paper
Code

NewsEdits: A News Article Revision Dataset and a Novel Document-Level Reasoning Challenge

1 code implementation • NAACL 2022 • Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng

News article revision histories provide clues to narrative and factual evolution in news articles.

Paper
Code

Learning Contextualized Knowledge Structures for Commonsense Reasoning

1 code implementation • Findings (ACL) 2021 • Jun Yan, Mrigank Raman, Aaron Chan, Tianyu Zhang, Ryan Rossi, Handong Zhao, Sungchul Kim, Nedim Lipka, Xiang Ren

Recently, knowledge graph (KG) augmented models have achieved noteworthy success on various commonsense reasoning tasks.

Knowledge Graphs Natural Language Inference +1

Paper
Code

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement

1 code implementation • 12 Oct 2023 • Linlu Qiu, Liwei Jiang, Ximing Lu, Melanie Sclar, Valentina Pyatkin, Chandra Bhagavatula, Bailin Wang, Yoon Kim, Yejin Choi, Nouha Dziri, Xiang Ren

The ability to derive underlying principles from a handful of observations and then generalize to novel situations -- known as inductive reasoning -- is central to human intelligence.

Paper
Code

Cross-relation Cross-bag Attention for Distantly-supervised Relation Extraction

1 code implementation • 27 Dec 2018 • Yujin Yuan, Liyuan Liu, Siliang Tang, Zhongfei Zhang, Yueting Zhuang, ShiLiang Pu, Fei Wu, Xiang Ren

Distant supervision leverages knowledge bases to automatically label instances, thus allowing us to train relation extractor without human annotations.

Relation Relation Extraction +1

Paper
Code

Gradient-based Editing of Memory Examples for Online Task-free Continual Learning

1 code implementation • NeurIPS 2021 • Xisen Jin, Arka Sadhu, Junyi Du, Xiang Ren

We explore task-free continual learning (CL), in which a model is trained to avoid catastrophic forgetting in the absence of explicit task boundaries or identities.

Continual Learning

Paper
Code

Raw-to-End Name Entity Recognition in Social Media

1 code implementation • 14 Aug 2019 • Liyuan Liu, Zihan Wang, Jingbo Shang, Dandong Yin, Heng Ji, Xiang Ren, Shaowen Wang, Jiawei Han

Our model neither requires the conversion from character sequences to word sequences, nor assumes tokenizer can correctly detect all word boundaries.

named-entity-recognition Named Entity Recognition +1

Paper
Code

SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning

1 code implementation • NeurIPS 2021 • Aaron Chan, Jiashu Xu, Boyuan Long, Soumya Sanyal, Tanishq Gupta, Xiang Ren

and fine (Which nodes/paths in the KG are useful?)

Knowledge Graphs

Paper
Code

FaiRR: Faithful and Robust Deductive Reasoning over Natural Language

1 code implementation • ACL 2022 • Soumya Sanyal, Harman Singh, Xiang Ren

Recent works show that such models can also produce the reasoning steps (i. e., the proof graph) that emulate the model's logical reasoning process.

Fact Selection Logical Reasoning

Paper
Code

Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

1 code implementation • 24 May 2023 • Ximing Lu, Faeze Brahman, Peter West, Jaehun Jang, Khyathi Chandu, Abhilasha Ravichander, Lianhui Qin, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Yuchen Lin, Skyler Hallinan, Xiang Ren, Sean Welleck, Yejin Choi

While extreme-scale language models have demonstrated exceptional performance on a variety of language tasks, the degree of control over these language models through pure prompting can often be limited.

Language Modelling reinforcement-learning +1

Paper
Code

Learning to Generate Task-Specific Adapters from Task Description

1 code implementation • ACL 2021 • Qinyuan Ye, Xiang Ren

Recent study further shows that they can learn to generalize to novel tasks, by including task descriptions as part of the source sequence and training the model with (source, target) examples.

Text Generation Zero-Shot Learning

Paper
Code

Refining Language Models with Compositional Explanations

1 code implementation • NeurIPS 2021 • Huihan Yao, Ying Chen, Qinyuan Ye, Xisen Jin, Xiang Ren

However, such a regularization technique lacks flexibility and coverage, since only importance scores towards a pre-defined list of features are adjusted, while more complex human knowledge such as feature interaction and pattern generalization can hardly be incorporated.

Fairness Language Modelling +2

Paper
Code

X-METRA-ADA: Cross-lingual Meta-Transfer Learning Adaptation to Natural Language Understanding and Question Answering

1 code implementation • NAACL 2021 • Meryem M'hamdi, Doo Soon Kim, Franck Dernoncourt, Trung Bui, Xiang Ren, Jonathan May

We extensively evaluate our framework on two challenging cross-lingual NLU tasks: multilingual task-oriented dialog and typologically diverse question answering.

Meta-Learning Natural Language Understanding +4

Paper
Code

Dynamic Network Embedding via Incremental Skip-gram with Negative Sampling

1 code implementation • 9 Jun 2019 • Hao Peng, Jian-Xin Li, Hao Yan, Qiran Gong, Senzhang Wang, Lin Liu, Lihong Wang, Xiang Ren

Most existing methods focus on learning the structural representations of vertices in a static network, but cannot guarantee an accurate and efficient embedding in a dynamic network scenario.

Link Prediction Multi-Label Classification +1

Paper
Code

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling

1 code implementation • ACL 2020 • Ouyu Lan, Xiao Huang, Bill Yuchen Lin, He Jiang, Liyuan Liu, Xiang Ren

Its performance is largely influenced by the annotation quality and quantity in supervised learning scenarios, and obtaining ground truth labels is often costly.

Paper
Code

IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization

1 code implementation • 2 May 2020 • Wenxuan Zhou, Bill Yuchen Lin, Xiang Ren

Fine-tuning pre-trained language models (PTLMs), such as BERT and its better variant RoBERTa, has been a common practice for advancing performance in natural language understanding (NLU) tasks.

Natural Language Understanding Representation Learning

Paper
Code

Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models

1 code implementation • NAACL 2022 • Qinyuan Ye, Madian Khabsa, Mike Lewis, Sinong Wang, Xiang Ren, Aaron Jaech

Distilling state-of-the-art transformer models into lightweight student models is an effective way to reduce computation cost at inference time.

Domain Generalization Privacy Preserving +4

Paper
Code

DOMINO: A Dual-System for Multi-step Visual Language Reasoning

1 code implementation • 4 Oct 2023 • Peifang Wang, Olga Golovneva, Armen Aghajanyan, Xiang Ren, Muhao Chen, Asli Celikyilmaz, Maryam Fazel-Zarandi

By fine-tuning the System-2 module (LLaMA-2 70B) on only a small amount of data on multi-step reasoning, the accuracy of our method is further improved and surpasses the best fully-supervised end-to-end approach by 5. 7% and a pipeline approach with FlanPaLM (540B) by 7. 5% on a challenging dataset with human-authored questions.

Arithmetic Reasoning Language Modelling +2

Paper
Code

Teaching Machine Comprehension with Compositional Explanations

2 code implementations • Findings of the Association for Computational Linguistics 2020 • Qinyuan Ye, Xiao Huang, Elizabeth Boschee, Xiang Ren

Advances in machine reading comprehension (MRC) rely heavily on the collection of large scale human-annotated examples in the form of (question, paragraph, answer) triples.

Data Augmentation Machine Reading Comprehension +1

Paper
Code

On the Robustness of Reading Comprehension Models to Entity Renaming

1 code implementation • NAACL 2022 • Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren

We study the robustness of machine reading comprehension (MRC) models to entity renaming -- do models make more wrong predictions when the same questions are asked about an entity whose name has been changed?

Continual Pretraining Machine Reading Comprehension

Paper
Code

Contextualized Scene Imagination for Generative Commonsense Reasoning

1 code implementation • ICLR 2022 • Peifeng Wang, Jonathan Zamora, Junfeng Liu, Filip Ilievski, Muhao Chen, Xiang Ren

In this paper, we propose an Imagine-and-Verbalize (I&V) method, which learns to imagine a relational scene knowledge graph (SKG) with relations between the input concepts, and leverage the SKG as a constraint when generating a plausible scene description.

Common Sense Reasoning Descriptive +2

Paper
Code

REV: Information-Theoretic Evaluation of Free-Text Rationales

1 code implementation • 10 Oct 2022 • Hanjie Chen, Faeze Brahman, Xiang Ren, Yangfeng Ji, Yejin Choi, Swabha Swayamdipta

More concretely, we propose a metric called REV (Rationale Evaluation with conditional V-information), to quantify the amount of new, label-relevant information in a rationale beyond the information already available in the input or the label.

Paper
Code

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

1 code implementation • EMNLP 2020 • Yuning Mao, Yanru Qu, Yiqing Xie, Xiang Ren, Jiawei Han

Additionally, the explicit redundancy measure in MMR helps the neural representation of the summary to better capture redundancy.

Document Summarization Multi-Document Summarization +3

Paper
Code

RockNER: A Simple Method to Create Adversarial Examples for Evaluating the Robustness of Named Entity Recognition Models

1 code implementation • EMNLP 2021 • Bill Yuchen Lin, Wenyang Gao, Jun Yan, Ryan Moreno, Xiang Ren

To audit the robustness of named entity recognition (NER) models, we propose RockNER, a simple yet effective method to create natural adversarial examples.

Data Augmentation named-entity-recognition +2

Paper
Code

Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection

1 code implementation • 31 Jul 2023 • Jun Yan, Vikas Yadav, Shiyang Li, Lichang Chen, Zheng Tang, Hai Wang, Vijay Srinivasan, Xiang Ren, Hongxia Jin

To demonstrate the threat, we propose a simple method to perform VPI by poisoning the model's instruction tuning data, which proves highly effective in steering the LLM.

Backdoor Attack

Paper
Code

Facet-Aware Evaluation for Extractive Summarization

1 code implementation • ACL 2020 • Yuning Mao, Liyuan Liu, Qi Zhu, Xiang Ren, Jiawei Han

In this paper, we present a facet-aware evaluation setup for better assessment of the information coverage in extracted summaries.

Extractive Summarization Sentence +1

Paper
Code

BITE: Textual Backdoor Attacks with Iterative Trigger Injection

1 code implementation • 25 May 2022 • Jun Yan, Vansh Gupta, Xiang Ren

We propose BITE, a backdoor attack that poisons the training data to establish strong correlations between the target label and a set of "trigger words".

Backdoor Attack Hate Speech Detection +3

Paper
Code

Retweet-BERT: Political Leaning Detection Using Language Features and Information Diffusion on Social Networks

1 code implementation • 18 Jul 2022 • Julie Jiang, Xiang Ren, Emilio Ferrara

We introduce Retweet-BERT, a simple and scalable model to estimate the political leanings of Twitter users.

Paper
Code

PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning

1 code implementation • 31 May 2023 • Faeze Brahman, Chandra Bhagavatula, Valentina Pyatkin, Jena D. Hwang, Xiang Lorraine Li, Hirona J. Arai, Soumya Sanyal, Keisuke Sakaguchi, Xiang Ren, Yejin Choi

In addition, we introduce a novel task, Counterfactual Planning, that requires a revision of a plan to cope with a counterfactual situation.

Common Sense Reasoning counterfactual +3

Paper
Code

Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

1 code implementation • 18 Feb 2024 • Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren

Our analysis of GPT-series models over a rule subset reveals significant gaps in LLMs' logic understanding compared to human performance, especially in compositional and structural complex rules with certain bias patterns.

Logical Reasoning

Paper
Code

Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning

1 code implementation • Findings (EMNLP) 2021 • Xisen Jin, Bill Yuchen Lin, Mohammad Rostami, Xiang Ren

The ability to continuously expand knowledge over time and utilize it to rapidly generalize to new tasks is a key feature of human linguistic intelligence.

Continual Learning Few-Shot Learning +2

Paper
Code

PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales

1 code implementation • 3 Nov 2022 • Peifeng Wang, Aaron Chan, Filip Ilievski, Muhao Chen, Xiang Ren

Neural language models (LMs) have achieved impressive results on various language-based reasoning tasks by utilizing latent knowledge encoded in their own pretrained parameters.

counterfactual Decision Making

Paper
Code

NS3: Neuro-Symbolic Semantic Code Search

1 code implementation • 21 May 2022 • Shushan Arakelyan, Anna Hakhverdyan, Miltiadis Allamanis, Luis Garcia, Christophe Hauser, Xiang Ren

We compare our model - NS3 (Neuro-Symbolic Semantic Search) - to a number of baselines, including state-of-the-art semantic code retrieval methods, and evaluate on two datasets - CodeSearchNet and Code Search and Question Answering.

Code Search Question Answering +2

Paper
Code

Eliciting Knowledge from Experts:Automatic Transcript Parsing for Cognitive Task Analysis

2 code implementations • 26 Jun 2019 • Junyi Du, He Jiang, Jiaming Shen, Xiang Ren

To reduce human efforts and scale the process, automated CTA transcript parsing is desirable.

Relation Extraction Sentence

Paper
Code

Reporting the Unreported: Event Extraction for Analyzing the Local Representation of Hate Crimes

1 code implementation • IJCNLP 2019 • Aida Mostafazadeh Davani, Leigh Yeh, Mohammad Atari, Brendan Kennedy, Gwenyth Portillo-Wightman, Elaine Gonzalez, Natalie Delong, Rhea Bhatia, Arineh Mirinjian, Xiang Ren, Morteza Dehghani

Official reports of hate crimes in the US are under-reported relative to the actual number of such incidents.

Event Extraction

Paper
Code

Cross-lingual Lifelong Learning

1 code implementation • 23 May 2022 • Meryem M'hamdi, Xiang Ren, Jonathan May

The longstanding goal of multi-lingual learning has been to develop a universal cross-lingual model that can withstand the changes in multi-lingual data distributions.

Continual Learning Transfer Learning

Paper
Code

RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning

1 code implementation • 25 May 2022 • Soumya Sanyal, Zeyi Liao, Xiang Ren

Transformers have been shown to be able to perform deductive reasoning on a logical rulebase containing rules and statements written in English natural language.

Logical Reasoning Negation

Paper
Code

Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales

1 code implementation • 11 May 2023 • Brihi Joshi, Ziyi Liu, Sahana Ramnath, Aaron Chan, Zhewei Tong, Shaoliang Nie, Qifan Wang, Yejin Choi, Xiang Ren

Existing metrics like task performance of the LM generating the rationales, or similarity between generated and gold rationales are not good indicators of their human utility.

Paper
Code

In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search

1 code implementation • 13 Nov 2023 • Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren

We further use the data generated by LINK to construct a dataset Logic-Induced-Long-Tail (LINT) that can be used to evaluate downstream models on the long-tail distribution; LINT contains 108K knowledge statements spanning four domains.

Language Modelling Natural Language Inference +1

Paper
Code

Eliciting Knowledge from Experts: Automatic Transcript Parsing for Cognitive Task Analysis

1 code implementation • ACL 2019 • Junyi Du, He Jiang, Jiaming Shen, Xiang Ren

To reduce human efforts and scale the process, automated CTA transcript parsing is desirable.

Relation Extraction Sentence

Paper
Code

Learning to Deceive Knowledge Graph Augmented Models via Targeted Perturbation

1 code implementation • ICLR 2021 • Mrigank Raman, Aaron Chan, Siddhant Agarwal, Peifeng Wang, Hansen Wang, Sungchul Kim, Ryan Rossi, Handong Zhao, Nedim Lipka, Xiang Ren

Knowledge graphs (KGs) have helped neural models improve performance on various knowledge-intensive tasks, like question answering and item recommendation.

Knowledge Graphs Question Answering +3

Paper
Code

Tailoring Self-Rationalizers with Multi-Reward Distillation

1 code implementation • 6 Nov 2023 • Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Yejin Choi, Xiang Ren

Results on five difficult question-answering datasets StrategyQA, QuaRel, OpenBookQA, NumerSense and QASC show that not only does MaRio improve task accuracy, but it also improves the self-rationalization quality of small LMs across the aforementioned axes better than a supervised fine-tuning (SFT) baseline.

Question Answering StrategyQA

Paper
Code

Do Language Models Perform Generalizable Commonsense Inference?

1 code implementation • Findings (ACL) 2021 • Peifeng Wang, Filip Ilievski, Muhao Chen, Xiang Ren

Inspired by evidence that pretrained language models (LMs) encode commonsense knowledge, recent work has applied LMs to automatically populate commonsense knowledge graphs (CKGs).

Knowledge Graphs

Paper
Code

ER-Test: Evaluating Explanation Regularization Methods for Language Models

1 code implementation • 25 May 2022 • Brihi Joshi, Aaron Chan, Ziyi Liu, Shaoliang Nie, Maziar Sanjabi, Hamed Firooz, Xiang Ren

to align with human rationales (Which input tokens would humans focus on?).

Paper
Code

Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models

1 code implementation • 28 Nov 2022 • Albert Xu, Xiang Ren, Robin Jia

In many task settings, text classification models are likely to encounter examples from novel classes on which they cannot predict correctly.

Language Modelling Large Language Model +2

Paper
Code

Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts

1 code implementation • 25 May 2022 • Qinyuan Ye, Juan Zha, Xiang Ren

Recent works suggest that transformer models are capable of multi-tasking on diverse NLP tasks and adapting to new tasks efficiently.

Multi-Task Learning World Knowledge +1

Paper
Code

How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench

1 code implementation • 24 May 2023 • Qinyuan Ye, Harvey Yiyun Fu, Xiang Ren, Robin Jia

We investigate the predictability of large language model (LLM) capabilities: given records of past experiments using different model families, numbers of parameters, tasks, and numbers of in-context examples, can we accurately predict LLM performance on new experiment configurations?

Language Modelling Large Language Model

Paper
Code

KCAT: A Knowledge-Constraint Typing Annotation Tool

1 code implementation • ACL 2019 • Sheng Lin, Luye Zheng, Bo Chen, Siliang Tang, Yueting Zhuang, Fei Wu, Zhigang Chen, Guoping Hu, Xiang Ren

Fine-grained Entity Typing is a tough task which suffers from noise samples extracted from distant supervision.

Entity Linking Entity Typing

Paper
Code

FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks

1 code implementation • Findings (NAACL) 2022 • Bill Yuchen Lin, Chaoyang He, Zihang Zeng, Hulin Wang, Yufen Huang, Christophe Dupuy, Rahul Gupta, Mahdi Soltanolkotabi, Xiang Ren, Salman Avestimehr

Increasing concerns and regulations about data privacy and sparsity necessitate the study of privacy-preserving, decentralized learning methods for natural language processing (NLP) tasks.

Benchmarking Federated Learning +5

Paper
Code

Machine Translation Robustness to Natural Asemantic Variation

1 code implementation • 25 May 2022 • Jacob Bremerman, Xiang Ren, Jonathan May

We find that existing MT models fail when presented with NAV data, but we demonstrate strategies to improve performance on NAV by fine-tuning them with human-generated variations.

Machine Translation Translation

Paper
Code

Estimating Large Language Model Capabilities without Labeled Test Data

1 code implementation • 24 May 2023 • Harvey Yiyun Fu, Qinyuan Ye, Albert Xu, Xiang Ren, Robin Jia

In this paper, we propose the task of ICL accuracy estimation, in which we predict the accuracy of an LLM when doing in-context learning on a new task given only unlabeled test data for that task.

In-Context Learning Language Modelling +1

Paper
Code

Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step

1 code implementation • 24 Jun 2023 • Liunian Harold Li, Jack Hessel, Youngjae Yu, Xiang Ren, Kai-Wei Chang, Yejin Choi

We release our corpus of chain-of-thought samples and code.

Paper
Code

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting

1 code implementation • 16 Apr 2024 • Huihan Li, Liwei Jiang, Nouha Dziri, Xiang Ren, Yejin Choi

As the utilization of large language models (LLMs) has proliferated worldwide, it is crucial for them to have adequate knowledge and fair representation for diverse global cultures.

Fairness

Paper
Code

Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning

no code implementations • 9 Nov 2017 • Meng Qu, Xiang Ren, Yu Zhang, Jiawei Han

We propose a novel co-training framework with a distributional module and a pattern module.

Knowledge Base Completion Relation +1

Paper
Add Code

MetaPAD: Meta Pattern Discovery from Massive Text Corpora

no code implementations • 13 Mar 2017 • Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M. Kaplan, Timothy P. Hanratty, Jiawei Han

We propose an efficient framework, called MetaPAD, which discovers meta patterns from massive corpora with three techniques: (1) it develops a context-aware segmentation method to carefully determine the boundaries of patterns with a learnt pattern quality assessment function, which avoids costly dependency parsing and generates high-quality patterns; (2) it identifies and groups synonymous meta patterns from multiple facets---their types, contexts, and extractions; and (3) it examines type distributions of entities in the instances extracted by each group of patterns, and looks for appropriate type levels to make discovered patterns precise.

Dependency Parsing

Paper
Add Code

Life-iNet: A Structured Network-Based Knowledge Exploration and Analytics System for Life Sciences

no code implementations • ACL 2017 • Xiang Ren, Jiaming Shen, Meng Qu, Xuan Wang, Zeqiu Wu, Qi Zhu, Meng Jiang, Fangbo Tao, Saurabh Sinha, David Liem, Peipei Ping, Richard Weinshilboum, Jiawei Han

Efficient Exploration

Paper
Add Code

Scalable Construction and Reasoning of Massive Knowledge Bases

no code implementations • NAACL 2018 • Xiang Ren, Nanyun Peng, William Yang Wang

In today{'}s information-based society, there is abundant knowledge out there carried in the form of natural language texts (e. g., news articles, social media posts, scientific publications), which spans across various domains (e. g., corporate documents, advertisements, legal acts, medical reports), which grows at an astonishing rate.

Paper
Add Code

Cross-media Event Extraction and Recommendation

no code implementations • NAACL 2016 • Di Lu, Clare Voss, Fangbo Tao, Xiang Ren, Rachel Guan, Rostyslav Korolov, Tongtao Zhang, Dongang Wang, Hongzhi Li, Taylor Cassidy, Heng Ji, Shih-Fu Chang, Jiawei Han, William Wallace, James Hendler, Mei Si, Lance Kaplan

Event Extraction

Paper
Add Code

Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

no code implementations • NAACL 2019 • Bo Chen, Xiaotao Gu, Yu-Feng Hu, Siliang Tang, Guoping Hu, Yueting Zhuang, Xiang Ren

Recently, distant supervision has gained great success on Fine-grained Entity Typing (FET).

Clustering Entity Typing

Paper
Add Code

Posterior-regularized REINFORCE for Instance Selection in Distant Supervision

1 code implementation • NAACL 2019 • Qi Zhang, Siliang Tang, Xiang Ren, Fei Wu, ShiLiang Pu, Yueting Zhuang

This paper provides a new way to improve the efficiency of the REINFORCE training process.

Reinforcement Learning (RL)

Paper
Code

AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging

no code implementations • ACL 2019 • Bill Yuchen Lin, Dong-Ho Lee, Frank F. Xu, Ouyu Lan, Xiang Ren

We introduce an open-source web-based data annotation framework (AlpacaTag) for sequence tagging tasks such as named-entity recognition (NER).

Active Learning named-entity-recognition +2

Paper
Add Code

HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

no code implementations • 17 Oct 2019 • Jiaming Shen, Zeqiu Wu, Dongming Lei, Chao Zhang, Xiang Ren, Michelle T. Vanni, Brian M. Sadler, Jiawei Han

Taxonomies are of great value to many knowledge-rich applications.

Relation Relation Extraction

Paper
Add Code

Improving BERT Fine-tuning with Embedding Normalization

no code implementations • 10 Nov 2019 • Wenxuan Zhou, Junyi Du, Xiang Ren

Large pre-trained sentence encoders like BERT start a new chapter in natural language processing.

General Classification Sentence +2

Paper
Add Code

Mining News Events from Comparable News Corpora: A Multi-Attribute Proximity Network Modeling Approach

no code implementations • 14 Nov 2019 • Hyungsul Kim, Ahmed El-Kishky, Xiang Ren, Jiawei Han

This proximity network captures the corpus-level co-occurence statistics for candidate event descriptors, event attributes, as well as their connections.

Attribute News Summarization

Paper
Add Code

Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

no code implementations • 10 Mar 2020 • Yankun Ren, Jianbin Lin, Siliang Tang, Jun Zhou, Shuang Yang, Yuan Qi, Xiang Ren

It can attack text classification models with a higher success rate than existing methods, and provide acceptable quality for humans in the meantime.

Adversarial Text General Classification +4

Paper
Add Code

LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

no code implementations • ACL 2020 • Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Jamin Chen, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves, Xiang Ren

Successfully training a deep neural network demands a huge corpus of labeled data.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data

no code implementations • ACL 2021 • Woojeong Jin, Rahul Khanna, Suji Kim, Dong-Ho Lee, Fred Morstatter, Aram Galstyan, Xiang Ren

In this work, we aim to formulate a task, construct a dataset, and provide benchmarks for developing methods for event forecasting with large volumes of unstructured text data.

Knowledge Graphs Language Modelling +5

Paper
Add Code

RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms

no code implementations • EMNLP 2021 • Pei Zhou, Rahul Khanna, Seyeon Lee, Bill Yuchen Lin, Daniel Ho, Jay Pujara, Xiang Ren

Pre-trained language models (PTLMs) have achieved impressive performance on commonsense inference benchmarks, but their ability to employ commonsense to make robust inferences, which is crucial for effective communications with humans, is debated.

Paper
Add Code

Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-trained Language Models

no code implementations • EMNLP 2020 • Bill Yuchen Lin, Seyeon Lee, Rahul Khanna, Xiang Ren

Recent works show that pre-trained language models (PTLMs), such as BERT, possess certain commonsense and factual knowledge.

Paper
Add Code

Screenplay Quality Assessment: Can We Predict Who Gets Nominated?

no code implementations • WS 2020 • Ming-Chang Chiu, Tiantian Feng, Xiang Ren, Shrikanth Narayanan

Toward that goal, in this work, we present a method to evaluate the quality of a screenplay based on linguistic cues.

Paper
Add Code

Two Step Joint Model for Drug Drug Interaction Extraction

no code implementations • 28 Aug 2020 • Siliang Tang, Qi Zhang, Tianpeng Zheng, Mengdi Zhou, Zhan Chen, Lixing Shen, Xiang Ren, Yueting Zhuang, ShiLiang Pu, Fei Wu

When patients need to take medicine, particularly taking more than one kind of drug simultaneously, they should be alarmed that there possibly exists drug-drug interaction.

Drug–drug Interaction Extraction named-entity-recognition +4

Paper
Add Code

SynSetExpan: An Iterative Framework for Joint Entity Set Expansion and Synonym Discovery

no code implementations • EMNLP 2020 • Jiaming Shen, Wenda Qiu, Jingbo Shang, Michelle Vanni, Xiang Ren, Jiawei Han

To facilitate the research on studying the interplays of these two tasks, we create the first large-scale Synonym-Enhanced Set Expansion (SE2) dataset via crowdsourcing.

Paper
Add Code

Learning Contextualized Knowledge Graph Structures for Commonsense Reasoning

no code implementations • 1 Jan 2021 • Jun Yan, Mrigank Raman, Tianyu Zhang, Ryan Rossi, Handong Zhao, Sungchul Kim, Nedim Lipka, Xiang Ren

Recently, neural-symbolic architectures have achieved success on commonsense reasoning through effectively encoding relational structures retrieved from external knowledge graphs (KGs) and obtained state-of-the-art results in tasks such as (commonsense) question answering and natural language inference.

Knowledge Graphs Natural Language Inference +1

Paper
Add Code

Pre-training Text-to-Text Transformers to Write and Reason with Concepts

no code implementations • ICLR 2021 • Wangchunshu Zhou, Dong-Ho Lee, Ravi Kiran Selvam, Seyeon Lee, Xiang Ren

To augment PTLMs with common sense, we propose generative and contrastive objectives as intermediate self-supervised pre-training tasks between general pre-training and downstream task-specific fine-tuning.

Common Sense Reasoning Language Modelling +2

Paper
Add Code

Efficient Learning of Less Biased Models with Transfer Learning

no code implementations • 1 Jan 2021 • Xisen Jin, Francesco Barbieri, Leonardo Neves, Xiang Ren

Prediction bias in machine learning models, referring to undesirable model behaviors that discriminates inputs mentioning or produced by certain group, has drawn increasing attention from the research community given its societal impact.

Transfer Learning

Paper
Add Code

Will This Idea Spread Beyond Academia? Understanding Knowledge Transfer of Scientific Concepts across Text Corpora

no code implementations • Findings of the Association for Computational Linguistics 2020 • Hancheng Cao, Mengjie Cheng, Zhepeng Cen, Daniel A. McFarland, Xiang Ren

We extract scientific concepts (i. e., phrases) from corpora as instantiations of "research ideas", create concept-level features as motivated by literature, and then follow the trajectories of over 450, 000 new concepts (emerged from 1995-2014) to identify factors that lead only a small proportion of these ideas to be used in inventions and drug trials.

Transfer Learning

Paper
Add Code

One-shot Learning for Temporal Knowledge Graphs

no code implementations • AKBC 2021 • Mehrnoosh Mirtaheri, Mohammad Rostami, Xiang Ren, Fred Morstatter, Aram Galstyan

Most real-world knowledge graphs are characterized by a long-tail relation frequency distribution where a significant fraction of relations occurs only a handful of times.

Knowledge Graphs Link Prediction +2

Paper
Add Code

On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning

no code implementations • NAACL 2021 • Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves, Xiang Ren

Fine-tuned language models have been shown to exhibit biases against protected groups in a host of modeling tasks such as text classification and coreference resolution.

coreference-resolution Fairness +6

Paper
Add Code

Differentiable Open-Ended Commonsense Reasoning

no code implementations • NAACL 2021 • Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren, William W. Cohen

As a step towards making commonsense reasoning research more realistic, we propose to study open-ended commonsense reasoning (OpenCSR) -- the task of answering a commonsense question without any pre-defined choices -- using as a resource only a corpus of commonsense facts written in natural language.

Multiple-choice

Paper
Add Code

Fair Hate Speech Detection through Evaluation of Social Group Counterfactuals

no code implementations • 24 Oct 2020 • Aida Mostafazadeh Davani, Ali Omrani, Brendan Kennedy, Mohammad Atari, Xiang Ren, Morteza Dehghani

Counterfactual token fairness for a mentioned social group evaluates the model's predictions as to whether they are the same for (a) the actual sentence and (b) a counterfactual instance, which is generated by changing the mentioned social group in the sentence.

counterfactual Fairness +2

Paper
Add Code

Recurrent Event Network: Autoregressive Structure Inferenceover Temporal Knowledge Graphs

no code implementations • EMNLP 2020 • Woojeong Jin, Meng Qu, Xisen Jin, Xiang Ren

The task becomes more challenging on temporal knowledge graphs, where each fact is associated with a timestamp.

Knowledge Graphs Link Prediction +1

Paper
Add Code

Studying Strategically: Learning to Mask for Closed-book QA

no code implementations • 31 Dec 2020 • Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa

Thus, our policy packs task-relevant knowledge into the parameters of a language model.

Language Modelling Question Answering +1

Paper
Add Code

RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge

no code implementations • Findings (ACL) 2021 • Bill Yuchen Lin, Ziyi Wu, Yichi Yang, Dong-Ho Lee, Xiang Ren

Question: I have five fingers but I am not alive.

counterfactual Counterfactual Reasoning +3

Paper
Add Code

MSD: Saliency-aware Knowledge Distillation for Multimodal Understanding

no code implementations • Findings (EMNLP) 2021 • Woojeong Jin, Maziar Sanjabi, Shaoliang Nie, Liang Tan, Xiang Ren, Hamed Firooz

The idea aims at mimicking a teacher's modality-specific predictions by introducing auxiliary loss terms for each modality.

Knowledge Distillation Meta-Learning

Paper
Add Code

Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources

no code implementations • EMNLP 2021 • Ninareh Mehrabi, Pei Zhou, Fred Morstatter, Jay Pujara, Xiang Ren, Aram Galstyan

In addition, we analyze two downstream models that use ConceptNet as a source for commonsense knowledge and find the existence of biases in those models as well.

Paper
Add Code

On the Influence of Masking Policies in Intermediate Pre-training

no code implementations • EMNLP 2021 • Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa

Current NLP models are predominantly trained through a two-stage "pre-train then fine-tune" pipeline.

Abstractive Text Summarization Language Modelling +4

Paper
Add Code

Probing Commonsense Explanation in Dialogue Response Generation

no code implementations • Findings (EMNLP) 2021 • Pei Zhou, Pegah Jandaghi, Bill Yuchen Lin, Justin Cho, Jay Pujara, Xiang Ren

Humans use commonsense reasoning (CSR) implicitly to produce natural and coherent responses in conversations.

Common Sense Reasoning Response Generation

Paper
Add Code

AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding

no code implementations • ACL 2021 • Jun Yan, Nasser Zalmout, Yan Liang, Christan Grant, Xiang Ren, Xin Luna Dong

However, this approach constrains knowledge sharing across different attributes.

Attribute Attribute Extraction +2

Paper
Add Code

TaxoClass: Hierarchical Multi-Label Text Classification Using Only Class Names

no code implementations • NAACL 2021 • Jiaming Shen, Wenda Qiu, Yu Meng, Jingbo Shang, Xiang Ren, Jiawei Han

Hierarchical multi-label text classification (HMTC) aims to tag each document with a set of classes from a taxonomic class hierarchy.

Multi Label Text Classification Multi-Label Text Classification +3

Paper
Add Code

Improving Counterfactual Generation for Fair Hate Speech Detection

no code implementations • ACL (WOAH) 2021 • Aida Mostafazadeh Davani, Ali Omrani, Brendan Kennedy, Mohammad Atari, Xiang Ren, Morteza Dehghani

By applying logit pairing to equalize outcomes on the restricted set of counterfactuals for each instance, we improve fairness metrics while preserving model performance on hate speech detection.

counterfactual Fairness +2

Paper
Add Code

AutoTriggER: Label-Efficient and Robust Named Entity Recognition with Auxiliary Trigger Extraction

no code implementations • 10 Sep 2021 • Dong-Ho Lee, Ravi Kiran Selvam, Sheikh Muhammad Sarwar, Bill Yuchen Lin, Fred Morstatter, Jay Pujara, Elizabeth Boschee, James Allan, Xiang Ren

Deep neural models for named entity recognition (NER) have shown impressive results in overcoming label scarcity and generalizing to unseen entities by leveraging distant supervision and auxiliary information such as explanations.

Low Resource Named Entity Recognition named-entity-recognition +2

Paper
Add Code

KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering

no code implementations • ACL 2022 • Donghan Yu, Chenguang Zhu, Yuwei Fang, Wenhao Yu, Shuohang Wang, Yichong Xu, Xiang Ren, Yiming Yang, Michael Zeng

The recent proposed Fusion-in-Decoder (FiD), which is built on top of the pretrained generative model T5, achieves the state-of-the-art performance in the reading module.

Answer Generation Open-Domain Question Answering +3

Paper
Add Code

Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora

no code implementations • NAACL 2022 • Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew Arnold, Xiang Ren

We evaluate PTLM's ability to adapt to new corpora while retaining learned knowledge in earlier corpora.

Continual Learning Continual Pretraining +2

Paper
Add Code

Think Before You Speak: Explicitly Generating Implicit Commonsense Knowledge for Response Generation

no code implementations • ACL 2022 • Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

Implicit knowledge, such as common sense, is key to fluid human conversations.

Common Sense Reasoning Explainable Models +1

Paper
Add Code

Modality-specific Distillation

no code implementations • NAACL (maiworkshop) 2021 • Woojeong Jin, Maziar Sanjabi, Shaoliang Nie, Liang Tan, Xiang Ren, Hamed Firooz

In this paper, we propose modality-specific distillation (MSD) to effectively transfer knowledge from a teacher on multimodal datasets.

Knowledge Distillation Meta-Learning

Paper
Add Code

Using Word Embedding to Reveal Monetary Policy Explanation Changes

no code implementations • EMNLP (ECONLP) 2021 • Akira Matsui, Xiang Ren, Emilio Ferrara

Documents have been an essential tool of communication for governments to announce their policy operations.

Sentiment Analysis

Paper
Add Code

Think Before You Speak: Learning to Generate Implicit Knowledge for Response Generation by Self-Talk

no code implementations • EMNLP (NLP4ConvAI) 2021 • Pei Zhou, Behnam Hedayatnia, Karthik Gopalakrishnan, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

We further investigate can such models identify when to generate implicit background knowledge and when it is not necessary.

Common Sense Reasoning Response Generation

Paper
Add Code

End-to-End Hierarchical Text Classification with Label Assignment Policy

no code implementations • 27 Sep 2018 • Yuning Mao, Jingjing Tian, Jiawei Han, Xiang Ren

We present an end-to-end reinforcement learning approach to hierarchical text classification where documents are labeled by placing them at the right positions in a given hierarchy.

text-classification Text Classification

Paper
Add Code

Recurrent Event Network : Global Structure Inference Over Temporal Knowledge Graph

no code implementations • 25 Sep 2019 • Woojeong Jin, He Jiang, Meng Qu, Tong Chen, Changlin Zhang, Pedro Szekely, Xiang Ren

We present Recurrent Event Network (RE-Net), a novel autoregressive architecture for modeling temporal sequences of multi-relational graphs (e. g., temporal knowledge graph), which can perform sequential, global structure inference over future time stamps to predict new events.

Link Prediction Temporal Sequences

Paper
Add Code

Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-modal Knowledge Transfer

no code implementations • ACL 2022 • Woojeong Jin, Dong-Ho Lee, Chenguang Zhu, Jay Pujara, Xiang Ren

Pre-trained language models are still far from human performance in tasks that need understanding of properties (e. g. appearance, measurable quantity) and affordances of everyday objects in the real world since the text lacks such information due to reporting bias.

Image Captioning Language Modelling +1

Paper
Add Code

On Continual Model Refinement in Out-of-Distribution Data Streams

no code implementations • ACL 2022 • Bill Yuchen Lin, Sida Wang, Xi Victoria Lin, Robin Jia, Lin Xiao, Xiang Ren, Wen-tau Yih

Real-world natural language processing (NLP) models need to be continually updated to fix the prediction errors in out-of-distribution (OOD) data streams while overcoming catastrophic forgetting.

Benchmarking Continual Learning

Paper
Add Code

Knowledge-Augmented Methods for Natural Language Processing

no code implementations • ACL 2022 • Chenguang Zhu, Yichong Xu, Xiang Ren, Bill Lin, Meng Jiang, Wenhao Yu

Knowledge in natural language processing (NLP) has been a rising trend especially after the advent of large scale pre-trained models.

Text Generation

Paper
Add Code

FRAME: Evaluating Rationale-Label Consistency Metrics for Free-Text Rationales

no code implementations • 2 Jul 2022 • Aaron Chan, Shaoliang Nie, Liang Tan, Xiaochang Peng, Hamed Firooz, Maziar Sanjabi, Xiang Ren

Following how humans communicate, free-text rationales aim to use natural language to explain neural language model (LM) behavior.

Hallucination Language Modelling +2

Paper
Add Code

ER-TEST Evaluating Explanation Regularization Methods for NLP Models

no code implementations • NAACL (TrustNLP) 2022 • Brihi Joshi, Aaron Chan, Ziyi Liu, Xiang Ren

For the latter, explanation regularization (ER) aims to improve NLM generalization by pushing the machine rationales to align with human rationales.

Paper
Add Code

Curriculum Learning for Data-Efficient Vision-Language Alignment

no code implementations • 29 Jul 2022 • Tejas Srinivasan, Xiang Ren, Jesse Thomason

Aligning image and text encoders from scratch using contrastive learning requires large amounts of paired image-text data.

Contrastive Learning Image Retrieval +3

Paper
Add Code

On Grounded Planning for Embodied Tasks with Language Models

no code implementations • 29 Aug 2022 • Bill Yuchen Lin, Chengsong Huang, Qian Liu, Wenda Gu, Sam Sommerer, Xiang Ren

Language models (LMs) have demonstrated their capability in possessing commonsense knowledge of the physical world, a crucial aspect of performing tasks in everyday life.

Paper
Add Code

MMGA: Multimodal Learning with Graph Alignment

no code implementations • 18 Oct 2022 • Xuan Yang, Quanjin Tao, Xiao Feng, Donghong Cai, Xiang Ren, Yang Yang

In this paper, we propose MMGA (Multimodal learning with Graph Alignment), a novel multimodal pre-training framework to incorporate information from graph (social network), image and text modalities on social media to enhance user representation learning.

Representation Learning

Paper
Add Code

XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models

no code implementations • 30 Oct 2022 • Dong-Ho Lee, Akshen Kadakia, Brihi Joshi, Aaron Chan, Ziyi Liu, Kiran Narahari, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren

Explanation-based model debugging aims to resolve spurious biases by showing human users explanations of model behavior, asking users to give feedback on the behavior, then using the feedback to update the model.

text-classification Text Classification

Paper
Add Code

Reflect, Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality

no code implementations • 16 Nov 2022 • Pei Zhou, Hyundong Cho, Pegah Jandaghi, Dong-Ho Lee, Bill Yuchen Lin, Jay Pujara, Xiang Ren

Human communication relies on common ground (CG), the mutual knowledge and beliefs shared by participants, to produce coherent and interesting conversations.

Response Generation

Paper
Add Code

KNIFE: Distilling Reasoning Knowledge From Free-Text Rationales

no code implementations • 19 Dec 2022 • Aaron Chan, Zhiyuan Zeng, Wyatt Lake, Brihi Joshi, Hanjie Chen, Xiang Ren

First, KNIFE finetunes a teacher LM (given task input and FTR) to predict the task output, transferring reasoning knowledge from the FTRs to the teacher's hidden states.

Knowledge Distillation Language Modelling +1

Paper
Add Code

APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning

no code implementations • 19 Dec 2022 • Soumya Sanyal, Yichong Xu, Shuohang Wang, ZiYi Yang, Reid Pryzant, Wenhao Yu, Chenguang Zhu, Xiang Ren

Logical reasoning of text is an important ability that requires understanding the information present in the text, their interconnections, and then reasoning through them to infer new conclusions.

Data Augmentation Language Modelling +3

Paper
Add Code

PairReranker: Pairwise Reranking for Natural Language Generation

no code implementations • 20 Dec 2022 • Dongfu Jiang, Bill Yuchen Lin, Xiang Ren

Pre-trained language models have been successful in natural language generation (NLG) tasks.

Machine Translation Text Generation

Paper
Add Code

I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons

no code implementations • 20 Dec 2022 • Pei Zhou, Andrew Zhu, Jennifer Hu, Jay Pujara, Xiang Ren, Chris Callison-Burch, Yejin Choi, Prithviraj Ammanabrolu

We propose a novel task, G4C, to study teacher-student natural language interactions in a goal-driven and grounded environment.

Reinforcement Learning (RL) Text Generation

Paper
Add Code

Exploring Distributional Shifts in Large Language Models for Code Analysis

no code implementations • 16 Mar 2023 • Shushan Arakelyan, Rocktim Jyoti Das, Yi Mao, Xiang Ren

We systematically study how three large language models with code capabilities - CodeT5, Codex, and ChatGPT - generalize to out-of-domain data.

Code Generation Code Summarization

Paper
Add Code

Design of Reconfigurable Intelligent Surfaces for Wireless Communication: A Review

no code implementations • 27 Apr 2023 • Rujing Xiong, Jianan Zhang, Fuhai Wang, Zhengyu Wang, Xiang Ren, Junshuo Liu, Jialong Lu, Kai Wan, Tiebin Mi, Robert Caiming Qiu

The prototype undergoes rigorous empirical evaluation, encompassing multi-hop RIS signal amplification, image reconstruction, and real-world indoor signal coverage experiments.

Image Reconstruction

Paper
Add Code

GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions

no code implementations • 24 May 2023 • Woojeong Jin, Subhabrata Mukherjee, Yu Cheng, Yelong Shen, Weizhu Chen, Ahmed Hassan Awadallah, Damien Jose, Xiang Ren

Generalization to unseen tasks is an important ability for few-shot learners to achieve better zero-/few-shot performance on diverse tasks.

Object Question Answering +2

Paper
Add Code

Instruction-following Evaluation through Verbalizer Manipulation

no code implementations • 20 Jul 2023 • Shiyang Li, Jun Yan, Hai Wang, Zheng Tang, Xiang Ren, Vijay Srinivasan, Hongxia Jin

We conduct a comprehensive evaluation of four major model families across nine datasets, employing twelve sets of verbalizers for each of them.

Instruction Following

Paper
Add Code

How FaR Are Large Language Models From Agents with Theory-of-Mind?

no code implementations • 4 Oct 2023 • Pei Zhou, Aman Madaan, Srividya Pranavi Potharaju, Aditya Gupta, Kevin R. McKee, Ari Holtzman, Jay Pujara, Xiang Ren, Swaroop Mishra, Aida Nematzadeh, Shyam Upadhyay, Manaal Faruqui

We propose a new evaluation paradigm for large language models (LLMs): Thinking for Doing (T4D), which requires models to connect inferences about others' mental states to actions in social scenarios.

In-Context Learning Question Answering

Paper
Add Code

Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance

no code implementations • 16 Oct 2023 • Jesse Zhang, Jiahui Zhang, Karl Pertsch, Ziyi Liu, Xiang Ren, Minsuk Chang, Shao-Hua Sun, Joseph J. Lim

Instead, our approach BOSS (BOotStrapping your own Skills) learns to accomplish new tasks by performing "skill bootstrapping," where an agent with a set of primitive skills interacts with the environment to practice new skills without receiving reward feedback for tasks outside of the initial skill set.

Language Modelling Large Language Model

Paper
Add Code

Wireless Communications in Cavity: A Reconfigurable Boundary Modulation based Approach

no code implementations • 15 Nov 2023 • Xuehui Dong, Xiang Ren, Bokai Lai, Rujing Xiong, Tiebin Mi, Robert Caiming Qiu

This paper explores the potential wireless communication applications of Reconfigurable Intelligent Surfaces (RIS) in reverberant wave propagation environments.

Position

Paper
Add Code

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

no code implementations • NeurIPS 2023 • Bill Yuchen Lin, Yicheng Fu, Karina Yang, Faeze Brahman, Shiyu Huang, Chandra Bhagavatula, Prithviraj Ammanabrolu, Yejin Choi, Xiang Ren

The Swift module is a small encoder-decoder LM fine-tuned on the oracle agent's action trajectories, while the Sage module employs LLMs such as GPT-4 for subgoal planning and grounding.

Paper
Add Code

Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty

no code implementations • 12 Jan 2024 • Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Maarten Sap

As natural language becomes the default interface for human-AI interaction, there is a critical need for LMs to appropriately communicate uncertainties in downstream applications.

Paper
Add Code

What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement

no code implementations • 2 Feb 2024 • Xisen Jin, Xiang Ren

We propose a partially interpretable forecasting model based on the observation that changes in pre-softmax logit scores of pretraining examples resemble that of online learned examples, which performs decently on BART but fails on T5 models.

Language Modelling

Paper
Add Code

Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification

no code implementations • 6 Feb 2024 • Soumya Sanyal, Tianyi Xiao, Jiacheng Liu, Wenya Wang, Xiang Ren

Finally, we use this model to filter out inconsistent model-generated rationales in self-consistency decoding, resulting in a 6% accuracy improvement on average across three MCQ datasets.

Benchmarking Multiple-choice +3

Paper
Add Code

WinoViz: Probing Visual Properties of Objects Under Different States

no code implementations • 21 Feb 2024 • Woojeong Jin, Tejas Srinivasan, Jesse Thomason, Xiang Ren

We present WinoViz, a text-only evaluation dataset, consisting of 1, 380 examples that probe the reasoning abilities of language models regarding variant visual properties of objects under different contexts or states.

Language Modelling

Paper
Add Code

Logits of API-Protected LLMs Leak Proprietary Information

no code implementations • 14 Mar 2024 • Matthew Finlayson, Xiang Ren, Swabha Swayamdipta

The commercialization of large language models (LLMs) has led to the common practice of high-level API-only access to proprietary models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.