Search Results for author: Meng Jiang

Found 68 papers, 34 papers with code

Knowledge-Enriched Natural Language Generation

1 code implementation EMNLP (ACL) 2021 Wenhao Yu, Meng Jiang, Zhiting Hu, Qingyun Wang, Heng Ji, Nazneen Rajani

Knowledge-enriched text generation poses unique challenges in modeling and learning, driving active research in several core directions, ranging from integrated modeling of neural representations and symbolic information in the sequential/hierarchical/graphical structures, learning without direct supervisions due to the cost of structured annotation, efficient optimization and inference with massive and global constraints, to language grounding on multiple modalities, and generative reasoning with implicit commonsense knowledge and background knowledge.

Text Generation

Knowledge-Augmented Methods for Natural Language Processing

no code implementations ACL 2022 Chenguang Zhu, Yichong Xu, Xiang Ren, Bill Lin, Meng Jiang, Wenhao Yu

Knowledge in natural language processing (NLP) has been a rising trend especially after the advent of large scale pre-trained models.

Text Generation

Motif-aware Attribute Masking for Molecular Graph Pre-training

1 code implementation8 Sep 2023 Eric Inae, Gang Liu, Meng Jiang

Attribute reconstruction is used to predict node or edge features in the pre-training of graph neural networks.

Molecular Property Prediction Property Prediction

Embedding Mental Health Discourse for Community Recommendation

no code implementations8 Jul 2023 Hy Dang, Bang Nguyen, Noah Ziems, Meng Jiang

Our paper investigates the use of discourse embedding techniques to develop a community recommendation system that focuses on mental health support groups on social media.

Collaborative Filtering

Investigating Cross-Domain Behaviors of BERT in Review Understanding

no code implementations27 Jun 2023 Albert Lu, Meng Jiang

Review score prediction requires review text understanding, a critical real-world application of natural language processing.

text-classification Text Classification

Improving Language Models via Plug-and-Play Retrieval Feedback

no code implementations23 May 2023 Wenhao Yu, Zhihan Zhang, Zhenwen Liang, Meng Jiang, Ashish Sabharwal

ReFeed first generates initial outputs, then utilizes a retrieval model to acquire relevant information from large document collections, and finally incorporates the retrieved information into the in-context demonstration for output refinement, thereby addressing the limitations of LLMs in a more efficient and cost-effective manner.

Retrieval

IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions

no code implementations23 May 2023 Wenhao Yu, Meng Jiang, Peter Clark, Ashish Sabharwal

Although counterfactual reasoning is a fundamental aspect of intelligence, the lack of large-scale counterfactual open-domain question-answering (QA) benchmarks makes it difficult to evaluate and improve models on this ability.

Open-Domain Question Answering Retrieval

Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions

1 code implementation23 May 2023 Zhihan Zhang, Wenhao Yu, Zheng Ning, Mingxuan Ju, Meng Jiang

Contrast consistency, the ability of a model to make consistently correct predictions in the presence of perturbations, is an essential aspect in NLP.

Data Augmentation Language Modelling +4

Pre-training Language Models for Comparative Reasoning

no code implementations23 May 2023 Mengxia Yu, Zhihan Zhang, Wenhao Yu, Meng Jiang

In this paper, we propose a novel framework to pre-train language models for enhancing their abilities of comparative reasoning over texts.

Question Answering Question Generation +1

Semi-Supervised Graph Imbalanced Regression

1 code implementation20 May 2023 Gang Liu, Tong Zhao, Eric Inae, Tengfei Luo, Meng Jiang

The training data balance is achieved by (1) pseudo-labeling more graphs for under-represented labels with a novel regression confidence measurement and (2) augmenting graph examples in latent space for remaining rare labels after data balancing with pseudo-labels.

Graph Regression regression

Large Language Models are Built-in Autoregressive Search Engines

1 code implementation16 May 2023 Noah Ziems, Wenhao Yu, Zhihan Zhang, Meng Jiang

To overcome this limitation, recent autoregressive search engines replace the dual-encoder architecture by directly generating identifiers for relevant documents in the candidate pool.

Open-Domain Question Answering Retrieval

Data-Centric Learning from Unlabeled Graphs with Diffusion Model

1 code implementation17 Mar 2023 Gang Liu, Eric Inae, Tong Zhao, Jiaxin Xu, Tengfei Luo, Meng Jiang

A conventional approach is training a model with the unlabeled graphs on self-supervised tasks and then fine-tuning the model on the prediction tasks.

Denoising Graph Property Prediction +2

Very Large Language Model as a Unified Methodology of Text Mining

1 code implementation19 Dec 2022 Meng Jiang

Text data mining is the process of deriving essential information from language text.

Clustering Language Modelling +4

Retrieval Augmentation for Commonsense Reasoning: A Unified Approach

1 code implementation23 Oct 2022 Wenhao Yu, Chenguang Zhu, Zhihan Zhang, Shuohang Wang, Zhuosheng Zhang, Yuwei Fang, Meng Jiang

However, applying such methods to commonsense reasoning tasks faces two unique challenges, i. e., the lack of a general large-scale corpus for retrieval and a corresponding effective commonsense retriever.

Retrieval

A Unified Encoder-Decoder Framework with Entity Memory

1 code implementation7 Oct 2022 Zhihan Zhang, Wenhao Yu, Chenguang Zhu, Meng Jiang

The entity knowledge is stored in the memory as latent representations, and the memory is pre-trained on Wikipedia along with encoder-decoder parameters.

Question Answering Text Generation

Generate rather than Retrieve: Large Language Models are Strong Context Generators

1 code implementation21 Sep 2022 Wenhao Yu, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya Sanyal, Chenguang Zhu, Michael Zeng, Meng Jiang

We call our method generate-then-read (GenRead), which first prompts a large language model to generate contextutal documents based on a given question, and then reads the generated documents to produce the final answer.

Language Modelling Large Language Model +1

Heterogeneous Line Graph Transformer for Math Word Problems

no code implementations11 Aug 2022 Zijian Hu, Meng Jiang

We originally planned to employ existing models but realized that they processed a math word problem as a sequence or a homogeneous graph of tokens.

Representation Learning Semantic Role Labeling

On the Relationship Between Counterfactual Explainer and Recommender

no code implementations9 Jul 2022 Gang Liu, Zhihan Zhang, Zheng Ning, Meng Jiang

To enable explainability, recent techniques such as ACCENT and FIA are looking for counterfactual explanations that are specific historical actions of a user, the removal of which leads to a change to the recommendation result.

Collaborative Filtering Counterfactual Explanation +1

Automatic Controllable Product Copywriting for E-Commerce

1 code implementation21 Jun 2022 Xiaojie Guo, Qingkai Zeng, Meng Jiang, Yun Xiao, Bo Long, Lingfei Wu

Automatic product description generation for e-commerce has witnessed significant advancement in the past decade.

Aspect Extraction Language Modelling +2

Graph Rationalization with Environment-based Augmentations

1 code implementation6 Jun 2022 Gang Liu, Tong Zhao, Jiaxin Xu, Tengfei Luo, Meng Jiang

Rationale is defined as a subset of input features that best explains or supports the prediction by machine learning models.

Graph Regression Property Prediction +1

A Bottom-Up End-User Intelligent Assistant Approach to Empower Gig Workers against AI Inequality

no code implementations29 Apr 2022 Toby Jia-Jun Li, Yuwen Lu, Jaylexia Clark, Meng Chen, Victor Cox, Meng Jiang, Yang Yang, Tamara Kay, Danielle Wood, Jay Brockman

The AI inequality is caused by (1) the technology divide in who has access to AI technologies in gig work; and (2) the data divide in who owns the data in gig work leads to unfair working conditions, growing pay gap, neglect of workers' diverse preferences, and workers' lack of trust in the platforms.

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods

no code implementations7 Apr 2022 Zhihan Zhang, Wenhao Yu, Mengxia Yu, Zhichun Guo, Meng Jiang

Multi-task learning (MTL) has become increasingly popular in natural language processing (NLP) because it improves the performance of related tasks by exploiting their commonalities and differences.

Multi-Task Learning

Graph Data Augmentation for Graph Machine Learning: A Survey

1 code implementation17 Feb 2022 Tong Zhao, Wei Jin, Yozen Liu, Yingheng Wang, Gang Liu, Stephan Günnemann, Neil Shah, Meng Jiang

Overall, our work aims to clarify the landscape of existing literature in graph data augmentation and motivates additional work in this area, providing a helpful resource for researchers and practitioners in the broader graph machine learning domain.

BIG-bench Machine Learning Data Augmentation

Dict-BERT: Enhancing Language Model Pre-training with Dictionary

1 code implementation Findings (ACL) 2022 Wenhao Yu, Chenguang Zhu, Yuwei Fang, Donghan Yu, Shuohang Wang, Yichong Xu, Michael Zeng, Meng Jiang

In addition to training with the masked language modeling objective, we propose two novel self-supervised pre-training tasks on word and sentence-level alignment between input text sequence and rare word definitions to enhance language modeling representation with dictionary.

Language Modelling Masked Language Modeling

Multi-Round Parsing-based Multiword Rules for Scientific OpenIE

no code implementations4 Aug 2021 Joseph Kuebler, Lingbo Tong, Meng Jiang

Information extraction (IE) in scientific literature has facilitated many down-stream tasks.

Dependency Parsing

Cross-Network Learning with Partially Aligned Graph Convolutional Networks

no code implementations3 Jun 2021 Meng Jiang

Graph neural networks have been widely used for learning representations of nodes for many downstream tasks on graph data.

Knowledge Graphs Link Prediction +1

Sentence-Permuted Paragraph Generation

1 code implementation EMNLP 2021 Wenhao Yu, Chenguang Zhu, Tong Zhao, Zhichun Guo, Meng Jiang

Generating paragraphs of diverse contents is important in many applications.

TCN: Table Convolutional Network for Web Table Interpretation

1 code implementation17 Feb 2021 Daheng Wang, Prashant Shiralkar, Colin Lockard, Binxuan Huang, Xin Luna Dong, Meng Jiang

Existing work linearize table cells and heavily rely on modifying deep language models such as BERT which only captures related cells information in the same table.

Representation Learning Table annotation +1

Few-Shot Graph Learning for Molecular Property Prediction

1 code implementation16 Feb 2021 Zhichun Guo, Chuxu Zhang, Wenhao Yu, John Herr, Olaf Wiest, Meng Jiang, Nitesh V. Chawla

The recent success of graph neural networks has significantly boosted molecular property prediction, advancing activities such as drug discovery.

Drug Discovery Graph Learning +6

Traceability Transformed: Generating moreAccurate Links with Pre-Trained BERT Models

1 code implementation8 Feb 2021 Jinfeng Lin, Yalin Liu, Qingkai Zeng, Meng Jiang, Jane Cleland-Huang

In this study, we propose a novel framework called Trace BERT (T-BERT) to generate trace links between source code and natural language artifacts.

Transfer Learning Software Engineering

FGNAS: FPGA-Aware Graph Neural Architecture Search

no code implementations1 Jan 2021 Qing Lu, Weiwen Jiang, Meng Jiang, Jingtong Hu, Sakyasingha Dasgupta, Yiyu Shi

The success of gragh neural networks (GNNs) in the past years has aroused grow-ing interest and effort in designing best models to handle graph-structured data.

Neural Architecture Search

Tri-Train: Automatic Pre-Fine Tuning between Pre-Training and Fine-Tuning for SciNER

no code implementations Findings of the Association for Computational Linguistics 2020 Qingkai Zeng, Wenhao Yu, Mengxia Yu, Tianwen Jiang, Tim Weninger, Meng Jiang

The training process of scientific NER models is commonly performed in two steps: i) Pre-training a language model by self-supervised tasks on huge data and ii) fine-tune training with small labelled data.

Language Modelling NER

Action Sequence Augmentation for Early Graph-based Anomaly Detection

1 code implementation20 Oct 2020 Tong Zhao, Bo Ni, Wenhao Yu, Zhichun Guo, Neil Shah, Meng Jiang

With Eland, anomaly detection performance at an earlier stage is better than non-augmented methods that need significantly more observed data by up to 15% on the Area under the ROC curve.

Data Augmentation Graph Anomaly Detection

Technical Question Answering across Tasks and Domains

1 code implementation NAACL 2021 Wenhao Yu, Lingfei Wu, Yu Deng, Qingkai Zeng, Ruchi Mahindru, Sinem Guven, Meng Jiang

In this paper, we propose a novel framework of deep transfer learning to effectively address technical QA across tasks and domains.

Question Answering Reading Comprehension +2

A Survey of Knowledge-Enhanced Text Generation

3 code implementations9 Oct 2020 Wenhao Yu, Chenguang Zhu, Zaitang Li, Zhiting Hu, Qingyun Wang, Heng Ji, Meng Jiang

To address this issue, researchers have considered incorporating various forms of knowledge beyond the input text into the generation models.

Text Generation

Federated Dynamic GNN with Secure Aggregation

no code implementations15 Sep 2020 Meng Jiang, Taeho Jung, Ryan Karl, Tong Zhao

Given video data from multiple personal devices or street cameras, can we exploit the structural and dynamic information to learn dynamic representation of objects for applications such as distributed surveillance, without storing data at a central server that leads to a violation of user privacy?

Federated Learning

Specification mining and automated task planning for autonomous robots based on a graph-based spatial temporal logic

no code implementations16 Jul 2020 Zhiyu Liu, Meng Jiang, Hai Lin

For knowledge representation, we use a graph-based spatial temporal logic (GSTL) to capture spatial and temporal information of related skills demonstrated by demo videos.

Data Augmentation for Graph Neural Networks

2 code implementations11 Jun 2020 Tong Zhao, Yozen Liu, Leonardo Neves, Oliver Woodford, Meng Jiang, Neil Shah

Our work shows that neural edge predictors can effectively encode class-homophilic structure to promote intra-class edges and demote inter-class edges in given graph structure, and our main contribution introduces the GAug graph data augmentation framework, which leverages these insights to improve performance in GNN-based node classification via edge prediction.

Data Augmentation General Classification +1

Calendar Graph Neural Networks for Modeling Time Structures in Spatiotemporal User Behaviors

1 code implementation11 Jun 2020 Daheng Wang, Meng Jiang, Munira Syed, Oliver Conway, Vishal Juneja, Sriram Subramanian, Nitesh V. Chawla

The user embeddings preserve spatial patterns and temporal patterns of a variety of periodicity (e. g., hourly, weekly, and weekday patterns).

A Probabilistic Model with Commonsense Constraints for Pattern-based Temporal Fact Extraction

no code implementations WS 2020 Yang Zhou, Tong Zhao, Meng Jiang

Textual patterns (e. g., Country's president Person) are specified and/or generated for extracting factual information from unstructured data.

TAG Text Generation

Crossing Variational Autoencoders for Answer Retrieval

no code implementations ACL 2020 Wenhao Yu, Lingfei Wu, Qingkai Zeng, Shu Tao, Yu Deng, Meng Jiang

Existing methods learned semantic representations with dual encoders or dual variational auto-encoders.

Retrieval

Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning

no code implementations12 Mar 2020 Mandana Saebi, Steven Krieg, Chuxu Zhang, Meng Jiang, Nitesh Chawla

Path-based relational reasoning over knowledge graphs has become increasingly popular due to a variety of downstream applications such as question answering in dialogue systems, fact prediction, and recommender systems.

Knowledge Graphs Question Answering +4

Improving Generalizability of Fake News Detection Methods using Propensity Score Matching

1 code implementation28 Jan 2020 Bo Ni, Zhichun Guo, Jianing Li, Meng Jiang

Recently, due to the booming influence of online social networks, detecting fake news is drawing significant attention from both academic communities and general public.

Fake News Detection regression

Few-Shot Knowledge Graph Completion

1 code implementation26 Nov 2019 Chuxu Zhang, Huaxiu Yao, Chao Huang, Meng Jiang, Zhenhui Li, Nitesh V. Chawla

Knowledge graphs (KGs) serve as useful resources for various natural language processing applications.

One-Shot Learning

Faceted Hierarchy: A New Graph Type to Organize Scientific Concepts and a Construction Method

no code implementations WS 2019 Qingkai Zeng, Mengxia Yu, Wenhao Yu, JinJun Xiong, Yiyu Shi, Meng Jiang

On a scientific concept hierarchy, a parent concept may have a few attributes, each of which has multiple values being a group of child concepts.

Face Recognition

Multi-Input Multi-Output Sequence Labeling for Joint Extraction of Fact and Condition Tuples from Scientific Text

no code implementations IJCNLP 2019 Tianwen Jiang, Tong Zhao, Bing Qin, Ting Liu, Nitesh Chawla, Meng Jiang

In this work, we propose a new sequence labeling framework (as well as a new tag schema) to jointly extract the fact and condition tuples from statement sentences.

TAG

TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering

2 code implementations22 Dec 2018 Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian Sadler, Michelle Vanni, Jiawei Han

Our method, TaxoGen, uses term embeddings and hierarchical clustering to construct a topic taxonomy in a recursive fashion.

Databases

PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence

no code implementations26 Feb 2018 Jinglan Liu, Jiaxin Zhang, Yukun Ding, Xiaowei Xu, Meng Jiang, Yiyu Shi

This work explores the binarization of the deconvolution-based generator in a GAN for memory saving and speedup of image construction.

Binarization

MetaPAD: Meta Pattern Discovery from Massive Text Corpora

no code implementations13 Mar 2017 Meng Jiang, Jingbo Shang, Taylor Cassidy, Xiang Ren, Lance M. Kaplan, Timothy P. Hanratty, Jiawei Han

We propose an efficient framework, called MetaPAD, which discovers meta patterns from massive corpora with three techniques: (1) it develops a context-aware segmentation method to carefully determine the boundaries of patterns with a learnt pattern quality assessment function, which avoids costly dependency parsing and generates high-quality patterns; (2) it identifies and groups synonymous meta patterns from multiple facets---their types, contexts, and extractions; and (3) it examines type distributions of entities in the instances extracted by each group of patterns, and looks for appropriate type levels to make discovered patterns precise.

Dependency Parsing

Automated Phrase Mining from Massive Text Corpora

4 code implementations15 Feb 2017 Jingbo Shang, Jialu Liu, Meng Jiang, Xiang Ren, Clare R. Voss, Jiawei Han

As one of the fundamental tasks in text analysis, phrase mining aims at extracting quality phrases from a text corpus.

General Knowledge POS +1

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns

no code implementations31 Oct 2016 Jingbo Shang, Meng Jiang, Wenzhu Tong, Jinfeng Xiao, Jian Peng, Jiawei Han

In the literature, two series of models have been proposed to address prediction problems including classification and regression.

Cannot find the paper you are looking for? You can Submit a new open access paper.