Search Results for author: Jing Jiang

Found 154 papers, 68 papers with code

Learning and Evaluating Chinese Idiom Embeddings

1 code implementation • RANLP 2021 • Minghuan Tan, Jing Jiang

We find that our method substantially outperforms existing methods on the evaluation dataset we have constructed.

Paper
Code

Does BERT Understand Idioms? A Probing-Based Empirical Study of BERT Encodings of Idioms

no code implementations • RANLP 2021 • Minghuan Tan, Jing Jiang

Understanding idioms is important in NLP.

Paraphrase Identification

Paper
Add Code

Translate-Train Embracing Translationese Artifacts

no code implementations • ACL 2022 • Sicheng Yu, Qianru Sun, Hao Zhang, Jing Jiang

Translate-train is a general training approach to multilingual tasks.

Paper
Add Code

Coupled Hierarchical Transformer for Stance-Aware Rumor Verification in Social Media Conversations

no code implementations • EMNLP 2020 • Jianfei Yu, Jing Jiang, Ling Min Serena Khoo, Hai Leong Chieu, Rui Xia

The prevalent use of social media enables rapid spread of rumors on a massive scale, which leads to the emerging need of automatic rumor verification (RV).

Multi-Task Learning Stance Classification

Paper
Add Code

What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning

no code implementations • 16 Apr 2024 • Zhihong Deng, Jing Jiang, Guodong Long, Chengqi Zhang

In sequential decision-making problems involving sensitive attributes like race and gender, reinforcement learning (RL) agents must carefully consider long-term fairness while maximizing returns.

Paper
Add Code

Peer-aided Repairer: Empowering Large Language Models to Repair Advanced Student Assignments

no code implementations • 2 Apr 2024 • Qianhui Zhao, Fang Liu, Li Zhang, Yang Liu, Zhen Yan, Zhenghao Chen, Yufei Zhou, Jing Jiang, Ge Li

Automated generation of feedback on programming assignments holds significant benefits for programming education, especially when it comes to advanced assignments.

Language Modelling Large Language Model +1

Paper
Add Code

Dual-Personalizing Adapter for Federated Foundation Models

no code implementations • 28 Mar 2024 • Yiyuan Yang, Guodong Long, Tao Shen, Jing Jiang, Michael Blumenstein

To address challenges in this new setting, we explore a simple yet effective solution to learn a comprehensive foundation model.

Federated Learning

Paper
Add Code

Modularized Networks for Few-shot Hateful Meme Detection

1 code implementation • 19 Feb 2024 • Rui Cao, Roy Ka-Wei Lee, Jing Jiang

We then use the few available annotated samples to train a module composer, which assigns weights to the LoRA modules based on their relevance.

Few-Shot Learning In-Context Learning

Paper
Code

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

1 code implementation • 13 Feb 2024 • Xiangming Gu, Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Ye Wang, Jing Jiang, Min Lin

A multimodal large language model (MLLM) agent can receive instructions, capture images, retrieve histories from memory, and decide which tools to use.

Language Modelling Large Language Model

Paper
Code

Knowledge Generation for Zero-shot Knowledge-based VQA

1 code implementation • 4 Feb 2024 • Rui Cao, Jing Jiang

Previous solutions to knowledge-based visual question answering~(K-VQA) retrieve knowledge from external knowledge bases and use supervised learning to train the K-VQA model.

Question Answering Visual Question Answering

Paper
Code

GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

no code implementations • 3 Feb 2024 • Cunxiao Du, Jing Jiang, Xu Yuanchen, Jiawei Wu, Sicheng Yu, Yongqi Li, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu, Yang You

Speculative decoding is a relatively new decoding framework that leverages small and efficient draft models to reduce the latency of LLMs.

Paper
Add Code

Revisiting the Markov Property for Machine Translation

no code implementations • 3 Feb 2024 • Cunxiao Du, Hao Zhou, Zhaopeng Tu, Jing Jiang

In this paper, we re-examine the Markov property in the context of neural machine translation.

Machine Translation Translation

Paper
Add Code

Enhancing Document-level Translation of Large Language Model via Translation Mixed-instructions

no code implementations • 16 Jan 2024 • Yachao Li, Junhui Li, Jing Jiang, Min Zhang

Our proposed translation mixed-instructions enable LLMs (Llama-2~7B and 13B) to maintain consistent translation performance from the sentence level to documents containing as many as 2048 tokens.

Language Modelling Large Language Model +3

Paper
Add Code

Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey

1 code implementation • 5 Dec 2023 • Shengchao Chen, Guodong Long, Jing Jiang, Dikai Liu, Chengqi Zhang

Furthermore, in relation to the creation and application of foundation models for weather and climate data understanding, we delve into the field's prevailing challenges, offer crucial insights, and propose detailed avenues for future research.

Paper
Code

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

1 code implementation • 28 Nov 2023 • Yijun Yang, Tianyi Zhou, Kanxue Li, Dapeng Tao, Lusong Li, Li Shen, Xiaodong He, Jing Jiang, Yuhui Shi

While large language models (LLMs) excel in a simulated world of texts, they struggle to interact with the more realistic world without perceptions of other modalities such as visual or audio signals.

Imitation Learning

Paper
Code

Uplift Modeling based on Graph Neural Network Combined with Causal Knowledge

1 code implementation • 14 Nov 2023 • Haowen Wang, Xinyan Ye, Yangze Zhou, Zhiyi Zhang, Longhan Zhang, Jing Jiang

Through uplift modeling, we can identify the treatment with the greatest benefit.

Marketing

Paper
Code

Intriguing Properties of Data Attribution on Diffusion Models

1 code implementation • 1 Nov 2023 • Xiaosen Zheng, Tianyu Pang, Chao Du, Jing Jiang, Min Lin

Data attribution seeks to trace model outputs back to training data.

counterfactual

Paper
Code

ROME: Evaluating Pre-trained Vision-Language Models on Reasoning beyond Visual Common Sense

1 code implementation • 30 Oct 2023 • Kankan Zhou, Eason Lai, Wei Bin Au Yeong, Kyriakos Mouratidis, Jing Jiang

Humans possess a strong capability for reasoning beyond common sense.

Common Sense Reasoning Language Modelling

Paper
Code

Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution

no code implementations • 21 Sep 2023 • Shuang Ao, Tianyi Zhou, Guodong Long, Xuan Song, Jing Jiang

Throughout long history, natural species have learned to survive by evolving their physical structures adaptive to the environment changes.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

GPT-Lab: Next Generation Of Optimal Chemistry Discovery By GPT Driven Robotic Lab

no code implementations • 15 Sep 2023 • Xiaokai Qin, Mingda Song, Yangguan Chen, Zhehong Ai, Jing Jiang

The integration of robots in chemical experiments has enhanced experimental efficiency, but lacking the human intelligence to comprehend literature, they seldom provide assistance in experimental design.

Experimental Design

Paper
Add Code

Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection

3 code implementations • 16 Aug 2023 • Rui Cao, Ming Shan Hee, Adriel Kuek, Wen-Haw Chong, Roy Ka-Wei Lee, Jing Jiang

Specifically, we prompt a frozen PVLM by asking hateful content-related questions and use the answers as image captions (which we call Pro-Cap), so that the captions contain information critical for hateful content detection.

Image Captioning Language Modelling +2

Paper
Code

Causal Reinforcement Learning: A Survey

no code implementations • 4 Jul 2023 • Zhihong Deng, Jing Jiang, Guodong Long, Chengqi Zhang

Causality, however, offers a notable advantage as it can formalize knowledge in a systematic manner and leverage invariance for effective knowledge transfer.

reinforcement-learning Transfer Learning

Paper
Add Code

Continual Task Allocation in Meta-Policy Network via Sparse Prompting

1 code implementation • 29 May 2023 • Yijun Yang, Tianyi Zhou, Jing Jiang, Guodong Long, Yuhui Shi

We address it by "Continual Task Allocation via Sparse Prompting (CoTASP)", which learns over-complete dictionaries to produce sparse masks as prompts extracting a sub-network for each task from a meta-policy network.

Paper
Code

Modularized Zero-shot VQA with Pre-trained Models

1 code implementation • 27 May 2023 • Rui Cao, Jing Jiang

We propose a modularized zero-shot network that explicitly decomposes questions into sub reasoning steps and is highly interpretable.

object-detection Object Detection +3

Paper
Code

Spatial-temporal Prompt Learning for Federated Weather Forecasting

no code implementations • 23 May 2023 • Shengchao Chen, Guodong Long, Tao Shen, Tianyi Zhou, Jing Jiang

Federated weather forecasting is a promising collaborative learning framework for analyzing meteorological data across participants from different countries and regions, thus embodying a global-scale real-time weather data predictive analytics platform to tackle climate change.

Time Series Weather Forecasting

Paper
Add Code

Does Continual Learning Equally Forget All Parameters?

no code implementations • 9 Apr 2023 • Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

In this paper, we study which modules in neural networks are more prone to forgetting by investigating their training dynamics during CL.

Attribute Continual Learning

Paper
Add Code

Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

no code implementations • 14 Mar 2023 • Han Zheng, Xufang Luo, Pengfei Wei, Xuan Song, Dongsheng Li, Jing Jiang

In this paper, we consider an offline-to-online setting where the agent is first learned from the offline dataset and then trained online, and propose a framework called Adaptive Policy Learning for effectively taking advantage of offline and online data.

Continuous Control Offline RL +2

Paper
Add Code

Prompting for Multimodal Hateful Meme Classification

no code implementations • 8 Feb 2023 • Rui Cao, Roy Ka-Wei Lee, Wen-Haw Chong, Jing Jiang

Specifically, we construct simple prompts and provide a few in-context examples to exploit the implicit knowledge in the pre-trained RoBERTa language model for hateful meme classification.

Ranked #3 on Hateful Meme Classification on HarMeme

Classification Hateful Meme Classification +1

Paper
Add Code

Voting from Nearest Tasks: Meta-Vote Pruning of Pre-trained Models for Downstream Tasks

no code implementations • 27 Jan 2023 • Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

To address these challenges, we create a small model for a new task from the pruned models of similar tasks.

Paper
Add Code

Prompt Federated Learning for Weather Forecasting: Toward Foundation Models on Meteorological Data

1 code implementation • 22 Jan 2023 • Shengchao Chen, Guodong Long, Tao Shen, Jing Jiang

To relieve the data exposure concern across regions, a novel federated learning approach has been proposed to collaboratively learn a brand-new spatio-temporal Transformer-based foundation model across participants with heterogeneous meteorological data.

Federated Learning Time Series +2

Paper
Code

P-Transformer: Towards Better Document-to-Document Neural Machine Translation

no code implementations • 12 Dec 2022 • Yachao Li, Junhui Li, Jing Jiang, Shimin Tao, Hao Yang, Min Zhang

To alleviate this problem, we propose a position-aware Transformer (P-Transformer) to enhance both the absolute and relative position information in both self-attention and cross-attention.

Machine Translation NMT +3

Paper
Add Code

Federated Learning on Non-IID Graphs via Structural Knowledge Sharing

1 code implementation • 23 Nov 2022 • Yue Tan, Yixin Liu, Guodong Long, Jing Jiang, Qinghua Lu, Chengqi Zhang

Inspired by this, we propose FedStar, an FGL framework that extracts and shares the common underlying structure information for inter-graph federated learning tasks.

Federated Learning Graph Learning

Paper
Code

CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification

no code implementations • 11 Nov 2022 • Yang Li, Canran Xu, Guodong Long, Tao Shen, Chongyang Tao, Jing Jiang

Basically, an instance-dependent soft prefix, derived from fact-counterfactual pairs in the label space, is leveraged to complement the language verbalizers in many-class classification.

Classification counterfactual +7

Paper
Add Code

ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

no code implementations • COLING 2022 • Cunxiao Du, Zhaopeng Tu, Longyue Wang, Jing Jiang

Recently, a new training oaxe loss has proven effective to ameliorate the effect of multimodality for non-autoregressive translation (NAT), which removes the penalty of word order errors in the standard cross-entropy loss.

Machine Translation Sentence +1

Paper
Add Code

Federated Learning from Pre-Trained Models: A Contrastive Learning Approach

2 code implementations • 21 Sep 2022 • Yue Tan, Guodong Long, Jie Ma, Lu Liu, Tianyi Zhou, Jing Jiang

To prevent these issues from hindering the deployment of FL systems, we propose a lightweight framework where clients jointly learn to fuse the representations generated by multiple fixed pre-trained models rather than training a large-scale model from scratch.

Contrastive Learning Federated Learning

1,135

Paper
Code

Improving Compositional Generalization in Math Word Problem Solving

1 code implementation • 3 Sep 2022 • Yunshi Lan, Lei Wang, Jing Jiang, Ee-Peng Lim

To improve the compositional generalization in MWP solving, we propose an iterative data augmentation method that includes diverse compositional variation into training data and could collaborate with MWP methods.

Data Augmentation Math +1

Paper
Code

Disentangling Identity and Pose for Facial Expression Recognition

no code implementations • 17 Aug 2022 • Jing Jiang, Weihong Deng

Combining identity and pose feature, a neutral face of input individual should be generated by the decoder.

Disentanglement Face Recognition +2

Paper
Add Code

Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

1 code implementation • NeurIPS 2023 • Pengfei Wei, Lingdong Kong, Xinghua Qu, Yi Ren, Zhiqiang Xu, Jing Jiang, Xiang Yin

Specifically, we consider the generation of cross-domain videos from two sets of latent factors, one encoding the static information and another encoding the dynamic information.

Action Recognition Disentanglement +1

119

Paper
Code

Boosting Facial Expression Recognition by A Semi-Supervised Progressive Teacher

no code implementations • 28 May 2022 • Jing Jiang, Weihong Deng

On the one hand, PT introduces semi-supervised learning method to relieve the shortage of data in FER.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

FedNoiL: A Simple Two-Level Sampling Method for Federated Learning with Noisy Labels

no code implementations • 20 May 2022 • Zhuowei Wang, Tianyi Zhou, Guodong Long, Bo Han, Jing Jiang

Federated learning (FL) aims at training a global model on the server side while the training data are collected and located at the local devices.

Federated Learning Learning with noisy labels

Paper
Add Code

An Empirical Study of Memorization in NLP

1 code implementation • ACL 2022 • Xiaosen Zheng, Jing Jiang

We empirically show that our memorization attribution method is faithful, and share our interesting finding that the top-memorized parts of a training instance tend to be features negatively correlated with the class label.

Memorization

Paper
Code

Personalized Federated Learning With Graph

1 code implementation • 2 Mar 2022 • Fengwen Chen, Guodong Long, Zonghan Wu, Tianyi Zhou, Jing Jiang

We propose a novel structured federated learning (SFL) framework to learn both the global and personalized models simultaneously using client-wise relation graphs and clients' private data.

Personalized Federated Learning Relation

Paper
Code

Exploring and Adapting Chinese GPT to Pinyin Input Method

1 code implementation • ACL 2022 • Minghuan Tan, Yong Dai, Duyu Tang, Zhangyin Feng, Guoping Huang, Jing Jiang, Jiwei Li, Shuming Shi

We find that a frozen GPT achieves state-of-the-art performance on perfect pinyin.

Text Generation

Paper
Code

Applications of blockchain and artificial intelligence technologies for enabling prosumers in smart grids: A review

no code implementations • 21 Feb 2022 • Weiqi Hua, Ying Chen, Meysam Qadrdan, Jing Jiang, Hongjian Sun, Jianzhong Wu

The blockchain and artificial intelligence (AI) are innovative technologies to fulfil these two factors, by which the blockchain provides decentralised trading platforms for energy markets and the AI supports the optimal operational control of power systems.

Decision Making

Paper
Add Code

On the Convergence of Clustered Federated Learning

1 code implementation • 13 Feb 2022 • Jie Ma, Guodong Long, Tianyi Zhou, Jing Jiang, Chengqi Zhang

Knowledge sharing and model personalization are essential components to tackle the non-IID challenge in federated learning (FL).

Federated Learning

Paper
Code

CO-PILOT: COllaborative Planning and reInforcement Learning On sub-Task curriculum

1 code implementation • NeurIPS 2021 • Shuang Ao, Tianyi Zhou, Guodong Long, Qinghua Lu, Liming Zhu, Jing Jiang

Next, a bottom-up traversal of the tree trains the RL agent from easier sub-tasks with denser rewards on bottom layers to harder ones on top layers and collects its cost on each sub-task train the planner in the next episode.

Continuous Control reinforcement-learning +1

Paper
Code

Handling Inter-class and Intra-class Imbalance in Class-imbalanced Learning

1 code implementation • 24 Nov 2021 • Zhining Liu, Pengfei Wei, Zhepei Wei, Boyang Yu, Jing Jiang, Wei Cao, Jiang Bian, Yi Chang

Class-imbalance is a common problem in machine learning practice.

Ensemble Learning

Paper
Code

False Correlation Reduction for Offline Reinforcement Learning

1 code implementation • 24 Oct 2021 • Zhihong Deng, Zuyue Fu, Lingxiao Wang, Zhuoran Yang, Chenjia Bai, Tianyi Zhou, Zhaoran Wang, Jing Jiang

Offline reinforcement learning (RL) harnesses the power of massive datasets for resolving sequential decision problems.

D4RL Decision Making +3

Paper
Code

Adaptive Q-learning for Interaction-Limited Reinforcement Learning

no code implementations • 29 Sep 2021 • Han Zheng, Xufang Luo, Pengfei Wei, Xuan Song, Dongsheng Li, Jing Jiang

Specifically, we explicitly consider the difference between the online and offline data and apply an adaptive update scheme accordingly, i. e., a pessimistic update strategy for the offline dataset and a greedy or no pessimistic update scheme for the online dataset.

Offline RL Q-Learning +2

Paper
Add Code

EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning

no code implementations • 29 Sep 2021 • Shuang Ao, Tianyi Zhou, Jing Jiang, Guodong Long, Xuan Song, Chengqi Zhang

They are complementary in acquiring more informative feedback for RL: the planning policy provides dense reward of finishing easier sub-tasks while the environment policy modifies these sub-tasks to be adequately challenging and diverse so the RL agent can quickly adapt to different tasks/environments.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Digital Twins based Day-ahead Integrated Energy System Scheduling under Load and Renewable Energy Uncertainties

no code implementations • 29 Sep 2021 • Minglei You, Qian Wang, Hongjian Sun, Ivan Castro, Jing Jiang

By constructing digital twins (DT) of an integrated energy system (IES), one can benefit from DT's predictive capabilities to improve coordinations among various energy converters, hence enhancing energy efficiency, cost savings and carbon emission reduction.

Scheduling

Paper
Add Code

Uncertainty Regularized Policy Learning for Offline Reinforcement Learning

no code implementations • 29 Sep 2021 • Han Zheng, Jing Jiang, Pengfei Wei, Guodong Long, Xuan Song, Chengqi Zhang

URPL adds an uncertainty regularization term in the policy learning objective to enforce to learn a more stable policy under the offline setting.

D4RL Offline RL +2

Paper
Add Code

Pareto Policy Pool for Model-based Offline Reinforcement Learning

no code implementations • ICLR 2022 • Yijun Yang, Jing Jiang, Tianyi Zhou, Jie Ma, Yuhui Shi

Model-based offline RL instead trains an environment model using a dataset of pre-collected experiences so online RL methods can learn in an offline manner by solely interacting with the model.

D4RL Offline RL +2

Paper
Add Code

Vote for Nearest Neighbors Meta-Pruning of Self-Supervised Networks

no code implementations • 29 Sep 2021 • Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Liming Zhu, Chengqi Zhang

Can we find a better initialization for a new task, e. g., a much smaller network closer to the final pruned model, by exploiting its similar tasks?

Paper
Add Code

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

1 code implementation • Findings (EMNLP) 2021 • Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim

While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects.

Graph Question Answering Question Answering

Paper
Code

Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision

no code implementations • Findings (NAACL) 2022 • Yang Li, Guodong Long, Tao Shen, Jing Jiang

It consists of (1) a pairwise type-enriched sentence encoding module injecting both context-free and -related backgrounds to alleviate sentence-level wrong labeling, and (2) a hierarchical type-sentence alignment module enriching a sentence with the triple fact's basic attributes to support long-tail relations.

Attribute Knowledge Graphs +4

Paper
Add Code

Sequential Diagnosis Prediction with Transformer and Ontological Representation

1 code implementation • 7 Sep 2021 • Xueping Peng, Guodong Long, Tao Shen, Sen Wang, Jing Jiang

Sequential diagnosis prediction on the Electronic Health Record (EHR) has been proven crucial for predictive analytics in the medical domain.

Sequential Diagnosis

Paper
Code

Federated Learning for Open Banking

no code implementations • 24 Aug 2021 • Guodong Long, Yue Tan, Jing Jiang, Chengqi Zhang

In the near future, it is foreseeable to have decentralized data ownership in the finance sector using federated learning.

Federated Learning

Paper
Add Code

Federated Learning for Privacy-Preserving Open Innovation Future on Digital Health

no code implementations • 24 Aug 2021 • Guodong Long, Tao Shen, Yue Tan, Leah Gerrard, Allison Clarke, Jing Jiang

Implementing an open innovation framework in the healthcare industry, namely open health, is to enhance innovation and creative capability of health-related organisations by building a next-generation collaborative framework with partner organisations and the research community.

Federated Learning Privacy Preserving

Paper
Add Code

Multi-Center Federated Learning: Clients Clustering for Better Personalization

1 code implementation • 19 Aug 2021 • Guodong Long, Ming Xie, Tao Shen, Tianyi Zhou, Xianzhi Wang, Jing Jiang, Chengqi Zhang

By comparison, a mixture of multiple global models could capture the heterogeneity across various clients if assigning the client to different global models (i. e., centers) in FL.

Clustering Decision Making +1

Paper
Code

Complex Knowledge Base Question Answering: A Survey

1 code implementation • 15 Aug 2021 • Yunshi Lan, Gaole He, Jinhao Jiang, Jing Jiang, Wayne Xin Zhao, Ji-Rong Wen

Knowledge base question answering (KBQA) aims to answer a question over a knowledge base (KB).

Information Retrieval Knowledge Base Question Answering +2

165

Paper
Code

Disentangling Hate in Online Memes

no code implementations • 9 Aug 2021 • Rui Cao, Ziqing Fan, Roy Ka-Wei Lee, Wen-Haw Chong, Jing Jiang

Our experiment results show that DisMultiHate is able to outperform state-of-the-art unimodal and multimodal baselines in the hateful meme classification task.

Ranked #4 on Hateful Meme Classification on HarMeme

Classification Hateful Meme Classification

Paper
Add Code

Modeling Transitions of Focal Entities for Conversational Knowledge Base Question Answering

1 code implementation • ACL 2021 • Yunshi Lan, Jing Jiang

We propose a novel graph-based model to capture the transitions of focal entities and apply a graph neural network to derive a probability distribution of focal entities for each question, which is then combined with a standard KBQA module to perform answer ranking.

Knowledge Base Question Answering

Paper
Code

COSY: COunterfactual SYntax for Cross-Lingual Understanding

1 code implementation • ACL 2021 • Sicheng Yu, Hao Zhang, Yulei Niu, Qianru Sun, Jing Jiang

Pre-trained multilingual language models, e. g., multilingual-BERT, are widely used in cross-lingual tasks, yielding the state-of-the-art performance.

counterfactual Natural Language Inference +3

Paper
Code

MIPO: Mutual Integration of Patient Journey and Medical Ontology for Healthcare Representation Learning

1 code implementation • 20 Jul 2021 • Xueping Peng, Guodong Long, Sen Wang, Jing Jiang, Allison Clarke, Clement Schlegel, Chengqi Zhang

Hence, some recent works train healthcare representations by incorporating medical ontology, by self-supervised tasks like diagnosis prediction, but (1) the small-scale, monotonous ontology is insufficient for robust learning, and (2) critical contexts or dependencies underlying patient journeys are barely exploited to enhance ontology learning.

Graph Embedding Ontology Embedding +1

Paper
Code

Beyond Low-pass Filtering: Graph Convolutional Networks with Automatic Filtering

1 code implementation • 10 Jul 2021 • Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, Chengqi Zhang

Second, the bandwidth of existing graph convolutional filters is fixed.

Paper
Code

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

1 code implementation • 9 Jun 2021 • Cunxiao Du, Zhaopeng Tu, Jing Jiang

We propose a new training objective named order-agnostic cross entropy (OaXE) for fully non-autoregressive translation (NAT) models.

Machine Translation Translation

Paper
Code

A Survey on Complex Knowledge Base Question Answering: Methods, Challenges and Solutions

no code implementations • 25 May 2021 • Yunshi Lan, Gaole He, Jinhao Jiang, Jing Jiang, Wayne Xin Zhao, Ji-Rong Wen

In this paper, we elaborately summarize the typical challenges and solutions for complex KBQA.

Information Retrieval Knowledge Base Question Answering +2

Paper
Add Code

Investigating Math Word Problems using Pretrained Multilingual Language Models

1 code implementation • 19 May 2021 • Minghuan Tan, Lei Wang, Lingxiao Jiang, Jing Jiang

In this paper, we revisit math word problems~(MWPs) from the cross-lingual and multilingual perspective.

Machine Translation Math +2

Paper
Code

FedProto: Federated Prototype Learning across Heterogeneous Clients

4 code implementations • 1 May 2021 • Yue Tan, Guodong Long, Lu Liu, Tianyi Zhou, Qinghua Lu, Jing Jiang, Chengqi Zhang

Heterogeneity across clients in federated learning (FL) usually hinders the optimization convergence and generalization performance when the aggregation of clients' knowledge occurs in the gradient space.

Federated Learning

1,135

Paper
Code

Cross-Topic Rumor Detection using Topic-Mixtures

no code implementations • EACL 2021 • Xiaoying Ren, Jing Jiang, Ling Min Serena Khoo, Hai Leong Chieu

After deriving a vector representation for each topic, given an instance, we derive a {``}topic mixture{''} vector for the instance based on its topic distribution.

Paper
Add Code

A Low-Complexity ADMM-based Massive MIMO Detectors via Deep Neural Networks

no code implementations • 27 Feb 2021 • Isayiyas Nigatu Tiba, Quan Zhang, Jing Jiang, Yongchao Wang

An alternate direction method of multipliers (ADMM)-based detectors can achieve good performance in both small and large-scale multiple-input multiple-output (MIMO) systems.

Paper
Add Code

Isometric Propagation Network for Generalized Zero-shot Learning

no code implementations • ICLR 2021 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Xuanyi Dong, Chengqi Zhang

To resolve this problem, we propose Isometric Propagation Network (IPN), which learns to strengthen the relation between classes within each space and align the class dependency in the two spaces.

Generalized Zero-Shot Learning

Paper
Add Code

Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task

no code implementations • 24 Jan 2021 • Xiaohan Zhang, Lu Liu, Guodong Long, Jing Jiang, Shenquan Liu

Typical methods to study cognitive function are to record the electrical activities of animal neurons during the training of animals performing behavioral tasks.

Decision Making Hippocampus +3

Paper
Add Code

Attention-Guided Black-box Adversarial Attacks with Large-Scale Multiobjective Evolutionary Optimization

no code implementations • ICML Workshop AML 2021 • Jie Wang, Zhaoxia Yin, Jing Jiang, Yang Du

In this paper, we propose an attention-guided black-box adversarial attack based on the large-scale multiobjective evolutionary optimization, termed as LMOA.

Adversarial Attack

Paper
Add Code

PICA: A Pixel Correlation-based Attentional Black-box Adversarial Attack

no code implementations • 19 Jan 2021 • Jie Wang, Zhaoxia Yin, Jin Tang, Jing Jiang, Bin Luo

The studies on black-box adversarial attacks have become increasingly prevalent due to the intractable acquisition of the structural knowledge of deep neural networks (DNNs).

Adversarial Attack

Paper
Add Code

Improving Multi-hop Knowledge Base Question Answering by Learning Intermediate Supervision Signals

1 code implementation • 11 Jan 2021 • Gaole He, Yunshi Lan, Jing Jiang, Wayne Xin Zhao, Ji-Rong Wen

In our approach, the student network aims to find the correct answer to the query, while the teacher network tries to learn intermediate supervision signals for improving the reasoning capacity of the student network.

Ranked #2 on Semantic Parsing on WebQuestionsSP

Knowledge Base Question Answering Semantic Parsing

119

Paper
Code

Extract Local Inference Chains of Deep Neural Nets

no code implementations • 1 Jan 2021 • Haiyan Zhao, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

In this paper, we introduce an efficient method, \name, to extract the local inference chains by optimizing a differentiable sparse scoring for the filters and layers to preserve the outputs on given data from a local region.

Interpretable Machine Learning Network Pruning

Paper
Add Code

MASP: Model-Agnostic Sample Propagation for Few-shot learning

no code implementations • 1 Jan 2021 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Xuanyi Dong, Chengqi Zhang

Few-shot learning aims to train a classifier given only a few samples per class that are highly insufficient to describe the whole data distribution.

Few-Shot Learning

Paper
Add Code

SemiNLL: A Framework of Noisy-Label Learning by Semi-Supervised Learning

no code implementations • 2 Dec 2020 • Zhuowei Wang, Jing Jiang, Bo Han, Lei Feng, Bo An, Gang Niu, Guodong Long

We also instantiate our framework with different combinations, which set the new state of the art on benchmark-simulated and real-world datasets with noisy labels.

Learning with noisy labels

Paper
Add Code

Confusable Learning for Large-class Few-Shot Classification

no code implementations • 6 Nov 2020 • Bingcong Li, Bo Han, Zhuowei Wang, Jing Jiang, Guodong Long

Specifically, our method maintains a dynamically updating confusion matrix, which analyzes confusable classes in the dataset.

Classification Few-Shot Image Classification +2

Paper
Add Code

A BERT-based Dual Embedding Model for Chinese Idiom Prediction

1 code implementation • COLING 2020 • Minghuan Tan, Jing Jiang

Specifically, we first match the embedding of each candidate idiom with the hidden representation corresponding to the blank in the context.

Cloze Test

Paper
Code

Cooperative Heterogeneous Deep Reinforcement Learning

no code implementations • NeurIPS 2020 • Han Zheng, Pengfei Wei, Jing Jiang, Guodong Long, Qinghua Lu, Chengqi Zhang

Numerous deep reinforcement learning agents have been proposed, and each of them has its strengths and flaws.

Continuous Control Evolutionary Algorithms +2

Paper
Add Code

MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler

2 code implementations • NeurIPS 2020 • Zhining Liu, Pengfei Wei, Jing Jiang, Wei Cao, Jiang Bian, Yi Chang

This makes MESA generally applicable to most of the existing learning models and the meta-sampler can be efficiently applied to new tasks.

imbalanced classification Meta-Learning

103

Paper
Code

Counterfactual Variable Control for Robust and Interpretable Question Answering

1 code implementation • 12 Oct 2020 • Sicheng Yu, Yulei Niu, Shuohang Wang, Jing Jiang, Qianru Sun

We then conduct two novel CVC inference methods (on trained models) to capture the effect of comprehensive reasoning as the final prediction.

Causal Inference counterfactual +3

Paper
Code

RatE: Relation-Adaptive Translating Embedding for Knowledge Graph Completion

no code implementations • COLING 2020 • Hao Huang, Guodong Long, Tao Shen, Jing Jiang, Chengqi Zhang

Many graph embedding approaches have been proposed for knowledge graph completion via link prediction.

Graph Embedding Knowledge Graph Completion +3

Paper
Add Code

Improving Long-Tail Relation Extraction with Collaborating Relation-Augmented Attention

2 code implementations • COLING 2020 • Yang Li, Tao Shen, Guodong Long, Jing Jiang, Tianyi Zhou, Chengqi Zhang

Then, facilitated by the proposed base model, we introduce collaborating relation features shared among relations in the hierarchies to promote the relation-augmenting process and balance the training data for long-tail relations.

Relation Relation Extraction +1

Paper
Code

Cross-Thought for Sentence Encoder Pre-training

1 code implementation • EMNLP 2020 • Shuohang Wang, Yuwei Fang, Siqi Sun, Zhe Gan, Yu Cheng, Jing Jiang, Jingjing Liu

In this paper, we propose Cross-Thought, a novel approach to pre-training sequence encoder, which is instrumental in building reusable sequence embeddings for large-scale NLP tasks such as question answering.

Information Retrieval Language Modelling +5

Paper
Code

Context Modeling with Evidence Filter for Multiple Choice Question Answering

no code implementations • 6 Oct 2020 • Sicheng Yu, Hao Zhang, Wei Jing, Jing Jiang

In addition to the effective reduction of human efforts of our approach compared, through extensive experiments on OpenbookQA, we show that the proposed approach outperforms the models that use the same backbone and more training data; and our parameter analysis also demonstrates the interpretability of our approach.

Machine Reading Comprehension Multiple-choice +1

Paper
Add Code

Attribute Propagation Network for Graph Zero-shot Learning

no code implementations • 24 Sep 2020 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

To address this challenging task, most ZSL methods relate unseen test classes to seen(training) classes via a pre-defined set of attributes that can describe all classes in the same semantic space, so the knowledge learned on the training classes can be adapted to unseen classes.

Attribute Meta-Learning +1

Paper
Add Code

BiteNet: Bidirectional Temporal Encoder Network to Predict Medical Outcomes

1 code implementation • 24 Sep 2020 • Xueping Peng, Guodong Long, Tao Shen, Sen Wang, Jing Jiang, Chengqi Zhang

Electronic health records (EHRs) are longitudinal records of a patient's interactions with healthcare systems.

Clustering

Paper
Code

Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer

no code implementations • ACL 2020 • Jianfei Yu, Jing Jiang, Li Yang, Rui Xia

To tackle the first issue, we propose a multimodal interaction module to obtain both image-aware word representations and word-aware visual representations.

Multi-modal Named Entity Recognition named-entity-recognition +1

Paper
Add Code

Query Graph Generation for Answering Multi-hop Complex Questions from Knowledge Bases

no code implementations • ACL 2020 • Yunshi Lan, Jing Jiang

Previous work on answering complex questions from knowledge bases usually separately addresses two types of complexity: questions with constraints and questions with multiple hops of relations.

Graph Generation

Paper
Add Code

Many-Class Few-Shot Learning on Multi-Granularity Class Hierarchy

1 code implementation • 28 Jun 2020 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

We study many-class few-shot (MCFS) problem in both supervised learning and meta-learning settings.

Few-Shot Learning

Paper
Code

A Universal Representation Transformer Layer for Few-Shot Image Classification

1 code implementation • ICLR 2021 • Lu Liu, William Hamilton, Guodong Long, Jing Jiang, Hugo Larochelle

We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data sources.

Ranked #1 on Few-Shot Image Classification on Meta-Dataset Rank

Classification Domain Generalization +2

103

Paper
Code

Self-Attention Enhanced Patient Journey Understanding in Healthcare System

1 code implementation • 15 Jun 2020 • Xueping Peng, Guodong Long, Tao Shen, Sen Wang, Jing Jiang

The key challenge of patient journey understanding is to design an effective encoding mechanism which can properly tackle the aforementioned multi-level structured patient journey data with temporal sequential visits and a set of medical codes.

Paper
Code

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

2 code implementations • 24 May 2020 • Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, Xiaojun Chang, Chengqi Zhang

Modeling multivariate time series has long been a subject that has attracted researchers from a diverse range of fields including economics, finance, and traffic.

Ranked #1 on Univariate Time Series Forecasting on Electricity

Graph Learning Multivariate Time Series Forecasting +3

2,481

Paper
Code

Online non-convex learning for river pollution source identification

no code implementations • 22 May 2020 • Wenjie Huang, Jing Jiang, Xiao Liu

In this paper, novel gradient-based online learning algorithms are developed to investigate an important environmental application: real-time river pollution source identification, which aims at estimating the released mass, location, and time of a river pollution source based on downstream sensor data monitoring the pollution concentration.

Paper
Add Code

Multi-Center Federated Learning: Clients Clustering for Better Personalization

3 code implementations • 3 May 2020 • Guodong Long, Ming Xie, Tao Shen, Tianyi Zhou, Xianzhi Wang, Jing Jiang, Chengqi Zhang

However, due to the diverse nature of user behaviors, assigning users' gradients to different global models (i. e., centers) can better capture the heterogeneity of data distributions across users.

Clustering Federated Learning

Paper
Code

Aspect and Opinion Aware Abstractive Review Summarization with Reinforced Hard Typed Decoder

no code implementations • 13 Apr 2020 • Yufei Tian, Jianfei Yu, Jing Jiang

In this paper, we study abstractive review summarization. Observing that review summaries often consist of aspect words, opinion words and context words, we propose a two-stage reinforcement learning approach, which first predicts the output word type from the three types, and then leverages the predicted word type to generate the final word distribution. Experimental results on two Amazon product review datasets demonstrate that our method can consistently outperform several strong baseline approaches based on ROUGE scores.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification

3 code implementations • ICLR 2022 • Wensi Tang, Guodong Long, Lu Liu, Tianyi Zhou, Michael Blumenstein, Jing Jiang

Particularly, it is a set of kernel sizes that can efficiently cover the best RF size across different datasets via consisting of multiple prime numbers according to the length of the time series.

General Classification Time Series +2

4,651

Paper
Code

Interpretable Rumor Detection in Microblogs by Attending to User Interactions

1 code implementation • 29 Jan 2020 • Ling Min Serena Khoo, Hai Leong Chieu, Zhong Qian, Jing Jiang

We propose a post-level attention model (PLAN) to model long distance interactions between tweets with the multi-head attention mechanism in a transformer network.

Sentence

Paper
Code

Multi-level Head-wise Match and Aggregation in Transformer for Textual Sequence Matching

no code implementations • 20 Jan 2020 • Shuohang Wang, Yunshi Lan, Yi Tay, Jing Jiang, Jingjing Liu

Transformer has been successfully applied to many natural language processing tasks.

QQP

Paper
Add Code

Multimodal Story Generation on Plural Images

no code implementations • 16 Jan 2020 • Jing Jiang

In this work, we propose the architecture to use images instead of text as the input of the text generation model, called StoryGen.

Story Generation

Paper
Add Code

Self-Attention Enhanced Selective Gate with Entity-Aware Embedding for Distantly Supervised Relation Extraction

no code implementations • 27 Nov 2019 • Yang Li, Guodong Long, Tao Shen, Tianyi Zhou, Lina Yao, Huan Huo, Jing Jiang

Distantly supervised relation extraction intrinsically suffers from noisy labels due to the strong assumption of distant supervision.

Entity Embeddings Relation +3

Paper
Add Code

Two-Headed Monster And Crossed Co-Attention Networks

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Yaoyiran Li, Jing Jiang

This paper presents some preliminary investigations of a new co-attention mechanism in neural transduction models.

Translation Vocal Bursts Valence Prediction

Paper
Add Code

What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?

no code implementations • 28 Oct 2019 • Chenglei Si, Shuohang Wang, Min-Yen Kan, Jing Jiang

Based on our experiments on the 5 key MCRC datasets - RACE, MCTest, MCScript, MCScript2. 0, DREAM - we observe that 1) fine-tuned BERT mainly learns how keywords lead to correct prediction, instead of learning semantic understanding and reasoning; and 2) BERT does not need correct syntactic information to solve the task; 3) there exists artifacts in these datasets such that they can be solved even without the full context.

Multiple-choice Reading Comprehension

Paper
Add Code

Temporal Self-Attention Network for Medical Concept Embedding

1 code implementation • 15 Sep 2019 • Xueping Peng, Guodong Long, Tao Shen, Sen Wang, Jing Jiang, Michael Blumenstein

In this paper, we propose a medical concept embedding method based on applying a self-attention mechanism to represent each medical concept.

Clustering

Paper
Code

Learning to Propagate for Graph Meta-Learning

1 code implementation • NeurIPS 2019 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

It can significantly improve tasks that suffer from insufficient training data, e. g., few shot learning.

Few-Shot Image Classification Few-Shot Learning

Paper
Code

Effective Search of Logical Forms for Weakly Supervised Knowledge-Based Question Answering

no code implementations • 6 Sep 2019 • Tao Shen, Xiubo Geng, Tao Qin, Guodong Long, Jing Jiang, Daxin Jiang

These two problems lead to a poorly-trained semantic parsing model.

Question Answering Semantic Parsing +1

Paper
Add Code

Attributed Graph Clustering: A Deep Attentional Embedding Approach

2 code implementations • 15 Jun 2019 • Chun Wang, Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, Chengqi Zhang

Graph clustering is a fundamental task which discovers communities or groups in networks.

Ranked #8 on Node Clustering on Cora

Clustering Graph Clustering +2

Paper
Code

Graph WaveNet for Deep Spatial-Temporal Graph Modeling

8 code implementations • 31 May 2019 • Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, Chengqi Zhang

Spatial-temporal graph modeling is an important task to analyze the spatial relations and temporal trends of components in a system.

Ranked #5 on Traffic Prediction on NE-BJ

Relation Temporal Sequences +1

544

Paper
Code

Prototype Propagation Networks (PPN) for Weakly-supervised Few-shot Learning on Category Graph

2 code implementations • 10 May 2019 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Lina Yao, Chengqi Zhang

The resulting graph of prototypes can be continually re-used and updated for new tasks and classes.

Few-Shot Learning General Classification

Paper
Code

MahiNet: A Neural Network for Many-Class Few-Shot Learning with Class Hierarchy

no code implementations • ICLR 2019 • Lu Liu, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

It addresses the ``many-class'' problem by exploring the class hierarchy, e. g., the coarse-class label that covers a subset of fine classes, which helps to narrow down the candidates for the fine class and is cheaper to obtain.

Few-Shot Learning General Classification

Paper
Add Code

DAGCN: Dual Attention Graph Convolutional Networks

1 code implementation • 4 Apr 2019 • Fengwen Chen, Shirui Pan, Jing Jiang, Huan Huo, Guodong Long

In this paper, we propose a novel framework called, dual attention graph convolutional networks (DAGCN) to address these problems.

Ranked #25 on Graph Classification on NCI1

General Classification Graph Classification +1

Paper
Code

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning

no code implementations • NAACL 2019 • Shuohang Wang, Sheng Zhang, Yelong Shen, Xiaodong Liu, Jingjing Liu, Jianfeng Gao, Jing Jiang

Commonsense reasoning is fundamental to natural language understanding.

Ranked #2 on Natural Language Understanding on PDP60

Common Sense Reasoning Coreference Resolution +2

Paper
Add Code

Learning Graph Embedding with Adversarial Training Methods

no code implementations • 4 Jan 2019 • Shirui Pan, Ruiqi Hu, Sai-fu Fung, Guodong Long, Jing Jiang, Chengqi Zhang

Based on this framework, we derive two variants of adversarial models, the adversarially regularized graph autoencoder (ARGA) and its variational version, adversarially regularized variational graph autoencoder (ARVGA), to learn the graph embedding effectively.

Ranked #7 on Node Clustering on Cora

Clustering Graph Clustering +3

Paper
Add Code

Learning Private Neural Language Modeling with Attentive Aggregation

4 code implementations • 17 Dec 2018 • Shaoxiong Ji, Shirui Pan, Guodong Long, Xue Li, Jing Jiang, Zi Huang

Federated learning (FL) provides a promising approach to learning private language modeling for intelligent personalized keyboard suggestion by training models in distributed clients rather than training in a central server.

Federated Learning Language Modelling

1,142

Paper
Code

Global Inference for Aspect and Opinion Terms Co-Extraction Based on Multi-Task Neural Networks

no code implementations • IEEE 2018 • Jianfei Yu, Jing Jiang, Rui Xia

However, most existing methods fail to explicitly consider the syntactic relations among aspect terms and opinion terms, which may lead to the inconsistencies between the model predictions and the syntactic constraints.

Aspect Term Extraction and Sentiment Classification Multi-Task Learning +2

Paper
Add Code

Improving Multi-label Emotion Classification via Sentiment Classification with Dual Attention Transfer Network

no code implementations • EMNLP 2018 • Jianfei Yu, Lu{\'\i}s Marujo, Jing Jiang, Pradeep Karuturi, William Brendel

In this paper, we target at improving the performance of multi-label emotion classification with the help of sentiment classification.

Classification Emotion Classification +7

Paper
Add Code

Did you take the pill? - Detecting Personal Intake of Medicine from Twitter

no code implementations • 3 Aug 2018 • Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, Jing Jiang

We believe that the developed classifier has direct uses in the areas of psychology, health informatics, pharmacovigilance and affective computing for tracking moods, emotions and sentiments of patients expressing intake of medicine in social media.

Paper
Add Code

Embedding WordNet Knowledge for Textual Entailment

no code implementations • COLING 2018 • Yunshi Lan, Jing Jiang

In this paper, we study how we can improve a deep learning approach to textual entailment by incorporating lexical entailment relations from WordNet.

Feature Engineering Lexical Entailment +1

Paper
Add Code

A Co-Matching Model for Multi-choice Reading Comprehension

1 code implementation • ACL 2018 • Shuohang Wang, Mo Yu, Shiyu Chang, Jing Jiang

Multi-choice reading comprehension is a challenging task, which involves the matching between a passage and a question-answer pair.

Reading Comprehension

Paper
Code

Tensorized Self-Attention: Efficiently Modeling Pairwise and Global Dependencies Together

2 code implementations • NAACL 2019 • Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

Neural networks equipped with self-attention have parallelizable computation, light-weight structure, and the ability to capture both long-range and local dependencies.

313

Paper
Code

Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling

1 code implementation • ICLR 2018 • Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

In this paper, we propose a model, called "bi-directional block self-attention network (Bi-BloSAN)", for RNN/CNN-free sequence encoding.

124

Paper
Code

Adversarially Regularized Graph Autoencoder for Graph Embedding

4 code implementations • 13 Feb 2018 • Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, Lina Yao, Chengqi Zhang

Graph embedding is an effective method to represent graph data in a low dimensional space for graph analytics.

Ranked #5 on Link Prediction on Pubmed

Clustering Graph Clustering +2

183

Paper
Code

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

1 code implementation • 31 Jan 2018 • Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Sen Wang, Chengqi Zhang

In this paper, we integrate both soft and hard attention into one context fusion model, "reinforced self-attention (ReSA)", for the mutual benefit of each other.

Ranked #56 on Natural Language Inference on SNLI

Hard Attention Natural Language Inference +1

313

Paper
Code

Modelling Domain Relationships for Transfer Learning on Retrieval-based Question Answering Systems in E-commerce

1 code implementation • 23 Nov 2017 • Jianfei Yu, Minghui Qiu, Jing Jiang, Jun Huang, Shuangyong Song, Wei Chu, Haiqing Chen

In this paper, we study transfer learning for the PI and NLI problems, aiming to propose a general framework, which can effectively and efficiently adapt the shared knowledge learned from a resource-rich source domain to a resource- poor target domain.

Chatbot Natural Language Inference +5

Paper
Code

Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering

1 code implementation • ICLR 2018 • Shuohang Wang, Mo Yu, Jing Jiang, Wei zhang, Xiaoxiao Guo, Shiyu Chang, Zhiguo Wang, Tim Klinger, Gerald Tesauro, Murray Campbell

We propose two methods, namely, strength-based re-ranking and coverage-based re-ranking, to make use of the aggregated evidence from different passages to better determine the answer.

Ranked #1 on Open-Domain Question Answering on Quasar

Open-Domain Question Answering Reading Comprehension +2

Paper
Code

Marginalized graph autoencoder for graph clustering

no code implementations • CIKM '17 Proceedings of the 2017 ACM on Conference on Information and Knowledge Management 2017 • Chun Wang, Shirui Pan, Guodong Long, Xingquan Zhu, Jing Jiang

In this paper, we propose a novel marginalized graph autoencoder (MGAE) algorithm for graph clustering.

Clustering Graph Clustering +1

Paper
Add Code

Leveraging Auxiliary Tasks for Document-Level Cross-Domain Sentiment Classification

no code implementations • IJCNLP 2017 • Jianfei Yu, Jing Jiang

In this paper, we study domain adaptation with a state-of-the-art hierarchical neural network for document-level sentiment classification.

Classification Denoising +7

Paper
Add Code

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

3 code implementations • 14 Sep 2017 • Tao Shen, Tianyi Zhou, Guodong Long, Jing Jiang, Shirui Pan, Chengqi Zhang

Recurrent neural nets (RNN) and convolutional neural nets (CNN) are widely used on NLP tasks to capture the long-term and local dependencies, respectively.

Ranked #68 on Natural Language Inference on SNLI

Natural Language Inference Sentence +2

313

Paper
Code

R$^3$: Reinforced Reader-Ranker for Open-Domain Question Answering

1 code implementation • 31 Aug 2017 • Shuohang Wang, Mo Yu, Xiaoxiao Guo, Zhiguo Wang, Tim Klinger, Wei zhang, Shiyu Chang, Gerald Tesauro, Bo-Wen Zhou, Jing Jiang

Second, we propose a novel method that jointly trains the Ranker along with an answer-generation Reader model, based on reinforcement learning.

Ranked #4 on Open-Domain Question Answering on Quasar

Answer Generation Information Retrieval +3

Paper
Code

Can Syntax Help? Improving an LSTM-based Sentence Compression Model for New Domains

no code implementations • ACL 2017 • Liangguo Wang, Jing Jiang, Hai Leong Chieu, Chen Hui Ong, D. Song, an, Lejian Liao

In this paper, we study how to improve the domain adaptability of a deletion-based Long Short-Term Memory (LSTM) neural network model for sentence compression.

Ranked #6 on Sentence Compression on Google Dataset