Search Results for author: Lu Chen

Found 106 papers, 43 papers with code

Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe Prediction

1 code implementation • Findings (EMNLP) 2021 • Boer Lyu, Lu Chen, Kai Yu

Sememes are defined as the atomic units to describe the semantic meaning of concepts.

Paper
Code

Towards Reliable and Empathetic Depression-Diagnosis-Oriented Chats

no code implementations • 7 Apr 2024 • Kunyao Lan, Cong Ming, Binwei Yao, Lu Chen, Mengyue Wu

Nevertheless, the blend of task-oriented and chit-chat in diagnosis-related dialogues necessitates professional expertise and empathy.

Paper
Add Code

Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind

2 code implementations • 6 Apr 2024 • Hongchuan Zeng, Hongshen Xu, Lu Chen, Kai Yu

MBS overcomes the English-centric limitations of existing methods by sampling calibration data from various languages proportionally to the language distribution of the model training datasets.

Model Compression

144

Paper
Code

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

no code implementations • 27 Mar 2024 • Hongshen Xu, Zichen Zhu, Situo Zhang, Da Ma, Shuai Fan, Lu Chen, Kai Yu

Large Language Models (LLMs) often generate erroneous outputs, known as hallucinations, due to their limitations in discerning questions beyond their knowledge scope.

Hallucination

Paper
Add Code

ChatCite: LLM Agent with Human Workflow Guidance for Comparative Literature Summary

no code implementations • 5 Mar 2024 • Yutong Li, Lu Chen, Aiwei Liu, Kai Yu, Lijie Wen

In this work, we firstly focus on the independent literature summarization step and introduce ChatCite, an LLM agent with human workflow guidance for comparative literature summary.

Retrieval

Paper
Add Code

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

1 code implementation • 28 Feb 2024 • Hongshen Xu, Ruisheng Cao, Su Zhu, Sheng Jiang, Hanchong Zhang, Lu Chen, Kai Yu

Previous work on spoken language understanding (SLU) mainly focuses on single-intent settings, where each input utterance merely contains one user intent.

Graph Attention Spoken Language Understanding

Paper
Code

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

1 code implementation • 28 Feb 2024 • Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yu

Additionally, we propose several pre-training tasks to model the interaction among text, structure, and image modalities effectively.

document understanding Information Retrieval +1

Paper
Code

Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution

no code implementations • 18 Feb 2024 • Nuo Xu, Jun Zhao, Can Zu, Sixian Li, Lu Chen, Zhihao Zhang, Rui Zheng, Shihan Dou, Wenjuan Qin, Tao Gui, Qi Zhang, Xuanjing Huang

To address this issue, we propose a cost-effective preference learning strategy, optimizing reward models by distinguishing between human and machine translations.

Machine Translation Translation

Paper
Add Code

A Unified Causal View of Instruction Tuning

no code implementations • 9 Feb 2024 • Lu Chen, Wei Huang, Ruqing Zhang, Wei Chen, Jiafeng Guo, Xueqi Cheng

The key idea is to learn task-required causal factors and only use those to make predictions for a given task.

Paper
Add Code

MULTI: Multimodal Understanding Leaderboard with Text and Images

no code implementations • 5 Feb 2024 • Zichen Zhu, Yang Xu, Lu Chen, Jingkai Yang, Yichuan Ma, Yiming Sun, Hailin Wen, Jiaqi Liu, Jinyu Cai, Yingzi Ma, Situo Zhang, Zihan Zhao, Liangtai Sun, Kai Yu

Rapid progress in multimodal large language models (MLLMs) highlights the need to introduce challenging yet realistic benchmarks to the academic community, while existing benchmarks primarily focus on understanding simple natural images and short context.

In-Context Learning

Paper
Add Code

Seeing is not always believing: The Space of Harmless Perturbations

no code implementations • 3 Feb 2024 • Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo

In the context of deep neural networks, we expose the existence of a harmless perturbation space, where perturbations leave the network output entirely unaltered.

Privacy Preserving

Paper
Add Code

A Comprehensive Survey on 3D Content Generation

1 code implementation • 2 Feb 2024 • Jian Liu, Xiaoshui Huang, Tianyu Huang, Lu Chen, Yuenan Hou, Shixiang Tang, Ziwei Liu, Wanli Ouyang, WangMeng Zuo, Junjun Jiang, Xianming Liu

Recent years have witnessed remarkable advances in artificial intelligence generated content(AIGC), with diverse input modalities, e. g., text, image, video, audio and 3D.

367

Paper
Code

MouSi: Poly-Visual-Expert Vision-Language Models

1 code implementation • 30 Jan 2024 • Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

This technique introduces a fusion network to unify the processing of outputs from different visual experts, while bridging the gap between image encoders and pre-trained LLMs.

Ranked #40 on Visual Question Answering on MM-Vet

Image Segmentation Image-text matching +4

Paper
Code

Defining and Extracting generalizable interaction primitives from DNNs

no code implementations • 29 Jan 2024 • Lu Chen, Siyu Lou, Benhao Huang, Quanshi Zhang

Faithfully summarizing the knowledge encoded by a deep neural network (DNN) into a few symbolic primitive patterns without losing much information represents a core challenge in explainable AI.

Paper
Add Code

ChemDFM: Dialogue Foundation Model for Chemistry

no code implementations • 26 Jan 2024 • Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li, Hongshen Xu, Zichen Zhu, Su Zhu, Shuai Fan, Guodong Shen, Xin Chen, Kai Yu

To this end, we develop ChemDFM, the first LLM towards CGI.

Paper
Add Code

FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis

no code implementations • 19 Jan 2024 • Chao Zhang, YUREN MAO, Yijiang Fan, Yu Mi, Yunjun Gao, Lu Chen, Dongfang Lou, Jinshu Lin

Text-to-SQL, which provides zero-code interface for operating relational databases, has gained much attention in financial analysis; because, financial professionals may not well-skilled in SQL programming.

Language Modelling Large Language Model +1

Paper
Add Code

Secrets of RLHF in Large Language Models Part II: Reward Modeling

1 code implementation • 11 Jan 2024 • Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang

We introduce a series of novel methods to mitigate the influence of incorrect and ambiguous preferences in the dataset and fully leverage high-quality preference data.

Contrastive Learning Meta-Learning +1

1,160

Paper
Code

Text Classification Based on Knowledge Graphs and Improved Attention Mechanism

no code implementations • 7 Jan 2024 • Siyu Li, Lu Chen, Chenwei Song, Xinyi Liu

To resolve the semantic ambiguity in texts, we propose a model, which innovatively combines a knowledge graph with an improved attention mechanism.

Knowledge Graphs text-classification +1

Paper
Add Code

MUST: An Effective and Scalable Framework for Multimodal Search of Target Modality

1 code implementation • 11 Dec 2023 • Mengzhao Wang, Xiangyu Ke, Xiaoliang Xu, Lu Chen, Yunjun Gao, Pinpin Huang, Runkai Zhu

We investigate the problem of multimodal search of target modality, where the task involves enhancing a query in a specific target modality by integrating information from auxiliary modalities.

Information Retrieval

Paper
Code

Dynamic Fault Analysis in Substations Based on Knowledge Graphs

no code implementations • 22 Nov 2023 • Weiwei Li, Xing Liu, Wei Wang, Lu Chen, Sizhe Li, Hui Fan

To address the challenge of identifying hidden danger in substations from unstructured text, a novel dynamic analysis method is proposed.

Knowledge Graphs

Paper
Add Code

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL

no code implementations • 28 Oct 2023 • Ruisheng Cao, Hanchong Zhang, Hongshen Xu, Jieyu Li, Da Ma, Lu Chen, Kai Yu

Text-to-SQL aims to generate an executable SQL program given the user utterance and the corresponding database schema.

Text-To-SQL

Paper
Add Code

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought

1 code implementation • 26 Oct 2023 • Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu

Recently Large Language Models (LLMs) have been proven to have strong abilities in various domains and tasks.

In-Context Learning Text-To-SQL

Paper
Code

SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research

1 code implementation • 25 Aug 2023 • Liangtai Sun, Yang Han, Zihan Zhao, Da Ma, Zhennan Shen, Baocai Chen, Lu Chen, Kai Yu

This design suffers from data leakage problem and lacks the evaluation of subjective Q/A ability.

Language Modelling Large Language Model

Paper
Code

Inducing Causal Structure for Abstractive Text Summarization

1 code implementation • 24 Aug 2023 • Lu Chen, Ruqing Zhang, Wei Huang, Wei Chen, Jiafeng Guo, Xueqi Cheng

The key idea is to reformulate the Variational Auto-encoder (VAE) to fit the joint distribution of the document and summary variables from the training corpus.

Abstractive Text Summarization

Paper
Code

MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching

1 code implementation • 2 Aug 2023 • Xiaocan Zeng, Pengfei Wang, YUREN MAO, Lu Chen, Xiaoze Liu, Yunjun Gao

Traditional unsupervised EM assumes that all entities come from two tables; however, it is more common to match entities from multiple tables in practical applications, that is, multi-table entity matching (multi-table EM).

Management

Paper
Code

C3: Zero-shot Text-to-SQL with ChatGPT

1 code implementation • 14 Jul 2023 • XueMei Dong, Chao Zhang, Yuhang Ge, YUREN MAO, Yunjun Gao, Lu Chen, Jinshu Lin, Dongfang Lou

This paper proposes a ChatGPT-based zero-shot Text-to-SQL method, dubbed C3, which achieves 82. 3\% in terms of execution accuracy on the holdout test set of Spider and becomes the state-of-the-art zero-shot Text-to-SQL method on the Spider Challenge.

Ranked #4 on Text-To-SQL on spider

Text-To-SQL

Paper
Code

Secrets of RLHF in Large Language Models Part I: PPO

1 code implementation • 11 Jul 2023 • Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang

Therefore, we explore the PPO-max, an advanced version of PPO algorithm, to efficiently improve the training stability of the policy model.

1,160

Paper
Code

Real-time Workload Pattern Analysis for Large-scale Cloud Databases

no code implementations • 5 Jul 2023 • Jiaqi Wang, Tianyi Li, Anni Wang, Xiaoze Liu, Lu Chen, Jie Chen, Jianye Liu, Junyang Wu, Feifei Li, Yunjun Gao

This has led to the increasing volume of database workloads, which provides the opportunity for pattern analysis.

Paper
Add Code

MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators

no code implementations • 19 Jun 2023 • Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu, Lu Chen, Lei Bai, Qi Chu, Nenghai Yu, Wanli Ouyang

Generating realistic human motion from given action descriptions has experienced significant advancements because of the emerging requirement of digital humans.

Paper
Add Code

Large Language Models Are Semi-Parametric Reinforcement Learning Agents

1 code implementation • NeurIPS 2023 • Danyang Zhang, Lu Chen, Situo Zhang, Hongshen Xu, Zihan Zhao, Kai Yu

By equipping the LLM with a long-term experience memory, REMEMBERER is capable of exploiting the experiences from the past episodes even for different task goals, which excels an LLM-based agent with fixed exemplars or equipped with a transient working memory.

Language Modelling Large Language Model +1

Paper
Code

CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset

1 code implementation • 25 May 2023 • Hanchong Zhang, Jieyu Li, Lu Chen, Ruisheng Cao, Yunyan Zhang, Yu Huang, Yefeng Zheng, Kai Yu

Furthermore, we present CSS, a large-scale CrosS-Schema Chinese text-to-SQL dataset, to carry on corresponding studies.

Benchmarking Text-To-SQL

Paper
Code

Mobile-Env: An Evaluation Platform and Benchmark for LLM-GUI Interaction

1 code implementation • 14 May 2023 • Danyang Zhang, Hongshen Xu, Zihan Zhao, Lu Chen, Ruisheng Cao, Kai Yu

A GUI task set based on WikiHow app is collected on Mobile-Env to form a benchmark covering a range of GUI interaction capabilities.

Language Modelling

Paper
Code

Knowledge-refined Denoising Network for Robust Recommendation

1 code implementation • 28 Apr 2023 • Xinjun Zhu, Yuntao Du, YUREN MAO, Lu Chen, Yujia Hu, Yunjun Gao

Knowledge graph (KG), which contains rich side information, becomes an essential part to boost the recommendation performance and improve its explainability.

Denoising Knowledge-Aware Recommendation +1

Paper
Code

Towards Explainable Collaborative Filtering with Taste Clusters Learning

1 code implementation • 27 Apr 2023 • Yuntao Du, Jianxun Lian, Jing Yao, Xiting Wang, Mingqi Wu, Lu Chen, Yunjun Gao, Xing Xie

In recent decades, there have been significant advancements in latent embedding-based CF methods for improved accuracy, such as matrix factorization, neural collaborative filtering, and LightGCN.

Collaborative Filtering Decision Making +3

Paper
Code

SEA: A Scalable Entity Alignment System

1 code implementation • 14 Apr 2023 • Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao, Ziheng Wei

To enhance the usability of GNN-based EA models in real-world applications, we present SEA, a scalable entity alignment system that enables to (i) train large-scale GNNs for EA, (ii) speed up the normalization and the evaluation process, and (iii) report clear results for users to estimate different models and parameter settings.

Entity Alignment Knowledge Graphs

Paper
Code

HarsanyiNet: Computing Accurate Shapley Values in a Single Forward Propagation

1 code implementation • 4 Apr 2023 • Lu Chen, Siyu Lou, Keyan Zhang, Jin Huang, Quanshi Zhang

The HarsanyiNet is designed on the theoretical foundation that the Shapley value can be reformulated as the redistribution of Harsanyi interactions encoded by the network.

Paper
Code

SparDL: Distributed Deep Learning Training with Efficient Sparse Communication

no code implementations • 3 Apr 2023 • Minjun Zhao, Yichen Yin, YUREN MAO, Qing Liu, Lu Chen, Yunjun Gao

Recently, a few methods have been put forward to handle the SGA dilemma.

Paper
Add Code

Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation

no code implementations • 3 Feb 2023 • Ronghao Dang, Lu Chen, Liuyi Wang, Zongtao He, Chengju Liu, Qijun Chen

We propose a meta-ability decoupling (MAD) paradigm, which brings together various object navigation methods in an architecture system, allowing them to mutually enhance each other and evolve together.

Object

Paper
Add Code

Unsupervised Entity Alignment for Temporal Knowledge Graphs

1 code implementation • 1 Feb 2023 • Xiaoze Liu, Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao

State-of-the-art time-aware EA studies have suggested that the temporal information of TKGs facilitates the performance of EA.

Ranked #1 on Entity Alignment on YAGO-WIKI50K

Entity Alignment Graph Matching +1

Paper
Code

On the Structural Generalization in Text-to-SQL

no code implementations • 12 Jan 2023 • Jieyu Li, Lu Chen, Ruisheng Cao, Su Zhu, Hongshen Xu, Zhi Chen, Hanchong Zhang, Kai Yu

Exploring the generalization of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases.

Text-To-SQL

Paper
Add Code

Estimator: An Effective and Scalable Framework for Transportation Mode Classification over Trajectories

no code implementations • 11 Dec 2022 • Danlei Hu, Ziquan Fang, Hanxi Fang, Tianyi Li, Chunhui Shen, Lu Chen, Yunjun Gao

Transportation mode classification, the process of predicting the class labels of moving objects transportation modes, has been widely applied to a variety of real world applications, such as traffic management, urban computing, and behavior study.

Classification Management

Paper
Add Code

OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue

no code implementations • 10 Sep 2022 • Zhi Chen, Yuncong Liu, Lu Chen, Su Zhu, Mengyue Wu, Kai Yu

The second phase is to fine-tune the pretrained model on the TOD data.

Language Modelling Text Generation

Paper
Add Code

DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images

1 code implementation • 15 Aug 2022 • Bing Wang, Lu Chen, Bo Yang

In this paper, we study the problem of 3D scene geometry decomposition and manipulation from 2D views.

Object

243

Paper
Code

DFM: Dialogue Foundation Model for Universal Large-Scale Dialogue-Oriented Task Learning

no code implementations • 25 May 2022 • Zhi Chen, Jijia Bao, Lu Chen, Yuncong Liu, Da Ma, Bei Chen, Mengyue Wu, Su Zhu, Xin Dong, Fujiang Ge, Qingliang Miao, Jian-Guang Lou, Kai Yu

In this work, we aim to build a unified dialogue foundation model (DFM) which can be used to solve massive diverse dialogue tasks.

Dialogue Generation Knowledge Distillation

Paper
Add Code

D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat

no code implementations • 24 May 2022 • Binwei Yao, Chao Shi, Likai Zou, Lingfeng Dai, Mengyue Wu, Lu Chen, Zhen Wang, Kai Yu

In a depression-diagnosis-directed clinical session, doctors initiate a conversation with ample emotional support that guides the patients to expose their symptoms based on clinical diagnosis criteria.

Response Generation

Paper
Add Code

META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI

no code implementations • 23 May 2022 • Liangtai Sun, Xingyu Chen, Lu Chen, Tianle Dai, Zichen Zhu, Kai Yu

However, this API-based architecture greatly limits the information-searching capability of intelligent assistants and may even lead to task failure if TOD-specific APIs are not available or the task is too complicated to be executed by the provided APIs.

Scheduling

Paper
Add Code

ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities

2 code implementations • 20 May 2022 • Yunjun Gao, Xiaoze Liu, Junyang Wu, Tianyi Li, Pengfei Wang, Lu Chen

To tackle this challenge, we present ClusterEA, a general framework that is capable of scaling up EA models and enhancing their results by leveraging normalization methods on mini-batches with a high entity equivalent rate.

Ranked #2 on Entity Alignment on DBP1M DE-EN

Entity Alignment Entity Embeddings +1

Paper
Code

TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages

1 code implementation • NAACL 2022 • Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu

Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests.

Graph Attention Language Modelling +2

Paper
Code

Self-Guided Learning to Denoise for Robust Recommendation

2 code implementations • 14 Apr 2022 • Yunjun Gao, Yuntao Du, Yujia Hu, Lu Chen, Xinjun Zhu, Ziquan Fang, Baihua Zheng

Besides, our method can automatically switch its learning phase at the memorization point from memorization to self-guided learning, and select clean and informative memorized data via a novel adaptive denoising scheduler to improve the robustness.

Denoising Memorization +2

Paper
Code

HAKG: Hierarchy-Aware Knowledge Gated Network for Recommendation

1 code implementation • 11 Apr 2022 • Yuntao Du, Xinjun Zhu, Lu Chen, Baihua Zheng, Yunjun Gao

Furthermore, we propose a dual item embeddings design to represent and propagate collaborative signals and knowledge associations separately, and leverage the gated aggregation to distill discriminative information for better capturing user behavior patterns.

Ranked #1 on Recommendation Systems on Alibaba-iFashion

Knowledge-Aware Recommendation

Paper
Code

UniDU: Towards A Unified Generative Dialogue Understanding Framework

no code implementations • SIGDIAL (ACL) 2022 • Zhi Chen, Lu Chen, Bei Chen, Libo Qin, Yuncong Liu, Su Zhu, Jian-Guang Lou, Kai Yu

With the development of pre-trained language models, remarkable success has been witnessed in dialogue understanding (DU).

Dialogue State Tracking dialogue summary +3

Paper
Add Code

MetaKG: Meta-learning on Knowledge Graph for Cold-start Recommendation

1 code implementation • 8 Feb 2022 • Yuntao Du, Xinjun Zhu, Lu Chen, Ziquan Fang, Yunjun Gao

Inspired by the success of meta-learning on scarce training samples, we propose a novel meta-learning based framework called MetaKG, which encompasses a collaborative-aware meta learner and a knowledge-aware meta learner, to capture meta users' preference and entities' knowledge for cold-start recommendations.

Meta-Learning

Paper
Code

Linear Array Network for Low-light Image Enhancement

no code implementations • 22 Jan 2022 • Keqi Wang, Ziteng Cui, Jieru Jia, Hao Xu, Ge Wu, Yin Zhuang, Lu Chen, Zhiguo Hu, Yuhua Qian

However, the convolution operation is based on a local sliding window mechanism, which is difficult to construct the long-range dependencies of the feature maps.

Low-Light Image Enhancement

Paper
Add Code

RFormer: Transformer-based Generative Adversarial Network for Real Fundus Image Restoration on A New Clinical Benchmark

1 code implementation • 3 Jan 2022 • Zhuo Deng, Yuanhao Cai, Lu Chen, Zheng Gong, Qiqi Bao, Xue Yao, Dong Fang, Shaochong Zhang, Lan Ma

In this paper, we investigate the real clinical fundus image restoration problem.

Generative Adversarial Network Image Restoration

Paper
Code

Deep Spatially and Temporally Aware Similarity Computation for Road Network Constrained Trajectories

1 code implementation • 17 Dec 2021 • Ziquan Fang, Yuntao Du, Xinjun Zhu, Lu Chen, Yunjun Gao, Christian S. Jensen

Trajectory similarity computation has drawn massive attention, as it is core functionality in a wide range of applications such as ride-sharing, traffic analysis, and social recommendation.

Representation Learning

Paper
Code

Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF

no code implementations • 9 Dec 2021 • Su Zhu, Lu Chen, Ruisheng Cao, Zhi Chen, Qingliang Miao, Kai Yu

In this paper, we propose to improve prototypical networks with vector projection distance and abstract triangular Conditional Random Field (CRF) for the few-shot NLU.

intent-classification Intent Classification +5

Paper
Add Code

FastSGD: A Fast Compressed SGD Framework for Distributed Machine Learning

no code implementations • 8 Dec 2021 • Keyu Yang, Lu Chen, Zhihao Zeng, Yunjun Gao

Distributed ML models trained by SGD involve large amounts of gradient communication, which limits the scalability of distributed ML.

BIG-bench Machine Learning Quantization

Paper
Add Code

Towards a Unified Game-Theoretic View of Adversarial Perturbations and Robustness

1 code implementation • NeurIPS 2021 • Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi Zhang

This paper provides a unified view to explain different adversarial attacks and defense methods, i. e. the view of multi-order interactions between input variables of DNNs.

Adversarial Robustness

Paper
Code

Machine Learning-Based Soft Sensors for Vacuum Distillation Unit

no code implementations • 19 Nov 2021 • Kamil Oster, Stefan Güttel, Lu Chen, Jonathan L. Shapiro, Megan Jobson

Firstly, it is important to enhance the quality of both sets of data (laboratory measurements and physical sensors) in a data pre-processing stage (as described in Methodology section).

BIG-bench Machine Learning Chemical Process

Paper
Add Code

A Unified Game-Theoretic Interpretation of Adversarial Robustness

1 code implementation • 5 Nov 2021 • Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi Zhang

This paper provides a unified view to explain different adversarial attacks and defense methods, \emph{i. e.} the view of multi-order interactions between input variables of DNNs.

Adversarial Robustness

Paper
Code

Finding Materialized Models for Model Reuse

1 code implementation • 13 Oct 2021 • Minjun Zhao, Lu Chen, Keyu Yang, Yuntao Du, Yunjun Gao

It uses a Gaussian mixture-based metric called separation degree to rank materialized models.

Model Selection Transfer Learning

Paper
Code

Dissecting Local Properties of Adversarial Examples

no code implementations • 29 Sep 2021 • Lu Chen, Renjie Chen, Hang Guo, Yuan Luo, Quanshi Zhang, Yisen Wang

Adversarial examples have attracted significant attention over the years, yet a sufficient understanding is in lack, especially when analyzing their performances in combination with adversarial training.

Adversarial Robustness

Paper
Add Code

Copy-Move Image Forgery Detection Based on Evolving Circular Domains Coverage

no code implementations • 9 Sep 2021 • Shilin Lu, Xinghong Hu, Chengyou Wang, Lu Chen, Shulu Han, Yuejia Han

The aim of this paper is to improve the accuracy of copy-move forgery detection (CMFD) in image forensics by proposing a novel scheme and the main contribution is evolving circular domains coverage (ECDC) algorithm.

Image Forensics Image Forgery Detection

Paper
Add Code

Pre-treatment of outliers and anomalies in plant data: Methodology and case study of a Vacuum Distillation Unit

no code implementations • 17 Jun 2021 • Kamil Oster, Stefan Güttel, Jonathan L. Shapiro, Lu Chen, Megan Jobson

In this case, we used principal component analysis (PCA) with Hotelling's $T^2$ statistics to identify the long-term outliers.

Time Series Analysis

Paper
Add Code

Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL

no code implementations • Findings (ACL) 2021 • Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu

A dual learning approach is also proposed for the utterance rewrite model to address the data sparsity problem.

Semantic Parsing SQL Parsing +1

Paper
Add Code

LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations

1 code implementation • ACL 2021 • Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu, Kai Yu

This work aims to tackle the challenging heterogeneous graph encoding problem in the text-to-SQL task.

Text-To-SQL

143

Paper
Code

ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser

no code implementations • NAACL 2021 • Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, Kai Yu

Given a database schema, Text-to-SQL aims to translate a natural language question into the corresponding SQL query.

Semantic Parsing Text-To-SQL

Paper
Add Code

A Unified Game-Theoretic Interpretation of Adversarial Robustness

1 code implementation • 12 Mar 2021 • Jie Ren, Die Zhang, Yisen Wang, Lu Chen, Zhanpeng Zhou, Yiting Chen, Xu Cheng, Xin Wang, Meng Zhou, Jie Shi, Quanshi Zhang

This paper provides a unified view to explain different adversarial attacks and defense methods, i. e. the view of multi-order interactions between input variables of DNNs.

Adversarial Robustness

Paper
Code

LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching

1 code implementation • 25 Feb 2021 • Boer Lyu, Lu Chen, Su Zhu, Kai Yu

Additionally, we adopt the word lattice graph as input to maintain multi-granularity information.

Text Matching

Paper
Code

WebSRC: A Dataset for Web-Based Structural Reading Comprehension

1 code implementation • EMNLP 2021 • Xingyu Chen, Zihan Zhao, Lu Chen, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu

In this paper, we introduce the task of structural reading comprehension (SRC) on web.

Reading Comprehension

Paper
Code

Comparison and Improvement for Delay Analysis Approaches: Theoretical Models and Experimental Tests

no code implementations • 21 Jan 2021 • Yue Hong Gao, Xiao Hong, Hao Tian Yang, Lu Chen, Xiao Nan Zhang

The test results are compared with the theoretical results, analyzed and corrected, in order to verify the feasibility of our analysis model for the performance analysis of the actual network.

Networking and Internet Architecture Performance

Paper
Add Code

Searching Personalized $k$-wing in Large and Dynamic Bipartite Graphs

no code implementations • 4 Jan 2021 • Aman Abidi, Lu Chen, Rui Zhou, Chengfei Liu

By exploiting the discoveries, we propose novel algorithms for maintaining the two indices, which substantially reduces the cost of maintenance.

Paper
Add Code

FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) Systems

1 code implementation • 15 Dec 2020 • Lu Chen, Jiao Sun, Wei Xu

In both letter-level and word-level attacks, our experiments show that in addition to natural appearance, FAWA achieves a 100% attack success rate with 60% less perturbations and 78% fewer iterations on average.

Optical Character Recognition Optical Character Recognition (OCR)

Paper
Code

An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models

no code implementations • 14 Oct 2020 • Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu

Recently, pre-trained language models like BERT have shown promising performance on multiple natural language processing tasks.

Clustering Quantization

Paper
Add Code

SOUP: Spatial-Temporal Demand Forecasting and Competitive Supply

no code implementations • 24 Sep 2020 • Bolong Zheng, Qi Hu, Lingfeng Ming, Jilin Hu, Lu Chen, Kai Zheng, Christian S. Jensen

In this setting, an assignment authority is to assign agents to requests such that the average idle time of the agents is minimized.

Databases Signal Processing

Paper
Add Code

Structured Hierarchical Dialogue Policy with Graph Neural Networks

no code implementations • 22 Sep 2020 • Zhi Chen, Xiaoyuan Liu, Lu Chen, Kai Yu

A novel ComNet is proposed to model the structure of a hierarchical agent.

Paper
Add Code

Deep Reinforcement Learning for On-line Dialogue State Tracking

no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Xiang Zhou, Kai Yu

To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems.

Dialogue Management Dialogue State Tracking +4

Paper
Add Code

CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking

no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Zihan Xu, Yanbin Zhao, Su Zhu, Kai Yu

In dialogue systems, a dialogue state tracker aims to accurately find a compact representation of the current dialogue status, based on the entire dialogue history.

Dialogue State Tracking

Paper
Add Code

Dual Learning for Dialogue State Tracking

no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Yanbin Zhao, Su Zhu, Kai Yu

In task-oriented multi-turn dialogue systems, dialogue state refers to a compact representation of the user goal in the context of dialogue history.

Dialogue State Tracking Sentence

Paper
Add Code

Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management

no code implementations • 22 Sep 2020 • Zhi Chen, Lu Chen, Xiaoyuan Liu, Kai Yu

The task-oriented spoken dialogue system (SDS) aims to assist a human user in accomplishing a specific task (e. g., hotel booking).

Decision Making Dialogue Management +3

Paper
Add Code

Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding

1 code implementation • 21 Sep 2020 • Su Zhu, Ruisheng Cao, Lu Chen, Kai Yu

Few-shot slot tagging becomes appealing for rapid domain transfer and adaptation, motivated by the tremendous development of conversational dialogue systems.

Few-Shot Learning Natural Language Understanding +2

Paper
Code

Robust Spoken Language Understanding with RL-based Value Error Recovery

no code implementations • 7 Sep 2020 • Chen Liu, Su Zhu, Lu Chen, Kai Yu

The framework consists of a slot tagging model and a rule-based value error recovery module.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Neural Graph Matching Networks for Chinese Short Text Matching

no code implementations • ACL 2020 • Lu Chen, Yanbin Zhao, Boer Lyu, Lesheng Jin, Zhi Chen, Su Zhu, Kai Yu

Chinese short text matching usually employs word sequences rather than character sequences to get better performance.

Chinese Word Segmentation Graph Matching +3

Paper
Add Code

Line Graph Enhanced AMR-to-Text Generation with Mix-Order Graph Attention Networks

no code implementations • ACL 2020 • Yanbin Zhao, Lu Chen, Zhi Chen, Ruisheng Cao, Su Zhu, Kai Yu

We also adopt graph attention networks with higher-order neighborhood information to encode the rich structure in AMR graphs.

AMR-to-Text Generation Graph Attention +2

Paper
Add Code

Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing

1 code implementation • ACL 2020 • Ruisheng Cao, Su Zhu, Chenyu Yang, Chen Liu, Rao Ma, Yanbin Zhao, Lu Chen, Kai Yu

One daunting problem for semantic parsing is the scarcity of annotation.

Semantic Parsing Vocal Bursts Valence Prediction

Paper
Code

Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding

1 code implementation • 24 May 2020 • Chen Liu, Su Zhu, Zijian Zhao, Ruisheng Cao, Lu Chen, Kai Yu

In this paper, a novel BERT based SLU model (WCN-BERT SLU) is proposed to encode WCNs and the dialogue context jointly.

Spoken Language Understanding

Paper
Code

Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders

no code implementations • 30 Apr 2020 • Yanbin Zhao, Lu Chen, Zhi Chen, Kai Yu

When modeling simple and complex sentences with autoencoders, we introduce different types of noise into the training process.

Denoising Language Modelling +4

Paper
Add Code

CrowdTSC: Crowd-based Neural Networks for Text Sentiment Classification

no code implementations • 26 Apr 2020 • Keyu Yang, Yunjun Gao, Lei Liang, Song Bian, Lu Chen, Baihua Zheng

We propose Crowd-based neural networks for Text Sentiment Classification (CrowdTSC for short).

Clustering General Classification +4

Paper
Add Code

Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking

no code implementations • Findings of the Association for Computational Linguistics 2020 • Su Zhu, Jieyu Li, Lu Chen, Kai Yu

In this paper, a novel context and schema fusion network is proposed to encode the dialogue context and schema graph by using internal and external attention mechanisms.

Ranked #8 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.0

Dialogue State Tracking Multi-domain Dialogue State Tracking

Paper
Add Code

Schema-Guided Multi-Domain Dialogue State Tracking with Graph Attention Neural Networks

no code implementations • 3 Apr 2020 • Lu Chen, Boer Lv, Chi Wang, Su Zhu, Bowen Tan, Kai Yu

For multi-domain DST, the data sparsity problem is also a major obstacle due to the increased number of state candidates.

Ranked #12 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.1

Dialogue State Tracking Graph Attention +1

Paper
Add Code

Index-based Solutions for Efficient Density Peak Clustering

no code implementations • 8 Feb 2020 • Zafaryab Rasool, Rui Zhou, Lu Chen, Chengfei Liu, Jiajie Xu

Efficient query algorithms are proposed for these indices which significantly avoids irrelevant comparisons at the cost of space.

Clustering

Paper
Add Code

Attacking Optical Character Recognition (OCR) Systems with Adversarial Watermarks

no code implementations • 8 Feb 2020 • Lu Chen, Wei Xu

Optical character recognition (OCR) is widely applied in real applications serving as a key preprocessing tool.

Optical Character Recognition Optical Character Recognition (OCR)

Paper
Add Code

AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning

no code implementations • 27 May 2019 • Lu Chen, Zhi Chen, Bowen Tan, Sishan Long, Milica Gasic, Kai Yu

Experiments show that AgentGraph models significantly outperform traditional reinforcement learning approaches on most of the 18 tasks of the PyDial benchmark.

Dialogue Management Management +4

Paper
Add Code

Recurrent Multi-Graph Neural Networks for Travel Cost Prediction

no code implementations • 13 Nov 2018 • Jilin Hu, Chenjuan Guo, Bin Yang, Christian S. Jensen, Lu Chen

Origin-destination (OD) matrices are often used in urban planning, where a city is partitioned into regions and an element (i, j) in an OD matrix records the cost (e. g., travel time, fuel consumption, or travel speed) from region i to region j.

Paper
Add Code

DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction

1 code implementation • ACL 2019 • Shun Zheng, Xu Han, Yankai Lin, Peilin Yu, Lu Chen, Ling Huang, Zhiyuan Liu, Wei Xu

To demonstrate the effectiveness of DIAG-NRE, we apply it to two real-world datasets and present both significant and interpretable improvements over state-of-the-art methods.

Relation Relation Extraction

Paper
Code

Towards Universal Dialogue State Tracking

1 code implementation • EMNLP 2018 • Liliang Ren, Kaige Xie, Lu Chen, Kai Yu

Dialogue state tracking is the core part of a spoken dialogue system.

Ranked #2 on Dialogue State Tracking on Second dialogue state tracking challenge

Dialogue State Tracking

Paper
Code

Structured Dialogue Policy with Graph Neural Networks

no code implementations • COLING 2018 • Lu Chen, Bowen Tan, Sishan Long, Kai Yu

The proposed structured deep reinforcement learning is based on graph neural networks (GNN), which consists of some sub-networks, each one for a node on a directed graph.

Automatic Speech Recognition (ASR) Decision Making +5

Paper
Add Code

Cost-Sensitive Active Learning for Dialogue State Tracking

no code implementations • WS 2018 • Kaige Xie, Cheng Chang, Liliang Ren, Lu Chen, Kai Yu

Dialogue state tracking (DST), when formulated as a supervised learning problem, relies on labelled data.

Active Learning Dialogue State Tracking

Paper
Add Code

Affordable On-line Dialogue Policy Learning

no code implementations • EMNLP 2017 • Cheng Chang, Runzhe Yang, Lu Chen, Xiang Zhou, Kai Yu

The key to building an evolvable dialogue system in real-world scenarios is to ensure an affordable on-line dialogue policy learning, which requires the on-line learning process to be safe, efficient and economical.

Dialogue Management