Search Results for author: Yue Yu

Found 116 papers, 51 papers with code

A Mandarin-English Code-Switching Corpus

no code implementations • LREC 2012 • Ying Li, Yue Yu, Pascale Fung

Generally the existing monolingual corpora are not suitable for large vocabulary continuous speech recognition (LVCSR) of code-switching speech.

Boundary Detection Language Identification +4

Paper
Add Code

Representing Clinical Diagnostic Criteria in Quality Data Model Using Natural Language Processing

no code implementations • WS 2015 • Na Hong, Dingcheng Li, Yue Yu, Hongfang Liu, Christopher G. Chute, Guoqian Jiang

Decision Making

Paper
Add Code

Fast Stochastic Variance Reduced ADMM for Stochastic Composition Optimization

no code implementations • 11 May 2017 • Yue Yu, Longbo Huang

We consider the stochastic composition optimization problem proposed in \cite{wang2017stochastic}, which has applications ranging from estimation to statistical and machine learning.

BIG-bench Machine Learning

Paper
Add Code

Optimal Cooperative Inference

no code implementations • 24 May 2017 • Scott Cheng-Hsin Yang, Yue Yu, Arash Givchi, Pei Wang, Wai Keen Vong, Patrick Shafto

Cooperative transmission of data fosters rapid accumulation of knowledge by efficiently combining experiences across learners.

BIG-bench Machine Learning

Paper
Add Code

Aurora: Providing Trusted System Services for Enclaves On an Untrusted System

1 code implementation • 10 Feb 2018 • Hongliang Liang, Mingyu Li, Qiong Zhang, Yue Yu, Lin Jiang, Yixiu Chen

Intel SGX provisions shielded executions for security-sensitive computation, but lacks support for trusted system services (TSS), such as clock, network and filesystem.

Cryptography and Security

Paper
Code

Double Quantization for Communication-Efficient Distributed Optimization

no code implementations • NeurIPS 2019 • Yue Yu, Jiaxiang Wu, Longbo Huang

In this paper, to reduce the communication complexity, we propose \emph{double quantization}, a general scheme for quantizing both model parameters and gradients.

Distributed Optimization Quantization

Paper
Add Code

Characterizing Malicious Edges targeting on Graph Neural Networks

no code implementations • 27 Sep 2018 • Xiaojun Xu, Yue Yu, Bo Li, Le Song, Chengfeng Liu, Carl Gunter

Extensive experiments are conducted to show that the proposed detection mechanism can achieve AUC above 90% against the two attack strategies on both Cora and Citeseer datasets.

Graph Generation Link Prediction +1

Paper
Add Code

DAG-GNN: DAG Structure Learning with Graph Neural Networks

3 code implementations • 22 Apr 2019 • Yue Yu, Jie Chen, Tian Gao, Mo Yu

Learning a faithful directed acyclic graph (DAG) from samples of a joint distribution is a challenging combinatorial problem, owing to the intractable search space superexponential in the number of graph nodes.

272

Paper
Code

GumDrop at the DISRPT2019 Shared Task: A Model Stacking Approach to Discourse Unit Segmentation and Connective Detection

1 code implementation • WS 2019 • Yue Yu, YIlun Zhu, Yang Liu, Yan Liu, Siyao Peng, Mackenzie Gong, Amir Zeldes

In this paper we present GumDrop, Georgetown University's entry at the DISRPT 2019 Shared Task on automatic discourse unit segmentation and connective detection.

Connective Detection Discourse Segmentation +2

Paper
Code

Modeling Long-Range Context for Concurrent Dialogue Acts Recognition

no code implementations • 2 Sep 2019 • Yue Yu, Siyao Peng, Grace Hui Yang

Previous work on DA recognition either assumes one DA per utterance or fails to realize the sequential nature of dialogues.

Sentence

Paper
Add Code

Acutum: When Generalization Meets Adaptability

no code implementations • 25 Sep 2019 • Xunpeng Huang, Zhengyang Liu, Zhe Wang, Yue Yu, Lei LI

To the best of our knowledge, Acutum is the first adaptive gradient method without second moments.

BIG-bench Machine Learning

Paper
Add Code

SPAN: A Stochastic Projected Approximate Newton Method

no code implementations • 10 Feb 2020 • Xunpeng Huang, Xianfeng Liang, Zhengyang Liu, Yitan Li, Linyun Yu, Yue Yu, Lei LI

SPAN computes the inverse of the Hessian matrix via low-rank approximation and stochastic Hessian-vector products.

Paper
Add Code

Urban Anomaly Analytics: Description, Detection, and Prediction

no code implementations • 25 Apr 2020 • Mingyang Zhang, Tong Li, Yue Yu, Yong Li, Pan Hui, Yu Zheng

Urban anomalies may result in loss of life or property if not handled properly.

Paper
Add Code

Data-driven learning of robust nonlocal physics from high-fidelity synthetic data

no code implementations • 17 May 2020 • Huaiqian You, Yue Yu, Nathaniel Trask, Mamikon Gulian, Marta D'Elia

A key challenge to nonlocal models is the analytical complexity of deriving them from first principles, and frequently their use is justified a posteriori.

Paper
Add Code

STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths

1 code implementation • 18 Jun 2020 • Yue Yu, Yinghao Li, Jiaming Shen, Hao Feng, Jimeng Sun, Chao Zhang

We propose a self-supervised taxonomy expansion model named STEAM, which leverages natural supervision in the existing taxonomy for expansion.

Taxonomy Expansion

Paper
Code

BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision

1 code implementation • 28 Jun 2020 • Chen Liang, Yue Yu, Haoming Jiang, Siawpeng Er, Ruijia Wang, Tuo Zhao, Chao Zhang

We study the open-domain named entity recognition (NER) problem under distant supervision.

Ranked #1 on Weakly-Supervised Named Entity Recognition on CoNLL03

Language Modelling named-entity-recognition +3

290

Paper
Code

SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph Summarization

1 code implementation • 4 Oct 2020 • Yue Yu, Kexin Huang, Chao Zhang, Lucas M. Glass, Jimeng Sun, Cao Xiao

Furthermore, most previous works focus on binary DDI prediction whereas the multi-typed DDI pharmacological effect prediction is a more meaningful but harder task.

Data Integration Knowledge Graphs

Paper
Code

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

1 code implementation • EMNLP 2020 • Rongzhi Zhang, Yue Yu, Chao Zhang

Our method, SeqMix, simply augments the queried samples by generating extra labeled sequences in each iteration.

Active Learning Data Augmentation +4

Paper
Code

MCMH: Learning Multi-Chain Multi-Hop Rules for Knowledge Graph Reasoning

no code implementations • Findings of the Association for Computational Linguistics 2020 • Lu Zhang, Mo Yu, Tian Gao, Yue Yu

Multi-hop reasoning approaches over knowledge graphs infer a missing relationship between entities with a multi-hop rule, which corresponds to a chain of relationships.

Knowledge Graphs Relation

Paper
Add Code

Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach

1 code implementation • NAACL 2021 • Yue Yu, Simiao Zuo, Haoming Jiang, Wendi Ren, Tuo Zhao, Chao Zhang

To address this problem, we develop a contrastive self-training framework, COSINE, to enable fine-tuning LMs with weak supervision.

Ranked #1 on Word Sense Disambiguation on Words in Context

Language Modelling Sentence +2

198

Paper
Code

DAGs with No Fears: A Closer Look at Continuous Optimization for Learning Bayesian Networks

1 code implementation • NeurIPS 2020 • Dennis Wei, Tian Gao, Yue Yu

This paper re-examines a continuous optimization framework dubbed NOTEARS for learning Bayesian networks.

Paper
Code

Data-driven learning of nonlocal models: from high-fidelity simulations to constitutive laws

no code implementations • 8 Dec 2020 • Huaiqian You, Yue Yu, Stewart Silling, Marta D'Elia

We show that machine learning can improve the accuracy of simulations of stress waves in one-dimensional composite materials.

Paper
Add Code

An asymptotically compatible treatment of traction loading in linearly elastic peridynamic fracture

no code implementations • 5 Jan 2021 • Yue Yu, Huaiqian You, Nathaniel Trask

In the absence of fracture, when a corresponding classical continuum mechanics model exists, our improvements provide asymptotically compatible convergence to corresponding local solutions, eliminating surface effects and issues with traction loading which have historically plagued peridynamic discretizations.

Numerical Analysis Computational Engineering, Finance, and Science Numerical Analysis Analysis of PDEs

Paper
Add Code

On Controllability and Persistency of Excitation in Data-Driven Control: Extensions of Willems' Fundamental Lemma

no code implementations • 5 Feb 2021 • Yue Yu, Shahriar Talebi, Henk J. van Waarde, Ufuk Topcu, Mehran Mesbahi, Behçet Açıkmeşe

Willems' fundamental lemma asserts that all trajectories of a linear time-invariant system can be obtained from a finite number of measured ones, assuming that controllability and a persistency of excitation condition hold.

LEMMA Model Predictive Control

Paper
Add Code

deGraphCS: Embedding Variable-based Flow Graph for Neural Code Search

1 code implementation • 24 Mar 2021 • Chen Zeng, Yue Yu, Shanshan Li, Xin Xia, Zhiming Wang, Mingyang Geng, Bailin Xiao, Wei Dong, Xiangke Liao

With the rapid increase in the amount of public code repositories, developers maintain a great desire to retrieve precise code snippets by using natural language.

Code Search

Paper
Code

PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation

4 code implementations • 26 Apr 2021 • Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, ZhenZhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, YaoWei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian

To enhance the generalization ability of PanGu-$\alpha$, we collect 1. 1TB high-quality Chinese data from a wide range of domains to pretrain the model.

Ranked #1 on Reading Comprehension (One-Shot) on DuReader

Cloze (multi-choices) (Few-Shot) Cloze (multi-choices) (One-Shot) +19

219

Paper
Code

DAGs with No Curl: An Efficient DAG Structure Learning Approach

1 code implementation • 14 Jun 2021 • Yue Yu, Tian Gao, Naiyu Yin, Qiang Ji

To further improve efficiency, we propose a novel learning framework to model and learn the weighted adjacency matrices in the DAG space directly.

Paper
Code

GGT: Graph-Guided Testing for Adversarial Sample Detection of Deep Neural Network

no code implementations • 9 Jul 2021 • Zuohui Chen, Renxuan Wang, Jingyang Xiang, Yue Yu, Xin Xia, Shouling Ji, Qi Xuan, Xiaoniu Yang

Deep Neural Networks (DNN) are known to be vulnerable to adversarial samples, the detection of which is crucial for the wide application of these DNN models.

Paper
Add Code

A data-driven peridynamic continuum model for upscaling molecular dynamics

no code implementations • 4 Aug 2021 • Huaiqian You, Yue Yu, Stewart Silling, Marta D'Elia

Nonlocal models, including peridynamics, often use integral operators that embed lengthscales in their definition.

Paper
Add Code

Advances in Trajectory Optimization for Space Vehicle Control

1 code implementation • 5 Aug 2021 • Danylo Malyuta, Yue Yu, Purnanand Elango, Behcet Acikmese

Space mission design places a premium on cost and operational efficiency.

Model Predictive Control

Paper
Code

A physics-informed variational DeepONet for predicting the crack path in brittle materials

no code implementations • 16 Aug 2021 • Somdatta Goswami, Minglang Yin, Yue Yu, George Karniadakis

We propose a physics-informed variational formulation of DeepONet (V-DeepONet) for brittle fracture analysis.

Paper
Add Code

Self-Training with Differentiable Teacher

no code implementations • Findings (NAACL) 2022 • Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha

In self-training, the student contributes to the prediction performance, and the teacher controls the training process by generating pseudo-labels.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

WRENCH: A Comprehensive Benchmark for Weak Supervision

1 code implementation • 23 Sep 2021 • Jieyu Zhang, Yue Yu, Yinghao Li, Yujing Wang, Yaming Yang, Mao Yang, Alexander Ratner

To address these problems, we introduce a benchmark platform, WRENCH, for thorough and standardized evaluation of WS approaches.

211

Paper
Code

A revised comparison between FF five-factor model and three-factor model,based on China's A-share market

no code implementations • 16 Oct 2021 • Zhijing Zhang, Yue Yu, Qinghua Ma, Haixiang Yao

In allusion to some contradicting results in existing research, this paper selects China's latest stock data from 2005 to 2020 for empirical analysis.

regression

Paper
Add Code

Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding

1 code implementation • 4 Dec 2021 • Deze Wang, Zhouyang Jia, Shanshan Li, Yue Yu, Yun Xiong, Wei Dong, Xiangke Liao

In this paper, we propose an approach to bridge pre-trained models and code-related tasks.

Clone Detection Code Search

Paper
Code

AcTune: Uncertainty-aware Active Self-Training for Semi-Supervised Active Learning with Pretrained Language Models

1 code implementation • 16 Dec 2021 • Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang

We propose {\ours}, a new framework that leverages unlabeled data to improve the label efficiency of active PLM fine-tuning.

Active Learning Language Modelling +2

Paper
Code

NIP: Neuron-level Inverse Perturbation Against Adversarial Attacks

no code implementations • 24 Dec 2021 • Ruoxi Chen, Haibo Jin, Jinyin Chen, Haibin Zheng, Yue Yu, Shouling Ji

From the perspective of image feature space, some of them cannot reach satisfying results due to the shift of features.

Paper
Add Code

CatchBackdoor: Backdoor Testing by Critical Trojan Neural Path Identification via Differential Fuzzing

no code implementations • 24 Dec 2021 • Haibo Jin, Ruoxi Chen, Jinyin Chen, Yao Cheng, Chong Fu, Ting Wang, Yue Yu, Zhaoyan Ming

Existing DNN testing methods are mainly designed to find incorrect corner case behaviors in adversarial settings but fail to discover the backdoors crafted by strong trojan attacks.

DNN Testing

Paper
Add Code

NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification

1 code implementation • 25 Dec 2021 • Haibin Zheng, Zhiqing Chen, Tianyu Du, Xuhong Zhang, Yao Cheng, Shouling Ji, Jingyi Wang, Yue Yu, Jinyin Chen

To overcome the challenges, we propose NeuronFair, a new DNN fairness testing framework that differs from previous work in several key aspects: (1) interpretable - it quantitatively interprets DNNs' fairness violations for the biased decision; (2) effective - it uses the interpretation results to guide the generation of more diverse instances in less time; (3) generic - it can handle both structured and unstructured data.

Fairness

Paper
Code

Nonlocal Kernel Network (NKN): a Stable and Resolution-Independent Deep Neural Network

no code implementations • 6 Jan 2022 • Huaiqian You, Yue Yu, Marta D'Elia, Tian Gao, Stewart Silling

In this work, we propose a novel nonlocal neural operator, which we refer to as nonlocal kernel network (NKN), that is resolution independent, characterized by deep neural networks, and capable of handling a variety of tasks such as learning governing equations and classifying images.

Image Classification

Paper
Add Code

Multi-task Joint Strategies of Self-supervised Representation Learning on Biomedical Networks for Drug Discovery

2 code implementations • 12 Jan 2022 • Xiaoqi Wang, Yingjie Cheng, Yaning Yang, Yue Yu, Fei Li, Shaoliang Peng

Therefore, we conjecture that the multimodal and local-global combination strategies can be treated as the guideline of multi-task SSL for drug discovery.

Drug Discovery Graph Attention +2

Paper
Code

A Survey on Programmatic Weak Supervision

1 code implementation • 11 Feb 2022 • Jieyu Zhang, Cheng-Yu Hsieh, Yue Yu, Chao Zhang, Alexander Ratner

Labeling training data has become one of the major roadblocks to using machine learning.

170

Paper
Code

Excitement Surfeited Turns to Errors: Deep Learning Testing Framework Based on Excitable Neurons

1 code implementation • 12 Feb 2022 • Haibo Jin, Ruoxi Chen, Haibin Zheng, Jinyin Chen, Yao Cheng, Yue Yu, Xianglong Liu

By maximizing the number of excitable neurons concerning various wrong behaviors of models, DeepSensor can generate testing examples that effectively trigger more errors due to adversarial inputs, polluted data and incomplete training.

Image Classification Speaker Recognition

Paper
Code

Interfacing Finite Elements with Deep Neural Operators for Fast Multiscale Modeling of Mechanics Problems

no code implementations • 25 Feb 2022 • Minglang Yin, Enrui Zhang, Yue Yu, George Em Karniadakis

In this work, we explore the idea of multiscale modeling with machine learning and employ DeepONet, a neural operator, as an efficient surrogate of the expensive solver.

Paper
Add Code

Smart Tracking Tray System for A Smart and Sustainable Wet Lab Community

no code implementations • 2 Mar 2022 • Nan Xu, Jingchen Li, Yue Yu, Yang Li, Jinglei Yang

Positive feedback has been collected from pilot tests in several labs.

Data Visualization Experimental Design

Paper
Add Code

Learning Deep Implicit Fourier Neural Operators (IFNOs) with Applications to Heterogeneous Material Modeling

1 code implementation • 15 Mar 2022 • Huaiqian You, Quinn Zhang, Colton J. Ross, Chung-Hao Lee, Yue Yu

In this work, we propose to use data-driven modeling, which directly utilizes high-fidelity simulation and/or experimental measurements to predict a material's response without using conventional constitutive models.

Paper
Code

PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning

1 code implementation • 18 Mar 2022 • Rongzhi Zhang, Yue Yu, Pranav Shetty, Le Song, Chao Zhang

Weakly-supervised learning (WSL) has shown promising results in addressing label scarcity on many NLP tasks, but manually designing a comprehensive, high-quality labeling rule set is tedious and difficult.

Weakly-supervised Learning

Paper
Code

A Physics-Guided Neural Operator Learning Approach to Model Biological Tissues from Digital Image Correlation Measurements

1 code implementation • 1 Apr 2022 • Huaiqian You, Quinn Zhang, Colton J. Ross, Chung-Hao Lee, Ming-Chen Hsu, Yue Yu

To improve the generalizability of our framework, we propose a physics-guided neural operator learning model via imposing partial physics knowledge.

Operator learning

Paper
Code

Modality-Balanced Embedding for Video Retrieval

no code implementations • 18 Apr 2022 • Xun Wang, Bingqing Ke, Xuanping Li, Fangyu Liu, Mingyu Zhang, Xiao Liang, Qiushi Xiao, Cheng Luo, Yue Yu

This modality imbalanceresults from a) modality gap: the relevance between a query and a video text is much easier to learn as the query is also a piece of text, with the same modality as the video text; b) data bias: most training samples can be solved solely by text matching.

Retrieval Text Matching +1

Paper
Add Code

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

1 code implementation • 19 May 2022 • Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, dianhai yu

We took natural language processing (NLP) as an example to show how Nebula-I works in different training phases that include: a) pre-training a multilingual language model using two remote clusters; and b) fine-tuning a machine translation model using knowledge distilled from pre-trained models, which run through the most popular paradigm of recent deep learning.

Cross-Lingual Natural Language Inference Distributed Computing +2

21,583

Paper
Code

Nonparametric learning of kernels in nonlocal operators

no code implementations • 23 May 2022 • Fei Lu, Qingci An, Yue Yu

In this work, we provide a rigorous identifiability analysis and convergence study for the learning of kernels in nonlocal operators.

Paper
Add Code

MetaNOR: A Meta-Learnt Nonlocal Operator Regression Approach for Metamaterial Modeling

no code implementations • 4 Jun 2022 • Lu Zhang, Huaiqian You, Yue Yu

We propose MetaNOR, a meta-learnt approach for transfer-learning operators based on the nonlocal operator regression.

regression Transfer Learning

Paper
Add Code

PolyU-BPCoMa: A Dataset and Benchmark Towards Mobile Colorized Mapping Using a Backpack Multisensorial System

1 code implementation • 15 Jun 2022 • Wenzhong Shi, Pengxin Chen, Muyang Wang, Sheng Bao, Haodong Xiang, Yue Yu, Daping Yang

Color checker boards are pasted in each surveyed area as targets and ground truth data are collected by an advanced terrestrial laser scanner (TLS).

Colorization

Paper
Code

Physics-Informed Deep Neural Operator Networks

no code implementations • 8 Jul 2022 • Somdatta Goswami, Aniruddha Bora, Yue Yu, George Em Karniadakis

Standard neural networks can approximate general nonlinear operators, represented either explicitly by a combination of mathematical operators, e. g., in an advection-diffusion-reaction partial differential equation, or simply as a black box, e. g., a system-of-systems.

Uncertainty Quantification

Paper
Add Code

OpenMedIA: Open-Source Medical Image Analysis Toolbox and Benchmark under Heterogeneous AI Computing Platforms

no code implementations • 11 Aug 2022 • Jia-Xin Zhuang, Xiansong Huang, Yang Yang, Jiancong Chen, Yue Yu, Wei Gao, Ge Li, Jie Chen, Tong Zhang

In this paper, we present OpenMedIA, an open-source toolbox library containing a rich set of deep learning methods for medical image analysis under heterogeneous Artificial Intelligence (AI) computing platforms.

Image Classification Medical Image Classification +2

Paper
Add Code

Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach

1 code implementation • 15 Sep 2022 • Yue Yu, Rongzhi Zhang, ran Xu, Jieyu Zhang, Jiaming Shen, Chao Zhang

Large Language Models have demonstrated remarkable few-shot performance, but the performance can be sensitive to the selection of few-shot instances.

Language Modelling Text Classification

Paper
Code

Online Poisoning Attacks Against Data-Driven Predictive Control

no code implementations • 19 Sep 2022 • Yue Yu, Ruihan Zhao, Sandeep Chinchali, Ufuk Topcu

Data-driven predictive control (DPC) is a feedback control method for systems with unknown dynamics.

Paper
Add Code

ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select

1 code implementation • 26 Oct 2022 • Yuchen Zhuang, Yinghao Li, Jerry Junyang Cheung, Yue Yu, Yingjun Mou, Xiang Chen, Le Song, Chao Zhang

We study the problem of extracting N-ary relation tuples from scientific articles.

Relation Extraction Retrieval

Paper
Code

COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning

1 code implementation • 27 Oct 2022 • Yue Yu, Chenyan Xiong, Si Sun, Chao Zhang, Arnold Overwijk

We present a new zero-shot dense retrieval (ZeroDR) method, COCO-DR, to improve the generalization ability of dense retrieval by combating the distribution shifts between source training tasks and target scenarios.

Ranked #1 on Zero-shot Text Search on CQADupStack

Language Modelling Retrieval +2

Paper
Code

Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives

1 code implementation • 31 Oct 2022 • Si Sun, Chenyan Xiong, Yue Yu, Arnold Overwijk, Zhiyuan Liu, Jie Bao

In this paper, we investigate the instability in the standard dense retrieval training, which iterates between model training and hard negative selection using the being-trained model.

Retrieval

Paper
Code

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks

1 code implementation • 1 Nov 2022 • Yue Yu, Xuan Kan, Hejie Cui, ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang

To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis.

Time Series Time Series Analysis

Paper
Code

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind

1 code implementation • 9 Nov 2022 • Mo Yu, Qiujing Wang, Shunchi Zhang, Yisi Sang, Kangsheng Pu, Zekai Wei, Han Wang, Liyan Xu, Jing Li, Yue Yu, Jie zhou

Our dataset consists of ~1, 000 parsed movie scripts, each corresponding to a few-shot character understanding task that requires models to mimic humans' ability of fast digesting characters with a few starting scenes in a new movie.

Meta-Learning Metric Learning

Paper
Code

Transferability Estimation Based On Principal Gradient Expectation

no code implementations • 29 Nov 2022 • Huiyan Qi, Lechao Cheng, Jingjing Chen, Yue Yu, Xue Song, Zunlei Feng, Yu-Gang Jiang

Transfer learning aims to improve the performance of target tasks by transferring knowledge acquired in source tasks.

Transfer Learning

Paper
Add Code

EDoG: Adversarial Edge Detection For Graph Neural Networks

no code implementations • 27 Dec 2022 • Xiaojun Xu, Yue Yu, Hanzhang Wang, Alok Lal, Carl A. Gunter, Bo Li

In this paper, we propose a general adversarial edge detection pipeline EDoG without requiring knowledge of the attack strategies based on graph generation.

Edge Detection Graph Generation +2

Paper
Add Code

INO: Invariant Neural Operators for Learning Complex Physical Systems with Momentum Conservation

no code implementations • 29 Dec 2022 • Ning Liu, Yue Yu, Huaiqian You, Neeraj Tatikola

Neural operators, which emerge as implicit solution operators of hidden governing equations, have recently become popular tools for learning responses of complex real-world physical systems.

Paper
Add Code

Neighborhood-Regularized Self-Training for Learning with Few Labels

1 code implementation • 10 Jan 2023 • ran Xu, Yue Yu, Hejie Cui, Xuan Kan, Yanqiao Zhu, Joyce Ho, Chao Zhang, Carl Yang

Our further analysis demonstrates that our proposed data selection strategy reduces the noise of pseudo labels by 36. 8% and saves 57. 3% of the time when compared with the best baseline.

Paper
Code

Towards a unified nonlocal, peridynamics framework for the coarse-graining of molecular dynamics data with fractures

no code implementations • 11 Jan 2023 • Huaiqian You, Xiao Xu, Yue Yu, Stewart Silling, Marta D'Elia, John Foster

Then, based on the coarse-grained MD data, a two-phase optimization-based learning approach is proposed to infer the optimal peridynamics model with damage criterion.

Paper
Add Code

MetaNO: How to Transfer Your Knowledge on Learning Hidden Physics

no code implementations • 28 Jan 2023 • Lu Zhang, Huaiqian You, Tian Gao, Mo Yu, Chung-Hao Lee, Yue Yu

Gradient-based meta-learning methods have primarily been applied to classical machine learning tasks such as image classification.

Image Classification Meta-Learning

Paper
Add Code

Stochastic Clustered Federated Learning

no code implementations • 2 Mar 2023 • Dun Zeng, Xiangjing Hu, Shiyu Liu, Yue Yu, Qifan Wang, Zenglin Xu

Federated learning is a distributed learning framework that takes full advantage of private data samples kept on edge devices.

Federated Learning

Paper
Add Code

Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and Inverse Learning

1 code implementation • 31 Mar 2023 • Shenghui Chen, Yue Yu, David Fridovich-Keil, Ufuk Topcu

Markov games model interactions among multiple players in a stochastic, dynamic environment.

OpenAI Gym

Paper
Code

Dynamic Routing in Stochastic Urban Air Mobility Networks: A Markov Decision Process Approach

no code implementations • 11 May 2023 • Qinshuang Wei, Yue Yu, Ufuk Topcu

Urban air mobility (UAM) is an emerging concept in short-range aviation transportation, where the aircraft will take off, land, and charge their batteries at a set of vertistops, and travel only through a set of flight corridors connecting these vertistops.

Decision Making

Paper
Add Code

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval

1 code implementation • 18 May 2023 • Yue Yu, Yuchen Zhuang, Rongzhi Zhang, Yu Meng, Jiaming Shen, Chao Zhang

With the development of large language models (LLMs), zero-shot learning has attracted much attention for various NLP tasks.

Ranked #1 on Zero-Shot Text Classification on AG News

Descriptive Retrieval +6

Paper
Code

Higher Order Gauge Equivariant CNNs on Riemannian Manifolds and Applications

no code implementations • 26 May 2023 • Gianfranco Cortes, Yue Yu, Robin Chen, Melissa Armstrong, David Vaillancourt, Baba C. Vemuri

With the advent of group equivariant convolutions in deep networks literature, spherical CNNs with $\mathsf{SO}(3)$-equivariant layers have been developed to cope with data that are samples of signals on the sphere $S^2$.

Paper
Add Code

DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling

1 code implementation • 30 May 2023 • Yuchen Zhuang, Yue Yu, Lingkai Kong, Xiang Chen, Chao Zhang

Most existing methods for learning from noisy labels use static input features for denoising, but these methods are limited by the information they can provide on true label distributions and can result in biased or incorrect predictions.

Denoising

Paper
Code

R-Mixup: Riemannian Mixup for Biological Networks

no code implementations • 5 Jun 2023 • Xuan Kan, Zimu Li, Hejie Cui, Yue Yu, ran Xu, Shaojun Yu, Zilong Zhang, Ying Guo, Carl Yang

Biological networks are commonly used in biomedical and healthcare domains to effectively model the structure of complex biological systems with interactions linking biological entities.

Data Augmentation

Paper
Add Code

Local Boosting for Weakly-Supervised Learning

no code implementations • 5 Jun 2023 • Rongzhi Zhang, Yue Yu, Jiaming Shen, Xiquan Cui, Chao Zhang

In this work, we show that the standard implementation of the convex combination of base learners can hardly work due to the presence of noisy labels.

Weakly-supervised Learning

Paper
Add Code

A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises

no code implementations • 7 Jun 2023 • Hejie Cui, Jiaying Lu, Shiyu Wang, ran Xu, Wenjing Ma, Shaojun Yu, Yue Yu, Xuan Kan, Chen Ling, Tianfan Fu, Liang Zhao, Joyce Ho, Fei Wang, Carl Yang

This work aims to serve as a valuable resource for understanding the potential and opportunities of HKG in health research.

Knowledge Graphs

Paper
Add Code

Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage Training

1 code implementation • 12 Jun 2023 • ran Xu, Yue Yu, Joyce C. Ho, Carl Yang

To address this challenge, we propose a weakly-supervised approach for scientific document classification using label names only.

Document Classification Retrieval

Paper
Code

MUBen: Benchmarking the Uncertainty of Molecular Representation Models

2 code implementations • 14 Jun 2023 • Yinghao Li, Lingkai Kong, Yuanqi Du, Yue Yu, Yuchen Zhuang, Wenhao Mu, Chao Zhang

While some studies have included UQ to improve molecular pre-trained models, the process of selecting suitable backbone and UQ methods for reliable molecular uncertainty estimation remains underexplored.

Benchmarking Drug Discovery +4

Paper
Code

ToolQA: A Dataset for LLM Question Answering with External Tools

1 code implementation • NeurIPS 2023 • Yuchen Zhuang, Yue Yu, Kuan Wang, Haotian Sun, Chao Zhang

To address this issue, we introduce a new dataset called ToolQA, which is designed to faithfully evaluate LLMs' ability to use external tools for question answering.

Hallucination Question Answering

206

Paper
Code

Practical Privacy-Preserving Gaussian Process Regression via Secret Sharing

no code implementations • 26 Jun 2023 • Jinglong Luo, Yehong Zhang, JiaQi Zhang, Shuang Qin, Hui Wang, Yue Yu, Zenglin Xu

In contrast to existing studies that protect the data privacy of GPR via homomorphic encryption, differential privacy, or federated learning, our proposed method is more practical and can be used to preserve the data privacy of both the model inputs and outputs for various data-sharing scenarios (e. g., horizontally/vertically-partitioned data).

Federated Learning GPR +2

Paper
Add Code

Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias

1 code implementation • NeurIPS 2023 • Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, Chao Zhang

Large language models (LLMs) have been recently leveraged as training data generators for various natural language processing (NLP) tasks.

Attribute Language Modelling +1

114

Paper
Code

Personalized Federated Learning via Amortized Bayesian Meta-Learning

no code implementations • 5 Jul 2023 • Shiyu Liu, Shaogao Lv, Dun Zeng, Zenglin Xu, Hui Wang, Yue Yu

Federated learning is a decentralized and privacy-preserving technique that enables multiple clients to collaborate with a server to learn a global model without exposing their private data.

Meta-Learning Personalized Federated Learning +2

Paper
Add Code

Model Provenance via Model DNA

no code implementations • 4 Aug 2023 • Xin Mu, Yu Wang, Yehong Zhang, JiaQi Zhang, Hui Wang, Yang Xiang, Yue Yu

Understanding the life cycle of the machine learning (ML) model is an intriguing area of research (e. g., understanding where the model comes from, how it is trained, and how it is used).

Representation Learning

Paper
Add Code

Intelligence-Endogenous Management Platform for Computing and Network Convergence

no code implementations • 7 Aug 2023 • Zicong Hong, Xiaoyu Qiu, Jian Lin, Wuhui Chen, Yue Yu, Hui Wang, Song Guo, Wen Gao

Therefore, in this article, we present the concept of an intelligence-endogenous management platform for CNCs called \emph{CNC brain} based on artificial intelligence technologies.

Management Scheduling

Paper
Add Code

Active Inverse Learning in Stackelberg Trajectory Games

no code implementations • 15 Aug 2023 • Yue Yu, Jacob Levy, Negar Mehr, David Fridovich-Keil, Ufuk Topcu

We formulate an inverse learning problem in a Stackelberg game between a leader and a follower, where each player's action is the trajectory of a dynamical system.

Paper
Add Code

Provable Training for Graph Contrastive Learning

1 code implementation • NeurIPS 2023 • Yue Yu, Xiao Wang, Mengmei Zhang, Nian Liu, Chuan Shi

To this end, we propose the PrOvable Training (POT) for GCL, which regularizes the training of GCL to encode node embeddings that follows the GCL principle better.

Contrastive Learning

Paper
Code

At Which Training Stage Does Code Data Help LLMs Reasoning?

1 code implementation • 28 Sep 2023 • Yingwei Ma, Yue Liu, Yue Yu, Yuanliang Zhang, Yu Jiang, Changjian Wang, Shanshan Li

Inspired by the great success of code data in training LLMs, we naturally wonder at which training stage introducing code data can really help LLMs reasoning.

Question Answering

Paper
Code

Bridging Code Semantic and LLMs: Semantic Chain-of-Thought Prompting for Code Generation

no code implementations • 16 Oct 2023 • Yingwei Ma, Yue Yu, Shanshan Li, Yu Jiang, Yong Guo, Yuanliang Zhang, Yutao Xie, Xiangke Liao

Meanwhile, while traditional techniques leveraging such semantic information require complex static or dynamic code analysis to obtain features such as data flow and control flow, SeCoT demonstrates that this process can be fully automated via the intrinsic capabilities of LLMs (i. e., in-context learning), while being generalizable and applicable to challenging domains.

Code Generation In-Context Learning

Paper
Add Code

Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models

1 code implementation • 1 Nov 2023 • ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, Wei Jin, Joyce Ho, Carl Yang

Clinical natural language processing requires methods that can address domain-specific challenges, such as complex medical terminology and clinical contexts.

Clinical Knowledge Knowledge Graphs +1

Paper
Code

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning

no code implementations • 13 Nov 2023 • Yue Yu, Jiaming Shen, Tianqi Liu, Zhen Qin, Jing Nathan Yan, Jialu Liu, Chao Zhang, Michael Bendersky

To fully unleash the power of explanations, we propose EASE, an Explanation-Aware Soft Ensemble framework to empower in-context learning with LLMs.

In-Context Learning Language Modelling +2

Paper
Add Code

On What Basis? Predicting Text Preference Via Structured Comparative Reasoning

no code implementations • 14 Nov 2023 • Jing Nathan Yan, Tianqi Liu, Justin T Chiu, Jiaming Shen, Zhen Qin, Yue Yu, Yao Zhao, Charu Lakshmanan, Yair Kurzion, Alexander M. Rush, Jialu Liu, Michael Bendersky

Comparative reasoning plays a crucial role in text preference prediction; however, large language models (LLMs) often demonstrate inconsistencies in their reasoning.

Hallucination Retrieval

Paper
Add Code

How Many Validation Labels Do You Need? Exploring the Design Space of Label-Efficient Model Ranking

1 code implementation • 4 Dec 2023 • Zhengyu Hu, Jieyu Zhang, Yue Yu, Yuchen Zhuang, Hui Xiong

This paper presents LEMR (Label-Efficient Model Ranking) and introduces the MoraBench Benchmark.

Model Selection

Paper
Code

Harnessing the Power of Neural Operators with Automatically Encoded Conservation Laws

no code implementations • 18 Dec 2023 • Ning Liu, Yiming Fan, Xianyi Zeng, Milan Klöwer, Yue Yu

In this work, we introduce conservation law-encoded neural operators (clawNOs), a suite of NOs that endow inference with automatic satisfaction of such conservation laws.

Paper
Add Code

EncryIP: A Practical Encryption-Based Framework for Model Intellectual Property Protection

no code implementations • 19 Dec 2023 • Xin Mu, Yu Wang, Zhengan Huang, Junzuo Lai, Yehong Zhang, Hui Wang, Yue Yu

In the rapidly growing digital economy, protecting intellectual property (IP) associated with digital products has become increasingly important.

Paper
Add Code

Towards Fair Graph Federated Learning via Incentive Mechanisms

1 code implementation • 20 Dec 2023 • Chenglu Pan, Jiarong Xu, Yue Yu, Ziqi Yang, Qingbiao Wu, Chunping Wang, Lei Chen, Yang Yang

Extensive experiments show that our model achieves the best trade-off between accuracy and the fairness of model gradient, as well as superior payoff fairness.

Fairness Federated Learning +1

Paper
Code

Causal Discovery under Identifiable Heteroscedastic Noise Model

no code implementations • 20 Dec 2023 • Naiyu Yin, Tian Gao, Yue Yu, Qiang Ji

We then propose an effective two-phase iterative DAG learning algorithm to address the increasing optimization difficulties and to learn a causal DAG from data with heteroscedastic variable noise under varying variance.

Causal Discovery

Paper
Add Code

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles

no code implementations • 30 Dec 2023 • Yuanzhao Zhai, Han Zhang, Yu Lei, Yue Yu, Kele Xu, Dawei Feng, Bo Ding, Huaimin Wang

Reinforcement learning from human feedback (RLHF) emerges as a promising paradigm for aligning large language models (LLMs).

Uncertainty Quantification

Paper
Add Code

SecFormer: Towards Fast and Accurate Privacy-Preserving Inference for Large Language Models

no code implementations • 1 Jan 2024 • Jinglong Luo, Yehong Zhang, JiaQi Zhang, Xin Mu, Hui Wang, Yue Yu, Zenglin Xu

However, the application of SMPC in Privacy-Preserving Inference (PPI) for large language models, particularly those based on the Transformer architecture, often leads to considerable slowdowns or declines in performance.

Knowledge Distillation Privacy Preserving

Paper
Add Code

Noise-Aware and Equitable Urban Air Traffic Management: An Optimization Approach

no code implementations • 1 Jan 2024 • Zhenyu Gao, Yue Yu, Qinshuang Wei, Ufuk Topcu, John-Paul Clarke

Urban air mobility (UAM), a transformative concept for the transport of passengers and cargo, faces several integration challenges in complex urban environments.

Fairness Management

Paper
Add Code

Training and Serving System of Foundation Models: A Comprehensive Survey

no code implementations • 5 Jan 2024 • Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen, Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, Zibin Zheng

Additionally, the paper summarizes the challenges and presents a perspective on the future development direction of foundation model systems.

Paper
Add Code

Peridynamic Neural Operators: A Data-Driven Nonlocal Constitutive Model for Complex Material Responses

no code implementations • 11 Jan 2024 • Siavash Jafarzadeh, Stewart Silling, Ning Liu, Zhongqiang Zhang, Yue Yu

In this work, we introduce a novel integral neural operator architecture called the Peridynamic Neural Operator (PNO) that learns a nonlocal constitutive law from data.

Paper
Add Code

EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records

1 code implementation • 13 Jan 2024 • Wenqi Shi, ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce Ho, Carl Yang, May D. Wang

Large language models (LLMs) have demonstrated exceptional capabilities in planning and tool utilization as autonomous agents, but few have been developed for medical problem-solving.

Code Generation Few-Shot Learning +1

Paper
Code

HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification

no code implementations • 24 Jan 2024 • Vidit Jain, Mukund Rungta, Yuchen Zhuang, Yue Yu, Zeyu Wang, Mu Gao, Jeffrey Skolnick, Chao Zhang

The best-performing models aim to learn a static representation by combining document and hierarchical label information.

Language Modelling Multi Label Text Classification +3

Paper
Add Code

Endowing Pre-trained Graph Models with Provable Fairness

1 code implementation • 19 Feb 2024 • Zhongjian Zhang, Mengmei Zhang, Yue Yu, Cheng Yang, Jiawei Liu, Chuan Shi

Furthermore, with GraphPAR, we quantify whether the fairness of each node is provable, i. e., predictions are always fair within a certain range of sensitive attribute semantics.

Attribute Fairness +1

Paper
Code

ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling

no code implementations • 21 Feb 2024 • Lingxi Zhang, Yue Yu, Kuan Wang, Chao Zhang

Retrieval-augmented generation enhances large language models (LLMs) by incorporating relevant information from external knowledge sources.

Retrieval Transfer Learning +1

Paper
Add Code

COPR: Continual Human Preference Learning via Optimal Policy Regularization

no code implementations • 22 Feb 2024 • Han Zhang, Lin Gui, Yu Lei, Yuanzhao Zhai, Yehong Zhang, Yulan He, Hui Wang, Yue Yu, Kam-Fai Wong, Bin Liang, Ruifeng Xu

Reinforcement Learning from Human Feedback (RLHF) is commonly utilized to improve the alignment of Large Language Models (LLMs) with human preferences.

Continual Learning

Paper
Add Code

Multi-modal Stance Detection: New Datasets and Model

no code implementations • 22 Feb 2024 • Bin Liang, Ang Li, Jingqian Zhao, Lin Gui, Min Yang, Yue Yu, Kam-Fai Wong, Ruifeng Xu

Stance detection is a challenging task that aims to identify public opinion from social media platforms with respect to specific targets.

Stance Detection

Paper
Add Code

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

no code implementations • 25 Feb 2024 • ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May D. Wang, Joyce C. Ho, Carl Yang

We present RAM-EHR, a Retrieval AugMentation pipeline to improve clinical predictions on Electronic Health Records (EHRs).

Retrieval

Paper
Add Code

ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine Translation

no code implementations • 11 Mar 2024 • Shaojie Dai, Xin Liu, Ping Luo, Yue Yu

Large language model (LLM) has achieved promising performance in multilingual machine translation tasks through zero/few-shot prompts or prompt-tuning.

Language Modelling Large Language Model +2

Paper
Add Code

VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation

1 code implementation • 14 Mar 2024 • Mingya Zhang, Yue Yu, Limei Gu, Tingsheng Lin, Xianping Tao

In the field of medical image segmentation, models based on both CNN and Transformer have been thoroughly investigated.

Image Segmentation Medical Image Segmentation +1

Paper
Code

ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models

1 code implementation • 17 Mar 2024 • Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang

Although Large Language Models (LLMs) exhibit remarkable adaptability across domains, these models often fall short in structured knowledge extraction tasks such as named entity recognition (NER).

Attribute named-entity-recognition +2

Paper
Code

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

1 code implementation • 18 Mar 2024 • Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang

To tackle these challenges, we present GenView, a controllable framework that augments the diversity of positive views leveraging the power of pretrained generative models while preserving semantics.

Contrastive Learning Data Augmentation +1

Paper
Code

Heterogeneous Peridynamic Neural Operators: Discover Biotissue Constitutive Law and Microstructure From Digital Image Correlation Measurements

no code implementations • 27 Mar 2024 • Siavash Jafarzadeh, Stewart Silling, Lu Zhang, Colton Ross, Chung-Hao Lee, S. M. Rakibur Rahman, Shuodao Wang, Yue Yu

Then, in the second phase we reinitialize the learnt bond force and the kernel function, and training them together with a fiber orientation field for each material point.

Paper
Add Code

Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning

1 code implementation • ACL 2022 • Rongzhi Zhang, Yue Yu, Pranav Shetty, Le Song, Chao Zhang

Weakly-supervised Learning

Paper
Code

AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models

1 code implementation • NAACL 2022 • Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang

We develop AcTune, a new framework that improves the label efficiency of active PLM fine-tuning by unleashing the power of unlabeled data via self-training.

Active Learning text-classification +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.