Search Results for author: Jing Gao

Found 55 papers, 14 papers with code

Profanity-Avoiding Training Framework for Seq2seq Models with Certified Robustness

no code implementations EMNLP 2021 Hengtong Zhang, Tianhang Zheng, Yaliang Li, Jing Gao, Lu Su, Bo Li

To address this problem, we propose a training framework with certified robustness to eliminate the causes that trigger the generation of profanity.

Dialogue Generation Style Transfer

Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

1 code implementation1 Apr 2024 Xiaoze Liu, Feijie Wu, Tianyang Xu, Zhuo Chen, Yichi Zhang, Xiaoqian Wang, Jing Gao

In this paper, we propose GraphEval to evaluate an LLM's performance using a substantially large test dataset.

Knowledge Graphs

Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset

no code implementations14 Mar 2024 Ning Cheng, You Li, Jing Gao, Bin Fang, Jinan Xu, Wenjuan Han

Tactility provides crucial support and enhancement for the perception and interaction capabilities of both humans and robots.

Sentence

Knowledge-Reuse Transfer Learning Methods in Molecular and Material Science

no code implementations2 Mar 2024 An Chen, Zhilong Wang, Karl Luigi Loza Vidaurre, Yanqiang Han, Simin Ye, Kehao Tao, Shiwei Wang, Jing Gao, Jinjin Li

We focus on the application of transfer learning methods for the discovery of advanced molecules/materials, particularly, the construction of transfer learning frameworks for different systems, and how transfer learning can enhance the performance of models.

Transfer Learning

$C^3$: Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding

no code implementations25 Feb 2024 Taixi Lu, Haoyu Wang, Huajie Shao, Jing Gao, Huaxiu Yao

Existing model cascade methods seek to enhance inference efficiency by greedily selecting the lightest model capable of processing the current input from a variety of models, based on model confidence scores.

Natural Language Understanding

BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

no code implementations16 Feb 2024 Haoyu Wang, Tuo Zhao, Jing Gao

Retrieval-augmented Large Language Models (LLMs) offer substantial benefits in enhancing performance across knowledge-intensive scenarios.

Open-Domain Question Answering Retrieval

Advanced Unstructured Data Processing for ESG Reports: A Methodology for Structured Transformation and Enhanced Analysis

1 code implementation4 Jan 2024 Jiahui Peng, Jing Gao, Xin Tong, Jing Guo, Hang Yang, Jianchuan Qi, Ruiqiao Li, Nan Li, Ming Xu

In the evolving field of corporate sustainability, analyzing unstructured Environmental, Social, and Governance (ESG) reports is a complex challenge due to their varied formats and intricate content.

Towards Poisoning Fair Representations

no code implementations28 Sep 2023 Tianci Liu, Haoyu Wang, Feijie Wu, Hengtong Zhang, Pan Li, Lu Su, Jing Gao

Fair machine learning seeks to mitigate model prediction bias against certain demographic subgroups such as elder and female.

Bilevel Optimization Data Poisoning +2

Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

no code implementations2 Jun 2023 Zhuo Wang, Rongzhen Li, Bowen Dong, Jie Wang, Xiuxing Li, Ning Liu, Chenhui Mao, Wei zhang, Liling Dong, Jing Gao, Jianyong Wang

In this paper, we explore the potential of LLMs such as GPT-4 to outperform traditional AI tools in dementia diagnosis.

Behavioral Machine Learning? Computer Predictions of Corporate Earnings also Overreact

no code implementations25 Mar 2023 Murray Z. Frank, Jing Gao, Keer Yang

There is considerable evidence that machine learning algorithms have better predictive abilities than humans in various financial settings.

SimFair: A Unified Framework for Fairness-Aware Multi-Label Classification

no code implementations19 Feb 2023 Tianci Liu, Haoyu Wang, Yaqing Wang, Xiaoqian Wang, Lu Su, Jing Gao

This new framework utilizes data that have similar labels when estimating fairness on a particular label group for better stability, and can unify DP and EOp.

Classification Fairness +1

Multi-rater Prism: Learning self-calibrated medical image segmentation from multiple raters

no code implementations1 Dec 2022 Junde Wu, Huihui Fang, Yehui Yang, Yuanpei Liu, Jing Gao, Lixin Duan, Weihua Yang, Yanwu Xu

In this paper, we propose a novel neural network framework, called Multi-Rater Prism (MrPrism) to learn the medical image segmentation from multiple labels.

Image Segmentation Medical Image Segmentation +2

Towards Reliable Item Sampling for Recommendation Evaluation

no code implementations28 Nov 2022 Dong Li, Ruoming Jin, Zhenming Liu, Bin Ren, Jing Gao, Zhi Liu

Since Rendle and Krichene argued that commonly used sampling-based evaluation metrics are "inconsistent" with respect to the global metrics (even in expectation), there have been a few studies on the sampling-based recommender system evaluation.

Recommendation Systems

AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning

1 code implementation31 Oct 2022 Yaqing Wang, Sahaj Agarwal, Subhabrata Mukherjee, Xiaodong Liu, Jing Gao, Ahmed Hassan Awadallah, Jianfeng Gao

Standard fine-tuning of large pre-trained language models (PLMs) for downstream tasks requires updating hundreds of millions to billions of parameters, and storing a large copy of the PLM weights for every task resulting in increased cost for storing, sharing and serving the models.

Temporal Spatial Decomposition and Fusion Network for Time Series Forecasting

no code implementations6 Oct 2022 Liwang Zhou, Jing Gao

One decomposition approach often cannot be used for numerous forecasting tasks since the standard time series decomposition lacks flexibility and robustness.

Feature Engineering feature selection +3

An Efficient Person Clustering Algorithm for Open Checkout-free Groceries

1 code implementation5 Aug 2022 Junde Wu, Yu Zhang, Rao Fu, Yuanpei Liu, Jing Gao

Then, to ensure that the method adapts to the dynamic and unseen person flow, we propose Graph Convolutional Network (GCN) with a simple Nearest Neighbor (NN) strategy to accurately cluster the instances of CSG.

Clustering

Anchor Sampling for Federated Learning with Partial Client Participation

1 code implementation13 Jun 2022 Feijie Wu, Song Guo, Zhihao Qu, Shiqi He, Ziming Liu, Jing Gao

The lack of inactive clients' updates in partial client participation makes it more likely for the model aggregation to deviate from the aggregation based on full client participation.

Federated Learning

SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer

no code implementations12 Jun 2022 Junde Wu, Huihui Fang, Fangxin Shang, Dalu Yang, Zhaowei Wang, Jing Gao, Yehui Yang, Yanwu Xu

To model the segmentation-diagnosis interaction, SeA-block first embeds the diagnosis feature based on the segmentation information via the encoder, and then transfers the embedding back to the diagnosis feature space by a decoder.

Melanoma Diagnosis Segmentation

AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning

1 code implementation24 May 2022 Yaqing Wang, Sahaj Agarwal, Subhabrata Mukherjee, Xiaodong Liu, Jing Gao, Ahmed Hassan Awadallah, Jianfeng Gao

Standard fine-tuning of large pre-trained language models (PLMs) for downstream tasks requires updating hundreds of millions to billions of parameters, and storing a large copy of the PLM weights for every task resulting in increased cost for storing, sharing and serving the models.

Natural Language Understanding Sparse Learning

Label a Herd in Minutes: Individual Holstein-Friesian Cattle Identification

1 code implementation22 Apr 2022 Jing Gao, Tilo Burghardt, Neill W. Campbell

In particular, for the task of automatic identification of individual Holstein-Friesians in real-world farm CCTV, we show that self-supervision, metric learning, cluster analysis, and active learning can complement each other to significantly reduce the annotation requirements usually needed to train cattle identification frameworks.

Active Learning Metric Learning

Deep Learning for Spatiotemporal Modeling of Urbanization

no code implementations17 Dec 2021 Tang Li, Jing Gao, Xi Peng

Here we explore the capacity of deep spatial learning for the predictive modeling of urbanization.

BIG-bench Machine Learning

Path-specific Causal Fair Prediction via Auxiliary Graph Structure Learning

no code implementations29 Sep 2021 Liuyi Yao, Yaliang Li, Bolin Ding, Jingren Zhou, Jinduo Liu, Mengdi Huai, Jing Gao

To tackle these challenges, we propose a novel casual graph based fair prediction framework, which integrates graph structure learning into fair prediction to ensure that unfair pathways are excluded in the causal graph.

Fairness Graph structure learning

On the regularization landscape for the linear recommendation models

no code implementations29 Sep 2021 Dong Li, Zhenming Liu, Ruoming Jin, Zhi Liu, Jing Gao, Bin Ren

Recently, a wide range of recommendation algorithms inspired by deep learning techniques have emerged as the performance leaders several standard recommendation benchmarks.

Multimodal Emergent Fake News Detection via Meta Neural Process Networks

no code implementations22 Jun 2021 Yaqing Wang, Fenglong Ma, Haoyu Wang, Kishlay Jha, Jing Gao

The experimental results show our proposed MetaFEND model can detect fake news on never-seen events effectively and outperform the state-of-the-art methods.

Fake News Detection Hard Attention +1

On Sampling Top-K Recommendation Evaluation

no code implementations20 Jun 2021 Dong Li, Ruoming Jin, Jing Gao, Zhi Liu

Recently, Rendle has warned that the use of sampling-based top-$k$ metrics might not suffice.

Towards a Better Understanding of Linear Models for Recommendation

no code implementations27 May 2021 Ruoming Jin, Dong Li, Jing Gao, Zhi Liu, Li Chen, Yang Zhou

Through the derivation and analysis of the closed-form solutions for two basic regression and matrix factorization approaches, we found these two approaches are indeed inherently related but also diverge in how they "scale-down" the singular values of the original user-item interaction matrix.

regression

Towards Self-Supervision for Video Identification of Individual Holstein-Friesian Cattle: The Cows2021 Dataset

2 code implementations5 May 2021 Jing Gao, Tilo Burghardt, William Andrew, Andrew W. Dowsey, Neill W. Campbell

Motivated by the labelling burden involved in constructing visual cattle identification systems, we propose exploiting the temporal coat pattern appearance across videos as a self-supervision signal for animal identity learning.

Contrastive Learning

Fairness-aware Outlier Ensemble

no code implementations17 Mar 2021 Haoyu Liu, Fenglong Ma, Shibo He, Jiming Chen, Jing Gao

Meanwhile, we propose a post-processing framework to tune the original ensemble results through a stacking process so that we can achieve a trade off between fairness and detection performance.

Fairness Fraud Detection +1

On Estimating Recommendation Evaluation Metrics under Sampling

no code implementations2 Mar 2021 Ruoming Jin, Dong Li, Benjamin Mudrak, Jing Gao, Zhi Liu

The proposed approaches either are rather uninformative (linking sampling to metric evaluation) or can only work on simple metrics, such as Recall/Precision (Krichene and Rendle 2020; Li et al. 2020).

Adaptive Self-training for Neural Sequence Labeling with Few Labels

no code implementations1 Jan 2021 Yaqing Wang, Subhabrata Mukherjee, Haoda Chu, Yuancheng Tu, Ming Wu, Jing Gao, Ahmed Hassan Awadallah

Neural sequence labeling is an important technique employed for many Natural Language Processing (NLP) tasks, such as Named Entity Recognition (NER), slot tagging for dialog systems and semantic parsing.

Meta-Learning named-entity-recognition +3

Adaptive Self-training for Few-shot Neural Sequence Labeling

no code implementations7 Oct 2020 Yaqing Wang, Subhabrata Mukherjee, Haoda Chu, Yuancheng Tu, Ming Wu, Jing Gao, Ahmed Hassan Awadallah

While self-training serves as an effective mechanism to learn from large amounts of unlabeled data -- meta-learning helps in adaptive sample re-weighting to mitigate error propagation from noisy pseudo-labels.

Meta-Learning named-entity-recognition +3

Efficient Knowledge Graph Validation via Cross-Graph Representation Learning

no code implementations16 Aug 2020 Yaqing Wang, Fenglong Ma, Jing Gao

To tackle this challenging task, we propose a cross-graph representation learning framework, i. e., CrossVal, which can leverage an external KG to validate the facts in the target KG efficiently.

Graph Representation Learning Knowledge Graphs

Visual Identification of Individual Holstein-Friesian Cattle via Deep Metric Learning

2 code implementations16 Jun 2020 William Andrew, Jing Gao, Siobhan Mullan, Neill Campbell, Andrew W Dowsey, Tilo Burghardt

Holstein-Friesian cattle exhibit individually-characteristic black and white coat patterns visually akin to those arising from Turing's reaction-diffusion systems.

Metric Learning

Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data

no code implementations15 Jun 2020 Yaqing Wang, Yifan Ethan Xu, Xi-An Li, Xin Luna Dong, Jing Gao

(1) We formalize the problem of validating the textual attribute values of products from a variety of categories as a natural language inference task in the few-shot learning setting, and propose a meta-learning latent variable model to jointly process the signals obtained from product profiles and textual attribute values.

Attribute Few-Shot Learning +1

Decomposed Adversarial Learned Inference

no code implementations21 Apr 2020 Alexander Hanbo Li, Yaqing Wang, Changyou Chen, Jing Gao

Effective inference for a generative adversarial model remains an important and challenging problem.

Practical Data Poisoning Attack against Next-Item Recommendation

no code implementations7 Apr 2020 Hengtong Zhang, Yaliang Li, Bolin Ding, Jing Gao

In real-world recommendation systems, the cost of retraining recommendation models is high, and the interaction frequency between users and a recommendation system is restricted. Given these real-world restrictions, we propose to let the agent interact with a recommender simulator instead of the target recommendation system and leverage the transferability of the generated adversarial samples to poison the target system.

Data Poisoning Recommendation Systems

A Survey on Causal Inference

1 code implementation5 Feb 2020 Liuyi Yao, Zhixuan Chu, Sheng Li, Yaliang Li, Jing Gao, Aidong Zhang

Embraced with the rapidly developed machine learning area, various causal effect estimation methods for observational data have sprung up.

BIG-bench Machine Learning Causal Inference

Weak Supervision for Fake News Detection via Reinforcement Learning

1 code implementation28 Dec 2019 Yaqing Wang, Weifeng Yang, Fenglong Ma, Jin Xu, Bin Zhong, Qiang Deng, Jing Gao

In order to tackle this challenge, we propose a reinforced weakly-supervised fake news detection framework, i. e., WeFEND, which can leverage users' reports as weak supervision to enlarge the amount of training data for fake news detection.

Fake News Detection reinforcement-learning +1

Atmospheric turbulence removal using convolutional neural network

no code implementations22 Dec 2019 Jing Gao, N. Anantrasirichai, David Bull

This paper describes a novel deep learning-based method for mitigating the effects of atmospheric distortion.

Data Poisoning Attack against Knowledge Graph Embedding

no code implementations26 Apr 2019 Hengtong Zhang, Tianhang Zheng, Jing Gao, Chenglin Miao, Lu Su, Yaliang Li, Kui Ren

Knowledge graph embedding (KGE) is a technique for learning continuous embeddings for entities and relations in the knowledge graph. Due to its benefit to a variety of downstream tasks such as knowledge graph completion, question answering and recommendation, KGE has gained significant attention recently.

Data Poisoning Knowledge Graph Completion +2

Representation Learning for Treatment Effect Estimation from Observational Data

1 code implementation NeurIPS 2018 Liuyi Yao, Sheng Li, Yaliang Li, Mengdi Huai, Jing Gao, Aidong Zhang

Estimating individual treatment effect (ITE) is a challenging problem in causal inference, due to the missing counterfactuals and the selection bias.

Causal Inference Representation Learning +1

Finding Similar Medical Questions from Question Answering Websites

no code implementations14 Oct 2018 Yaliang Li, Liuyi Yao, Nan Du, Jing Gao, Qi Li, Chuishi Meng, Chenwei Zhang, Wei Fan

Patients who have medical information demands tend to post questions about their health conditions on these crowdsourced Q&A websites and get answers from other users.

Question Answering Retrieval

Towards Differentially Private Truth Discovery for Crowd Sensing Systems

no code implementations10 Oct 2018 Yaliang Li, Houping Xiao, Zhan Qin, Chenglin Miao, Lu Su, Jing Gao, Kui Ren, Bolin Ding

To better utilize sensory data, the problem of truth discovery, whose goal is to estimate user quality and infer reliable aggregated results through quality-aware data aggregation, has emerged as a hot topic.

Privacy Preserving

AIM: Adversarial Inference by Matching Priors and Conditionals

no code implementations27 Sep 2018 Hanbo Li, Yaqing Wang, Changyou Chen, Jing Gao

We propose a novel approach, Adversarial Inference by Matching priors and conditionals (AIM), which explicitly matches prior and conditional distributions in both data and code spaces, and puts a direct constraint on the dependency structure of the generative model.

Long-Term Memory Networks for Question Answering

no code implementations6 Jul 2017 Fenglong Ma, Radha Chitta, Saurabh Kataria, Jing Zhou, Palghat Ramesh, Tong Sun, Jing Gao

Question answering is an important and difficult task in the natural language processing domain, because many basic natural language processing tasks can be cast into a question answering task.

Question Answering

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks

no code implementations19 Jun 2017 Fenglong Ma, Radha Chitta, Jing Zhou, Quanzeng You, Tong Sun, Jing Gao

Existing work solves this problem by employing recurrent neural networks (RNNs) to model EHR data and utilizing simple attention mechanism to interpret the results.

Multi-source Hierarchical Prediction Consolidation

no code implementations11 Aug 2016 Chenwei Zhang, Sihong Xie, Yaliang Li, Jing Gao, Wei Fan, Philip S. Yu

We propose a novel multi-source hierarchical prediction consolidation method to effectively exploits the complicated hierarchical label structures to resolve the noisy and conflicting information that inherently originates from multiple imperfect sources.

Multilabel Consensus Classification

no code implementations16 Oct 2013 Sihong Xie, Xiangnan Kong, Jing Gao, Wei Fan, Philip S. Yu

Nonetheless, data nowadays are usually multilabeled, such that more than one label have to be predicted at the same time.

Classification General Classification

Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models

no code implementations NeurIPS 2009 Jing Gao, Feng Liang, Wei Fan, Yizhou Sun, Jiawei Han

First, we can boost the diversity of classification ensemble by incorporating multiple clustering outputs, each of which provides grouping constraints for the joint label predictions of a set of related objects.

Clustering General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.