Search Results for author: Sai Wu

Found 16 papers, 9 papers with code

Pre-Trained Model Recommendation for Downstream Fine-tuning

no code implementations • 11 Mar 2024 • Jiameng Bai, Sai Wu, Jie Song, Junbo Zhao, Gang Chen

As a fundamental problem in transfer learning, model selection aims to rank off-the-shelf pre-trained models and select the most suitable one for the new target task.

Inductive Bias Model Selection +1

Paper
Add Code

FL-GUARD: A Holistic Framework for Run-Time Detection and Recovery of Negative Federated Learning

no code implementations • 7 Mar 2024 • Hong Lin, Lidan Shou, Ke Chen, Gang Chen, Sai Wu

Federated learning (FL) is a promising approach for learning a model from data distributed on massive clients without exposing data privacy.

Federated Learning

Paper
Add Code

ModelGiF: Gradient Fields for Model Functional Distance

1 code implementation • ICCV 2023 • Jie Song, Zhengqi Xu, Sai Wu, Gang Chen, Mingli Song

The last decade has witnessed the success of deep learning and the surge of publicly released trained models, which necessitates the quantification of the model functional distance for various purposes.

Paper
Code

Towards Cross-Table Masked Pretraining for Web Data Mining

1 code implementation • 10 Jul 2023 • Chao Ye, Guoshan Lu, Haobo Wang, Liyao Li, Sai Wu, Gang Chen, Junbo Zhao

Tabular data pervades the landscape of the World Wide Web, playing a foundational role in the digital architecture that underpins online information.

Contrastive Learning

Paper
Code

Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility

1 code implementation • 15 May 2023 • Wentao Ye, Mingfeng Ou, Tianyi Li, Yipeng chen, Xuetao Ma, Yifan Yanggong, Sai Wu, Jie Fu, Gang Chen, Haobo Wang, Junbo Zhao

With most of the related literature in the era of LLM uncharted, we propose an automated workflow that copes with an upscaled number of queries/responses.

Memorization

Paper
Code

Controllable Textual Inversion for Personalized Text-to-Image Generation

1 code implementation • 11 Apr 2023 • Jianan Yang, Haobo Wang, YanMing Zhang, Ruixuan Xiao, Sai Wu, Gang Chen, Junbo Zhao

The recent large-scale generative modeling has attained unprecedented performance especially in producing high-fidelity images driven by text prompts.

Active Learning Text-to-Image Generation

Paper
Code

Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting

1 code implementation • 13 Feb 2023 • Yuchen Liu, Chen Chen, Lingjuan Lyu, Fangzhao Wu, Sai Wu, Gang Chen

In order to address this issue, we propose GAS, a \shorten approach that can successfully adapt existing robust AGRs to non-IID settings.

Federated Learning

Paper
Code

Comparison Knowledge Translation for Generalizable Image Classification

1 code implementation • 7 May 2022 • Zunlei Feng, Tian Qiu, Sai Wu, Xiaotuan Jin, Zengliang He, Mingli Song, Huiqiong Wang

In this paper, we attempt to build a generalizable framework that emulates the humans' recognition mechanism in the image classification task, hoping to improve the classification performance on unseen categories with the support of annotations of other categories.

Classification Image Classification +1

Paper
Code

Towards Unifying the Label Space for Aspect- and Sentence-based Sentiment Analysis

1 code implementation • Findings (ACL) 2022 • Yiming Zhang, Min Zhang, Sai Wu, Junbo Zhao

The aspect-based sentiment analysis (ABSA) is a fine-grained task that aims to determine the sentiment polarity towards targeted aspect terms occurring in the sentence.

Ranked #4 on Aspect-Based Sentiment Analysis (ABSA) on SemEval-2014 Task-4 (using extra training data)

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +3

Paper
Code

Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers

1 code implementation • 9 Dec 2021 • Zunlei Feng, Jiacong Hu, Sai Wu, Xiaotian Yu, Jie Song, Mingli Song

The aggregate gradient strategy is a versatile module for mainstream CNN classifiers.

Paper
Code

Joining datasets via data augmentation in the label space for neural networks

no code implementations • 17 Jun 2021 • Jake Zhao, Mingfeng Ou, Linji Xue, Yunkai Cui, Sai Wu, Gang Chen

Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference.

Data Augmentation text-classification +1

Paper
Add Code

A critical look at the current train/test split in machine learning

no code implementations • 8 Jun 2021 • Jimin Tan, Jianan Yang, Sai Wu, Gang Chen, Jake Zhao

The establishment of these split protocols are based on two assumptions: (i)-fixing the dataset to be eternally static so we could evaluate different machine learning algorithms or models; (ii)-there is a complete set of annotated data available to researchers or industrial practitioners.

Active Learning Benchmarking +2

Paper
Add Code

Effective Slot Filling via Weakly-Supervised Dual-Model Learning

1 code implementation • AAAI 2021 • Jue Wang, Ke Chen, Lidan Shou, Sai Wu, Gang Chen

By using some particular weakly-labeled data, namely the plain phrases included in sentences, we propose a weaklysupervised slot filling approach.

slot-filling Slot Filling +1

Paper
Code

LINDT: Tackling Negative Federated Learning with Local Adaptation

no code implementations • 23 Nov 2020 • Hong Lin, Lidan Shou, Ke Chen, Gang Chen, Sai Wu

On occasion of NFL recovery, the framework makes adaptation to the federated model on each client's local data by learning a Layer-wise Intertwined Dual-model.

Federated Learning

Paper
Add Code

BERT-JAM: Boosting BERT-Enhanced Neural Machine Translation with Joint Attention

no code implementations • 9 Nov 2020 • Zhebin Zhang, Sai Wu, Dawei Jiang, Gang Chen

In this work, we propose a novel BERT-enhanced NMT model called BERT-JAM which improves upon existing models from two aspects: 1) BERT-JAM uses joint-attention modules to allow the encoder/decoder layers to dynamically allocate attention between different representations, and 2) BERT-JAM allows the encoder/decoder layers to make use of BERT's intermediate representations by composing them using a gated linear unit (GLU).

Machine Translation NMT +1

Paper
Add Code

Semi-Supervised Few-Shot Learning for Dual Question-Answer Extraction

no code implementations • 8 Apr 2019 • Jue Wang, Ke Chen, Lidan Shou, Sai Wu, Sharad Mehrotra

In this paper, we redefine the problem as question-answer extraction, and present SAMIE: Self-Asking Model for Information Ixtraction, a semi-supervised model which dually learns to ask and to answer questions by itself.

Clustering Few-Shot Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.