Search Results for author: Lydia Y. Chen

Found 30 papers, 6 papers with code

Duwak: Dual Watermarks in Large Language Models

no code implementations12 Mar 2024 Chaoyi Zhu, Jeroen Galjaard, Pin-Yu Chen, Lydia Y. Chen

As large language models (LLM) are increasingly used for text generation tasks, it is critical to audit their usages, govern their applications, and mitigate their potential harms.

Text Generation

Quantifying and Mitigating Privacy Risks for Tabular Generative Models

no code implementations12 Mar 2024 Chaoyi Zhu, Jiayi Tang, Hans Brouwer, Juan F. Pérez, Marten van Dijk, Lydia Y. Chen

The backbone technology of tabular synthesizers is rooted in image generative models, ranging from Generative Adversarial Networks (GANs) to recent diffusion models.

Privacy Preserving

The Rise of Diffusion Models in Time-Series Forecasting

2 code implementations5 Jan 2024 Caspar Meijer, Lydia Y. Chen

This survey delves into the application of diffusion models in time-series forecasting.

Time Series Time Series Forecasting

CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model

no code implementations3 Nov 2023 Jui-Yi Tsai, Ya-Wen Teng, Ho Chiok Yew, De-Nian Yang, Lydia Y. Chen

The social graphs synthesized by the generative models are increasingly in demand due to data scarcity and concerns over user privacy.

Denoising

BatMan-CLR: Making Few-shots Meta-Learners Resilient Against Label Noise

no code implementations12 Sep 2023 Jeroen M. Galjaard, Robert Birke, Juan Perez, Lydia Y. Chen

We show that the accuracy of Reptile, iMAML, and foMAML drops by up to 42% on the Omniglot and CifarFS datasets when meta-training is affected by label noise.

Meta-Learning

GTV: Generating Tabular Data via Vertical Federated Learning

no code implementations3 Feb 2023 Zilong Zhao, Han Wu, Aad van Moorsel, Lydia Y. Chen

Conditional vector for tabular GANs is a valuable tool to control specific features of generated data.

Privacy Preserving Vertical Federated Learning

Permutation-Invariant Tabular Data Synthesis

no code implementations17 Nov 2022 Yujin Zhu, Zilong Zhao, Robert Birke, Lydia Y. Chen

We show that changing the input column order worsens the statistical difference between real and synthetic data by up to 38. 67% due to the encoding of tabular data and the network architectures.

Aergia: Leveraging Heterogeneity in Federated Learning Systems

1 code implementation12 Oct 2022 Bart Cox, Lydia Y. Chen, Jérémie Decouchant

Federated Learning (FL) is a popular approach for distributed deep learning that prevents the pooling of large amounts of data in a central server.

Federated Learning

FCT-GAN: Enhancing Table Synthesis via Fourier Transform

no code implementations12 Oct 2022 Zilong Zhao, Robert Birke, Lydia Y. Chen

Mainstream state-of-the-art tabular data synthesizers draw methodologies from Generative Adversarial Networks (GANs), which are composed of a generator and a discriminator.

Generative Adversarial Network

AGIC: Approximate Gradient Inversion Attack on Federated Learning

no code implementations28 Apr 2022 Jin Xu, Chi Hong, Jiyue Huang, Lydia Y. Chen, Jérémie Decouchant

Recent reconstruction attacks apply a gradient inversion optimization on the gradient update of a single minibatch to reconstruct the private data used by clients during training.

Federated Learning

Federated Geometric Monte Carlo Clustering to Counter Non-IID Datasets

no code implementations23 Apr 2022 Federico Lucchetti, Jérémie Decouchant, Maria Fernandes, Lydia Y. Chen, Marcus Völp

Federated learning allows clients to collaboratively train models on datasets that are acquired in different locations and that cannot be exchanged because of their size or regulations.

Clustering Federated Learning

CTAB-GAN+: Enhancing Tabular Data Synthesis

2 code implementations1 Apr 2022 Zilong Zhao, Aditya Kunar, Robert Birke, Lydia Y. Chen

We extensively evaluate CTAB-GAN+ on data similarity and analysis utility against state-of-the-art tabular GANs.

Privacy Preserving

Fabricated Flips: Poisoning Federated Learning without Data

no code implementations7 Feb 2022 Jiyue Huang, Zilong Zhao, Lydia Y. Chen, Stefanie Roos

Consequently, we design REFD, a defense specifically crafted to protect against data-free attacks.

Federated Learning

MEGA: Model Stealing via Collaborative Generator-Substitute Networks

no code implementations31 Jan 2022 Chi Hong, Jiyue Huang, Lydia Y. Chen

However, they are all based on competing generator-substitute networks and hence encounter training instability. In this paper we propose a data-free model stealing frame-work, MEGA, which is based on collaborative generator-substitute networks and only requires the target model toprovide label prediction for synthetic query examples.

Attacks and Defenses for Free-Riders in Multi-Discriminator GAN

no code implementations24 Jan 2022 Zilong Zhao, Jiyue Huang, Stefanie Roos, Lydia Y. Chen

To mitigate the model degradation, we propose a defense strategy against free-riders in MD-GAN, termed DFG.

LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision

no code implementations18 Dec 2021 Rui Han, Qinglong Zhang, Chi Harold Liu, Guoren Wang, Jian Tang, Lydia Y. Chen

The prior art sheds light on exploring the accuracy-resource tradeoff by scaling the model sizes in accordance to resource dynamics.

Knowledge Distillation Model Compression +1

Confident Data-free Model Stealing for Black-box Adversarial Attacks

no code implementations29 Sep 2021 Chi Hong, Jiyue Huang, Lydia Y. Chen

Deep machine learning models are increasingly deployed in the wild, subject to adversarial attacks.

Fed-TGAN: Federated Learning Framework for Synthesizing Tabular Data

1 code implementation18 Aug 2021 Zilong Zhao, Robert Birke, Aditya Kunar, Lydia Y. Chen

And, while learning GANs to synthesize images on FL systems has just been demonstrated, it is unknown if GANs for tabular data can be learned from decentralized data sources.

Federated Learning Privacy Preserving

Is Shapley Value fair? Improving Client Selection for Mavericks in Federated Learning

no code implementations20 Jun 2021 Jiyue Huang, Chi Hong, Lydia Y. Chen, Stefanie Roos

Shapley Value is commonly adopted to measure and incentivize client participation in federated learning.

Federated Learning

Enhancing Robustness of On-line Learning Models on Highly Noisy Data

1 code implementation19 Mar 2021 Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

Classification algorithms have been widely adopted to detect anomalies for various systems, e. g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i. e., features and labels are correctly set.

Anomaly Detection Face Recognition

CTAB-GAN: Effective Table Data Synthesizing

1 code implementation16 Feb 2021 Zilong Zhao, Aditya Kunar, Hiek Van der Scheer, Robert Birke, Lydia Y. Chen

In this paper, we develop CTAB-GAN, a novel conditional table GAN architecture that can effectively model diverse data types, including a mix of continuous and categorical variables.

Robust Learning via Golden Symmetric Loss of (un)Trusted Labels

no code implementations1 Jan 2021 Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

In this paper, we propose to construct a golden symmetric loss (GSL) based on the estimated confusion matrix as to avoid overfitting to noisy labels and learn effectively from hard classes.

End-to-End Learning from Noisy Crowd to Supervised Machine Learning Models

no code implementations13 Nov 2020 Taraneh Younesian, Chi Hong, Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

Furthermore, relabeling only 10% of the data using the expert's results in over 90% classification accuracy with SVM.

BIG-bench Machine Learning

Active Learning for Noisy Data Streams Using Weak and Strong Labelers

no code implementations27 Oct 2020 Taraneh Younesian, Dick Epema, Lydia Y. Chen

Labeling data correctly is an expensive and challenging task in machine learning, especially for on-line data streams.

Active Learning Image Classification +1

TrustNet: Learning from Trusted Data Against (A)symmetric Label Noise

no code implementations13 Jul 2020 Amirmasoud Ghiassi, Taraneh Younesian, Robert Birke, Lydia Y. Chen

Based on the insights, we design TrustNet that first adversely learns the pattern of noise corruption, being it both symmetric or asymmetric, from a small set of trusted data.

ExpertNet: Adversarial Learning and Recovery Against Noisy Labels

no code implementations10 Jul 2020 Amirmasoud Ghiassi, Robert Birke, Rui Han, Lydia Y. Chen

Today's available datasets in the wild, e. g., from social media and open platforms, present tremendous opportunities and challenges for deep learning, as there is a significant portion of tagged images, but often with noisy, i. e. erroneous, labels.

Robust classification

QActor: On-line Active Learning for Noisy Labeled Stream Data

no code implementations28 Jan 2020 Taraneh Younesian, Zilong Zhao, Amirmasoud Ghiassi, Robert Birke, Lydia Y. Chen

A central feature of QActor is to dynamically adjust the query limit according to the learning loss for each data batch.

Active Learning

RAD: On-line Anomaly Detection for Highly Unreliable Data

no code implementations11 Nov 2019 Zilong Zhao, Robert Birke, Rui Han, Bogdan Robu, Sara Bouchenak, Sonia Ben Mokhtar, Lydia Y. Chen

Classification algorithms have been widely adopted to detect anomalies for various systems, e. g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i. e., features and labels are correctly set.

Anomaly Detection Face Recognition

Online Label Aggregation: A Variational Bayesian Approach

no code implementations19 Jul 2018 Chi Hong, Amirmasoud Ghiassi, Yichi Zhou, Robert Birke, Lydia Y. Chen

Our evaluation results on various online scenarios show that BiLA can effectively infer the true labels, with an error rate reduction of at least 10 to 1. 5 percent points for synthetic and real-world datasets, respectively.

Bayesian Inference Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.