no code implementations • 9 Oct 2024 • Hongbin Liu, Youzheng Chen, Arun Narayanan, Athula Balachandran, Pedro J. Moreno, Lun Wang
Recent advances in text-to-speech (TTS) systems, particularly those with voice cloning capabilities, have made voice impersonation readily accessible, raising ethical and legal concerns due to potential misuse for malicious activities like misinformation campaigns and fraud.
no code implementations • 15 Sep 2024 • Yiyi Tao, Zhuoyue Wang, Hang Zhang, Lun Wang
In noise-adaptive learning, we estimate the noise probability of each image-text pair based on the transformer's memorization effect and employ noise-adaptive regularization on image-text contrastive learning to condition cross-modal alignment.
no code implementations • 29 Aug 2024 • Lun Wang
Under this assumption, the convergence analysis shows that micro-batch clipping can improve the convergence rate asymptotically at the cost of an additional constant bias that does not diminish with more training iterations.
no code implementations • 18 Jun 2024 • Shaohuang Wang, Lun Wang, Yunhan Bu, Tianwei Huang
Large Language Models (LLMs) have achieved remarkable progress in language understanding and generation.
1 code implementation • 11 Jun 2024 • Hongbin Liu, Moyang Guo, Zhengyuan Jiang, Lun Wang, Neil Zhenqiang Gong
The increasing realism of synthetic speech, driven by advancements in text-to-speech models, raises ethical concerns regarding impersonation and disinformation.
no code implementations • 4 Jun 2024 • Lun Wang, Om Thakkar, Zhong Meng, Nicole Rafidi, Rohit Prabhavalkar, Arun Narayanan
Gradient clipping plays a vital role in training large-scale automatic speech recognition (ASR) models.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 2 Apr 2024 • Matthew Jagielski, Om Thakkar, Lun Wang
Our method fine-tunes the encoder to produce an ASR model, and then performs noise masking on this model, which we find recovers private information from the pretraining data, despite the model never having seen transcripts at pretraining time!
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 9 Feb 2024 • Xuanzhong Chen, Xiaohao Mao, Qihan Guo, Lun Wang, Shuyang Zhang, Ting Chen
Meanwhile, we have compiled the largest open-source dataset on rare disease patients, establishing a benchmark for future studies in this domain.
no code implementations • 22 Nov 2023 • Yunming Liao, Yang Xu, Hongli Xu, Lun Wang, Zhiwei Yao, Chunming Qiao
Recently, federated learning (FL) has emerged as a popular technique for edge AI to mine valuable knowledge in edge computing (EC) systems.
no code implementations • 18 Oct 2023 • Lun Wang, Om Thakkar, Rajiv Mathews
We empirically show that clipping each example's gradient can mitigate memorization for sped-up training examples with up to 16 repetitions in the training set.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 19 Feb 2023 • Arun Ganesh, Mahdi Haghifam, Milad Nasr, Sewoong Oh, Thomas Steinke, Om Thakkar, Abhradeep Thakurta, Lun Wang
To explain this phenomenon, we hypothesize that the non-convex loss landscape of a model training necessitates an optimization algorithm to go through two phases.
2 code implementations • 24 May 2022 • Banghua Zhu, Lun Wang, Qi Pang, Shuai Wang, Jiantao Jiao, Dawn Song, Michael I. Jordan
In contrast to prior work, our proposed protocols improve the dimension dependence and achieve a tight statistical rate in terms of all the parameters for strongly convex losses.
no code implementations • 29 Sep 2021 • Lun Wang, Qi Pang, Shuai Wang, Dawn Song
In the present work, we propose a federated learning protocol with bi-directional security guarantees.
no code implementations • 29 Sep 2021 • Lun Wang, Qi Pang, Shuai Wang, Dawn Song
In this paper, we propose the first secure federated $\chi^2$-test protocol, FED-$\chi^2$.
no code implementations • 2 May 2021 • Lun Wang, Zaynah Javed, Xian Wu, Wenbo Guo, Xinyu Xing, Dawn Song
Recent research has confirmed the feasibility of backdoor attacks in deep reinforcement learning (RL) systems.
2 code implementations • 2 Mar 2021 • Wenxiao Wang, Tianhao Wang, Lun Wang, Nanqing Luo, Pan Zhou, Dawn Song, Ruoxi Jia
Deep learning techniques have achieved remarkable performance in wide-ranging tasks.
no code implementations • 1 Jan 2021 • Lun Wang, Qi Pang, Shuai Wang, Dawn Song
In this paper, we present F^2ed-Learning, the first federated learning protocol simultaneously defending against both semi-honest server and Byzantine malicious clients.
no code implementations • 1 Jan 2021 • Lun Wang, Ruoxi Jia, Dawn Song
We provide complete analysis of the privacy guarantee, communication cost and convergence rate of D2p-fed.
no code implementations • 2 Oct 2020 • Lun Wang, Qi Pang, Shuai Wang, Dawn Song
At one end of the spectrum, some work uses secure aggregation techniques to hide the individual client's updates and only reveal the aggregated global update to a malicious server that strives to infer the clients' privacy from their updates.
no code implementations • 22 Jun 2020 • Lun Wang, Ruoxi Jia, Dawn Song
In this paper, we propose the discrete Gaussian based differentially private federated learning (D2P-Fed), a unified scheme to achieve both differential privacy (DP) and communication efficiency in federated learning (FL).
no code implementations • NeurIPS 2020 • Lun Wang, Qi Pang, Dawn Song
Causal graph discovery refers to the process of discovering causal relation graphs from purely observational data.
1 code implementation • 2 Aug 2019 • Wenbo Guo, Lun Wang, Xinyu Xing, Min Du, Dawn Song
As such, given a deep neural network model and clean input samples, it is very challenging to inspect and determine the existence of a trojan backdoor.