Search Results for author: Yiren Zhao

Found 42 papers, 14 papers with code

Dynamic Channel Pruning: Feature Boosting and Suppression

2 code implementations • ICLR 2019 • Xitong Gao, Yiren Zhao, Łukasz Dudziak, Robert Mullins, Cheng-Zhong Xu

Making deep convolutional neural networks more accurate typically comes at the cost of increased computational and memory resources.

Model Compression Network Pruning

113

Paper
Code

Focused Quantization for Sparse CNNs

1 code implementation • NeurIPS 2019 • Yiren Zhao, Xitong Gao, Daniel Bates, Robert Mullins, Cheng-Zhong Xu

In ResNet-50, we achieved a 18. 08x CR with only 0. 24% loss in top-5 accuracy, outperforming existing compression methods.

Neural Network Compression Quantization

113

Paper
Code

Markpainting: Adversarial Machine Learning meets Inpainting

1 code implementation • 1 Jun 2021 • David Khachaturov, Ilia Shumailov, Yiren Zhao, Nicolas Papernot, Ross Anderson

Inpainting is a learned interpolation technique that is based on generative modeling and used to populate masked or missing pieces in an image; it has wide applications in picture editing and retouching.

BIG-bench Machine Learning

Paper
Code

Sponge Examples: Energy-Latency Attacks on Neural Networks

2 code implementations • 5 Jun 2020 • Ilia Shumailov, Yiren Zhao, Daniel Bates, Nicolas Papernot, Robert Mullins, Ross Anderson

The high energy costs of neural network training and inference led to the use of acceleration hardware such as GPUs and TPUs.

Autonomous Vehicles

Paper
Code

Genomic Interpreter: A Hierarchical Genomic Deep Neural Network with 1D Shifted Window Transformer

1 code implementation • 8 Jun 2023 • Zehui Li, Akashaditya Das, William A V Beardall, Yiren Zhao, Guy-Bart Stan

This work presents Genomic Interpreter: a novel architecture for genomic assay prediction.

Interpretable Machine Learning

Paper
Code

Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?

1 code implementation • 8 Oct 2023 • Cheng Zhang, Jianyi Cheng, Ilia Shumailov, George A. Constantinides, Yiren Zhao

In this work, we explore the statistical and learning properties of the LLM layer and attribute the bottleneck of LLM quantisation to numerical scaling offsets.

Attribute

Paper
Code

Architectural Backdoors in Neural Networks

2 code implementations • CVPR 2023 • Mikel Bober-Irizar, Ilia Shumailov, Yiren Zhao, Robert Mullins, Nicolas Papernot

Machine learning is vulnerable to adversarial manipulation.

Inductive Bias

Paper
Code

Flareon: Stealthy any2any Backdoor Injection via Poisoned Augmentation

1 code implementation • 20 Dec 2022 • Tianrui Qin, Xianghuan He, Xitong Gao, Yiren Zhao, Kejiang Ye, Cheng-Zhong Xu

Open software supply chain attacks, once successful, can exact heavy costs in mission-critical applications.

Data Augmentation

Paper
Code

Latent Diffusion Model for DNA Sequence Generation

1 code implementation • 9 Oct 2023 • Zehui Li, Yuhao Ni, Tim August B. Huygelen, Akashaditya Das, Guoxuan Xia, Guy-Bart Stan, Yiren Zhao

On the other hand, Diffusion Models are a promising new class of generative models that are not burdened with these problems, enabling them to reach the state-of-the-art in domains such as image generation.

Text Generation

Paper
Code

Revisiting Automated Prompting: Are We Actually Doing Better?

1 code implementation • 7 Apr 2023 • Yulin Zhou, Yiren Zhao, Ilia Shumailov, Robert Mullins, Yarin Gal

Current literature demonstrates that Large Language Models (LLMs) are great few-shot learners, and prompting significantly increases their performance on a range of downstream tasks in a few-shot learning setting.

Few-Shot Learning

Paper
Code

The Curse of Recursion: Training on Generated Data Makes Models Forget

1 code implementation • 27 May 2023 • Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Yarin Gal, Nicolas Papernot, Ross Anderson

It is now clear that large language models (LLMs) are here to stay, and will bring about drastic change in the whole ecosystem of online text and images.

Descriptive

Paper
Code

Manipulating SGD with Data Ordering Attacks

1 code implementation • NeurIPS 2021 • Ilia Shumailov, Zakhar Shumaylov, Dmitry Kazhdan, Yiren Zhao, Nicolas Papernot, Murat A. Erdogdu, Ross Anderson

Machine learning is vulnerable to a wide variety of attacks.

Paper
Code

Augmentation Backdoors

1 code implementation • 29 Sep 2022 • Joseph Rance, Yiren Zhao, Ilia Shumailov, Robert Mullins

It is well known that backdoors can be inserted into machine learning models through serving a modified dataset to train on.

Data Augmentation

Paper
Code

Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration

1 code implementation • 27 Jan 2023 • Xiangyu Zhao, Hannes Stärk, Dominique Beaini, Yiren Zhao, Pietro Liò

Existing GNN benchmarking methods for molecular representation learning focus on comparing the GNNs' performances on some node/graph classification/regression tasks on certain datasets.

Benchmarking Graph Classification +3

Paper
Code

To compress or not to compress: Understanding the Interactions between Adversarial Attacks and Neural Network Compression

no code implementations • 29 Sep 2018 • Yiren Zhao, Ilia Shumailov, Robert Mullins, Ross Anderson

We, therefore, investigate the extent to which adversarial samples are transferable between uncompressed and compressed DNNs.

Neural Network Compression

Paper
Add Code

The Taboo Trap: Behavioural Detection of Adversarial Samples

no code implementations • 18 Nov 2018 • Ilia Shumailov, Yiren Zhao, Robert Mullins, Ross Anderson

Most existing detection mechanisms against adversarial attacksimpose significant costs, either by using additional classifiers to spot adversarial samples, or by requiring the DNN to be restructured.

Paper
Add Code

Sitatapatra: Blocking the Transfer of Adversarial Samples

no code implementations • 23 Jan 2019 • Ilia Shumailov, Xitong Gao, Yiren Zhao, Robert Mullins, Ross Anderson, Cheng-Zhong Xu

Convolutional Neural Networks (CNNs) are widely used to solve classification tasks in computer vision.

Blocking General Classification

Paper
Add Code

Blackbox Attacks on Reinforcement Learning Agents Using Approximated Temporal Information

no code implementations • 6 Sep 2019 • Yiren Zhao, Ilia Shumailov, Han Cui, Xitong Gao, Robert Mullins, Ross Anderson

In this work, we show how such samples can be generalised from White-box and Grey-box attacks to a strong Black-box case, where the attacker has no knowledge of the agents, their training parameters and their training methods.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Automatic Generation of Multi-precision Multi-arithmetic CNN Accelerators for FPGAs

no code implementations • 21 Oct 2019 • Yiren Zhao, Xitong Gao, Xuan Guo, Junyi Liu, Erwei Wang, Robert Mullins, Peter Y. K. Cheung, George Constantinides, Cheng-Zhong Xu

Furthermore, we show how Tomato produces implementations of networks with various sizes running on single or multiple FPGAs.

General Classification

Paper
Add Code

Towards Certifiable Adversarial Sample Detection

no code implementations • 20 Feb 2020 • Ilia Shumailov, Yiren Zhao, Robert Mullins, Ross Anderson

Convolutional Neural Networks (CNNs) are deployed in more and more classification systems, but adversarial samples can be maliciously crafted to trick them, and are becoming a real threat.

Adversarial Robustness

Paper
Add Code

Probabilistic Dual Network Architecture Search on Graphs

no code implementations • 21 Mar 2020 • Yiren Zhao, Duo Wang, Xitong Gao, Robert Mullins, Pietro Lio, Mateja Jamnik

We present the first differentiable Network Architecture Search (NAS) for Graph Neural Networks (GNNs).

Paper
Add Code

Pay Attention to Features, Transfer Learn Faster CNNs

no code implementations • ICLR 2020 • Kafeng Wang, Xitong Gao, Yiren Zhao, Xingjian Li, Dejing Dou, Cheng-Zhong Xu

Deep convolutional neural networks are now widely deployed in vision applications, but a limited size of training data can restrict their task performance.

Transfer Learning

Paper
Add Code

Learned Low Precision Graph Neural Networks

no code implementations • 19 Sep 2020 • Yiren Zhao, Duo Wang, Daniel Bates, Robert Mullins, Mateja Jamnik, Pietro Lio

LPGNAS learns the optimal architecture coupled with the best quantisation strategy for different components in the GNN automatically using back-propagation in a single search round.

Paper
Add Code

Nudge Attacks on Point-Cloud DNNs

no code implementations • 22 Nov 2020 • Yiren Zhao, Ilia Shumailov, Robert Mullins, Ross Anderson

The wide adaption of 3D point-cloud data in safety-critical applications such as autonomous driving makes adversarial samples a real threat.

Autonomous Driving

Paper
Add Code

Rapid Model Architecture Adaption for Meta-Learning

no code implementations • 10 Sep 2021 • Yiren Zhao, Xitong Gao, Ilia Shumailov, Nicolo Fusi, Robert Mullins

H-Meta-NAS shows a Pareto dominance compared to a variety of NAS and manual baselines in popular few-shot learning benchmarks with various hardware platforms and constraints.

Few-Shot Learning

Paper
Add Code

FedDrop: Trajectory-weighted Dropout for Efficient Federated Learning

no code implementations • 29 Sep 2021 • Dongping Liao, Xitong Gao, Yiren Zhao, Hao Dai, Li Li, Kafeng Wang, Kejiang Ye, Yang Wang, Cheng-Zhong Xu

Federated learning (FL) enables edge clients to train collaboratively while preserving individual's data privacy.

Federated Learning

Paper
Add Code

DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning

no code implementations • 31 Oct 2021 • Robert Hönig, Yiren Zhao, Robert Mullins

First, we introduce a time-adaptive quantization algorithm that increases the quantization level as training progresses.

Federated Learning Privacy Preserving +1

Paper
Add Code

Model Architecture Adaption for Bayesian Neural Networks

no code implementations • 9 Feb 2022 • Duo Wang, Yiren Zhao, Ilia Shumailov, Robert Mullins

Bayesian Neural Networks (BNNs) offer a mathematically grounded framework to quantify the uncertainty of model predictions but come with a prohibitive computation cost for both training and inference.

Uncertainty Quantification

Paper
Add Code

Efficient Adversarial Training With Data Pruning

no code implementations • 1 Jul 2022 • Maximilian Kaufmann, Yiren Zhao, Ilia Shumailov, Robert Mullins, Nicolas Papernot

In this paper we demonstrate data pruning-a method for increasing adversarial training efficiency through data sub-sampling. We empirically show that data pruning leads to improvements in convergence and reliability of adversarial training, albeit with different levels of utility degradation.

Paper
Add Code

DARTFormer: Finding The Best Type Of Attention

no code implementations • 2 Oct 2022 • Jason Ross Brown, Yiren Zhao, Ilia Shumailov, Robert D Mullins

Given the wide and ever growing range of different efficient Transformer attention mechanisms, it is important to identify which attention is most effective when given a task.

ListOps Neural Architecture Search +3

Paper
Add Code

Wide Attention Is The Way Forward For Transformers?

no code implementations • 2 Oct 2022 • Jason Ross Brown, Yiren Zhao, Ilia Shumailov, Robert D Mullins

We demonstrate that wide single layer Transformer models can compete with or outperform deeper ones in a variety of Natural Language Processing (NLP) tasks when both are trained from scratch.

text-classification Text Classification

Paper
Add Code

ImpNet: Imperceptible and blackbox-undetectable backdoors in compiled neural networks

no code implementations • 30 Sep 2022 • Tim Clifford, Ilia Shumailov, Yiren Zhao, Ross Anderson, Robert Mullins

These backdoors are impossible to detect during the training or data preparation processes, because they are not yet present.

Paper
Add Code

Revisiting Structured Dropout

no code implementations • 5 Oct 2022 • Yiren Zhao, Oluwatomisin Dada, Xitong Gao, Robert D Mullins

Large neural networks are often overparameterised and prone to overfitting, Dropout is a widely used regularization technique to combat overfitting and improve model generalization.

Scheduling

Paper
Add Code

Dynamic Stashing Quantization for Efficient Transformer Training

no code implementations • 9 Mar 2023 • Guo Yang, Daniel Lo, Robert Mullins, Yiren Zhao

Large Language Models (LLMs) have demonstrated impressive performance on a range of Natural Language Processing (NLP) tasks.

Quantization

Paper
Add Code

Adaptive Channel Sparsity for Federated Learning Under System Heterogeneity

no code implementations • CVPR 2023 • Dongping Liao, Xitong Gao, Yiren Zhao, Cheng-Zhong Xu

Owing to the non-i. i. d.

Federated Learning

Paper
Add Code

Hybrid Graph: A Unified Graph Representation with Datasets and Benchmarks for Complex Graphs

no code implementations • 8 Jun 2023 • Zehui Li, Xiangyu Zhao, Mingzhu Shen, Guy-Bart Stan, Pietro Liò, Yiren Zhao

Additionally, though many Graph Neural Networks (GNNs) have been proposed for representation learning on higher-order graphs, they are usually only evaluated on simple graph datasets.

Graph Learning Representation Learning

Paper
Add Code

Will More Expressive Graph Neural Networks do Better on Generative Tasks?

no code implementations • 23 Aug 2023 • Xiandong Zou, Xiangyu Zhao, Pietro Liò, Yiren Zhao

Moreover, we show that GCPN and GraphAF with advanced GNNs can achieve state-of-the-art results across 17 other non-GNN-based graph generative approaches, such as variational autoencoders and Bayesian optimisation models, on the proposed molecular generative objectives (DRD2, Median1, Median2), which are impor- tant metrics for de-novo molecular design.

Bayesian Optimisation Graph Generation +1

Paper
Add Code

LLM4DV: Using Large Language Models for Hardware Test Stimuli Generation

no code implementations • 6 Oct 2023 • Zixi Zhang, Greg Chadwick, Hugo McNally, Yiren Zhao, Robert Mullins

Test stimuli generation has been a crucial but labor-intensive task in hardware design verification.

Benchmarking Mathematical Reasoning

Paper
Add Code

LQER: Low-Rank Quantization Error Reconstruction for LLMs

no code implementations • 4 Feb 2024 • Cheng Zhang, Jianyi Cheng, George A. Constantinides, Yiren Zhao

Post-training quantization of Large Language Models (LLMs) is challenging.

Knowledge Distillation Quantization

Paper
Add Code

DiscDiff: Latent Diffusion Model for DNA Sequence Generation

no code implementations • 8 Feb 2024 • Zehui Li, Yuhao Ni, William A V Beardall, Guoxuan Xia, Akashaditya Das, Guy-Bart Stan, Yiren Zhao

This paper introduces a novel framework for DNA sequence generation, comprising two key components: DiscDiff, a Latent Diffusion Model (LDM) tailored for generating discrete DNA sequences, and Absorb-Escape, a post-training algorithm designed to refine these sequences.

Paper
Add Code

Architectural Neural Backdoors from First Principles

no code implementations • 10 Feb 2024 • Harry Langford, Ilia Shumailov, Yiren Zhao, Robert Mullins, Nicolas Papernot

In this work we construct an arbitrary trigger detector which can be used to backdoor an architecture with no human supervision.

Paper
Add Code

Enhancing Real-World Complex Network Representations with Hyperedge Augmentation

no code implementations • 20 Feb 2024 • Xiangyu Zhao, Zehui Li, Mingzhu Shen, Guy-Bart Stan, Pietro Liò, Yiren Zhao

These methods cannot fully address the complexities of real-world large-scale networks that often involve higher-order node relations beyond only being pairwise.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.