Search Results for author: Kartikeya Bhardwaj

Found 17 papers, 7 papers with code

Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models

no code implementations • 26 Mar 2024 • Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Kyunggeun Lee, Jun Ma, Harris Teague

Large generative models such as large language models (LLMs) and diffusion models have revolutionized the fields of NLP and computer vision respectively.

Knowledge Distillation Quantization

Paper
Add Code

ZiCo-BC: A Bias Corrected Zero-Shot NAS for Vision Tasks

no code implementations • 26 Sep 2023 • Kartikeya Bhardwaj, Hsin-Pai Cheng, Sweta Priyadarshi, Zhuojin Li

To solve the problem, we propose a novel bias correction on ZiCo, called ZiCo-BC.

Image Classification Neural Architecture Search +4

Paper
Add Code

Zero-Shot Neural Architecture Search: Challenges, Solutions, and Opportunities

1 code implementation • 5 Jul 2023 • Guihong Li, Duc Hoang, Kartikeya Bhardwaj, Ming Lin, Zhangyang Wang, Radu Marculescu

Recently, zero-shot (or training-free) Neural Architecture Search (NAS) approaches have been proposed to liberate NAS from the expensive training process.

Neural Architecture Search

Paper
Code

TIPS: Topologically Important Path Sampling for Anytime Neural Networks

no code implementations • 13 May 2023 • Guihong Li, Kartikeya Bhardwaj, Yuedong Yang, Radu Marculescu

Anytime neural networks (AnytimeNNs) are a promising solution to adaptively adjust the model complexity at runtime under various hardware resource constraints.

Paper
Add Code

ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients

1 code implementation • 26 Jan 2023 • Guihong Li, Yuedong Yang, Kartikeya Bhardwaj, Radu Marculescu

Based on this theoretical analysis, we propose a new zero-shot proxy, ZiCo, the first proxy that works consistently better than #Params.

Image Classification Neural Architecture Search

Paper
Code

Restructurable Activation Networks

1 code implementation • 17 Aug 2022 • Kartikeya Bhardwaj, James Ward, Caleb Tung, Dibakar Gope, Lingchuan Meng, Igor Fedorov, Alex Chalfin, Paul Whatmough, Danny Loh

To address this question, we propose a new paradigm called Restructurable Activation Networks (RANs) that manipulate the amount of non-linearity in models to improve their hardware-awareness and efficiency.

object-detection Object Detection

Paper
Code

Super-Efficient Super Resolution for Fast Adversarial Defense at the Edge

1 code implementation • 29 Dec 2021 • Kartikeya Bhardwaj, Dibakar Gope, James Ward, Paul Whatmough, Danny Loh

Autonomous systems are highly vulnerable to a variety of adversarial attacks on Deep Neural Networks (DNNs).

Adversarial Defense Image Classification +1

Paper
Code

Collapsible Linear Blocks for Super-Efficient Super Resolution

3 code implementations • 17 Mar 2021 • Kartikeya Bhardwaj, Milos Milosavljevic, Liam O'Neil, Dibakar Gope, Ramon Matas, Alex Chalfin, Naveen Suda, Lingchuan Meng, Danny Loh

Our results highlight the challenges faced by super resolution on AI accelerators and demonstrate that SESR is significantly faster (e. g., 6x-8x higher FPS) than existing models on mobile-NPU.

4k 8k +1

276

Paper
Code

On the relationship between topology and gradient propagation in deep networks

no code implementations • 1 Jan 2021 • Kartikeya Bhardwaj, Guihong Li, Radu Marculescu

(ii) Can certain topological characteristics of deep networks indicate a priori (i. e., without training) which models, with a different number of parameters/FLOPS/layers, achieve a similar accuracy?

Paper
Add Code

New Directions in Distributed Deep Learning: Bringing the Network at Forefront of IoT Design

no code implementations • 25 Aug 2020 • Kartikeya Bhardwaj, Wei Chen, Radu Marculescu

In this paper, we first highlight three major challenges to large-scale adoption of deep learning at the edge: (i) Hardware-constrained IoT devices, (ii) Data security and privacy in the IoT era, and (iii) Lack of network-aware deep learning algorithms for distributed inference across multiple IoT devices.

Federated Learning

Paper
Add Code

FedMAX: Mitigating Activation Divergence for Accurate and Communication-Efficient Federated Learning

1 code implementation • 7 Apr 2020 • Wei Chen, Kartikeya Bhardwaj, Radu Marculescu

In this paper, we identify a new phenomenon called activation-divergence which occurs in Federated Learning (FL) due to data heterogeneity (i. e., data being non-IID) across multiple users.

Federated Learning

Paper
Code

EdgeAI: A Vision for Deep Learning in IoT Era

no code implementations • 23 Oct 2019 • Kartikeya Bhardwaj, Naveen Suda, Radu Marculescu

The significant computational requirements of deep learning present a major bottleneck for its large-scale adoption on hardware-constrained IoT-devices.

Paper
Add Code

How does topology influence gradient propagation and model performance of deep networks with DenseNet-type skip connections?

2 code implementations • CVPR 2021 • Kartikeya Bhardwaj, Guihong Li, Radu Marculescu

In this paper, we reveal that the topology of the concatenation-type skip connections is closely related to the gradient propagation which, in turn, enables a predictable behavior of DNNs' test performance.

Model Compression

Paper
Code

Memory- and Communication-Aware Model Compression for Distributed Deep Learning Inference on IoT

no code implementations • 26 Jul 2019 • Kartikeya Bhardwaj, Chingyi Lin, Anderson Sartor, Radu Marculescu

Therefore, we propose Network of Neural Networks (NoNN), a new distributed IoT learning paradigm that compresses a large pretrained 'teacher' deep network into several disjoint and highly-compressed 'student' modules, without loss of accuracy.

Image Classification Model Compression

Paper
Add Code

Dream Distillation: A Data-Independent Model Compression Framework

no code implementations • 17 May 2019 • Kartikeya Bhardwaj, Naveen Suda, Radu Marculescu

Model compression is eminently suited for deploying deep learning on IoT-devices.

Model Compression

Paper
Add Code

On Network Science and Mutual Information for Explaining Deep Neural Networks

no code implementations • 20 Jan 2019 • Brian Davis, Umang Bhatt, Kartikeya Bhardwaj, Radu Marculescu, José M. F. Moura

In this paper, we present a new approach to interpret deep learning models.

Paper
Add Code

A Dynamic Network and Representation LearningApproach for Quantifying Economic Growth fromSatellite Imagery

no code implementations • 1 Dec 2018 • Jiqian Dong, Gopaljee Atulya, Kartikeya Bhardwaj, Radu Marculescu

To this end, we propose a new network science- and representation learning-based approach that can quantify economic indicators and visualize the growth of various regions.

Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.