Search Results for author: Hyesoon Kim

Found 8 papers, 0 papers with code

Hydro: Adaptive Query Processing of ML Queries

no code implementations • 22 Mar 2024 • Gaurav Tarlok Kakkar, Jiashen Cao, Aubhro Sengupta, Joy Arulraj, Hyesoon Kim

Second, the optimal query plan for ML queries is data-dependent, necessitating DBMSs to adapt the query plan on the fly during execution.

Paper
Add Code

VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs

no code implementations • 17 Feb 2023 • Geonhwa Jeong, Sana Damani, Abhimanyu Rajeshkumar Bambhaniya, Eric Qin, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna

Therefore, as DL workloads embrace sparsity to reduce the computations and memory size of models, it is also imperative for CPUs to add support for sparsity to avoid under-utilization of the dense matrix engine and inefficient usage of the caches and registers.

Paper
Add Code

RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

no code implementations • 5 Oct 2021 • Geonhwa Jeong, Eric Qin, Ananda Samajdar, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna

As AI-based applications become pervasive, CPU vendors are starting to incorporate matrix engines within the datapath to boost efficiency.

Paper
Add Code

Reducing Inference Latency with Concurrent Architectures for Image Recognition

no code implementations • 13 Nov 2020 • Ramyad Hadidi, Jiashen Cao, Michael S. Ryoo, Hyesoon Kim

Satisfying the high computation demand of modern deep learning architectures is challenging for achieving low inference latency.

Neural Architecture Search

Paper
Add Code

LCP: A Low-Communication Parallelization Method for Fast Neural Network Inference in Image Recognition

no code implementations • 13 Mar 2020 • Ramyad Hadidi, Bahar Asgari, Jiashen Cao, Younmin Bae, Da Eun Shim, Hyojong Kim, Sung-Kyu Lim, Michael S. Ryoo, Hyesoon Kim

To benefit from available compute resources with low communication overhead, we propose the first DNN parallelization method for reducing the communication overhead in a distributed system.

Quantization

Paper
Add Code

A Case Study: Exploiting Neural Machine Translation to Translate CUDA to OpenCL

no code implementations • 18 May 2019 • Yonghae Kim, Hyesoon Kim

The sequence-to-sequence (seq2seq) model for neural machine translation has significantly improved the accuracy of language translation.

Machine Translation Translation

Paper
Add Code

Collaborative Execution of Deep Neural Networks on Internet of Things Devices

no code implementations • 8 Jan 2019 • Ramyad Hadidi, Jiashen Cao, Micheal S. Ryoo, Hyesoon Kim

In this paper, we propose an approach that utilizes aggregated existing computing power of Internet of Things (IoT) devices surrounding an environment by creating a collaborative network.

Paper
Add Code

Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices

no code implementations • 5 Feb 2018 • Ramyad Hadidi, Jiashen Cao, Matthew Woodward, Michael S. Ryoo, Hyesoon Kim

Furthermore, in image recognition, Musical Chair achieves similar performance and saves dynamic energy.

Action Recognition Temporal Action Localization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.