Search Results for author: Woongkyu Lee

Found 3 papers, 1 papers with code

LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System

no code implementations28 Dec 2024 Hyucksung Kwon, Kyungmo Koo, Janghyeon Kim, Woongkyu Lee, Minjae Lee, Hyungdeok Lee, Yousub Jung, JaeHan Park, Yosub Song, Byeongsu Yang, Haerang Choi, Guhyun Kim, Jongsoon Won, Woojae Shin, Changhyun Kim, Gyeongcheol Shin, Yongkee Kwon, Ilkon Kim, Euicheol Lim, John Kim, Jungwook Choi

Processing-in-Memory (PIM) maximizes memory bandwidth by moving compute to the data and can address the memory bandwidth challenges; however, PIM is not necessarily scalable to accelerate long-context LLM because of limited per-module memory capacity and the inflexibility of fixed-functional unit PIM architecture and static memory management.

Management

Learning from distinctive candidates to optimize reduced-precision convolution program on tensor cores

no code implementations11 Feb 2022 Junkyeong Choi, Hyucksung Kwon, Woongkyu Lee, Jungwook Choi, Jieun Lim

In this method, we devise a search space that explores the thread tile and warp sizes to increase the data reuse despite a large matrix operand of reduced-precision MMA.

Scheduling

Thermal Face Detection for High-Speed AI Thermometer

1 code implementation 2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) 2022 Woongkyu Lee, Hyucksung Kwon, Jungwook Choi

However, the computation-demanding nature of DNNs, along with the time-consuming fusion of video and thermal camera frames, raises hurdles for the cost-effective deployment of such AI thermometer systems.

Face Detection object-detection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.