Search Results for author: Jinho Lee

Found 25 papers, 9 papers with code

Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System

1 code implementation • 11 Mar 2024 • Hongsun Jang, Jaeyong Song, Jaewon Jung, Jaeyoung Park, Youngsok Kim, Jinho Lee

Our work, Smart-Infinity, addresses the storage bandwidth bottleneck of storage-offloaded LLM training using near-storage processing devices on a real system.

Language Modelling Large Language Model

Paper
Code

PeerAiD: Improving Adversarial Distillation from a Specialized Peer Tutor

1 code implementation • 11 Mar 2024 • Jaewon Jung, Hongsun Jang, Jaeyong Song, Jinho Lee

In this situation, adversarial distillation is a promising option which aims to distill the robustness of the teacher network to improve the robustness of a small student network.

Adversarial Robustness

Paper
Code

GraNNDis: Efficient Unified Distributed Training Framework for Deep GNNs on Large Clusters

no code implementations • 12 Nov 2023 • Jaeyong Song, Hongsun Jang, Jaewon Jung, Youngsok Kim, Jinho Lee

According to the growth in the dataset and the model size used for GNNs, an important problem is that it becomes nearly impossible to keep the whole network on GPU memory.

Paper
Add Code

Pipe-BD: Pipelined Parallel Blockwise Distillation

1 code implementation • 29 Jan 2023 • Hongsun Jang, Jaewon Jung, Jaeyong Song, Joonsang Yu, Youngsok Kim, Jinho Lee

However, this results in a high overhead of redundant teacher execution, low GPU utilization, and extra data loading.

Paper
Code

SGCN: Exploiting Compressed-Sparse Features in Deep Graph Convolutional Network Accelerators

1 code implementation • 25 Jan 2023 • Mingi Yoo, Jaeyong Song, Jounghoo Lee, Namhyung Kim, Youngsok Kim, Jinho Lee

A GCN takes as input an arbitrarily structured graph and executes a series of layers which exploit the graph's structure to calculate their output features.

Feature Compression

Paper
Code

Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression

no code implementations • 24 Jan 2023 • Jaeyong Song, Jinkyu Yim, Jaewon Jung, Hongsun Jang, Hyung-Jin Kim, Youngsok Kim, Jinho Lee

Compressing the communication is one way to mitigate the overhead by reducing the inter-node traffic volume; however, the existing compression techniques have critical limitations to be applied for NLP models with 3D parallelism in that 1) only the data parallelism traffic is targeted, and 2) the existing compression schemes already harm the model quality too much.

Paper
Add Code

Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network Accelerators

no code implementations • 24 Jan 2023 • Mingi Yoo, Jaeyong Song, Hyeyoon Lee, Jounghoo Lee, Namhyung Kim, Youngsok Kim, Jinho Lee

Graph convolutional networks (GCNs) are becoming increasingly popular as they can process a wide variety of data formats that prior deep neural networks cannot easily support.

Paper
Add Code

Enabling Hard Constraints in Differentiable Neural Network and Accelerator Co-Exploration

no code implementations • 23 Jan 2023 • Deokki Hong, Kanghyun Choi, Hye Yoon Lee, Joonsang Yu, Noseong Park, Youngsok Kim, Jinho Lee

Co-exploration of an optimal neural architecture and its hardware accelerator is an approach of rising interest which addresses the computational cost problem, especially in low-profile systems.

Neural Architecture Search

Paper
Add Code

ETF Portfolio Construction via Neural Network trained on Financial Statement Data

no code implementations • 4 Jul 2022 • Jinho Lee, Sungwoo Park, Jungyu Ahn, Jonghun Kwak

Therefore, we use the data of individual stocks to train our neural networks to predict the future performance of individual stocks and use these predictions and the portfolio deposit file (PDF) to construct a portfolio of ETFs.

Asset Management

Paper
Add Code

Shai-am: A Machine Learning Platform for Investment Strategies

no code implementations • 1 Jul 2022 • Jonghun Kwak, Jungyu Ahn, Jinho Lee, Sungwoo Park

The finance industry has adopted machine learning (ML) as a form of quantitative research to support better investment decisions, yet there are several challenges often overlooked in practice.

BIG-bench Machine Learning

Paper
Add Code

It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher

1 code implementation • CVPR 2022 • Kanghyun Choi, Hye Yoon Lee, Deokki Hong, Joonsang Yu, Noseong Park, Youngsok Kim, Jinho Lee

To deal with the performance drop induced by quantization errors, a popular method is to use training data to fine-tune quantized networks.

Data Free Quantization Knowledge Distillation

Paper
Code

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

2 code implementations • NeurIPS 2021 • Kanghyun Choi, Deokki Hong, Noseong Park, Youngsok Kim, Jinho Lee

We find that this is often insufficient to capture the distribution of the original data, especially around the decision boundaries.

Ranked #1 on Data Free Quantization on CIFAR-100

Data Free Quantization Disentanglement

Paper
Code

ConCoDE: Hard-constrained Differentiable Co-Exploration Method for Neural Architectures and Hardware Accelerators

no code implementations • 29 Sep 2021 • Deokki Hong, Kanghyun Choi, Hey Yoon Lee, Joonsang Yu, Youngsok Kim, Noseong Park, Jinho Lee

To handle the hard constraint problem of differentiable co-exploration, we propose ConCoDE, which searches for hard-constrained solutions without compromising the global design objectives.

Neural Architecture Search

Paper
Add Code

An Attention Module for Convolutional Neural Networks

no code implementations • 18 Aug 2021 • Zhu Baozhou, Peter Hofstee, Jinho Lee, Zaid Al-Ars

To solve the two problems together, we initially propose an attention module for convolutional neural networks by developing an AW-convolution, where the shape of attention maps matches that of the weights rather than the activations.

Image Classification Object +2

Paper
Add Code

AutoReCon: Neural Architecture Search-based Reconstruction for Data-free Compression

1 code implementation • 25 May 2021 • Baozhou Zhu, Peter Hofstee, Johan Peltenburg, Jinho Lee, Zaid AlArs

Thus, a common approach is to compute a reconstructed training dataset before compression.

Neural Architecture Search

Paper
Code

GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent

no code implementations • 15 Feb 2021 • Heesu Kim, Hanmin Park, Taehyun Kim, Kwanheum Cho, Eojin Lee, Soojung Ryu, Hyuk-Jae Lee, Kiyoung Choi, Jinho Lee

In this paper, we present GradPIM, a processing-in-memory architecture which accelerates parameter updates of deep neural networks training.

Paper
Add Code

Homogeneous superconducting gap in DBCO synthesized by oxide molecular beam epitaxy

no code implementations • 25 Jan 2021 • Ze-Bin Wu, Daniel Putzky, Asish K. Kundu, Hui Li, Shize Yang, Zengyi Du, Sang Hyun Joo, Jinho Lee, Yimei Zhu, Gennady Logvenov, Bernhard Keimer, Kazuhiro Fujita, Tonica Valla, Ivan Bozovic, Ilya K. Drozdov

This indicates that the pseudogap and superconductivity are of different origins.

Superconductivity Materials Science

Paper
Add Code

Deep Composer Classification Using Symbolic Representation

1 code implementation • 2 Oct 2020 • Sunghyeon Kim, Hyeyoon Lee, Sunjong Park, Jinho Lee, Keunwoo Choi

In this study, we train deep neural networks to classify composer on a symbolic domain.

Classification General Classification

Paper
Code

DANCE: Differentiable Accelerator/Network Co-Exploration

no code implementations • 14 Sep 2020 • Kanghyun Choi, Deokki Hong, Hojae Yoon, Joonsang Yu, Youngsok Kim, Jinho Lee

In such circumstances, this work presents DANCE, a differentiable approach towards the co-exploration of the hardware accelerator and network architecture design.

Neural Architecture Search

Paper
Add Code

SoFAr: Shortcut-based Fractal Architectures for Binary Convolutional Neural Networks

no code implementations • 11 Sep 2020 • Zhu Baozhou, Peter Hofstee, Jinho Lee, Zaid Al-Ars

Inspired by the shortcuts and fractal architectures, we propose two Shortcut-based Fractal Architectures (SoFAr) specifically designed for BCNNs: 1. residual connection-based fractal architectures for binary ResNet, and 2. dense connection-based fractal architectures for binary DenseNet.

Binarization

Paper
Add Code

MAPS: Multi-agent Reinforcement Learning-based Portfolio Management System

no code implementations • 10 Jul 2020 • Jinho Lee, Raehyun Kim, Seok-Won Yi, Jaewoo Kang

Generating an investment strategy using advanced deep learning methods in stock markets has recently been a topic of interest.

Management Multi-agent Reinforcement Learning +2

Paper
Add Code

SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of Autoencoders

no code implementations • 14 Jan 2020 • Inseok Hwang, Jinho Lee, Frank Liu, Minsik Cho

Our intuition is that, the more similarity exists between the unknown data samples and the part of known data that an autoencoder was trained with, the better chances there could be that this autoencoder makes use of its trained knowledge, reconstructing output samples closer to the originals.

Data Augmentation

Paper
Add Code

MUTE: Data-Similarity Driven Multi-hot Target Encoding for Neural Network Design

no code implementations • 15 Oct 2019 • Mayoore S. Jaiswal, Bumboo Kang, Jinho Lee, Minsik Cho

Target encoding is an effective technique to deliver better performance for conventional machine learning methods, and recently, for deep neural networks as well.

General Classification Image Classification

Paper
Add Code

Global Stock Market Prediction Based on Stock Chart Images Using Deep Q-Network

1 code implementation • 28 Feb 2019 • Jinho Lee, Raehyun Kim, Yookyung Koh, Jaewoo Kang

Moreover, the results show that future stock prices can be predicted even if the training and testing procedures are done in different countries.

Stock Market Prediction

Paper
Code

Globally Optimal Object Tracking with Fully Convolutional Networks

no code implementations • 25 Dec 2016 • Jinho Lee, Brian Kenji Iwana, Shouta Ide, Seiichi Uchida

Thus, we propose a new and robust tracking method using a Fully Convolutional Network (FCN) to obtain an object probability map and Dynamic Programming (DP) to seek the globally optimal path through all frames of video.

Object Object Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.