Search Results for author: Tian-Sheuan Chang

Found 18 papers, 0 papers with code

IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding

no code implementations • 15 Dec 2023 • Yu-Han Sun, Chiang Lo-Hsuan Lee, Tian-Sheuan Chang

Image prefiltering with just noticeable distortion (JND) improves coding efficiency in a visual lossless way by filtering the perceptually redundant information prior to compression.

Image Quality Assessment Quantization

Paper
Add Code

ASC: Adaptive Scale Feature Map Compression for Deep Neural Network

no code implementations • 13 Dec 2023 • Yuan YAO, Tian-Sheuan Chang

Furthermore, the hardware architecture scales effectively, with only a sublinear increase in area cost.

Paper
Add Code

ACNPU: A 4.75TOPS/W 1080P@30FPS Super Resolution Accelerator with Decoupled Asymmetric Convolution

no code implementations • 30 Aug 2023 • Tun-Hao Yang, Tian-Sheuan Chang

This challenge leads many accelerators to opt for simpler and shallow models like FSRCNN, compromising performance for real-time needs, especially for resource-limited edge devices.

Super-Resolution

Paper
Add Code

Real-Time Wearable Gait Phase Segmentation For Running And Walking

no code implementations • 10 May 2022 • Jien-De Sui, Wei-Han Chen, Tzyy-Yuang Shiang, Tian-Sheuan Chang

Previous gait phase detection as convolutional neural network (CNN) based classification task requires cumbersome manual setting of time delay or heavy overlapped sliding windows to accurately classify each phase under different test cases, which is not suitable for streaming Inertial-Measurement-Unit (IMU) sensor data and fails to adapt to different scenarios.

Paper
Add Code

A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization

no code implementations • 10 May 2022 • Yu-Hsiang Chiang, Tian-Sheuan Chang, Shyh Jye Jou

Keyword spotting has gained popularity as a natural way to interact with consumer devices in recent years.

Keyword Spotting Quantization

Paper
Add Code

A Real Time Super Resolution Accelerator with Tilted Layer Fusion

no code implementations • 9 May 2022 • An-Jung Huang, Kai-Chieh Hsu, Tian-Sheuan Chang

Deep learning based superresolution achieves high-quality results, but its heavy computational workload, large buffer, and high external memory bandwidth inhibit its usage in mobile devices.

Super-Resolution

Paper
Add Code

Row-wise Accelerator for Vision Transformer

no code implementations • 9 May 2022 • Hong-Yi Wang, Tian-Sheuan Chang

Following the success of the natural language processing, the transformer for vision applications has attracted significant attention in recent years due to its excellent performance.

Scheduling

Paper
Add Code

Hardware-Robust In-RRAM-Computing for Object Detection

no code implementations • 9 May 2022 • Yu-Hsiang Chiang, Cheng En Ni, Yun Sung, Tuo-Hung Hou, Tian-Sheuan Chang, Shyh Jye Jou

However, in-RRAM computing (IRC) suffered from large device variation and numerous nonideal effects in hardware.

Object object-detection +1

Paper
Add Code

IMU Based Deep Stride Length Estimation With Self-Supervised Learning

no code implementations • 6 May 2022 • Jien-De Sui, Tian-Sheuan Chang

The proposed model can achieve better average percent error, 4. 78\%, on running and walking stride length regression and 99. 83\% accuracy on running and walking classification, when compared to the previous approach, 7. 44\% on the stride length estimation.

Self-Supervised Learning

Paper
Add Code

Zebra: Memory Bandwidth Reduction for CNN Accelerators With Zero Block Regularization of Activation Maps

no code implementations • 2 May 2022 • Hsu-Tung Shih, Tian-Sheuan Chang

The large amount of memory bandwidth between local buffer and external DRAM has become the speedup bottleneck of CNN hardware accelerators, especially for activation maps.

Paper
Add Code

Real Time On Sensor Gait Phase Detection with 0.5KB Deep Learning Model

no code implementations • 2 May 2022 • Yi-An Chen, Jien-De Sui, Tian-Sheuan Chang

Gait phase detection with convolution neural network provides accurate classification but demands high computational cost, which inhibits real time low power on-sensor processing.

Paper
Add Code

BSRA: Block-based Super Resolution Accelerator with Hardware Efficient Pixel Attention

no code implementations • 2 May 2022 • Dun-Hao Yang, Tian-Sheuan Chang

Increasingly, convolution neural network (CNN) based super resolution models have been proposed for better reconstruction results, but their large model size and complicated structure inhibit their real-time hardware implementation.

Image Reconstruction Super-Resolution

Paper
Add Code

Sparse Compressed Spiking Neural Network Accelerator for Object Detection

no code implementations • 2 May 2022 • Hong-Han Lien, Tian-Sheuan Chang

Spiking neural networks (SNNs), which are inspired by the human brain, have recently gained popularity due to their relatively simple and low-power hardware for transmitting binary spikes and highly sparse activation maps.

Object object-detection +1

Paper
Add Code

PSCNN: A 885.86 TOPS/W Programmable SRAM-based Computing-In-Memory Processor for Keyword Spotting

no code implementations • 2 May 2022 • Shu-Hung Kuo, Tian-Sheuan Chang

Computing-in-memory (CIM) has attracted significant attentions in recent years due to its massive parallelism and low power consumption.

Keyword Spotting

Paper
Add Code

Pre-RTL DNN Hardware Evaluator With Fused Layer Support

no code implementations • 2 May 2022 • Chih-Chyau Yang, Tian-Sheuan Chang

With the popularity of the deep neural network (DNN), hardware accelerators are demanded for real time execution.

Paper
Add Code

A Real Time 1280x720 Object Detection Chip With 585MB/s Memory Traffic

no code implementations • 2 May 2022 • Kuo-Wei Chang, Hsu-Tung Shih, Tian-Sheuan Chang, Shang-Hong Tsai, Chih-Chyau Yang, Chien-Ming Wu, Chun-Ming Huang

Memory bandwidth has become the real-time bottleneck of current deep learning accelerators (DLA), particularly for high definition (HD) object detection.

MORPH Object +2

Paper
Add Code

Efficient Accelerator for Dilated and Transposed Convolution with Decomposition

no code implementations • 2 May 2022 • Kuo-Wei Chang, Tian-Sheuan Chang

Hardware acceleration for dilated and transposed convolution enables real time execution of related tasks like segmentation, but current designs are specific for these convolutional types or suffer from complex control for reconfigurable designs.

Paper
Add Code

Mitigating Asymmetric Nonlinear Weight Update Effects in Hardware Neural Network based on Analog Resistive Synapse

no code implementations • 16 Dec 2017 • Chih-Cheng Chang, Pin-Chun Chen, Teyuh Chou, I-Ting Wang, Boris Hudec, Che-Chia Chang, Chia-Ming Tsai, Tian-Sheuan Chang, Tuo-Hung Hou

Asymmetric nonlinear weight update is considered as one of the major obstacles for realizing hardware neural networks based on analog resistive synapses because it significantly compromises the online training capability.

General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.