Search Results for author: Tian-Sheuan Chang

Found 18 papers, 0 papers with code

IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding

no code implementations15 Dec 2023 Yu-Han Sun, Chiang Lo-Hsuan Lee, Tian-Sheuan Chang

Image prefiltering with just noticeable distortion (JND) improves coding efficiency in a visual lossless way by filtering the perceptually redundant information prior to compression.

Image Quality Assessment Quantization

ASC: Adaptive Scale Feature Map Compression for Deep Neural Network

no code implementations13 Dec 2023 Yuan YAO, Tian-Sheuan Chang

Furthermore, the hardware architecture scales effectively, with only a sublinear increase in area cost.

ACNPU: A 4.75TOPS/W 1080P@30FPS Super Resolution Accelerator with Decoupled Asymmetric Convolution

no code implementations30 Aug 2023 Tun-Hao Yang, Tian-Sheuan Chang

This challenge leads many accelerators to opt for simpler and shallow models like FSRCNN, compromising performance for real-time needs, especially for resource-limited edge devices.

Super-Resolution

Real-Time Wearable Gait Phase Segmentation For Running And Walking

no code implementations10 May 2022 Jien-De Sui, Wei-Han Chen, Tzyy-Yuang Shiang, Tian-Sheuan Chang

Previous gait phase detection as convolutional neural network (CNN) based classification task requires cumbersome manual setting of time delay or heavy overlapped sliding windows to accurately classify each phase under different test cases, which is not suitable for streaming Inertial-Measurement-Unit (IMU) sensor data and fails to adapt to different scenarios.

A Real Time Super Resolution Accelerator with Tilted Layer Fusion

no code implementations9 May 2022 An-Jung Huang, Kai-Chieh Hsu, Tian-Sheuan Chang

Deep learning based superresolution achieves high-quality results, but its heavy computational workload, large buffer, and high external memory bandwidth inhibit its usage in mobile devices.

Super-Resolution

Row-wise Accelerator for Vision Transformer

no code implementations9 May 2022 Hong-Yi Wang, Tian-Sheuan Chang

Following the success of the natural language processing, the transformer for vision applications has attracted significant attention in recent years due to its excellent performance.

Scheduling

Hardware-Robust In-RRAM-Computing for Object Detection

no code implementations9 May 2022 Yu-Hsiang Chiang, Cheng En Ni, Yun Sung, Tuo-Hung Hou, Tian-Sheuan Chang, Shyh Jye Jou

However, in-RRAM computing (IRC) suffered from large device variation and numerous nonideal effects in hardware.

Object object-detection +1

IMU Based Deep Stride Length Estimation With Self-Supervised Learning

no code implementations6 May 2022 Jien-De Sui, Tian-Sheuan Chang

The proposed model can achieve better average percent error, 4. 78\%, on running and walking stride length regression and 99. 83\% accuracy on running and walking classification, when compared to the previous approach, 7. 44\% on the stride length estimation.

Self-Supervised Learning

Zebra: Memory Bandwidth Reduction for CNN Accelerators With Zero Block Regularization of Activation Maps

no code implementations2 May 2022 Hsu-Tung Shih, Tian-Sheuan Chang

The large amount of memory bandwidth between local buffer and external DRAM has become the speedup bottleneck of CNN hardware accelerators, especially for activation maps.

Real Time On Sensor Gait Phase Detection with 0.5KB Deep Learning Model

no code implementations2 May 2022 Yi-An Chen, Jien-De Sui, Tian-Sheuan Chang

Gait phase detection with convolution neural network provides accurate classification but demands high computational cost, which inhibits real time low power on-sensor processing.

BSRA: Block-based Super Resolution Accelerator with Hardware Efficient Pixel Attention

no code implementations2 May 2022 Dun-Hao Yang, Tian-Sheuan Chang

Increasingly, convolution neural network (CNN) based super resolution models have been proposed for better reconstruction results, but their large model size and complicated structure inhibit their real-time hardware implementation.

Image Reconstruction Super-Resolution

Sparse Compressed Spiking Neural Network Accelerator for Object Detection

no code implementations2 May 2022 Hong-Han Lien, Tian-Sheuan Chang

Spiking neural networks (SNNs), which are inspired by the human brain, have recently gained popularity due to their relatively simple and low-power hardware for transmitting binary spikes and highly sparse activation maps.

Object object-detection +1

PSCNN: A 885.86 TOPS/W Programmable SRAM-based Computing-In-Memory Processor for Keyword Spotting

no code implementations2 May 2022 Shu-Hung Kuo, Tian-Sheuan Chang

Computing-in-memory (CIM) has attracted significant attentions in recent years due to its massive parallelism and low power consumption.

Keyword Spotting

Pre-RTL DNN Hardware Evaluator With Fused Layer Support

no code implementations2 May 2022 Chih-Chyau Yang, Tian-Sheuan Chang

With the popularity of the deep neural network (DNN), hardware accelerators are demanded for real time execution.

A Real Time 1280x720 Object Detection Chip With 585MB/s Memory Traffic

no code implementations2 May 2022 Kuo-Wei Chang, Hsu-Tung Shih, Tian-Sheuan Chang, Shang-Hong Tsai, Chih-Chyau Yang, Chien-Ming Wu, Chun-Ming Huang

Memory bandwidth has become the real-time bottleneck of current deep learning accelerators (DLA), particularly for high definition (HD) object detection.

MORPH Object +2

Efficient Accelerator for Dilated and Transposed Convolution with Decomposition

no code implementations2 May 2022 Kuo-Wei Chang, Tian-Sheuan Chang

Hardware acceleration for dilated and transposed convolution enables real time execution of related tasks like segmentation, but current designs are specific for these convolutional types or suffer from complex control for reconfigurable designs.

Mitigating Asymmetric Nonlinear Weight Update Effects in Hardware Neural Network based on Analog Resistive Synapse

no code implementations16 Dec 2017 Chih-Cheng Chang, Pin-Chun Chen, Teyuh Chou, I-Ting Wang, Boris Hudec, Che-Chia Chang, Chia-Ming Tsai, Tian-Sheuan Chang, Tuo-Hung Hou

Asymmetric nonlinear weight update is considered as one of the major obstacles for realizing hardware neural networks based on analog resistive synapses because it significantly compromises the online training capability.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.