Search Results for author: Hong-Han Shuai

Found 34 papers, 16 papers with code

Character-Preserving Coherent Story Visualization

2 code implementations ECCV 2020 Yun-Zhu Song, Zhi Rui Tam, Hung-Jen Chen, Huiao-Han Lu, Hong-Han Shuai

Different from video generation that focuses on maintaining the continuity of generated images (frames), story visualization emphasizes preserving the global consistency of characters and scenes across different story pictures, which is very challenging since story sentences only provide sparse signals for generating images.

Ranked #2 on Story Visualization on Pororo (using extra training data)

Representation Learning Sentence +1

EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning

1 code implementation25 Apr 2024 HongXia Xie, Chu-Jun Peng, Yu-Wen Tseng, Hung-Jen Chen, Chan-Feng Hsu, Hong-Han Shuai, Wen-Huang Cheng

Visual Instruction Tuning represents a novel learning paradigm involving the fine-tuning of pre-trained language models using task-specific instructions.

Training-and-prompt-free General Painterly Harmonization Using Image-wise Attention Sharing

1 code implementation19 Apr 2024 Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai

To surmount these hurdles, we design a Training-and-prompt-Free General Painterly Harmonization method using image-wise attention sharing (TF-GPH), which integrates a novel "share-attention module".

Image Harmonization

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

no code implementations8 Apr 2024 Hou-I Liu, Marco Galindo, HongXia Xie, Lai-Kuan Wong, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng

Over the past decade, the dominance of deep learning has prevailed across various domains of artificial intelligence, including natural language processing, computer vision, and biomedical signal processing.

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection

no code implementations4 Apr 2024 Yi-Xin Huang, Hou-I Liu, Hong-Han Shuai, Wen-Huang Cheng

Despite previous DETR-like methods having performed successfully in generic object detection, tiny object detection is still a challenging task for them since the positional information of object queries is not customized for detecting tiny objects, whose scale is extraordinarily smaller than general objects.

Object object-detection +1

An Improved Traditional Chinese Evaluation Suite for Foundation Model

no code implementations4 Mar 2024 Zhi-Rui Tam, Ya-Ting Pai, Yen-Wei Lee, Sega Cheng, Hong-Han Shuai

We included benchmark results in TMMLU+ from closed-source models and 24 open-weight Chinese large language models of parameters ranging from 1. 8B to 72B.

Multiple-choice Question Answering

SINC: Self-Supervised In-Context Learning for Vision-Language Tasks

no code implementations ICCV 2023 Yi-Syuan Chen, Yun-Zhu Song, Cheng Yu Yeo, Bei Liu, Jianlong Fu, Hong-Han Shuai

To this end, we raise a question: ``How can we enable in-context learning without relying on the intrinsic in-context ability of large language models?".

Hallucination In-Context Learning

Shilling Black-box Review-based Recommender Systems through Fake Review Generation

no code implementations27 Jun 2023 Hung-Yun Chiang, Yi-Syuan Chen, Yun-Zhu Song, Hong-Han Shuai, Jason S. Chang

Review-Based Recommender Systems (RBRS) have attracted increasing research interest due to their ability to alleviate well-known cold-start problems.

Recommendation Systems Review Generation

SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization

no code implementations24 Mar 2023 Yi-Syuan Chen, Yun-Zhu Song, Hong-Han Shuai

The generated summaries could therefore be constrained by the preference bias in the training set, especially under low-resource settings.

Abstractive Text Summarization Few-Shot Learning +1

Size Does Matter: Size-aware Virtual Try-on via Clothing-oriented Transformation Try-on Network

1 code implementation ICCV 2023 Chieh-Yun Chen, Yi-Chung Chen, Hong-Han Shuai, Wen-Huang Cheng

COTTON leverages clothing structure with landmarks and segmentation to design a novel landmark-guided transformation for precisely deforming clothes, allowing for size adjustment during try-on.

Virtual Try-on

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton

1 code implementation2 Dec 2021 Wei-Yao Wang, Hong-Han Shuai, Kai-Shiang Chang, Wen-Chih Peng

The increasing demand for analyzing the insights in sports has stimulated a line of productive studies from a variety of perspectives, e. g., health state monitoring, outcome prediction.

Position

Attack as the Best Defense: Nullifying Image-to-image Translation GANs via Limit-aware Adversarial Attack

1 code implementation ICCV 2021 Chin-Yuan Yeh, Hsi-Wen Chen, Hong-Han Shuai, De-Nian Yang, Ming-Syan Chen

To improve efficiency, we introduce the limit-aware random gradient-free estimation and the gradient sliding mechanism to estimate the gradient that adheres to the adversarial limit, i. e., the pixel value limitations of the adversarial example.

Adversarial Attack Face Swapping +2

Live Multi-Streaming and Donation Recommendations via Coupled Donation-Response Tensor Factorization

no code implementations5 Oct 2021 Hsu-Chao Lai, Jui-Yi Tsai, Hong-Han Shuai, Jiun-Long Huang, Wang-Chien Lee, De-Nian Yang

In contrast to traditional online videos, live multi-streaming supports real-time social interactions between multiple streamers and viewers, such as donations.

Recommendation Systems

Mask or Non-Mask? Robust Face Mask Detector via Triplet-Consistency Representation Learning

no code implementations1 Oct 2021 Chun-Wei Yang, Thanh-Hai Phung, Hong-Han Shuai, Wen-Huang Cheng

To automate the monitoring process, one of the promising solutions is to leverage existing object detection models to detect the faces with or without masks.

object-detection Object Detection +1

Gradient Normalization for Generative Adversarial Networks

2 code implementations ICCV 2021 Yi-Lun Wu, Hong-Han Shuai, Zhi-Rui Tam, Hong-Yu Chiu

In this paper, we propose a novel normalization method called gradient normalization (GN) to tackle the training instability of Generative Adversarial Networks (GANs) caused by the sharp gradient space.

Technical Report for Valence-Arousal Estimation in ABAW2 Challenge

no code implementations8 Jul 2021 Hong-Xia Xie, I-Hsuan Li, Ling Lo, Hong-Han Shuai, Wen-Huang Cheng

In this work, we describe our method for tackling the valence-arousal estimation challenge from ABAW2 ICCV-2021 Competition.

Arousal Estimation

Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting

1 code implementation25 May 2021 Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua, Jinyoung Moon, Hong-Han Shuai

This companion paper supports the replication of the fashion trend forecasting experiments with the KERN (Knowledge Enhanced Recurrent Network) method that we presented in the ICMR 2020.

Meta-Transfer Learning for Low-Resource Abstractive Summarization

1 code implementation18 Feb 2021 Yi-Syuan Chen, Hong-Han Shuai

Neural abstractive summarization has been studied in many pieces of literature and achieves great success with the aid of large corpora.

Abstractive Text Summarization Transfer Learning

Template-Free Try-on Image Synthesis via Semantic-guided Optimization

no code implementations6 Feb 2021 Chien-Lung Chou, Chieh-Yun Chen, Chia-Wei Hsieh, Hong-Han Shuai, Jiaying Liu, Wen-Huang Cheng

Afterward, given an in-shop clothing image, a user image, and a synthesized pose, we propose a novel model for synthesizing a human try-on image with the target clothing in the best fitting pose.

Image Generation Virtual Try-on

Spatiotemporal Dilated Convolution with Uncertain Matching for Video-based Crowd Estimation

1 code implementation29 Jan 2021 Yu-Jen Ma, Hong-Han Shuai, Wen-Huang Cheng

In this paper, we propose a novel SpatioTemporal convolutional Dense Network (STDNet) to address the video-based crowd counting problem, which contains the decomposition of 3D convolution and the 3D spatiotemporal dilated dense convolution to alleviate the rapid growth of the model size caused by the Conv3D layer.

Crowd Counting

FashionMirror: Co-Attention Feature-Remapping Virtual Try-On With Sequential Template Poses

1 code implementation ICCV 2021 Chieh-Yun Chen, Ling Lo, Pin-Jui Huang, Hong-Han Shuai, Wen-Huang Cheng

In the second stage, we first remove the clothes on the source human via the removed mask and warp the clothing features conditioning on the try-on clothing mask to fit the next frame human.

Segmentation Semantic Segmentation +1

Domain-Adaptive Object Detection via Uncertainty-Aware Distribution Alignment

1 code implementation31 Oct 2020 Dang-Khoa Nguyen, Wei-Lun Tseng, Hong-Han Shuai

Domain adaptation aims to transfer knowledge from the sourcedata with annotations to scarcely-labeled data in the target domain, which has attracted a lot of attention in recent years and facilitatedmany multimedia applications.

Object object-detection +2

Optimizing Item and Subgroup Configurations for Social-Aware VR Shopping

1 code implementation11 Feb 2020 Shao-Heng Ko, Hsu-Chao Lai, Hong-Han Shuai, De-Nian Yang, Wang-Chien Lee, Philip S. Yu

Shopping in VR malls has been regarded as a paradigm shift for E-commerce, but most of the conventional VR shopping platforms are designed for a single user.

Data Structures and Algorithms

Attractive or Faithful? Popularity-Reinforced Learning for Inspired Headline Generation

1 code implementation6 Feb 2020 Yun-Zhu Song, Hong-Han Shuai, Sung-Lin Yeh, Yi-Lun Wu, Lun-Wei Ku, Wen-Chih Peng

To generate inspired headlines, we propose a novel framework called POpularity-Reinforced Learning for inspired Headline Generation (PORL-HG).

Headline Generation Reinforcement Learning (RL) +1

Communications and Networking Technologies for Intelligent Drone Cruisers

no code implementations25 Sep 2019 Li-Chun Wang, Chuan-Chi Lai, Hong-Han Shuai, Hsin-Piao Lin, Chi-Yu Li, Teng-Hu Cheng, Chiun-Hsun Chen

Therefore, we propose to develop an "Artificial Intelligence (AI) Drone-Cruiser" base station that can help 5G mobile communication systems and beyond quickly recover the network after a disaster and handle the instant communications by the flash crowd.

Cannot find the paper you are looking for? You can Submit a new open access paper.