no code implementations • 5 Dec 2023 • Shao-Yu Chang, Hwann-Tzong Chen, Tyng-Luh Liu
Despite the success in image editing, diffusion models still encounter significant hindrances when it comes to video editing due to the challenge of maintaining spatiotemporal consistency in the object's appearance across frames.
no code implementations • CVPR 2024 • Cheng Sun, Wei-En Tai, Yu-Lin Shih, Kuan-Wei Chen, Yong-Jing Syu, Kent Selwyn The, Yu-Chiang Frank Wang, Hwann-Tzong Chen
State-of-the-art single-view 360-degree room layout reconstruction methods formulate the problem as a high-level 1D (per-column) regression task.
no code implementations • ICCV 2023 • Cheng-Hung Chan, Cheng-Yang Yuan, Cheng Sun, Hwann-Tzong Chen
We present a video decomposition method that facilitates layer-based editing of videos with spatiotemporally varying lighting and motion effects.
1 code implementation • ICML 2023 • Yu-Min Chu, Chieh Liu, Ting-I Hsieh, Hwann-Tzong Chen, Tyng-Luh Liu
We present a shape-guided expert-learning framework to tackle the problem of unsupervised 3D anomaly detection.
Ranked #1 on 3D Anomaly Detection and Segmentation on MVTEC 3D-AD
3D Anomaly Detection 3D Anomaly Detection and Segmentation +2
1 code implementation • 2 Aug 2022 • Chih-Jung Tsai, Cheng Sun, Hwann-Tzong Chen
This paper aims to address a new task of image morphing under a multiview setting, which takes two sets of multiview images as the input and generates intermediate renderings that not only exhibit smooth transitions between the two input sets but also ensure visual consistency across different views at any transition state.
1 code implementation • 31 May 2022 • Li-Jen Chang, Yu-Cheng Liao, Chia-Hui Lin, Hwann-Tzong Chen
We present a self-trainable method, Mask2Hand, which learns to solve the challenging task of predicting 3D hand pose and shape from a 2D binary mask of hand silhouette/shadow without additional manually-annotated data.
no code implementations • 30 Mar 2022 • Hao-Wen Ting, Cheng Sun, Hwann-Tzong Chen
We present the first self-supervised method to train panoramic room layout estimation models without any labeled data.
no code implementations • 23 Dec 2021 • Ta-Ying Cheng, Hsuan-ru Yang, Niki Trigoni, Hwann-Tzong Chen, Tyng-Luh Liu
We present a pose adaptive few-shot learning procedure and a two-stage data interpolation regularization, termed Pose Adaptive Dual Mixup (PADMix), for single-image 3D reconstruction.
2 code implementations • CVPR 2022 • Cheng Sun, Min Sun, Hwann-Tzong Chen
Finally, evaluation on five inward-facing benchmarks shows that our method matches, if not surpasses, NeRF's quality, yet it only takes about 15 minutes to train from scratch for a new scene.
no code implementations • ICCV 2021 • Chi-Wei Hsiao, Cheng Sun, Hwann-Tzong Chen, Min Sun
We present a novel pyramidal output representation to ensure parsimony with our "specialize and fuse" process for semantic segmentation.
1 code implementation • CVPR 2021 • Cheng Sun, Chi-Wei Hsiao, Ning-Hsu Wang, Min Sun, Hwann-Tzong Chen
Indoor panorama typically consists of human-made structures parallel or perpendicular to gravity.
1 code implementation • 21 Jun 2021 • Ching-Yu Hsu, Cheng Sun, Hwann-Tzong Chen
We present Omnidirectional Neural Radiance Fields (OmniNeRF), the first method to the application of parallax-enabled novel panoramic view synthesis.
1 code implementation • 13 Apr 2021 • Ting-I Hsieh, Esther Robb, Hwann-Tzong Chen, Jia-Bin Huang
Based on this insight, we develop DropLoss -- a novel adaptive loss to compensate for this imbalance without a trade-off between rare and frequent categories.
1 code implementation • CVPR 2021 • Cheng Sun, Min Sun, Hwann-Tzong Chen
We present HoHoNet, a versatile and efficient framework for holistic understanding of an indoor 360-degree panorama using a Latent Horizontal Feature (LHFeat).
3D Room Layouts From A Single RGB Panorama Depth Estimation +1
1 code implementation • ECCV 2020 • Ke-Chi Chang, Ren Wang, Hung-Jin Lin, Yu-Lun Liu, Chia-Ping Chen, Yu-Lin Chang, Hwann-Tzong Chen
Modeling imaging sensor noise is a fundamental problem for image processing and computer vision applications.
1 code implementation • 20 Jul 2020 • Shih-Hung Liu, Shang-Yi Yu, Shao-Chi Wu, Hwann-Tzong Chen, Tyng-Luh Liu
This paper presents a novel method for instance segmentation of 3D point clouds.
Ranked #9 on 3D Instance Segmentation on S3DIS (mPrec metric)
no code implementations • 28 Dec 2019 • Kuo-Wei Lee, Shih-Hung Liu, Hwann-Tzong Chen, Koichi Ito
3D hand pose estimation has received a lot of attention for its wide range of applications and has made great progress owing to the development of deep learning.
2 code implementations • NeurIPS 2019 • Ting-I Hsieh, Yi-Chen Lo, Hwann-Tzong Chen, Tyng-Luh Liu
This paper aims to tackle the challenging problem of one-shot object detection.
Ranked #3 on One-Shot Object Detection on MS COCO
no code implementations • 29 May 2019 • Chi-Wei Hsiao, Cheng Sun, Min Sun, Hwann-Tzong Chen
This paper also constructs a benchmark for validating the performance on general layout topologies, where Flat2Layout achieves good performance on general room types.
no code implementations • ICLR 2019 • Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen
The fact that the patch generation process is independent to each other inspires a wide range of new applications: firstly, "Patch-Inspired Image Generation" enables us to generate the entire image based on a single patch.
1 code implementation • CVPR 2019 • Songhao Jia, Ding-Jie Chen, Hwann-Tzong Chen
This paper presents a normalization mechanism called Instance-Level Meta Normalization (ILM~Norm) to address a learning-to-normalize problem.
no code implementations • 6 Apr 2019 • Chih-Yao Chiu, Hwann-Tzong Chen, Tyng-Luh Liu
This paper describes a channel-selection approach for simplifying deep neural networks.
3 code implementations • ICCV 2019 • Tsun-Hsuan Wang, Yen-Chi Cheng, Chieh Hubert Lin, Hwann-Tzong Chen, Min Sun
We introduce point-to-point video generation that controls the generation process with two control points: the targeted start- and end-frames.
1 code implementation • ICCV 2019 • Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen
On the computation side, COCO-GAN has a built-in divide-and-conquer paradigm that reduces memory requisition during training and inference, provides high-parallelism, and can generate parts of images on-demand.
Ranked #1 on Image Generation on CelebA-HQ 64x64
no code implementations • 28 Jan 2019 • Rosaura G. VidalMata, Sreya Banerjee, Brandon RichardWebster, Michael Albright, Pedro Davalos, Scott McCloskey, Ben Miller, Asong Tambo, Sushobhan Ghosh, Sudarshan Nagesh, Ye Yuan, Yueyu Hu, Junru Wu, Wenhan Yang, Xiaoshuai Zhang, Jiaying Liu, Zhangyang Wang, Hwann-Tzong Chen, Tzu-Wei Huang, Wen-Chi Chin, Yi-Chun Li, Mahmoud Lababidi, Charles Otto, Walter J. Scheirer
From the observed results, it is evident that we are in the early days of building a bridge between computational photography and visual recognition, leaving many opportunities for innovation in this area.
1 code implementation • CVPR 2019 • Cheng Sun, Chi-Wei Hsiao, Min Sun, Hwann-Tzong Chen
We present a new approach to the problem of estimating the 3D room layout from a single panoramic image.
3D Room Layouts From A Single RGB Panorama Data Augmentation
3 code implementations • 20 Dec 2018 • Ding-Jie Chen, Jui-Ting Chien, Hwann-Tzong Chen, Tyng-Luh Liu
This paper presents a "learning to learn" approach to figure-ground image segmentation.
no code implementations • 18 Dec 2018 • Ding-Jie Chen, Hwann-Tzong Chen, Long-Wen Chang
At each round of interaction the user is only presented with a small number of informative query seeds that are far apart from each other.
no code implementations • 25 Nov 2018 • Shou-Yao Roy Tseng, Hwann-Tzong Chen, Shao-Heng Tai, Tyng-Luh Liu
We present a generic and flexible module that encodes region proposals by both their intrinsic features and the extrinsic correlations to the others.
1 code implementation • ECCV 2018 • Chia-Che Chang, Chieh Hubert Lin, Che-Rung Lee, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen
Generative adversarial networks (GANs) often suffer from unpredictable mode-collapsing during training.
Ranked #29 on Image Generation on CelebA 64x64
no code implementations • 14 Jul 2018 • Shou-Yao Roy Tseng, Hwann-Tzong Chen, Shao-Heng Tai, Tyng-Luh Liu
We introduce the concept of Non-Local RoI (NL-RoI) Block as a generic and flexible module that can be seamlessly adapted into different Mask R-CNN heads for various tasks.
1 code implementation • 8 Jul 2017 • Chia-Jung Chou, Jui-Ting Chien, Hwann-Tzong Chen
This paper presents a deep learning based approach to the problem of human pose estimation.
Ranked #5 on Pose Estimation on Leeds Sports Poses
1 code implementation • 5 Jan 2017 • Yi-Ling Chen, Tzu-Wei Huang, Kai-Han Chang, Yu-Chen Tsai, Hwann-Tzong Chen, Bing-Yu Chen
Automatic photo cropping is an important tool for improving visual quality of digital photos without resorting to tedious manual selection.