Search Results for author: Winston H. Hsu

Found 48 papers, 22 papers with code

TelTrans: Applying Multi-Type Telecom Data to Transportation Evaluation and Prediction via Multifaceted Graph Modeling

no code implementations6 Jan 2024 ChungYi Lin, Shen-Lung Tung, Hung-Ting Su, Winston H. Hsu

To address the limitations of traffic prediction from location-bound detectors, we present Geographical Cellular Traffic (GCT) flow, a novel data source that leverages the extensive coverage of cellular traffic to capture mobility patterns.

Traffic Prediction

WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection

1 code implementation5 Oct 2023 Tsung-Lin Tsou, Tsung-Han Wu, Winston H. Hsu

In the field of domain adaptation (DA) on 3D object detection, most of the work is dedicated to unsupervised domain adaptation (UDA).

3D Object Detection object-detection +1

Unsupervised Adversarial Detection without Extra Model: Training Loss Should Change

1 code implementation7 Aug 2023 Chien Cheng Chyou, Hung-Ting Su, Winston H. Hsu

Adversarial robustness poses a critical challenge in the deployment of deep learning models for real-world applications.

Adversarial Robustness

Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering

no code implementations7 Apr 2023 Hung-Ting Su, Yulei Niu, Xudong Lin, Winston H. Hsu, Shih-Fu Chang

Causal Video Question Answering (CVidQA) queries not only association or temporal relations but also causal relations in a video.

Question Answering Question Generation +3

MuRAL: Multi-Scale Region-based Active Learning for Object Detection

no code implementations29 Mar 2023 Yi-Syuan Liou, Tsung-Han Wu, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu

MuRAL identifies informative regions of various scales to reduce annotation costs for well-learned objects and improve training performance.

Active Learning Object +2

Free-form 3D Scene Inpainting with Dual-stream GAN

1 code implementation16 Dec 2022 Ru-Fen Jheng, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu

Thus, we present a novel task named free-form 3D scene inpainting.

Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling

1 code implementation8 Oct 2022 Hsin-Ying Lee, Hung-Ting Su, Bing-Chen Tsai, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu

While recent large-scale video-language pre-training made great progress in video question answering, the design of spatial modeling of video-language models is less fine-grained than that of image-language models; existing practices of temporal modeling also suffer from weak and noisy alignment between modalities.

Language Modelling Question Answering +1

Coarse-to-Fine Point Cloud Registration with SE(3)-Equivariant Representations

1 code implementation5 Oct 2022 Cheng-Wei Lin, Tung-I Chen, Hsin-Ying Lee, Wen-Chin Chen, Winston H. Hsu

As global feature alignment requires the features to preserve the poses of input point clouds and local feature matching expects the features to be invariant to these poses, we propose an SE(3)-equivariant feature extractor to simultaneously generate two types of features.

Point Cloud Registration

CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection

1 code implementation27 Sep 2022 Ching-Yu Tseng, Yi-Rong Chen, Hsin-Ying Lee, Tsung-Han Wu, Wen-Chin Chen, Winston H. Hsu

To achieve accurate 3D object detection at a low cost for autonomous driving, many multi-camera methods have been proposed and solved the occlusion problem of monocular approaches.

3D Object Detection Autonomous Driving +5

GenISP: Neural ISP for Low-Light Machine Cognition

2 code implementations7 May 2022 Igor Morawski, Yu-An Chen, Yu-Sheng Lin, Shusil Dangi, Kai He, Winston H. Hsu

We propose to improve generalization to unseen camera sensors by implementing a minimal neural ISP pipeline for machine cognition, named GenISP, that explicitly incorporates Color Space Transformation to a device-independent color space.

Benchmarking Image Restoration +3

MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer

1 code implementation CVPR 2022 Kuan-Chih Huang, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu

Moreover, different from conventional pixel-wise positional encodings, we introduce a novel depth positional encoding (DPE) to inject depth positional hints into transformers.

Autonomous Driving Monocular 3D Object Detection +2

3rd Place Solution for NeurIPS 2021 Shifts Challenge: Vehicle Motion Prediction

no code implementations2 Dec 2021 Ching-Yu Tseng, Po-Shao Lin, Yu-Jia Liou, Kuan-Chih Huang, Winston H. Hsu

Shifts Challenge: Robustness and Uncertainty under Real-World Distributional Shift is a competition held by NeurIPS 2021.

motion prediction

Anomaly-Aware Semantic Segmentation by Leveraging Synthetic-Unknown Data

no code implementations29 Nov 2021 Guan-Rong Lu, Yueh-Cheng Liu, Tung-I Chen, Hung-Ting Su, Tsung-Han Wu, Winston H. Hsu

We design a new Masked Gradient Update (MGU) module to generate auxiliary data along the boundary of in-distribution data points.

Anomaly Detection Autonomous Driving +3

Multi-Stream Attention Learning for Monocular Vehicle Velocity and Inter-Vehicle Distance Estimation

no code implementations22 Oct 2021 Kuan-Chih Huang, Yu-Kai Huang, Winston H. Hsu

Vehicle velocity and inter-vehicle distance estimation are essential for ADAS (Advanced driver-assistance systems) and autonomous vehicles.

Autonomous Vehicles object-detection +1

Multivariate and Propagation Graph Attention Network for Spatial-Temporal Prediction with Outdoor Cellular Traffic

1 code implementation18 Aug 2021 Chung-Yi Lin, Hung-Ting Su, Shen-Lung Tung, Winston H. Hsu

Furthermore, we propose a new model for multivariate spatial-temporal prediction, mainly consisting of two extending graph attention networks (GAT).

Graph Attention

TrUMAn: Trope Understanding in Movies and Animations

no code implementations10 Aug 2021 Hung-Ting Su, Po-Wei Shen, Bing-Chen Tsai, Wen-Feng Cheng, Ke-Jyun Wang, Winston H. Hsu

By coping with the trope understanding task and enabling the deep cognition skills of machines, data mining applications and algorithms could be taken to the next level.

Recommendation Systems

ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation

1 code implementation ICCV 2021 Tsung-Han Wu, Yueh-Cheng Liu, Yu-Kai Huang, Hsin-Ying Lee, Hung-Ting Su, Ping-Chia Huang, Winston H. Hsu

Despite the success of deep learning on supervised point cloud semantic segmentation, obtaining large-scale point-by-point manual annotations is still a significant challenge.

Active Learning Scene Understanding +1

Class-agnostic-Few-shot-Object-Counting

1 code implementation WACV 2021 Shuo-Diao Yang, Hung-Ting Su, Winston H. Hsu, Wen-Chin Chen

Instead of counting a pre-defined class, our model is able to count instances based on input reference images and reduces the huge cost of data collection, training and parameter tuning for each new object class.

Object Object Counting

OCID-Ref: A 3D Robotic Dataset with Embodied Language for Clutter Scene Grounding

1 code implementation NAACL 2021 Ke-Jyun Wang, Yun-Hsuan Liu, Hung-Ting Su, Jen-Wei Wang, Yu-Siang Wang, Winston H. Hsu, Wen-Chin Chen

To effectively apply robots in working environments and assist humans, it is essential to develop and evaluate how visual grounding (VG) can affect machine performance on occluded objects.

Referring Expression Referring Expression Segmentation +1

Dual-Awareness Attention for Few-Shot Object Detection

1 code implementation24 Feb 2021 Tung-I Chen, Yueh-Cheng Liu, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu

While recent progress has significantly boosted few-shot classification (FSC) performance, few-shot object detection (FSOD) remains challenging for modern learning systems.

Few-Shot Learning Few-Shot Object Detection +2

Situation and Behavior Understanding by Trope Detection on Films

1 code implementation19 Jan 2021 Chen-Hsi Chang, Hung-Ting Su, Jui-heng Hsu, Yu-Siang Wang, Yu-Cheng Chang, Zhe Yu Liu, Ya-Liang Chang, Wen-Feng Cheng, Ke-Jyun Wang, Winston H. Hsu

Experimental result demonstrates that modern models including BERT contextual embedding, movie tag prediction systems, and relational networks, perform at most 37% of human performance (23. 97/64. 87) in terms of F1 score.

Reading Comprehension Sentence +1

Raw Image Deblurring

1 code implementation8 Dec 2020 Chih-Hung Liang, Yu-An Chen, Yueh-Cheng Liu, Winston H. Hsu

Therefore, we built a new dataset containing both RAW images and processed sRGB images and design a new model to utilize the unique characteristics of RAW images.

Blind Image Deblurring Image Deblurring +1

GDN: A Coarse-To-Fine (C2F) Representation for End-To-End 6-DoF Grasp Detection

no code implementations21 Oct 2020 Kuang-Yu Jeng, Yueh-Cheng Liu, Zhe Yu Liu, Jen-Wei Wang, Ya-Liang Chang, Hung-Ting Su, Winston H. Hsu

We proposed an end-to-end grasp detection network, Grasp Detection Network (GDN), cooperated with a novel coarse-to-fine (C2F) grasp representation design to detect diverse and accurate 6-DoF grasps based on point clouds.

Efficient and Phase-aware Video Super-resolution for Cardiac MRI

no code implementations21 May 2020 Jhih-Yuan Lin, Yu-Cheng Chang, Winston H. Hsu

Cardiac Magnetic Resonance Imaging (CMR) is widely used since it can illustrate the structure and function of heart in a non-invasive and painless way.

Video Super-Resolution

Expanding Sparse Guidance for Stereo Matching

no code implementations24 Apr 2020 Yu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Winston H. Hsu

The performance of image based stereo estimation suffers from lighting variations, repetitive patterns and homogeneous appearance.

Domain Adaptation Stereo Matching

Indoor Depth Completion with Boundary Consistency and Self-Attention

3 code implementations22 Aug 2019 Yu-Kai Huang, Tsung-Han Wu, Yueh-Cheng Liu, Winston H. Hsu

We utilize self-attention mechanism, previously used in image inpainting fields, to extract more useful information in each layer of convolution so that the complete depth map is enhanced.

Depth Completion Depth Estimation +1

A Unified Point-Based Framework for 3D Segmentation

1 code implementation1 Aug 2019 Hung-Yueh Chiang, Yen-Liang Lin, Yueh-Cheng Liu, Winston H. Hsu

We present a new unified point-based framework for 3D point cloud segmentation that effectively optimizes pixel-level features, geometrical structures and global context priors of an entire scene.

Point Cloud Segmentation Segmentation

Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

no code implementations30 Jul 2019 Sebastian Agethen, Winston H. Hsu

Herein, we propose a new enhancement to convolutional LSTM networks that supports accommodation of multiple convolutional kernels and layers.

Action Recognition

Super-Identity Convolutional Neural Network for Face Hallucination

no code implementations ECCV 2018 Kaipeng Zhang, Zhanpeng Zhang, Chia-Wen Cheng, Winston H. Hsu, Yu Qiao, Wei Liu, Tong Zhang

Face hallucination is a generative task to super-resolve the facial image with low resolution while human perception of face heavily relies on identity information.

Face Generation Face Hallucination +1

Drone-based Object Counting by Spatially Regularized Regional Proposal Network

no code implementations ICCV 2017 Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu

Existing counting methods often adopt regression-based approaches and cannot precisely localize the target objects, which hinders the further analysis (e. g., high-level understanding and fine-grained classification).

Object Counting Region Proposal

Photo Filter Recommendation by Category-Aware Aesthetic Learning

no code implementations18 Aug 2016 Wei-Tse Sun, Ting-Hsuan Chao, Yin-Hsi Kuo, Winston H. Hsu

We conduct experiments on the collected FACD for filter recommendation, and the results show that our proposed category-aware aesthetic learning outperforms aesthetic classification methods (e. g., 12% relative improvement).

General Classification

De-Hashing: Server-Side Context-Aware Feature Reconstruction for Mobile Visual Search

no code implementations29 Jun 2016 Yin-Hsi Kuo, Winston H. Hsu

Based on the hashed binary codes, we propose a de-hashing process that reconstructs BoW by leveraging the computing power of remote servers.

Retrieval Video Retrieval

Mediated Experts for Deep Convolutional Networks

no code implementations19 Nov 2015 Sebastian Agethen, Winston H. Hsu

Our architecture achieves this with the help of expert networks: A network is trained on a disjoint subset of a given dataset and then run in parallel to other experts during deployment.

Incremental Learning

Who are the Devils Wearing Prada in New York City?

no code implementations19 Aug 2015 Kuan-Ting Chen, Kezhen Chen, Peizhong Cong, Winston H. Hsu, Jiebo Luo

To answer this question, we design a novel system that consists of three major components: (1) constructing a large dataset from the New York Fashion Shows and New York street chic in order to understand the likely clothing fashion trends in New York, (2) utilizing a learning-based approach to discover fashion attributes as the representative characteristics of fashion trends, and (3) comparing the analysis results from the New York Fashion Shows and street-chic images to verify whether the fashion shows have actual influence on the people in New York City.

Scalable Object Detection by Filter Compression With Regularized Sparse Coding

no code implementations CVPR 2015 Ting-Hsuan Chao, Yen-Liang Lin, Yin-Hsi Kuo, Winston H. Hsu

Our method can reconstruct filters by minimizing score map error, while sparse coding reconstructs filters by minimizing appearance error.

object-detection Object Detection

Transfer Learning for Video Recognition with Scarce Training Data for Deep Convolutional Neural Network

no code implementations15 Sep 2014 Yu-Chuan Su, Tzu-Hsuan Chiu, Chun-Yen Yeh, Hsin-Fu Huang, Winston H. Hsu

The same lack-of-training-sample problem limits the usage of deep models on a wide range of computer vision problems where obtaining training data are difficult.

4k Transfer Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.