Search Results for author: Jia-Hong Huang

Found 18 papers, 7 papers with code

Conditional Modeling Based Automatic Video Summarization

no code implementations20 Nov 2023 Jia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring

The aim of video summarization is to shorten videos automatically while retaining the key information necessary to convey the overall story.

Video Summarization

Query-based Video Summarization with Pseudo Label Supervision

no code implementations4 Jul 2023 Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring

Existing datasets for manually labelled query-based video summarization are costly and thus small, limiting the performance of supervised deep video summarization models.

Pseudo Label Video Summarization

Improving Visual Question Answering Models through Robustness Analysis and In-Context Learning with a Chain of Basic Questions

no code implementations6 Apr 2023 Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring

This work proposes a new method that utilizes semantically related questions, referred to as basic questions, acting as noise to evaluate the robustness of VQA models.

In-Context Learning Question Answering +1

The Dawn of Quantum Natural Language Processing

2 code implementations13 Oct 2021 Riccardo Di Sipio, Jia-Hong Huang, Samuel Yen-Chi Chen, Stefano Mangini, Marcel Worring

In this paper, we discuss the initial attempts at boosting understanding human language based on deep-learning models with quantum computing.

Sentiment Analysis

Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning"

no code implementations30 May 2021 Jia-Hong Huang, Ting-Wei Wu, Chao-Han Huck Yang, Marcel Worring

Automatically generating medical reports for retinal images is one of the promising ways to help ophthalmologists reduce their workload and improve work efficiency.

Avg Image Captioning +1

Contextualized Keyword Representations for Multi-modal Retinal Image Captioning

no code implementations26 Apr 2021 Jia-Hong Huang, Ting-Wei Wu, Marcel Worring

A traditional medical image captioning model creates a medical description only based on a single medical image input.

Avg Image Captioning

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization

2 code implementations26 Apr 2021 Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring

Traditional video summarization methods generate fixed video representations regardless of user interest.

Video Summarization

Query-controllable Video Summarization

1 code implementation7 Apr 2020 Jia-Hong Huang, Marcel Worring

In this work, we introduce a method which takes a text-based query as input and generates a video summary corresponding to it.

Video Summarization

Assessing the Robustness of Visual Question Answering Models

no code implementations30 Nov 2019 Jia-Hong Huang, Modar Alfadly, Bernard Ghanem, Marcel Worring

In this work, we propose a new method that uses semantically related questions, dubbed basic questions, acting as noise to evaluate the robustness of VQA models.

Question Answering Visual Question Answering

Auto-Classification of Retinal Diseases in the Limit of Sparse Data Using a Two-Streams Machine Learning Model

1 code implementation16 Aug 2018 C. -H. Huck Yang, Fangyu Liu, Jia-Hong Huang, Meng Tian, Hiromasa Morikawa, I-Hung Lin, Yi-Chieh Liu, Hao-Hsiang Yang, Jesper Tegner

Automatic clinical diagnosis of retinal diseases has emerged as a promising approach to facilitate discovery in areas with limited access to specialists.

General Classification

A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal Diseases

1 code implementation17 Jun 2018 C. -H. Huck Yang, Jia-Hong Huang, Fangyu Liu, Fang-Yi Chiu, Mengya Gao, Weifeng Lyu, I-Hung Lin M. D., Jesper Tegner

Automatic clinical diagnosis of retinal diseases has emerged as a promising approach to facilitate discovery in areas with limited access to specialists.

BIG-bench Machine Learning General Classification

A Novel Framework for Robustness Analysis of Visual QA Models

no code implementations16 Nov 2017 Jia-Hong Huang, Cuong Duc Dao, Modar Alfadly, Bernard Ghanem

In VQA, adversarial attacks can target the image and/or the proposed main question and yet there is a lack of proper analysis of the later.

Question Answering Visual Question Answering

VQABQ: Visual Question Answering by Basic Questions

no code implementations19 Mar 2017 Jia-Hong Huang, Modar Alfadly, Bernard Ghanem

Given a natural language question about an image, the first module takes the question as input and then outputs the basic questions of the main given question.

Question Answering Visual Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.