Search Results for author: Ming Jiang

Found 34 papers, 13 papers with code

Attention in Reasoning: Dataset, Analysis, and Modeling

1 code implementation20 Apr 2022 Shi Chen, Ming Jiang, Jinhui Yang, Qi Zhao

In this work, we propose an Attention with Reasoning capability (AiR) framework that uses attention to understand and improve the process leading to task outcomes.

Question Answering Visual Question Answering

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface

no code implementations16 Mar 2022 Diu Khue Luu, Anh Tuan Nguyen, Ming Jiang, Markus W. Drealan, Jian Xu, Tong Wu, Wing-kin Tam, Wenfeng Zhao, Brian Z. H. Lim, Cynthia K. Overstreet, Qi Zhao, Jonathan Cheng, Edward W. Keefer, Zhi Yang

Objective: The next generation prosthetic hand that moves and feels like a real hand requires a robust neural interconnection between the human minds and machines.

Query and Attention Augmentation for Knowledge-Based Explainable Reasoning

1 code implementation CVPR 2022 Yifeng Zhang, Ming Jiang, Qi Zhao

Explainable visual question answering (VQA) models have been developed with neural modules and query-based knowledge incorporation to answer knowledge-requiring questions.

Question Answering Visual Question Answering +1

VisualHow: Multimodal Problem Solving

1 code implementation CVPR 2022 Jinhui Yang, Xianyu Chen, Ming Jiang, Shi Chen, Louis Wang, Qi Zhao

With an overarching goal of developing intelligent systems to assist humans in various daily activities, we propose VisualHow, a free-form and open-ended research that focuses on understanding a real-life problem and deriving its solution by incorporating key components across multiple modalities.

Natural Language Processing

A Speaker-aware Parallel Hierarchical Attentive Encoder-Decoder Model for Multi-turn Dialogue Generation

no code implementations13 Oct 2021 ZiHao Wang, Ming Jiang, Junli Wang

Differing from prior work that solely relies on the content of conversation history to generate a response, we argue that capturing relative social relations among utterances (i. e., generated by either the same speaker or different persons) benefits the machine capturing fine-grained context information from a conversation history to improve context coherence in the generated response.

Dialogue Generation

Predicting Human Scanpaths in Visual Question Answering

no code implementations CVPR 2021 Xianyu Chen, Ming Jiang, Qi Zhao

Conditioned on a task guidance map, the proposed model learns question-specific attention patterns to generate scanpaths.

Question Answering Scanpath prediction +2

Explicit Knowledge Incorporation for Visual Reasoning

no code implementations CVPR 2021 Yifeng Zhang, Ming Jiang, Qi Zhao

Existing explainable and explicit visual reasoning methods only perform reasoning based on visual evidence but do not take into account knowledge beyond what is in the visual scene.

Visual Reasoning

A Portable, Self-Contained Neuroprosthetic Hand with Deep Learning-Based Finger Control

no code implementations24 Mar 2021 Anh Tuan Nguyen, Markus W. Drealan, Diu Khue Luu, Ming Jiang, Jian Xu, Jonathan Cheng, Qi Zhao, Edward W. Keefer, Zhi Yang

This enables the implementation of the neuroprosthetic hand as a portable and self-contained unit with real-time control of individual finger movements.

Edge-computing

Unified Quality Assessment of In-the-Wild Videos with Mixed Datasets Training

1 code implementation9 Nov 2020 Dingquan Li, Tingting Jiang, Ming Jiang

We focus on automatically assessing the quality of in-the-wild videos, which is a challenging problem due to the absence of reference videos, the complexity of distortions, and the diversity of video contents.

Video Quality Assessment VQA

Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

1 code implementation10 Aug 2020 Dingquan Li, Tingting Jiang, Ming Jiang

Experiments on two relevant datasets (KonIQ-10k and CLIVE) show that, compared to MAE or MSE loss, the new loss enables the IQA model to converge about 10 times faster and the final model achieves better performance.

Blind Image Quality Assessment No-Reference Image Quality Assessment +1

AiR: Attention with Reasoning Capability

1 code implementation ECCV 2020 Shi Chen, Ming Jiang, Jinhui Yang, Qi Zhao

In this work, we propose an Attention with Reasoning capability (AiR) framework that uses attention to understand and improve the process leading to task outcomes.

Saliency Prediction with External Knowledge

no code implementations27 Jul 2020 Yifeng Zhang, Ming Jiang, Qi Zhao

At the core of the method is a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge.

Graph Attention Saliency Prediction

Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection

no code implementations23 Jul 2020 Xianyu Chen, Ming Jiang, Qi Zhao

Few-shot object detection aims at detecting objects with few annotated examples, which remains a challenging research problem yet to be explored.

Few-Shot Learning Few-Shot Object Detection +1

Fantastic Answers and Where to Find Them: Immersive Question-Directed Visual Attention

no code implementations CVPR 2020 Ming Jiang, Shi Chen, Jinhui Yang, Qi Zhao

The Immersive Question-directed Visual Attention (IQVA) dataset features visual attention and corresponding task performance (i. e., answer correctness).

Decision Making

Improving Scholarly Knowledge Representation: Evaluating BERT-based Models for Scientific Relation Classification

no code implementations13 Apr 2020 Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie

With the rapid growth of research publications, there is a vast amount of scholarly knowledge that needs to be organized in digital libraries.

Classification General Classification +1

LabelFool: A Trick in the Label Space

no code implementations25 Sep 2019 Yujia Liu, Tingting Jiang, Ming Jiang

It is widely known that well-designed perturbations can cause state-of-the-art machine learning classifiers to mis-label an image, with sufficiently small perturbations that are imperceptible to the human eyes.

REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning

1 code implementation IJCNLP 2019 Ming Jiang, Junjie Hu, Qiuyuan Huang, Lei Zhang, Jana Diesner, Jianfeng Gao

In this study, we present a fine-grained evaluation method REO for automatically measuring the performance of image captioning systems.

Image Captioning

Quality Assessment of In-the-Wild Videos

2 code implementations1 Aug 2019 Dingquan Li, Tingting Jiang, Ming Jiang

We propose an objective no-reference video quality assessment method by integrating both effects into a deep neural network.

Image Classification Video Quality Assessment

Single Image Blind Deblurring Using Multi-Scale Latent Structure Prior

no code implementations11 Jun 2019 Yuanchao Bai, Huizhu Jia, Ming Jiang, Xian-Ming Liu, Xiaodong Xie, Wen Gao

Blind image deblurring is a challenging problem in computer vision, which aims to restore both the blur kernel and the latent sharp image from only a blurry observation.

Blind Image Deblurring Image Deblurring +3

Parsing R-CNN for Instance-Level Human Analysis

2 code implementations CVPR 2019 Lu Yang, Qing Song, Zhihui Wang, Ming Jiang

Models need to distinguish different human instances in the image panel and learn rich features to represent the details of each instance.

Human Part Segmentation Multi-Human Parsing +1

Quality Assessment for Tone-Mapped HDR Images Using Multi-Scale and Multi-Layer Information

1 code implementation19 Oct 2018 Qin He, Dingquan Li, Tingting Jiang, Ming Jiang

So we propose a new no-reference method of tone-mapped image quality assessment based on multi-scale and multi-layer features that are extracted from a pre-trained deep convolutional neural network model.

Blind Image Quality Assessment No-Reference Image Quality Assessment Multimedia

Exploiting High-Level Semantics for No-Reference Image Quality Assessment of Realistic Blur Images

1 code implementation18 Oct 2018 Dingquan Li, Tingting Jiang, Ming Jiang

To guarantee a satisfying Quality of Experience (QoE) for consumers, it is required to measure image quality efficiently and reliably.

Blind Image Quality Assessment Image Quality Estimation +1

Which Has Better Visual Quality: The Clear Blue Sky or a Blurry Animal?

1 code implementation IEEE Transactions on Multimedia 2018 Dingquan Li, Tingting Jiang, Weisi Lin, Ming Jiang

The proposed method, SFA, is compared with nine representative blur-specific NR-IQA methods, two general-purpose NR-IQA methods, and two extra full-reference IQA methods on Gaussian blur images (with and without Gaussian noise/JPEG compression) and realistic blur images from multiple databases, including LIVE, TID2008, TID2013, MLIVE1, MLIVE2, BID, and CLIVE.

Blind Image Quality Assessment Image Classification +2

A Neural Network Aided Approach for LDPC Coded DCO-OFDM with Clipping Distortion

no code implementations4 Sep 2018 Yuan He, Ming Jiang, Chunming Zhao

In this paper, a neural network-aided bit-interleaved coded modulation (NN-BICM) receiver is designed to mitigate the nonlinear clipping distortion in the LDPC coded direct currentbiased optical orthogonal frequency division multiplexing (DCOOFDM) systems.

Emotional Attention: A Study of Image Sentiment and Visual Attention

no code implementations CVPR 2018 Shaojing Fan, Zhiqi Shen, Ming Jiang, Bryan L. Koenig, Juan Xu, Mohan S. Kankanhalli, Qi Zhao

In this paper, we present the first study to focus on the relation between emotional properties of an image and visual attention.

Saliency Prediction

Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy

no code implementations CVPR 2018 Guanglu Song, Yu Liu, Ming Jiang, Yujie Wang, Junjie Yan, Biao Leng

Fully convolutional neural network (FCN) has been dominating the game of face detection task for a few years with its congenital capability of sliding-window-searching with shared kernels, which boiled down all the redundant calculation, and most recent state-of-the-art methods such as Faster-RCNN, SSD, YOLO and FPN use FCN as their backbone.

Face Detection Philosophy

Learning Visual Attention to Identify People With Autism Spectrum Disorder

no code implementations ICCV 2017 Ming Jiang, Qi Zhao

This paper presents a novel method for quantitative and objective diagnoses of Autism Spectrum Disorder (ASD) using eye tracking and deep neural networks.

Says Who\ldots? Identification of Expert versus Layman Critics' Reviews of Documentary Films

no code implementations COLING 2016 Ming Jiang, Jana Diesner

We extend classic review mining work by building a binary classifier that predicts whether a review of a documentary film was written by an expert or a layman with 90. 70{\%} accuracy (F1 score), and compare the characteristics of the predicted classes.

Decision Making Recommendation Systems

SALICON: Saliency in Context

no code implementations CVPR 2015 Ming Jiang, Shengsheng Huang, Juanyong Duan, Qi Zhao

Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention.

Saliency Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.