Search Results for author: Hanli Wang

Found 17 papers, 6 papers with code

Misalignment-Robust Frequency Distribution Loss for Image Transformation

1 code implementation • 28 Feb 2024 • Zhangkai Ni, Juncheng Wu, Zian Wang, Wenhan Yang, Hanli Wang, Lin Ma

This paper aims to address a common challenge in deep learning-based image transformation methods, such as image enhancement and super-resolution, which heavily rely on precisely aligned paired datasets with pixel-level alignments.

Image Enhancement Style Transfer +1

Paper
Code

ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field

1 code implementation • 14 Dec 2023 • Zhangkai Ni, Peiqi Yang, Wenhan Yang, Hanli Wang, Lin Ma, Sam Kwong

Through this, we construct a novel collaborative module that aligns information from various views and meanwhile imposes self-supervised constraints to ensure multi-view consistency in both geometry and appearance.

Novel View Synthesis

Paper
Code

Glow in the Dark: Low-Light Image Enhancement with External Memory

1 code implementation • IEEE Transactions on Multimedia 2023 • Dongjie Ye, Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Sam Kwong

Benefiting from the learned memory, more complex distributions of reference images in the entire dataset can be “remembered” to facilitate the adjustment of the testing samples more adaptively.

Ranked #2 on Low-Light Image Enhancement on LOL-v2

Low-Light Image Enhancement

Paper
Code

Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning

no code implementations • 30 Jan 2023 • Jian Zhu, Hanli Wang, Miaojing Shi

On the other hand, BLIP-2 as an MLLM is employed to process images and texts, and the referring expressions in texts involving specific visual objects are modified with linguistic object labels to serve as comprehensible MLLM inputs.

Language Modelling Large Language Model +1

Paper
Add Code

Just Noticeable Difference Modeling for Face Recognition System

no code implementations • 13 Sep 2022 • Yu Tian, Zhangkai Ni, Baoliang Chen, Shurun Wang, Shiqi Wang, Hanli Wang, Sam Kwong

In particular, in order to maximum redundancy removal without impairment of robust identity information, we apply the encoder with multiple feature extraction and attention-based feature decomposition modules to progressively decompose face features into two uncorrelated components, i. e., identity and residual features, via self-supervised learning.

Face Recognition Self-Supervised Learning

Paper
Add Code

High Dynamic Range Image Quality Assessment Based on Frequency Disparity

1 code implementation • 6 Sep 2022 • Yue Liu, Zhangkai Ni, Shiqi Wang, Hanli Wang, Sam Kwong

In this paper, a novel and effective image quality assessment (IQA) algorithm based on frequency disparity for high dynamic range (HDR) images is proposed, termed as local-global frequency feature-based model (LGFM).

Image Quality Assessment Vocal Bursts Intensity Prediction

Paper
Code

Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention

no code implementations • IEEE Transactions on Circuits and Systems for Video Technology 2022 • Jinjing Gu, Hanli Wang

In this work, a coherent visual storytelling (CoVS) framework is designed to address the above-mentioned problems.

Ranked #4 on Visual Storytelling on VIST

Sentence Visual Storytelling

Paper
Add Code

Cycle-Interactive Generative Adversarial Network for Robust Unsupervised Low-Light Enhancement

no code implementations • 3 Jul 2022 • Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Lin Ma, Sam Kwong

Getting rid of the fundamental limitations in fitting to the paired training data, recent unsupervised low-light enhancement methods excel in adjusting illumination and contrast of images.

Generative Adversarial Network Low-Light Image Enhancement

Paper
Add Code

Taking an Emotional Look at Video Paragraph Captioning

no code implementations • 12 Mar 2022 • Qinyu Li, Tengpeng Li, Hanli Wang, Chang Wen Chen

In this work, a comprehensive study is conducted on video paragraph captioning, with the goal to generate paragraph-level descriptions for a given video.

Image Captioning

Paper
Add Code

Two-stream Hierarchical Similarity Reasoning for Image-text Matching

no code implementations • 10 Mar 2022 • Ran Chen, Hanli Wang, Lei Wang, Sam Kwong

Second, previous approaches only consider learning single-stream similarity alignment (i. e., image-to-text level or text-to-image level), which is inadequate to fully use similarity information for image-text matching.

Image-text matching Text Matching +1

Paper
Add Code

Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling

no code implementations • 10 Mar 2022 • Tengpeng Li, Hanli Wang, Bin He, Chang Wen Chen

Third, a unified one-stage story generation model with encoder-decoder structure is proposed to simultaneously train and infer the knowledge-enriched attention network, group-wise semantic module and multi-modal story generation decoder in an end-to-end fashion.

Visual Storytelling

Paper
Add Code

Generalized Visual Quality Assessment of GAN-Generated Face Images

no code implementations • 28 Jan 2022 • Yu Tian, Zhangkai Ni, Baoliang Chen, Shiqi Wang, Hanli Wang, Sam Kwong

However, little work has been dedicated to automatic quality assessment of such GAN-generated face images (GFIs), even less have been devoted to generalized and robust quality assessment of GFIs generated with unseen GAN model.

Face Generation Image Quality Assessment +1

Paper
Add Code

Visual Storytelling with Hierarchical BERT Semantic Guidance

no code implementations • ACM Multimedia Asia 2022 • Ruichao Fan, Hanli Wang, Jinjing Gu, and Xianhui Liu

As there is no ground-truth topic information, a pre-trained BERT model based on visual contents and annotated stories is utilized to mine topics.

Ranked #2 on Visual Storytelling on VIST

Sentence Visual Storytelling

Paper
Add Code

CSformer: Bridging Convolution and Transformer for Compressive Sensing

1 code implementation • 31 Dec 2021 • Dongjie Ye, Zhangkai Ni, Hanli Wang, Jian Zhang, Shiqi Wang, Sam Kwong

The proposed approach is an end-to-end compressive image sensing method, composed of adaptive sampling and recovery.

Compressive Sensing Inductive Bias +1

Paper
Code

Categorizing Concepts With Basic Level for Vision-to-Language

no code implementations • CVPR 2018 • Hanzhang Wang, Hanli Wang, Kaisheng Xu

Vision-to-language tasks require a unified semantic understanding of visual content.

Clustering Image Captioning +3

Paper
Add Code

Real-time Action Recognition with Enhanced Motion Vector CNNs

1 code implementation • CVPR 2016 • Bowen Zhang, Li-Min Wang, Zhe Wang, Yu Qiao, Hanli Wang

The deep two-stream architecture exhibited excellent performance on video based action recognition.

Ranked #74 on Action Recognition on UCF101

Action Recognition Optical Flow Estimation +1

550

Paper
Code

A new network-based algorithm for human activity recognition in video

no code implementations • 21 Feb 2015 • Weiyao Lin, Yuanzhe Chen, Jianxin Wu, Hanli Wang, Bin Sheng, Hongxiang Li

Based on this network, we further model people in the scene as packages while human activities can be modeled as the process of package transmission in the network.

Activity Detection Activity Recognition In Videos +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.