Search Results for author: Hanli Wang

Found 17 papers, 6 papers with code

Misalignment-Robust Frequency Distribution Loss for Image Transformation

1 code implementation28 Feb 2024 Zhangkai Ni, Juncheng Wu, Zian Wang, Wenhan Yang, Hanli Wang, Lin Ma

This paper aims to address a common challenge in deep learning-based image transformation methods, such as image enhancement and super-resolution, which heavily rely on precisely aligned paired datasets with pixel-level alignments.

Image Enhancement Style Transfer +1

ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field

1 code implementation14 Dec 2023 Zhangkai Ni, Peiqi Yang, Wenhan Yang, Hanli Wang, Lin Ma, Sam Kwong

Through this, we construct a novel collaborative module that aligns information from various views and meanwhile imposes self-supervised constraints to ensure multi-view consistency in both geometry and appearance.

Novel View Synthesis

Glow in the Dark: Low-Light Image Enhancement with External Memory

1 code implementation IEEE Transactions on Multimedia 2023 Dongjie Ye, Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Sam Kwong

Benefiting from the learned memory, more complex distributions of reference images in the entire dataset can be “remembered” to facilitate the adjustment of the testing samples more adaptively.

Low-Light Image Enhancement

Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning

no code implementations30 Jan 2023 Jian Zhu, Hanli Wang, Miaojing Shi

On the other hand, BLIP-2 as an MLLM is employed to process images and texts, and the referring expressions in texts involving specific visual objects are modified with linguistic object labels to serve as comprehensible MLLM inputs.

Language Modelling Large Language Model +1

Just Noticeable Difference Modeling for Face Recognition System

no code implementations13 Sep 2022 Yu Tian, Zhangkai Ni, Baoliang Chen, Shurun Wang, Shiqi Wang, Hanli Wang, Sam Kwong

In particular, in order to maximum redundancy removal without impairment of robust identity information, we apply the encoder with multiple feature extraction and attention-based feature decomposition modules to progressively decompose face features into two uncorrelated components, i. e., identity and residual features, via self-supervised learning.

Face Recognition Self-Supervised Learning

High Dynamic Range Image Quality Assessment Based on Frequency Disparity

1 code implementation6 Sep 2022 Yue Liu, Zhangkai Ni, Shiqi Wang, Hanli Wang, Sam Kwong

In this paper, a novel and effective image quality assessment (IQA) algorithm based on frequency disparity for high dynamic range (HDR) images is proposed, termed as local-global frequency feature-based model (LGFM).

Image Quality Assessment Vocal Bursts Intensity Prediction

Cycle-Interactive Generative Adversarial Network for Robust Unsupervised Low-Light Enhancement

no code implementations3 Jul 2022 Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Lin Ma, Sam Kwong

Getting rid of the fundamental limitations in fitting to the paired training data, recent unsupervised low-light enhancement methods excel in adjusting illumination and contrast of images.

Generative Adversarial Network Low-Light Image Enhancement

Taking an Emotional Look at Video Paragraph Captioning

no code implementations12 Mar 2022 Qinyu Li, Tengpeng Li, Hanli Wang, Chang Wen Chen

In this work, a comprehensive study is conducted on video paragraph captioning, with the goal to generate paragraph-level descriptions for a given video.

Image Captioning

Two-stream Hierarchical Similarity Reasoning for Image-text Matching

no code implementations10 Mar 2022 Ran Chen, Hanli Wang, Lei Wang, Sam Kwong

Second, previous approaches only consider learning single-stream similarity alignment (i. e., image-to-text level or text-to-image level), which is inadequate to fully use similarity information for image-text matching.

Image-text matching Text Matching +1

Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling

no code implementations10 Mar 2022 Tengpeng Li, Hanli Wang, Bin He, Chang Wen Chen

Third, a unified one-stage story generation model with encoder-decoder structure is proposed to simultaneously train and infer the knowledge-enriched attention network, group-wise semantic module and multi-modal story generation decoder in an end-to-end fashion.

Visual Storytelling

Generalized Visual Quality Assessment of GAN-Generated Face Images

no code implementations28 Jan 2022 Yu Tian, Zhangkai Ni, Baoliang Chen, Shiqi Wang, Hanli Wang, Sam Kwong

However, little work has been dedicated to automatic quality assessment of such GAN-generated face images (GFIs), even less have been devoted to generalized and robust quality assessment of GFIs generated with unseen GAN model.

Face Generation Image Quality Assessment +1

Visual Storytelling with Hierarchical BERT Semantic Guidance

no code implementations ACM Multimedia Asia 2022 Ruichao Fan, Hanli Wang, Jinjing Gu, and Xianhui Liu

As there is no ground-truth topic information, a pre-trained BERT model based on visual contents and annotated stories is utilized to mine topics.

Sentence Visual Storytelling

CSformer: Bridging Convolution and Transformer for Compressive Sensing

1 code implementation31 Dec 2021 Dongjie Ye, Zhangkai Ni, Hanli Wang, Jian Zhang, Shiqi Wang, Sam Kwong

The proposed approach is an end-to-end compressive image sensing method, composed of adaptive sampling and recovery.

Compressive Sensing Inductive Bias +1

A new network-based algorithm for human activity recognition in video

no code implementations21 Feb 2015 Weiyao Lin, Yuanzhe Chen, Jianxin Wu, Hanli Wang, Bin Sheng, Hongxiang Li

Based on this network, we further model people in the scene as packages while human activities can be modeled as the process of package transmission in the network.

Activity Detection Activity Recognition In Videos +2

Cannot find the paper you are looking for? You can Submit a new open access paper.