Search Results for author: Liang Liao

Found 30 papers, 23 papers with code

Towards Open-ended Visual Quality Comparison

no code implementations • 26 Feb 2024 • HaoNing Wu, Hanwei Zhu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Annan Wang, Wenxiu Sun, Qiong Yan, Xiaohong Liu, Guangtao Zhai, Shiqi Wang, Weisi Lin

Comparative settings (e. g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses.

Image Quality Assessment

Paper
Add Code

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

1 code implementation • 28 Dec 2023 • HaoNing Wu, ZiCheng Zhang, Weixia Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Yixuan Gao, Annan Wang, Erli Zhang, Wenxiu Sun, Qiong Yan, Xiongkuo Min, Guangtao Zhai, Weisi Lin

The explosion of visual content available online underscores the requirement for an accurate machine assessor to robustly evaluate scores across diverse types of visual contents.

Ranked #1 on Video Quality Assessment on LIVE-FB LSVQ

Aesthetics Quality Assessment Video Quality Assessment +1

137

Paper
Code

Iterative Token Evaluation and Refinement for Real-World Super-Resolution

1 code implementation • 9 Dec 2023 • Chaofeng Chen, Shangchen Zhou, Liang Liao, HaoNing Wu, Wenxiu Sun, Qiong Yan, Weisi Lin

Distortion removal involves simple HQ token prediction with LQ images, while texture generation uses a discrete diffusion model to iteratively refine the distortion removal output with a token refinement network.

Image Super-Resolution Texture Synthesis

Paper
Code

Enhancing Diffusion Models with Text-Encoder Reinforcement Learning

1 code implementation • 27 Nov 2023 • Chaofeng Chen, Annan Wang, HaoNing Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin

While fine-tuning the U-Net can partially improve performance, it remains suffering from the suboptimal text encoder.

reinforcement-learning

Paper
Code

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

1 code implementation • 12 Nov 2023 • HaoNing Wu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Annan Wang, Kaixin Xu, Chunyi Li, Jingwen Hou, Guangtao Zhai, Geng Xue, Wenxiu Sun, Qiong Yan, Weisi Lin

Multi-modality foundation models, as represented by GPT-4V, have brought a new paradigm for low-level visual perception and understanding tasks, that can respond to a broad range of natural human instructions in a model.

157

Paper
Code

Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation

1 code implementation • 31 Oct 2023 • Liang Liao, Liang Wan, Mingsheng Liu, Shusheng Li

To be precise, we use the Dual-Guided Attention (DGA) module we proposed to replace some multi-scale transformations with the calculation of attention which means we only use several attention layers of near linear complexity to achieve performance comparable to frequently-used multi-layer fusion.

Real-Time Semantic Segmentation

Paper
Code

Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision

1 code implementation • 25 Sep 2023 • HaoNing Wu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Annan Wang, Chunyi Li, Wenxiu Sun, Qiong Yan, Guangtao Zhai, Weisi Lin

To address this gap, we present Q-Bench, a holistic benchmark crafted to systematically evaluate potential abilities of MLLMs on three realms: low-level visual perception, low-level visual description, and overall visual quality assessment.

Image Quality Assessment

188

Paper
Code

Local Distortion Aware Efficient Transformer Adaptation for Image Quality Assessment

no code implementations • 23 Aug 2023 • Kangmin Xu, Liang Liao, Jing Xiao, Chaofeng Chen, HaoNing Wu, Qiong Yan, Weisi Lin

Further, we propose a local distortion extractor to obtain local distortion features from the pretrained CNN and a local distortion injector to inject the local distortion features into ViT.

Image Quality Assessment Inductive Bias +1

Paper
Add Code

TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality Assessment

1 code implementation • 6 Aug 2023 • Chaofeng Chen, Jiadi Mo, Jingwen Hou, HaoNing Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin

Our approach to IQA involves the design of a heuristic coarse-to-fine network (CFANet) that leverages multi-scale features and progressively propagates multi-level semantic information to low-level representations in a top-down manner.

Ranked #11 on Video Quality Assessment on MSU SR-QA Dataset

Image Quality Assessment Local Distortion +2

1,442

Paper
Code

Color Image Recovery Using Generalized Matrix Completion over Higher-Order Finite Dimensional Algebra

no code implementations • 4 Aug 2023 • Liang Liao, Zhuang Guo, Qi Gao, Yan Wang, Fajun Yu, Qifeng Zhao, Stephen Johh Maybank

To improve the accuracy of color image completion with missing entries, we present a recovery method based on generalized higher-order scalars.

Matrix Completion

Paper
Add Code

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

1 code implementation • 20 Jun 2023 • Liang Liao, Taorong Liu, Delin Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

For precise utilization of the reference features for guidance, a reference-patch alignment (Ref-PA) module is proposed to align the patch features of the reference and corrupted images and harmonize their style differences, while a reference-patch transformer (Ref-PT) module is proposed to refine the embedded reference feature.

Image Inpainting Image Restoration

Paper
Code

Towards Explainable In-the-Wild Video Quality Assessment: A Database and a Language-Prompted Approach

1 code implementation • 22 May 2023 • HaoNing Wu, Erli Zhang, Liang Liao, Chaofeng Chen, Jingwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

Though subjective studies have collected overall quality scores for these videos, how the abstract quality scores relate with specific factors is still obscure, hindering VQA methods from more concrete quality evaluations (e. g. sharpness of a video).

Video Quality Assessment Visual Question Answering (VQA)

Paper
Code

GCFAgg: Global and Cross-view Feature Aggregation for Multi-view Clustering

1 code implementation • CVPR 2023 • Weiqing Yan, Yuanyang Zhang, Chenlei Lv, Chang Tang, Guanghui Yue, Liang Liao, Weisi Lin

However, most existing deep clustering methods learn consensus representation or view-specific representations from multiple views via view-wise aggregation way, where they ignore structure relationship of all samples.

Clustering Contrastive Learning +1

Paper
Code

Towards Robust Text-Prompted Semantic Criterion for In-the-Wild Video Quality Assessment

2 code implementations • 28 Apr 2023 • HaoNing Wu, Liang Liao, Annan Wang, Chaofeng Chen, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

The proliferation of videos collected during in-the-wild natural settings has pushed the development of effective Video Quality Assessment (VQA) methodologies.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Code

Exploring Opinion-unaware Video Quality Assessment with Semantic Affinity Criterion

2 code implementations • 26 Feb 2023 • HaoNing Wu, Liang Liao, Jingwen Hou, Chaofeng Chen, Erli Zhang, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

Recent learning-based video quality assessment (VQA) algorithms are expensive to implement due to the cost of data collection of human quality opinions, and are less robust across various scenarios due to the biases of these opinions.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Code

Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives

3 code implementations • ICCV 2023 • HaoNing Wu, Erli Zhang, Liang Liao, Chaofeng Chen, Jingwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

In light of this, we propose the Disentangled Objective Video Quality Evaluator (DOVER) to learn the quality of UGC videos based on the two perspectives.

Ranked #1 on Video Quality Assessment on LIVE-VQC

Disentanglement Video Generation +2

218

Paper
Code

Neighbourhood Representative Sampling for Efficient End-to-end Video Quality Assessment

4 code implementations • 11 Oct 2022 • HaoNing Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Jinwei Gu, Weisi Lin

On the other hand, existing practices, such as resizing and cropping, will change the quality of original videos due to the loss of details and contents, and are therefore harmful to quality assessment.

Ranked #2 on Video Quality Assessment on KoNViD-1k (using extra training data)

Video Quality Assessment Visual Question Answering (VQA)

218

Paper
Code

Reference-Guided Texture and Structure Inference for Image Inpainting

1 code implementation • 29 Jul 2022 • Taorong Liu, Liang Liao, Zheng Wang, Shin'ichi Satoh

Existing learning-based image inpainting methods are still in challenge when facing complex semantic environments and diverse hole patterns.

Image Inpainting

Paper
Code

Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment

1 code implementation • 8 Jul 2022 • Liang Liao, Kangmin Xu, HaoNing Wu, Chaofeng Chen, Wenxiu Sun, Qiong Yan, Weisi Lin

Experiments show that the perceptual representation in the HVS is an effective way of predicting subjective temporal quality, and thus TPQI can, for the first time, achieve comparable performance to the spatial quality metric and be even more effective in assessing videos with large temporal variations.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Code

FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

4 code implementations • 6 Jul 2022 • HaoNing Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

Consisting of fragments and FANet, the proposed FrAgment Sample Transformer for VQA (FAST-VQA) enables efficient end-to-end deep VQA and learns effective video-quality-related representations.

Ranked #3 on Video Quality Assessment on LIVE-VQC (using extra training data)

Video Quality Assessment

218

Paper
Code

DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

1 code implementation • 20 Jun 2022 • HaoNing Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

Based on prominent time-series modeling ability of transformers, we propose a novel and effective transformer-based VQA method to tackle these two issues.

Ranked #5 on Video Quality Assessment on KoNViD-1k

Time Series Analysis Video Quality Assessment +1

Paper
Code

Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion

1 code implementation • 10 Jun 2022 • Liang Liao, WenYi Chen, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

Specifically, based on the two discoveries of local spatial similarity and adjacent temporal correspondence of the sequential image data, we propose a novel Target-Domain driven pseudo label Diffusion (TDo-Dif) scheme.

Autonomous Driving Pseudo Label +4

Paper
Code

Spatial-Temporal Space Hand-in-Hand: Spatial-Temporal Video Super-Resolution via Cycle-Projected Mutual Learning

no code implementations • CVPR 2022 • Mengshun Hu, Kui Jiang, Liang Liao, Jing Xiao, Junjun Jiang, Zheng Wang

Specifically, we propose to exploit the mutual information among them via iterative up-and-down projections, where the spatial and temporal features are fully fused and distilled, helping the high-quality video reconstruction.

Video Reconstruction Video Super-Resolution

Paper
Add Code

Approximation of Images via Generalized Higher Order Singular Value Decomposition over Finite-dimensional Commutative Semisimple Algebra

1 code implementation • 1 Feb 2022 • Liang Liao, Sen Lin, Lun Li, Xiuwei Zhang, Song Zhao, Yan Wang, Xinqiang Wang, Qi Gao, Jingyu Wang

Higher order singular value decomposition (HOSVD) extends the SVD and can approximate higher order data using sums of a few rank-one components.

Paper
Code

Generalized Image Reconstruction over T-Algebra

1 code implementation • 17 Jan 2021 • Liang Liao, Xuechun Zhang, Xinqiang Wang, Sen Lin, Xin Liu

We also show in our experiments that the performance of TPCA increases when the order of compounded pixels increases.

Data Compression Dimensionality Reduction +1

Paper
Code

Image Inpainting Guided by Coherence Priors of Semantics and Textures

no code implementations • CVPR 2021 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

In this paper, we introduce coherence priors between the semantics and textures which make it possible to concentrate on completing separate textures in a semantic-wise manner.

Image Inpainting Semantic Segmentation

Paper
Add Code

General Data Analytics with Applications to Visual Information Analysis: A Provable Backward-Compatible Semisimple Paradigm over T-Algebra

1 code implementation • 31 Oct 2020 • Liang Liao, Stephen John Maybank

We consider a novel backward-compatible paradigm of general data analytics over a recently-reported semisimple algebra (called t-algebra).

Paper
Code

Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes

no code implementations • ECCV 2020 • Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh

Completing a corrupted image with correct structures and reasonable textures for a mixed scene remains an elusive challenge.

Image Inpainting Semantic Segmentation +1

Paper
Add Code

Intrinsic Dimension Estimation via Nearest Constrained Subspace Classifier

no code implementations • 8 Feb 2020 • Liang Liao, Stephen John Maybank

We consider the problems of classification and intrinsic dimension estimation on image data.

Classification General Classification

Paper
Add Code

Generalized Visual Information Analysis via Tensorial Algebra

1 code implementation • 31 Jan 2020 • Liang Liao, Stephen John Maybank

Higher order data is modeled using matrices whose entries are numerical arrays of a fixed size.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.