Search Results for author: Qingmin Liao

Found 37 papers, 20 papers with code

Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs

1 code implementation26 Sep 2024 Qinpeng Cui, Yixuan Liu, Xinyi Zhang, Qiqi Bao, Zhongdao Wang, Qingmin Liao, Li Wang, Tian Lu, Emad Barsoum

In this paper, we present DoSSR, a Domain Shift diffusion-based SR model that capitalizes on the generative powers of pretrained diffusion models while significantly enhancing efficiency by initiating the diffusion process with low-resolution (LR) images.

Image Restoration Image Super-Resolution

Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network

no code implementations6 Aug 2024 Xinyi Zhang, Qiqi Bao, Qinpeng Cui, Wenming Yang, Qingmin Liao

This architecture introduces local enhancement with GCN by capturing relationships between neighboring joints, thus producing new representations to complement Mamba's outputs.

3D Human Pose Estimation Computational Efficiency +3

SMPLX-Lite: A Realistic and Drivable Avatar Benchmark with Rich Geometry and Texture Annotations

no code implementations30 May 2024 Yujiao Jiang, Qingmin Liao, Zhaolong Wang, Xiangru Lin, Zongqing Lu, Yuxi Zhao, Hanqing Wei, Jingrui Ye, Yu Zhang, Zhijing Shao

We present SMPLX-Lite dataset, the most comprehensive clothing avatar dataset with multi-view RGB sequences, keypoints annotations, textured scanned meshes, and textured SMPLX-Lite-D models.

BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network

1 code implementation27 May 2024 Zongkai Zhang, Zidong Xu, Wenming Yang, Qingmin Liao, Jing-Hao Xue

To bridge these gaps, we propose a novel binarized deep convolution (BDC) unit that effectively enhances performance while increasing the number of binarized convolutional layers.

Binarization

Modeling User Fatigue for Sequential Recommendation

1 code implementation20 May 2024 Nian Li, Xin Ban, Cheng Ling, Chen Gao, Lantao Hu, Peng Jiang, Kun Gai, Yong Li, Qingmin Liao

In this paper, we propose to model user Fatigue in interest learning for sequential Recommendations (FRec).

Contrastive Learning Sequential Recommendation

OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering

no code implementations12 Apr 2024 Jingrui Ye, Zongkai Zhang, Yujiao Jiang, Qingmin Liao, Wenming Yang, Zongqing Lu

OccGaussian initializes 3D Gaussian distributions in the canonical space, and we perform occlusion feature query at occluded regions, the aggregated pixel-align feature is extracted to compensate for the missing information.

UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling

no code implementations18 Mar 2024 Yujiao Jiang, Qingmin Liao, Xiaoyu Li, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan

Therefore, we propose UV Gaussians, which models the 3D human body by jointly learning mesh deformations and 2D UV-space Gaussian textures.

DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication

no code implementations3 Feb 2024 Yanjun Liu, Wenming Yang, Qingmin Liao

To fill this gap, we introduce DiffVein, a unified diffusion model-based framework which simultaneously addresses vein segmentation and authentication tasks.

Denoising Segmentation +1

UV-SAM: Adapting Segment Anything Model for Urban Village Identification

1 code implementation16 Jan 2024 Xin Zhang, Yu Liu, Yuming Lin, Qingmin Liao, Yong Li

Urban villages, defined as informal residential areas in or around urban centers, are characterized by inadequate infrastructures and poor living conditions, closely related to the Sustainable Development Goals (SDGs) on poverty, adequate housing, and sustainable cities.

Image Classification Semantic Segmentation

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination

1 code implementation23 Dec 2023 Jijia Liu, Chao Yu, Jiaxuan Gao, Yuqing Xie, Qingmin Liao, Yi Wu, Yu Wang

AI agents powered by Large Language Models (LLMs) have made significant advances, enabling them to assist humans in diverse complex tasks and leading to a revolution in human-AI coordination.

Code Generation

DSR-Diff: Depth Map Super-Resolution with Diffusion Model

no code implementations16 Nov 2023 Yuan Shi, Bin Xia, Rui Zhu, Qingmin Liao, Wenming Yang

Color-guided depth map super-resolution (CDSR) improve the spatial resolution of a low-quality depth map with the corresponding high-quality color map, benefiting various applications such as 3D reconstruction, virtual reality, and augmented reality.

3D Reconstruction Depth Map Super-Resolution

EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities

1 code implementation16 Oct 2023 Nian Li, Chen Gao, Mingyu Li, Yong Li, Qingmin Liao

Existing agent modeling typically employs predetermined rules or learning-based neural networks for decision-making.

Decision Making Language Modelling +2

Elucidating the solution space of extended reverse-time SDE for diffusion models

1 code implementation12 Sep 2023 Qinpeng Cui, Xinyi Zhang, Zongqing Lu, Qingmin Liao

In this work, we formulate the sampling process as an extended reverse-time SDE (ER SDE), unifying prior explorations into ODEs and SDEs.

Image Generation

LLA-FLOW: A Lightweight Local Aggregation on Cost Volume for Optical Flow Estimation

no code implementations17 Apr 2023 Jiawei Xu, Zongqing Lu, Qingmin Liao

Lack of texture often causes ambiguity in matching, and handling this issue is an important challenge in optical flow estimation.

Optical Flow Estimation

IRGen: Generative Modeling for Image Retrieval

1 code implementation17 Mar 2023 Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Jingdong Wang, Baining Guo

While generative modeling has become prevalent across numerous research fields, its integration into the realm of image retrieval remains largely unexplored and underjustified.

Image Retrieval Retrieval

Dynamic Ensemble of Low-fidelity Experts: Mitigating NAS "Cold-Start"

1 code implementation2 Feb 2023 Junbo Zhao, Xuefei Ning, Enshu Liu, Binxin Ru, Zixuan Zhou, Tianchen Zhao, Chen Chen, Jiajin Zhang, Qingmin Liao, Yu Wang

In the first step, we train different sub-predictors on different types of available low-fidelity information to extract beneficial knowledge as low-fidelity experts.

Neural Architecture Search

SCS-Co: Self-Consistent Style Contrastive Learning for Image Harmonization

1 code implementation CVPR 2022 Yucheng Hang, Bin Xia, Wenming Yang, Qingmin Liao

In addition, we propose a background-attentional adaptive instance normalization (BAIN) to achieve an attention-weighted background feature distribution according to the foreground-background feature similarity.

Contrastive Learning Image Harmonization

STDAN: Deformable Attention Network for Space-Time Video Super-Resolution

1 code implementation14 Mar 2022 Hai Wang, Xiaoyu Xiang, Yapeng Tian, Wenming Yang, Qingmin Liao

Second, we put forward a spatial-temporal deformable feature aggregation (STDFA) module, in which spatial and temporal contexts in dynamic video frames are adaptively captured and aggregated to enhance SR reconstruction.

Space-time Video Super-resolution Video Super-Resolution

Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-based Super-Resolution

1 code implementation12 Jan 2022 Bin Xia, Yapeng Tian, Yucheng Hang, Wenming Yang, Qingmin Liao, Jie zhou

To improve matching efficiency, we design a novel Embedded PatchMacth scheme with random samples propagation, which involves end-to-end training with asymptotic linear computational cost to the input size.

Reference-based Super-Resolution

Efficient Non-Local Contrastive Attention for Image Super-Resolution

1 code implementation11 Jan 2022 Bin Xia, Yucheng Hang, Yapeng Tian, Wenming Yang, Qingmin Liao, Jie zhou

To demonstrate the effectiveness of ENLCA, we build an architecture called Efficient Non-Local Contrastive Network (ENLCN) by adding a few of our modules in a simple backbone.

Contrastive Learning Feature Correlation +1

ER-IQA: Boosting Perceptual Quality Assessment Using External Reference Images

no code implementations6 May 2021 Jingyu Guo, Wei Wang, Wenming Yang, Qingmin Liao, Jie zhou

In this paper, we introduce a brand new scheme, namely external-reference image quality assessment (ER-IQA), by introducing external reference images to bridge the gap between FR and NR-IQA.

Image Quality Assessment NR-IQA

SCNet: Enhancing Few-Shot Semantic Segmentation by Self-Contrastive Background Prototypes

no code implementations19 Apr 2021 Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao

To this end, we generate self-contrastive background prototypes directly from the query image, with which we enable the construction of complete sample pairs and thus a complementary and auxiliary segmentation task to achieve the training of a better segmentation model.

Few-Shot Semantic Segmentation Metric Learning +2

A region-based descriptor network for uniformly sampled keypoints

no code implementations26 Jan 2021 Kai Lv, Zongqing Lu, Qingmin Liao

By the new descriptor, we can obtain more high confidence matching points without extremum operation.

Attention Cube Network for Image Restoration

1 code implementation13 Sep 2020 Yucheng Hang, Qingmin Liao, Wenming Yang, Yupeng Chen, Jie zhou

The adaptive spatial attention branch (ASAB) and the adaptive channel attention branch (ACAB) constitute the adaptive dual attention module (ADAM), which can capture the long-range spatial and channel-wise contextual information to expand the receptive field and distinguish different types of information for more effective feature representations.

Feature Correlation Image Restoration

Real-MFF: A Large Realistic Multi-focus Image Dataset with Ground Truth

no code implementations28 Mar 2020 Juncheng Zhang, Qingmin Liao, Shaojun Liu, Haoyu Ma, Wenming Yang, Jing-Hao Xue

In this letter, we introduce a large and realistic multi-focus dataset called Real-MFF, which contains 710 pairs of source images with corresponding ground truth images.

XSepConv: Extremely Separated Convolution

no code implementations27 Feb 2020 Jiarong Chen, Zongqing Lu, Jing-Hao Xue, Qingmin Liao

Depthwise convolution has gradually become an indispensable operation for modern efficient neural networks and larger kernel sizes ($\ge5$) have been applied to it recently.

An α-Matte Boundary Defocus Model Based Cascaded Network for Multi-focus Image Fusion

2 code implementations29 Oct 2019 Haoyu Ma, Qingmin Liao, Juncheng Zhang, Shaojun Liu, Jing-Hao Xue

Based on this {\alpha}-matte defocus model and the generated data, a cascaded boundary aware convolutional network termed MMF-Net is proposed and trained, aiming to achieve clearer fusion results around the FDB.

LCSCNet: Linear Compressing Based Skip-Connecting Network for Image Super-Resolution

1 code implementation9 Sep 2019 Wenming Yang, Xuechen Zhang, Yapeng Tian, Wei Wang, Jing-Hao Xue, Qingmin Liao

In this paper, we develop a concise but efficient network architecture called linear compressing based skip-connecting network (LCSCNet) for image super-resolution.

Image Super-Resolution

Boundary Aware Multi-Focus Image Fusion Using Deep Neural Network

no code implementations30 Mar 2019 Haoyu Ma, Juncheng Zhang, Shaojun Liu, Qingmin Liao

Since it is usually difficult to capture an all-in-focus image of a 3D scene directly, various multi-focus image fusion methods are employed to generate it from several images focusing at different depths.

Lightweight Feature Fusion Network for Single Image Super-Resolution

2 code implementations15 Feb 2019 Wenming Yang, Wei Wang, Xuechen Zhang, Shuifa Sun, Qingmin Liao

Specifically, a spindle block is composed of a dimension extension unit, a feature exploration unit and a feature refinement unit.

Image Super-Resolution

Optical Flow Super-Resolution Based on Image Guidence Using Convolutional Neural Network

no code implementations3 Sep 2018 Liping Zhang, Zongqing Lu, Qingmin Liao

With the motivation of various convolutional neural network(CNN) structures succeeded in single image super-resolution(SISR) task, an end-to-end convolutional neural network is proposed to reconstruct the high resolution(HR) optical flow field from initial LR optical flow with the guidence of the first frame used in optical flow estimation.

Image Super-Resolution Optical Flow Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.