Search Results for author: Ao Luo

Found 20 papers, 13 papers with code

RecDiffusion: Rectangling for Image Stitching with Diffusion Models

1 code implementation • 28 Mar 2024 • Tianhao Zhou, Haipeng Li, Ziyi Wang, Ao Luo, Chen-Lin Zhang, Jiajun Li, Bing Zeng, Shuaicheng Liu

Image stitching from different captures often results in non-rectangular boundaries, which is often considered unappealing.

Image Stitching

Paper
Code

Better Explain Transformers by Illuminating Important Information

1 code implementation • 18 Jan 2024 • Linxin Song, Yan Cui, Ao Luo, Freddy Lecue, Irene Li

Transformer-based models excel in various natural language processing (NLP) tasks, attracting countless efforts to explain their inner workings.

Question Answering

Paper
Code

GAFlow: Incorporating Gaussian Attention into Optical Flow

1 code implementation • ICCV 2023 • Ao Luo, Fan Yang, Xin Li, Lang Nie, Chunyu Lin, Haoqiang Fan, Shuaicheng Liu

Moreover, for reliable motion analysis, we provide a new Gaussian-Guided Attention Module (GGAM) which not only inherits properties from Gaussian distribution to instinctively revolve around the neighbor fields of each point but also is empowered to put the emphasis on contextually related regions during matching.

Optical Flow Estimation Representation Learning

Paper
Code

SCP: Spherical-Coordinate-based Learned Point Cloud Compression

no code implementations • 24 Aug 2023 • Ao Luo, Linxin Song, Keisuke Nonaka, Kyohei Unno, Heming Sun, Masayuki Goto, Jiro Katto

In recent years, the task of learned point cloud compression has gained prominence.

Paper
Add Code

Low-Light Image Enhancement with Wavelet-based Diffusion Models

1 code implementation • 1 Jun 2023 • Hai Jiang, Ao Luo, Songchen Han, Haoqiang Fan, Shuaicheng Liu

Diffusion models have achieved promising results in image restoration tasks, yet suffer from time-consuming, excessive computational resource consumption, and unstable restoration.

Ranked #1 on Low-Light Image Enhancement on LOLv2

Denoising Face Detection +2

113

Paper
Code

Learning Optical Flow from Event Camera with Rendered Dataset

no code implementations • ICCV 2023 • Xinglong Luo, Kunming Luo, Ao Luo, Zhengning Wang, Ping Tan, Shuaicheng Liu

Previous datasets are created by either capturing real scenes by event cameras or synthesizing from images with pasted foreground objects.

Optical Flow Estimation

Paper
Add Code

Explicit Motion Disentangling for Efficient Optical Flow Estimation

1 code implementation • ICCV 2023 • Changxing Deng, Ao Luo, Haibin Huang, Shaodan Ma, Jiangyu Liu, Shuaicheng Liu

In this paper, we propose a novel framework for optical flow estimation that achieves a good balance between performance and efficiency.

Motion Estimation Optical Flow Estimation

Paper
Code

RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos

1 code implementation • 22 Jul 2022 • Yunhui Han, Kunming Luo, Ao Luo, Jiangyu Liu, Haoqiang Fan, Guiming Luo, Shuaicheng Liu

Specifically, we first estimate optical flow between a pair of video frames, and then synthesize a new image from this pair based on the predicted flow.

Image Generation Optical Flow Estimation

Paper
Code

Memory-Efficient Learned Image Compression with Pruned Hyperprior Module

no code implementations • 21 Jun 2022 • Ao Luo, Heming Sun, Jinming Liu, Jiro Katto

Learned Image Compression (LIC) gradually became more and more famous in these years.

Image Compression

Paper
Add Code

Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training

1 code implementation • 1 Jun 2022 • Yan Zeng, Wangchunshu Zhou, Ao Luo, Ziming Cheng, Xinsong Zhang

To this end, the cross-view language modeling framework considers both multi-modal data (i. e., image-caption pairs) and multi-lingual data (i. e., parallel sentence pairs) as two different views of the same object, and trains the model to align the two views by maximizing the mutual information between them with conditional masked language modeling and contrastive learning.

Ranked #1 on Zero-Shot Cross-Lingual Visual Question Answering on xGQA

Contrastive Learning Language Modelling +9

Paper
Code

Learning Optical Flow with Adaptive Graph Reasoning

1 code implementation • 8 Feb 2022 • Ao Luo, Fan Yang, Kunming Luo, Xin Li, Haoqiang Fan, Shuaicheng Liu

Our key idea is to decouple the context reasoning from the matching procedure, and exploit scene information to effectively assist motion estimation by learning to reason over the adaptive graph.

Motion Estimation Optical Flow Estimation +1

Paper
Code

Learning Optical Flow With Kernel Patch Attention

1 code implementation • CVPR 2022 • Ao Luo, Fan Yang, Xin Li, Shuaicheng Liu

Optical flow is a fundamental method used for quantitative motion estimation on the image plane.

Motion Estimation Optical Flow Estimation

Paper
Code

Probabilistic Model Distillation for Semantic Correspondence

1 code implementation • CVPR 2021 • Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu

We address this problem with the use of a novel Probabilistic Model Distillation (PMD) approach which transfers knowledge learned by a probabilistic teacher model on synthetic data to a static student model with the use of unlabeled real image pairs.

Representation Learning Semantic correspondence

Paper
Code

ASFlow: Unsupervised Optical Flow Learning with Adaptive Pyramid Sampling

no code implementations • 8 Apr 2021 • Kunming Luo, Ao Luo, Chuan Wang, Haoqiang Fan, Shuaicheng Liu

Equipped with these two modules, our method achieves the best performance for unsupervised optical flow estimation on multiple leading benchmarks, including MPI-SIntel, KITTI 2012 and KITTI 2015.

Optical Flow Estimation

Paper
Add Code

WebSRC: A Dataset for Web-Based Structural Reading Comprehension

1 code implementation • EMNLP 2021 • Xingyu Chen, Zihan Zhao, Lu Chen, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu

In this paper, we introduce the task of structural reading comprehension (SRC) on web.

Reading Comprehension

Paper
Code

Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection

1 code implementation • ICCV 2021 • Fan Yang, Qiang Zhai, Xin Li, Rui Huang, Ao Luo, Hong Cheng, Deng-Ping Fan

Spotting objects that are visually adapted to their surroundings is challenging for both humans and AI.

Object object-detection +2

Paper
Code

Cascade Graph Neural Networks for RGB-D Salient Object Detection

1 code implementation • ECCV 2020 • Ao Luo, Xin Li, Fan Yang, Zhicheng Jiao, Hong Cheng, Siwei Lyu

Current works either simply distill prior knowledge from the corresponding depth map for handling the RGB-image or blindly fuse color and geometric information to generate the coarse depth-aware representations, hindering the performance of RGB-D saliency detectors. In this work, we introduceCascade Graph Neural Networks(Cas-Gnn), a unified framework which is capable of comprehensively distilling and reasoning the mutual benefits between these two data sources through a set of cascade graphs, to learn powerful representations for RGB-D salient object detection.

Ranked #5 on RGB-D Salient Object Detection on NJU2K

Object object-detection +3

Paper
Code

Deep-VFX: Deep Action Recognition Driven VFX for Short Video

no code implementations • 22 Jul 2020 • Ao Luo, Ning Xie, Zhijia Tao, Feng Jiang

In the application, short-form mobile video is so popular all over the world such as Tik Tok.

Action Recognition Template Matching

Paper
Add Code

Hybrid Graph Neural Networks for Crowd Counting

no code implementations • 31 Jan 2020 • Ao Luo, Fan Yang, Xin Li, Dong Nie, Zhicheng Jiao, Shangchen Zhou, Hong Cheng

In this paper, we present a novel network structure called Hybrid Graph Neural Network (HyGnn) which targets to relieve the problem by interweaving the multi-scale features for crowd density as well as its auxiliary task (localization) together and performing joint reasoning over a graph.

Crowd Counting

Paper
Add Code

Fast Portrait Segmentation with Highly Light-weight Network

no code implementations • 19 Oct 2019 • Yuezun Li, Ao Luo, Siwei Lyu

In this paper, we describe a fast and light-weight portrait segmentation method based on a new highly light-weight backbone (HLB) architecture.

Portrait Segmentation Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.