Search Results for author: Jiayi Ma

Found 47 papers, 28 papers with code

Geometric Estimation via Robust Subspace Recovery

1 code implementation ECCV 2020 Aoxiang Fan, Xingyu Jiang, Yang Wang, Junjun Jiang, Jiayi Ma

Geometric estimation from image point correspondences is the core procedure of many 3D vision problems, which is prevalently accomplished by random sampling techniques.

Homography Estimation Pose Estimation

MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training

no code implementations17 Apr 2024 Jiayang Li, Junjun Jiang, Pengwei Liang, Jiayi Ma

Instead of being driven by downstream tasks, our model utilizes a pretrained encoder from Masked Autoencoders (MAE), which facilities the omni features extraction for low-level reconstruction and high-level vision tasks, to obtain perception friendly features with a low cost.

Infrared And Visible Image Fusion

Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion

1 code implementation25 Mar 2024 Xunpeng Yi, Han Xu, Hao Zhang, Linfeng Tang, Jiayi Ma

Through the text semantic encoder and semantic interaction fusion decoder, Text-IF is accessible to the all-in-one infrared and visible image degradation-aware processing and the interactive flexible fusion outcomes.

Latent Semantic Consensus For Deterministic Geometric Model Fitting

1 code implementation11 Mar 2024 Guobao Xiao, Jun Yu, Jiayi Ma, Deng-Ping Fan, Ling Shao

The principle of LSC is to preserve the latent semantic consensus in both data points and model hypotheses.

Word length-aware text spotting: Enhancing detection and recognition in dense text image

no code implementations25 Dec 2023 Hao Wang, Huabing Zhou, Yanduo Zhang, Tao Lu, Jiayi Ma

Scene text spotting is essential in various computer vision applications, enabling extracting and interpreting textual information from images.

Text Detection Text Spotting

ResMatch: Residual Attention Learning for Local Feature Matching

1 code implementation11 Jul 2023 Yuxin Deng, Jiayi Ma

In order to facilitate the learning of matching and filtering, we inject the similarity of descriptors and relative positions into cross- and self-attention score, respectively.

Pose Estimation Visual Localization

Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network

no code implementations4 Jul 2023 Zizhuo Li, Jiayi Ma

Then, our Matchable Keypoint-Assisted Context Aggregation Module regards sampled informative keypoints as message bottlenecks and thus constrains each keypoint only to retrieve favorable contextual information from intra- and inter- matchable keypoints, evading the interference of irrelevant and redundant connectivity with non-repeatable ones.

Visual Localization

Image Deblurring by Exploring In-depth Properties of Transformer

1 code implementation24 Mar 2023 Pengwei Liang, Junjun Jiang, Xianming Liu, Jiayi Ma

We demonstrate the effectiveness of transformer properties in improving the perceptual quality while not sacrificing the quantitative scores (PSNR) over the most competitive models, such as Uformer, Restormer, and NAFNet, on defocus deblurring and motion deblurring tasks.

Deblurring Image Deblurring

Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models

no code implementations19 Jan 2023 Jun Yue, Leyuan Fang, Shaobo Xia, Yue Deng, Jiayi Ma

In specific, instead of converting multi-channel images into single-channel data in existing fusion methods, we create the multi-channel data distribution with a denoising network in a latent space with forward and reverse diffusion process.

Denoising Infrared And Visible Image Fusion

LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection

no code implementations CVPR 2023 Jinsheng Xiao, Yuanxu Wu, Yunhua Chen, Shurui Wang, Zhongyuan Wang, Jiayi Ma

We find that context information from the long-term frame and temporal information from the short-term frame are two useful cues for video small object detection.

Object object-detection +1

Sparsely Annotated Semantic Segmentation With Adaptive Gaussian Mixtures

1 code implementation CVPR 2023 Linshan Wu, Zhun Zhong, Leyuan Fang, Xingxin He, Qiang Liu, Jiayi Ma, Hao Chen

Our AGMM can effectively endow reliable supervision for unlabeled pixels based on the distributions of labeled and unlabeled pixels.

Contrastive Learning Semantic Segmentation

Robust and Scalable Gaussian Process Regression and Its Applications

1 code implementation CVPR 2023 Yifan Lu, Jiayi Ma, Leyuan Fang, Xin Tian, Junjun Jiang

This enables the application of Gaussian processes to a wide range of real data, which are often large-scale and contaminated by outliers.

4k GPR +3

Progressive Learning with Cross-Window Consistency for Semi-Supervised Semantic Segmentation

no code implementations22 Nov 2022 Bo Dang, Yansheng Li, Yongjun Zhang, Jiayi Ma

Semi-supervised semantic segmentation focuses on the exploration of a small amount of labeled data and a large amount of unlabeled data, which is more in line with the demands of real-world image understanding applications.

Pseudo Label Semi-Supervised Semantic Segmentation

ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning

no code implementations16 May 2022 Yuxin Deng, Jiayi Ma

Deep-learning-based local feature extraction algorithms that combine detection and description have made significant progress in visible image matching.

Image Registration

Hierarchical Memory Learning for Fine-Grained Scene Graph Generation

no code implementations14 Mar 2022 Youming Deng, Yansheng Li, Yongjun Zhang, Xiang Xiang, Jian Wang, Jingdong Chen, Jiayi Ma

After the autonomous partition of coarse and fine predicates, the model is first trained on the coarse predicates and then learns the fine predicates.

Graph Generation Scene Graph Generation

Coherent Point Drift Revisited for Non-Rigid Shape Matching and Registration

no code implementations CVPR 2022 Aoxiang Fan, Jiayi Ma, Xin Tian, Xiaoguang Mei, Wei Liu

In this paper, we explore a new type of extrinsic method to directly align two geometric shapes with point-to-point correspondences in ambient space by recovering a deformation, which allows more continuous and smooth maps to be obtained.

RFNet: Unsupervised Network for Mutually Reinforcing Multi-Modal Image Registration and Fusion

no code implementations CVPR 2022 Han Xu, Jiayi Ma, Jiteng Yuan, Zhuliang Le, Wei Liu

Specifically, for image registration, we solve the bottlenecks of defining registration metrics applicable for multi-modal images and facilitating the network convergence.

Image Registration

MS2DG-Net: Progressive Correspondence Learning via Multiple Sparse Semantics Dynamic Graph

1 code implementation CVPR 2022 Luanyuan Dai, Yizhang Liu, Jiayi Ma, Lifang Wei, Taotao Lai, Changcai Yang, Riqing Chen

However, most such works ignore similar sparse semantics information between two given images and cannot capture local topology among correspondences well.

Pose Estimation

TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network

no code implementations16 Sep 2021 Yuanzhi Wang, Tao Lu, Yanduo Zhang, Junjun Jiang, JiaMing Wang, Zhongyuan Wang, Jiayi Ma

Recently, face super-resolution (FSR) methods either feed whole face image into convolutional neural networks (CNNs) or utilize extra facial priors (e. g., facial parsing maps, facial landmarks) to focus on facial structure, thereby maintaining the consistency of the facial structure while restoring facial details.

Face Reconstruction Super-Resolution

From Less to More: Spectral Splitting and Aggregation Network for Hyperspectral Face Super-Resolution

no code implementations31 Aug 2021 Junjun Jiang, Chenyang Wang, Xianming Liu, Kui Jiang, Jiayi Ma

By this spectral splitting and aggregation strategy (SSAS), we can divide the original hyperspectral image into multiple samples (\emph{from less to more}) to support the efficient training of the network and effectively exploit the spectral correlations among spectrum.

Image Super-Resolution

Uniformity in Heterogeneity:Diving Deep into Count Interval Partition for Crowd Counting

3 code implementations27 Jul 2021 Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu

Therefore, we propose a novel count interval partition criterion called Uniform Error Partition (UEP), which always keeps the expected counting error contributions equal for all intervals to minimize the prediction risk.

Crowd Counting Quantization

SDGMNet: Statistic-based Dynamic Gradient Modulation for Local Descriptor Learning

1 code implementation8 Jun 2021 Jiayi Ma, Yuxin Deng

Modifications on triplet loss that rescale the back-propagated gradients of special pairs have made significant progress on local descriptor learning.

Retrieval

BaMBNet: A Blur-aware Multi-branch Network for Defocus Deblurring

1 code implementation31 May 2021 Pengwei Liang, Junjun Jiang, Xianming Liu, Jiayi Ma

In particular, we estimate the blur amounts of different regions by the internal geometric constraint of the DP data, which measures the defocus disparity between the left and right views.

Deblurring Image Defocus Deblurring +1

Pan-sharpening via High-pass Modification Convolutional Neural Network

1 code implementation24 May 2021 JiaMing Wang, Zhenfeng Shao, Xiao Huang, Tao Lu, Ruiqian Zhang, Jiayi Ma

Most existing deep learning-based pan-sharpening methods have several widely recognized issues, such as spectral distortion and insufficient spatial texture enhancement, we propose a novel pan-sharpening convolutional neural network based on a high-pass modification block.

Vocal Bursts Intensity Prediction

Omniscient Video Super-Resolution

no code implementations ICCV 2021 Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, Jiayi Ma

Most recent video super-resolution (SR) methods either adopt an iterative manner to deal with low-resolution (LR) frames from a temporally sliding window, or leverage the previously estimated SR output to help reconstruct the current frame recurrently.

Video Super-Resolution

Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation

no code implementations30 Jan 2021 Chengli Peng, Jiayi Ma, Chen Chen, Xiaojie Guo

To verify the efficiency of the proposed bilateral attention decoder, we adopt a lightweight network as the backbone and compare our proposed method with other state-of-the-art real-time semantic segmentation methods on the Cityscapes and Camvid datasets.

Real-Time Semantic Segmentation Segmentation

Uniformity in Heterogeneity: Diving Deep Into Count Interval Partition for Crowd Counting

1 code implementation ICCV 2021 Changan Wang, Qingyu Song, Boshen Zhang, Yabiao Wang, Ying Tai, Xuyi Hu, Chengjie Wang, Jilin Li, Jiayi Ma, Yang Wu

Therefore, we propose a novel count interval partition criterion called Uniform Error Partition (UEP), which always keeps the expected counting error contributions equal for all intervals to minimize the prediction risk.

Crowd Counting Quantization

Learning Spatial-Spectral Prior for Super-Resolution of Hyperspectral Imagery

2 code implementations18 May 2020 Junjun Jiang, He Sun, Xian-Ming Liu, Jiayi Ma

Recently, single gray/RGB image super-resolution reconstruction task has been extensively studied and made significant progress by leveraging the advanced machine learning techniques based on deep convolutional neural networks (DCNNs).

Hyperspectral Image Super-Resolution Image Super-Resolution

Multi-Scale Progressive Fusion Network for Single Image Deraining

3 code implementations CVPR 2020 Kui Jiang, Zhongyuan Wang, Peng Yi, Chen Chen, Baojin Huang, Yimin Luo, Jiayi Ma, Junjun Jiang

In this work, we explore the multi-scale collaborative representation for rain streaks from the perspective of input image scales and hierarchical deep features in a unified framework, termed multi-scale progressive fusion network (MSPFN) for single image rain streak removal.

Single Image Deraining

LaFIn: Generative Landmark Guided Face Inpainting

1 code implementation26 Nov 2019 Yang Yang, Xiaojie Guo, Jiayi Ma, Lin Ma, Haibin Ling

It is challenging to inpaint face images in the wild, due to the large variation of appearance, such as different poses, expressions and occlusions.

Attribute Facial Inpainting

EDIT: Exemplar-Domain Aware Image-to-Image Translation

1 code implementation24 Nov 2019 Yuanbin Fu, Jiayi Ma, Lin Ma, Xiaojie Guo

The principle behind is that, for images from multiple domains, the content features can be obtained by a uniform extractor, while (re-)stylization is achieved by mapping the extracted features specifically to different purposes (domains and exemplars).

Generative Adversarial Network Image-to-Image Translation +1

Ensemble Super-Resolution with A Reference Dataset

1 code implementation12 May 2019 Junjun Jiang, Yi Yu, Zheng Wang, Suhua Tang, Ruimin Hu, Jiayi Ma

In this paper, we present a simple but effective single image SR method based on ensemble learning, which can produce a better performance than that could be obtained from any of SR methods to be ensembled (or called component super-resolvers).

Ensemble Learning Image Super-Resolution

PFLD: A Practical Facial Landmark Detector

18 code implementations28 Feb 2019 Xiaojie Guo, Siyuan Li, Jinke Yu, Jiawan Zhang, Jiayi Ma, Lin Ma, Wei Liu, Haibin Ling

Being accurate, efficient, and compact is essential to a facial landmark detector for practical use.

Face Alignment Facial Landmark Detection

Hyperspectral Image Classification in the Presence of Noisy Labels

1 code implementation12 Sep 2018 Junjun Jiang, Jiayi Ma, Zheng Wang, Chen Chen, Xian-Ming Liu

The key idea of RLPA is to exploit knowledge (e. g., the superpixel based spectral-spatial constraints) from the observed hyperspectral images and apply it to the process of label propagation.

Classification General Classification +1

Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning

2 code implementations3 Sep 2018 Junjun Jiang, Yi Yu, Suhua Tang, Jiayi Ma, Akiko Aizawa, Kiyoharu Aizawa

To this end, this study incorporates the contextual information of image patch and proposes a powerful and efficient context-patch based face hallucination approach, namely Thresholding Locality-constrained Representation and Reproducing learning (TLcR-RL).

Face Hallucination Hallucination +1

Deep CNN Denoiser and Multi-layer Neighbor Component Embedding for Face Hallucination

1 code implementation28 Jun 2018 Junjun Jiang, Yi Yu, Jinhui Hu, Suhua Tang, Jiayi Ma

Most of the current face hallucination methods, whether they are shallow learning-based or deep learning-based, all try to learn a relationship model between Low-Resolution (LR) and High-Resolution (HR) spaces with the help of a training set.

Face Hallucination Hallucination +1

SuperPCA: A Superpixelwise PCA Approach for Unsupervised Feature Extraction of Hyperspectral Imagery

1 code implementation26 Jun 2018 Junjun Jiang, Jiayi Ma, Chen Chen, Zhongyuan Wang, Zhihua Cai, Lizhe Wang

(1) Unlike the traditional PCA method based on a whole image, SuperPCA takes into account the diversity in different homogeneous regions, that is, different regions should have different projections.

Dimensionality Reduction General Classification

NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction

1 code implementation CVPR 2019 Yuan Gao, Jiayi Ma, Mingbo Zhao, Wei Liu, Alan L. Yuille

In this paper, we propose a novel Convolutional Neural Network (CNN) structure for general-purpose multi-task learning (MTL), which enables automatic feature fusing at every layer from different tasks.

Multi-Task Learning Semantic Segmentation

Semi-Supervised Sparse Representation Based Classification for Face Recognition with Insufficient Labeled Samples

no code implementations12 Sep 2016 Yuan Gao, Jiayi Ma, Alan L. Yuille

This is based on recent work on sparsity where faces are represented in terms of two dictionaries: a gallery dictionary consisting of one or more examples of each person, and a variation dictionary representing linear nuisance variables (e. g., different lighting conditions, different glasses).

Face Recognition General Classification +1

Density-Based Region Search with Arbitrary Shape for Object Localization

no code implementations23 Oct 2014 Ji Zhao, Deyu Meng, Jiayi Ma

Typically, the region search methods project the score of a classifier into an image plane, and then search the region with the maximal score.

Weakly-Supervised Object Localization

Robust Estimation of Nonrigid Transformation for Point Set Registration

no code implementations CVPR 2013 Jiayi Ma, Ji Zhao, Jinwen Tian, Zhuowen Tu, Alan L. Yuille

In the second step, we estimate the transformation using a robust estimator called L 2 E. This is the main novelty of our approach and it enables us to deal with the noise and outliers which arise in the correspondence step.

Cannot find the paper you are looking for? You can Submit a new open access paper.