Search Results for author: Mingbo Zhao

Found 12 papers, 2 papers with code

IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation

no code implementations28 Mar 2024 Jiacui Huang, Hongtao Zhang, Mingbo Zhao, Zhou Wu

To address this challenge, we propose a new method, namely, Instance-aware Visual Language Map (IVLMap), to empower the robot with instance-level and attribute-level semantic mapping, where it is autonomously constructed by fusing the RGBD video data collected from the robot agent with special-designed natural language map indexing in the bird's-in-eye view.

Attribute Language Modelling +4

Attention Disturbance and Dual-Path Constraint Network for Occluded Person Re-identification

no code implementations20 Mar 2023 Jiaer Xia, Lei Tan, Pingyang Dai, Mingbo Zhao, Yongjian Wu, Liujuan Cao

To address this issue, we propose a novel transformer-based Attention Disturbance and Dual-Path Constraint Network (ADP) to enhance the generalization of attention networks.

Person Re-Identification

OSIC: A New One-Stage Image Captioner Coined

no code implementations4 Nov 2022 Bo wang, Zhao Zhang, Mingbo Zhao, Xiaojie Jin, Mingliang Xu, Meng Wang

To obtain rich features, we use the Swin Transformer to calculate multi-level features, and then feed them into a novel dynamic multi-sight embedding module to exploit both global structure and local texture of input images.

Descriptive Language Modelling +2

Human Instance Segmentation and Tracking via Data Association and Single-stage Detector

no code implementations31 Mar 2022 Lu Cheng, Mingbo Zhao

To tracking the instance across the video, we have adopted data association strategy for matching the same instance in the video sequence, where we jointly learn target instance appearances and their affinities in a pair of video frames in an end-to-end fashion.

Human Instance Segmentation Position +5

Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing

no code implementations24 Nov 2021 Yu Liu, Mingbo Zhao, Zhao Zhang, Haijun Zhang, Shuicheng Yan

Based on this dataset, we then propose the Arbitrary Virtual Try-On Network (AVTON) that is utilized for all-type clothes, which can synthesize realistic try-on images by preserving and trading off characteristics of the target clothes and the reference person.

Geometric Matching Virtual Try-on

MFAGAN: A Compression Framework for Memory-Efficient On-Device Super-Resolution GAN

no code implementations27 Jul 2021 Wenlong Cheng, Mingbo Zhao, Zhiling Ye, Shuhang Gu

In this paper, we propose a novel compression framework \textbf{M}ulti-scale \textbf{F}eature \textbf{A}ggregation Net based \textbf{GAN} (MFAGAN) for reducing the memory access cost of the generator.

Hardware Aware Neural Architecture Search Image Super-Resolution +1

A Simple Approach to Automated Spectral Clustering

1 code implementation23 Jul 2021 Jicong Fan, Yiheng Tu, Zhao Zhang, Mingbo Zhao, Haijun Zhang

First, we propose to find the most reliable affinity matrix via grid search or Bayesian optimization among a set of candidates given by different AMC methods with different hyperparameters, where the reliability is quantified by the \textit{relative-eigen-gap} of graph Laplacian introduced in this paper.

Bayesian Optimization Clustering +1

Semi-DerainGAN: A New Semi-supervised Single Image Deraining Network

no code implementations23 Jan 2020 Yanyan Wei, Zhao Zhang, Yang Wang, Haijun Zhang, Mingbo Zhao, Mingliang Xu, Meng Wang

Although supervised deep deraining networks have obtained impressive results on synthetic datasets, they still cannot obtain satisfactory results on real images due to weak generalization of rain removal capacity, i. e., the pre-trained models usually cannot handle new shapes and directions that may lead to over-derained/under-derained results.

Single Image Deraining

Deep Self-representative Concept Factorization Network for Representation Learning

no code implementations13 Dec 2019 Yan Zhang, Zhao Zhang, Zheng Zhang, Mingbo Zhao, Li Zhang, Zheng-Jun Zha, Meng Wang

In this paper, we investigate the unsupervised deep representation learning issue and technically propose a novel framework called Deep Self-representative Concept Factorization Network (DSCF-Net), for clustering deep features.

Clustering Representation Learning

Robust Triple-Matrix-Recovery-Based Auto-Weighted Label Propagation for Classification

no code implementations20 Nov 2019 Huan Zhang, Zhao Zhang, Mingbo Zhao, Qiaolin Ye, Min Zhang, Meng Wang

Our method can jointly re-cover the underlying clean data, clean labels and clean weighting spaces by decomposing the original data, predicted soft labels or weights into a clean part plus an error part by fitting noise.

General Classification

Kernel-Induced Label Propagation by Mapping for Semi-Supervised Classification

no code implementations29 May 2019 Zhao Zhang, Lei Jia, Mingbo Zhao, Guangcan Liu, Meng Wang, Shuicheng Yan

A Kernel-Induced Label Propagation (Kernel-LP) framework by mapping is proposed for high-dimensional data classification using the most informative patterns of data in kernel space.

Classification General Classification

NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction

1 code implementation CVPR 2019 Yuan Gao, Jiayi Ma, Mingbo Zhao, Wei Liu, Alan L. Yuille

In this paper, we propose a novel Convolutional Neural Network (CNN) structure for general-purpose multi-task learning (MTL), which enables automatic feature fusing at every layer from different tasks.

Multi-Task Learning Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.