Search Results for author: Xiaodong Xie

Found 22 papers, 5 papers with code

Correlation Preserving Sparse Coding Over Multi-level Dictionaries for Image Denoising

no code implementations23 Dec 2016 Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao

In this letter, we propose a novel image denoising method based on correlation preserving sparse coding.

Image Denoising

Bayer Demosaicking Using Optimized Mean Curvature over RGB channels

no code implementations17 May 2017 Rui Chen, Huizhu Jia, Xiange Wen, Xiaodong Xie

Color artifacts of demosaicked images are often found at contours due to interpolation across edges and cross-channel aliasing.

Demosaicking

Noise Level Estimation for Overcomplete Dictionary Learning Based on Tight Asymptotic Bounds

no code implementations9 Dec 2017 Rui Chen, Changshui Yang, Huizhu Jia, Xiaodong Xie

In this letter, we address the problem of estimating Gaussian noise level from the trained dictionaries in update stage.

Dictionary Learning

Trajectory Factory: Tracklet Cleaving and Re-connection by Deep Siamese Bi-GRU for Multiple Object Tracking

no code implementations12 Apr 2018 Cong Ma, Changshui Yang, Fan Yang, Yueqing Zhuang, Ziwei Zhang, Huizhu Jia, Xiaodong Xie

In this paper, we propose a novel tracklet processing method to cleave and re-connect tracklets on crowd or long-term occlusion by Siamese Bi-Gated Recurrent Unit (GRU).

Autonomous Driving Multi-Object Tracking +2

Attention Driven Person Re-identification

no code implementations13 Oct 2018 Fan Yang, Ke Yan, Shijian Lu, Huizhu Jia, Xiaodong Xie, Wen Gao

Person re-identification (ReID) is a challenging task due to arbitrary human pose variations, background clutters, etc.

Person Re-Identification

Single Image Blind Deblurring Using Multi-Scale Latent Structure Prior

no code implementations11 Jun 2019 Yuanchao Bai, Huizhu Jia, Ming Jiang, Xian-Ming Liu, Xiaodong Xie, Wen Gao

Blind image deblurring is a challenging problem in computer vision, which aims to restore both the blur kernel and the latent sharp image from only a blurry observation.

Blind Image Deblurring Image Deblurring +3

FFA-Net: Feature Fusion Attention Network for Single Image Dehazing

3 code implementations18 Nov 2019 Xu Qin, Zhilin Wang, Yuanchao Bai, Xiaodong Xie, Huizhu Jia

The FFA-Net architecture consists of three key components: 1) A novel Feature Attention (FA) module combines Channel Attention with Pixel Attention mechanism, considering that different channel-wise features contain totally different weighted information and haze distribution is uneven on the different image pixels.

Image Dehazing Single Image Dehazing

Correlating Edge, Pose with Parsing

1 code implementation CVPR 2020 Ziwei Zhang, Chi Su, Liang Zheng, Xiaodong Xie

Compared with the existing practice of feature concatenation, we find that uncovering the correlation among the three factors is a superior way of leveraging the pivotal contextual cues provided by edges and poses.

Feature Correlation Human Parsing

The Layout Generation Algorithm of Graphic Design Based on Transformer-CVAE

no code implementations8 Oct 2021 Mengxi Guo, Dangqing Huang, Xiaodong Xie

This paper implemented the Transformer model and conditional variational autoencoder (CVAE) to the graphic design layout generation task.

Disentanglement Layout Design

BBA-net: A bi-branch attention network for crowd counting

no code implementations22 Jan 2022 Yi Hou, Chengyang Li, Fan Yang, Cong Ma, Liping Zhu, Yuan Li, Huizhu Jia, Xiaodong Xie

Our method can integrate the pedestrian's head and body information to enhance the feature expression ability of the density map.

Crowd Counting

Enhancing and Dissecting Crowd Counting By Synthetic Data

no code implementations22 Jan 2022 Yi Hou, Chengyang Li, Yuheng Lu, Liping Zhu, Yuan Li, Huizhu Jia, Xiaodong Xie

In this article, we propose a simulated crowd counting dataset CrowdX, which has a large scale, accurate labeling, parameterized realization, and high fidelity.

Crowd Counting

Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning

no code implementations5 Jul 2022 Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang

Current point-cloud detection methods have difficulty detecting the open-vocabulary objects in the real world, due to their limited generalization capability.

Cloud Detection Contrastive Learning

Multi-Agent Automated Machine Learning

no code implementations CVPR 2023 Zhaozhi Wang, Kefan Su, Jian Zhang, Huizhu Jia, Qixiang Ye, Xiaodong Xie, Zongqing Lu

In this paper, we propose multi-agent automated machine learning (MA2ML) with the aim to effectively handle joint optimization of modules in automated machine learning (AutoML).

Data Augmentation Multi-agent Reinforcement Learning +1

Towards Blind Watermarking: Combining Invertible and Non-invertible Mechanisms

1 code implementation24 Dec 2022 Rui Ma, Mengxi Guo, Yi Hou, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie

The CIN is composed of the invertible part to achieve high imperceptibility and the non-invertible part to strengthen the robustness against strong noise attacks.

Open-Vocabulary Point-Cloud Object Detection without 3D Annotation

1 code implementation CVPR 2023 Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang

In this paper, we address open-vocabulary 3D point-cloud detection by a dividing-and-conquering strategy, which involves: 1) developing a point-cloud detector that can learn a general representation for localizing various objects, and 2) connecting textual and point-cloud representations to enable the detector to classify novel object categories based on text prompting.

3D Object Detection 3D Open-Vocabulary Object Detection +3

PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers

no code implementations1 Jul 2023 Peidong Jia, Jiaming Liu, Senqiao Yang, Jiarui Wu, Xiaodong Xie, Shanghang Zhang

PDM comprehensively leverages the prompt memory to extract domain-specific knowledge and explicitly constructs a long-term memory space for the data distribution, which represents better domain diversity compared to existing methods.

object-detection Object Detection

COLE: A Hierarchical Generation Framework for Multi-Layered and Editable Graphic Design

no code implementations28 Nov 2023 Peidong Jia, Chenxuan Li, Yuhui Yuan, Zeyu Liu, Yichao Shen, Bohan Chen, Xingru Chen, Yinglin Zheng, Dong Chen, Ji Li, Xiaodong Xie, Shanghang Zhang, Baining Guo

Our COLE system comprises multiple fine-tuned Large Language Models (LLMs), Large Multimodal Models (LMMs), and Diffusion Models (DMs), each specifically tailored for design-aware layer-wise captioning, layout planning, reasoning, and the task of generating images and text.

Image Generation

FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection

no code implementations22 Dec 2023 Dongmei Zhang, Chang Li, Ray Zhang, Shenghao Xie, Wei Xue, Xiaodong Xie, Shanghang Zhang

In this work, we propose FM-OV3D, a method of Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection, which improves the open-vocabulary localization and recognition abilities of 3D model by blending knowledge from multiple pre-trained foundation models, achieving true open-vocabulary without facing constraints from original 3D datasets.

3D Object Detection 3D Open-Vocabulary Object Detection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.