Search Results for author: Xingjiao Wu

Found 13 papers, 2 papers with code

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

no code implementations18 Oct 2022 Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He

To address these problems, we construct the homogeneous structure between the point cloud and images to avoid projective information loss by transforming the camera features into the LiDAR 3D space.

3D Object Detection Autonomous Driving +1

Progressive Scene Text Erasing with Self-Supervision

no code implementations23 Jul 2022 Xiangcheng Du, Zhao Zhou, Yingbin Zheng, Xingjiao Wu, Tianlong Ma, Cheng Jin

Scene text erasing seeks to erase text contents from scene images and current state-of-the-art text erasing models are trained on large-scale synthetic data.

Multi-channel Attentive Graph Convolutional Network With Sentiment Fusion For Multimodal Sentiment Analysis

no code implementations25 Jan 2022 Luwei Xiao, Xingjiao Wu, Wen Wu, Jing Yang, Liang He

This paper proposes a Multi-channel Attentive Graph Convolutional Network (MAGCN), consisting of two main components: cross-modality interactive learning and sentimental feature fusion.

Multimodal Sentiment Analysis

Document Layout Analysis with Aesthetic-Guided Image Augmentation

no code implementations27 Nov 2021 Tianlong Ma, Xingjiao Wu, Xin Li, Xiangcheng Du, Zhao Zhou, Liang Xue, Cheng Jin

To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD.

Document Layout Analysis Image Augmentation

A Survey of Human-in-the-loop for Machine Learning

no code implementations2 Aug 2021 Xingjiao Wu, Luwei Xiao, Yixuan Sun, Junhang Zhang, Tianlong Ma, Liang He

Humans can provide training data for machine learning applications and directly accomplish tasks that are hard for computers in the pipeline with the help of machine-based approaches.

BIG-bench Machine Learning

Document Layout Analysis via Dynamic Residual Feature Fusion

no code implementations7 Apr 2021 Xingjiao Wu, Ziling Hu, Xiangcheng Du, Jing Yang, Liang He

The document layout analysis (DLA) aims to split the document image into different interest regions and understand the role of each region, which has wide application such as optical character recognition (OCR) systems and document retrieval.

Document Layout Analysis Optical Character Recognition (OCR) +1

Scene Text Recognition with Temporal Convolutional Encoder

no code implementations4 Nov 2019 Xiangcheng Du, Tianlong Ma, Yingbin Zheng, Hao Ye, Xingjiao Wu, Liang He

In this paper, we study text recognition framework by considering the long-term temporal dependencies in the encoder stage.

Scene Text Recognition

Fast Video Crowd Counting with a Temporal Aware Network

no code implementations4 Jul 2019 Xingjiao Wu, Baohan Xu, Yingbin Zheng, Hao Ye, Jing Yang, Liang He

Crowd counting aims to count the number of instantaneous people in a crowded space, and many promising solutions have been proposed for single image crowd counting.

Crowd Counting

Adaptive Scenario Discovery for Crowd Counting

1 code implementation6 Dec 2018 Xingjiao Wu, Yingbin Zheng, Hao Ye, Wenxin Hu, Jing Yang, Liang He

Crowd counting, i. e., estimation number of the pedestrian in crowd images, is emerging as an important research problem with the public security applications.

Crowd Counting

Cannot find the paper you are looking for? You can Submit a new open access paper.