Search Results for author: Xiao Song

Found 13 papers, 1 papers with code

Cross-modal Contrastive Attention Model for Medical Report Generation

no code implementations COLING 2022 Xiao Song, Xiaodan Zhang, Junzhong Ji, Ying Liu, Pengxu Wei

Medical report automatic generation has gained increasing interest recently as a way to help radiologists write reports more efficiently.

Medical Report Generation

SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection

no code implementations12 Mar 2024 Hongcheng Zhang, Liu Liang, Pengxin Zeng, Xiao Song, Zhe Wang

Sparse 3D detectors have received significant attention since the query-based paradigm embraces low latency without explicit dense BEV feature construction.

3D Object Detection object-detection

AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

no code implementations9 Dec 2021 Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Yuexin Ma, Zhe Wang, Jianping Shi

Compared to previous methods, our AdaStereo realizes a more standard, complete and effective domain adaptation pipeline.

Domain Adaptation Stereo Matching

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation

no code implementations17 Aug 2021 Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao

However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem).

Autonomous Driving LIDAR Semantic Segmentation +1

Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection in Autonomous Driving

no code implementations27 Nov 2020 Zhenxun Yuan, Xiao Song, Lei Bai, Wengang Zhou, Zhe Wang, Wanli Ouyang

As a special design of this transformer, the information encoded in the encoder is different from that in the decoder, i. e. the encoder encodes temporal-channel information of multiple frames while the decoder decodes the spatial-channel information for the current frame in a voxel-wise manner.

3D Object Detection Autonomous Driving +3

Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation

3 code implementations4 Aug 2020 Hui Zhou, Xinge Zhu, Xiao Song, Yuexin Ma, Zhe Wang, Hongsheng Li, Dahua Lin

A straightforward solution to tackle the issue of 3D-to-2D projection is to keep the 3D representation and process the points in the 3D space.

3D Semantic Segmentation LIDAR Semantic Segmentation

AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching

no code implementations CVPR 2021 Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi

Compared to previous methods for adaptive stereo matching, our AdaStereo realizes a more standard, complete and effective domain adaptation pipeline.

Domain Adaptation Stereo Matching

EdgeStereo: An Effective Multi-Task Learning Network for Stereo Matching and Edge Detection

no code implementations5 Mar 2019 Xiao Song, Xu Zhao, Liangji Fang, Hanwen Hu

EdgeStereo also achieves comparable generalization performance for disparity estimation because of the incorporation of edge cues.

Disparity Estimation Edge Detection +3

Discriminative Representation Combinations for Accurate Face Spoofing Detection

no code implementations27 Aug 2018 Xiao Song, Xu Zhao, Liangji Fang, Tianwei Lin

Secondly we utilize the SSD, which is a deep learning framework for detection, to excavate context cues and conduct end-to-end face presentation attack detection.

Face Presentation Attack Detection

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

no code implementations14 Mar 2018 Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang

Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task.

Disparity Estimation Edge Detection +2

Face Spoofing Detection by Fusing Binocular Depth and Spatial Pyramid Coding Micro-Texture Features

no code implementations13 Mar 2018 Xiao Song, Xu Zhao, Tianwei Lin

The second one is a high-level micro-texture based feature called Spatial Pyramid Coding Micro-Texture (SPMT) feature.

Cannot find the paper you are looking for? You can Submit a new open access paper.