Search Results for author: Liang Xie

Found 16 papers, 5 papers with code

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

no code implementations13 Jul 2024 Xiaopei Wu, Liang Peng, Liang Xie, Yuenan Hou, Binbin Lin, Xiaoshui Huang, Haifeng Liu, Deng Cai, Wanli Ouyang

In this paper, we propose PatchTeacher, which focuses on partial scene 3D object detection to provide high-quality pseudo labels for the student.

3D Object Detection Data Augmentation +2

From Redundancy to Relevance: Enhancing Explainability in Multimodal Large Language Models

no code implementations4 Jun 2024 Xiaofeng Zhang, Chen Shen, Xiaosong Yuan, Shaotian Yan, Liang Xie, Wenxiao Wang, Chaochen Gu, Hao Tang, Jieping Ye

To explore the interaction process between image and text in complex reasoning tasks, we introduce the information flow method to visualize the interaction mechanism.

Language Modelling Large Language Model

Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization

no code implementations24 Mar 2024 Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan, Erwei Yin

Lip reading, the process of interpreting silent speech from visual lip movements, has gained rising attention for its wide range of realistic applications.

Lip Reading

Neural Collapse Inspired Federated Learning with Non-iid Data

no code implementations27 Mar 2023 Chenxi Huang, Liang Xie, Yibo Yang, Wenxiao Wang, Binbin Lin, Deng Cai

One of the challenges in federated learning is the non-independent and identically distributed (non-iid) characteristics between heterogeneous devices, which cause significant differences in local updates and affect the performance of the central server.

Federated Learning

Learning A Simulation-based Visual Policy for Real-world Peg In Unseen Holes

1 code implementation9 May 2022 Liang Xie, Hongxiang Yu, Kechun Xu, Tong Yang, Minhang Wang, Haojian Lu, Rong Xiong, Yue Wang

This paper proposes a learning-based visual peg-in-hole that enables training with several shapes in simulation, and adapting to arbitrary unseen shapes in real world with minimal sim-to-real cost.

Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion

1 code implementation CVPR 2022 Xiaopei Wu, Liang Peng, Honghui Yang, Liang Xie, Chenxi Huang, Chengqi Deng, Haifeng Liu, Deng Cai

Many multi-modal methods are proposed to alleviate this issue, while different representations of images and point clouds make it difficult to fuse them, resulting in suboptimal performance.

3D Object Detection Data Augmentation +3

X-view: Non-egocentric Multi-View 3D Object Detector

no code implementations24 Mar 2021 Liang Xie, Guodong Xu, Deng Cai, Xiaofei He

3D object detection algorithms for autonomous driving reason about 3D obstacles either from 3D birds-eye view or perspective view or both.

3D Object Detection Autonomous Driving +3

PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module

no code implementations14 Nov 2019 Liang Xie, Chao Xiang, Zhengxu Yu, Guodong Xu, Zheng Yang, Deng Cai, Xiaofei He

Moreover, based on the PACF module, we propose a 3D multi-sensor multi-task network called Pointcloud-Image RCNN(PI-RCNN as brief), which handles the image segmentation and 3D object detection tasks.

3D Object Detection Image Segmentation +4

Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval

no code implementations25 Apr 2019 Lei Zhu, Zi Huang, Zhihui Li, Liang Xie, Heng Tao Shen

To address the problem, in this paper, we propose a novel hashing approach, dubbed as \emph{Discrete Semantic Transfer Hashing} (DSTH).

Content-Based Image Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.