Search Results for author: Jianbiao Mei

Found 8 papers, 5 papers with code

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

no code implementations22 Jan 2024 Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang, Yong liu

In this paper, we introduce a novel Multimodal, Multi-task CLIP adapting framework named \name to address these challenges, preserving both high supervised performance and robust transferability.

Action Recognition Temporal Action Localization

CR-SFP: Learning Consistent Representation for Soft Filter Pruning

no code implementations17 Dec 2023 Jingyang Xiang, Zhuangzhi Chen, Jianbiao Mei, Siqi Li, Jun Chen, Yong liu

In this paper, we propose to mitigate this gap by learning consistent representation for soft filter pruning, dubbed as CR-SFP.

Camera-based 3D Semantic Scene Completion with Sparse Guidance Network

1 code implementation10 Dec 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Junyu Zhu, Xiangrui Zhao, Jongwon Ra, Laijian Li, Yong liu

Semantic scene completion (SSC) aims to predict the semantic occupancy of each voxel in the entire 3D scene from limited observations, which is an emerging and critical task for autonomous driving.

3D Semantic Scene Completion Autonomous Driving

SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

1 code implementation27 Jun 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Tianxin Huang, Xuemeng Yang, Yong liu

However, how to effectively exploit the relationships between the semantic context in semantic segmentation and geometric structure in scene completion remains under exploration.

Autonomous Driving Scene Understanding +1

PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

1 code implementation27 Jun 2023 Jianbiao Mei, Yu Yang, Mengmeng Wang, Xiaojun Hou, Laijian Li, Yong liu

Firstly, we propose a non-learning Sparse Instance Proposal (SIP) module with the ``sampling-shifting-grouping" scheme to directly group thing points into instances from the raw point cloud efficiently.

Autonomous Driving Instance Segmentation +2

E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context

1 code implementation17 Jul 2022 Zizhang Li, Mengmeng Wang, Huaijin Pi, Kechun Xu, Jianbiao Mei, Yong liu

However, the redundant parameters within the network structure can cause a large model size when scaling up for desirable performance.

Video Reconstruction

MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation

no code implementations21 Nov 2021 Zizhang Li, Mengmeng Wang, Jianbiao Mei, Yong liu

Referring image segmentation is a typical multi-modal task, which aims at generating a binary mask for referent described in given language expressions.

Image Segmentation Referring Expression Segmentation +2

TransVOS: Video Object Segmentation with Transformers

1 code implementation1 Jun 2021 Jianbiao Mei, Mengmeng Wang, Yeneng Lin, Yi Yuan, Yong liu

Recently, Space-Time Memory Network (STM) based methods have achieved state-of-the-art performance in semi-supervised video object segmentation (VOS).

Object One-shot visual object segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.