Search Results for author: Jianbiao Mei

Found 8 papers, 5 papers with code

M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition

no code implementations • 22 Jan 2024 • Mengmeng Wang, Jiazheng Xing, Boyuan Jiang, Jun Chen, Jianbiao Mei, Xingxing Zuo, Guang Dai, Jingdong Wang, Yong liu

In this paper, we introduce a novel Multimodal, Multi-task CLIP adapting framework named \name to address these challenges, preserving both high supervised performance and robust transferability.

Action Recognition Temporal Action Localization

Paper
Add Code

CR-SFP: Learning Consistent Representation for Soft Filter Pruning

no code implementations • 17 Dec 2023 • Jingyang Xiang, Zhuangzhi Chen, Jianbiao Mei, Siqi Li, Jun Chen, Yong liu

In this paper, we propose to mitigate this gap by learning consistent representation for soft filter pruning, dubbed as CR-SFP.

Paper
Add Code

Camera-based 3D Semantic Scene Completion with Sparse Guidance Network

1 code implementation • 10 Dec 2023 • Jianbiao Mei, Yu Yang, Mengmeng Wang, Junyu Zhu, Xiangrui Zhao, Jongwon Ra, Laijian Li, Yong liu

Semantic scene completion (SSC) aims to predict the semantic occupancy of each voxel in the entire 3D scene from limited observations, which is an emerging and critical task for autonomous driving.

3D Semantic Scene Completion Autonomous Driving

Paper
Code

SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

1 code implementation • 27 Jun 2023 • Jianbiao Mei, Yu Yang, Mengmeng Wang, Tianxin Huang, Xuemeng Yang, Yong liu

However, how to effectively exploit the relationships between the semantic context in semantic segmentation and geometric structure in scene completion remains under exploration.

Autonomous Driving Scene Understanding +1

Paper
Code

PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

1 code implementation • 27 Jun 2023 • Jianbiao Mei, Yu Yang, Mengmeng Wang, Xiaojun Hou, Laijian Li, Yong liu

Firstly, we propose a non-learning Sparse Instance Proposal (SIP) module with the ``sampling-shifting-grouping" scheme to directly group thing points into instances from the raw point cloud efficiently.

Autonomous Driving Instance Segmentation +2

Paper
Code

E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context

1 code implementation • 17 Jul 2022 • Zizhang Li, Mengmeng Wang, Huaijin Pi, Kechun Xu, Jianbiao Mei, Yong liu

However, the redundant parameters within the network structure can cause a large model size when scaling up for desirable performance.

Ranked #4 on Video Reconstruction on UVG

Video Reconstruction

Paper
Code

MaIL: A Unified Mask-Image-Language Trimodal Network for Referring Image Segmentation

no code implementations • 21 Nov 2021 • Zizhang Li, Mengmeng Wang, Jianbiao Mei, Yong liu

Referring image segmentation is a typical multi-modal task, which aims at generating a binary mask for referent described in given language expressions.

Ranked #1 on Referring Expression Segmentation on G-Ref test B

Image Segmentation Referring Expression Segmentation +2

Paper
Add Code

TransVOS: Video Object Segmentation with Transformers

1 code implementation • 1 Jun 2021 • Jianbiao Mei, Mengmeng Wang, Yeneng Lin, Yi Yuan, Yong liu

Recently, Space-Time Memory Network (STM) based methods have achieved state-of-the-art performance in semi-supervised video object segmentation (VOS).

Object One-shot visual object segmentation +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.