Search Results for author: Shenglong Ye

Found 8 papers, 6 papers with code

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

no code implementations13 Mar 2025 Weiyun Wang, Zhangwei Gao, Lianjie Chen, Zhe Chen, Jinguo Zhu, Xiangyu Zhao, Yangzhou Liu, Yue Cao, Shenglong Ye, Xizhou Zhu, Lewei Lu, Haodong Duan, Yu Qiao, Jifeng Dai, Wenhai Wang

We introduce VisualPRM, an advanced multimodal Process Reward Model (PRM) with 8B parameters, which improves the reasoning abilities of existing Multimodal Large Language Models (MLLMs) across different model scales and families with Best-of-N (BoN) evaluation strategies.

Multimodal Reasoning

Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance

1 code implementation21 Oct 2024 Zhangwei Gao, Zhe Chen, Erfei Cui, Yiming Ren, Weiyun Wang, Jinguo Zhu, Hao Tian, Shenglong Ye, Junjun He, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Jifeng Dai, Wenhai Wang

Multimodal large language models (MLLMs) have demonstrated impressive performance in vision-language tasks across a broad spectrum of domains.

Autonomous Driving

SDAN: Squared Deformable Alignment Network for Learning Misaligned Optical Zoom

1 code implementation2 Apr 2021 Kangfu Mei, Shenglong Ye, Rui Huang

Deep Neural Network (DNN) based super-resolution algorithms have greatly improved the quality of the generated images.

Computational Efficiency Super-Resolution

Cannot find the paper you are looking for? You can Submit a new open access paper.