1 code implementation • 6 Feb 2024 • Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen
We introduce MobileVLM V2, a family of significantly improved vision language models upon MobileVLM, which proves that a delicate orchestration of novel architectural design, an improved training scheme tailored for mobile VLMs, and rich high-quality dataset curation can substantially benefit VLMs' performance.
1 code implementation • 28 Dec 2023 • Xiangxiang Chu, Limeng Qiao, Xinyang Lin, Shuang Xu, Yang Yang, Yiming Hu, Fei Wei, Xinyu Zhang, Bo Zhang, Xiaolin Wei, Chunhua Shen
We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted to run on mobile devices.
2 code implementations • ICCV 2023 • Wenjie Ding, Limeng Qiao, Xi Qiu, Chi Zhang
Furthermore, to supervise the position and topology of the vectorized point predictions, we propose a dynamic vectorized sequence loss.
2 code implementations • 17 Jun 2023 • Limeng Qiao, Yongchao Zheng, Peng Zhang, Wenjie Ding, Xi Qiu, Xing Wei, Chi Zhang
This report introduces the 1st place winning solution for the Autonomous Driving Challenge 2023 - Online HD-map Construction.
1 code implementation • CVPR 2023 • Limeng Qiao, Wenjie Ding, Xi Qiu, Chi Zhang
Vectorized high-definition map (HD-map) construction, which focuses on the perception of centimeter-level environmental information, has attracted significant research interest in the autonomous driving community.
no code implementations • 23 Apr 2022 • Boshu Lei, Wenjie Ding, Limeng Qiao, Xi Qiu
Visual place retrieval aims to search images in the database that depict similar places as the query image.
1 code implementation • ICCV 2021 • Limeng Qiao, Yuxuan Zhao, Zhiyuan Li, Xi Qiu, Jianan Wu, Chi Zhang
Few-shot object detection, which aims at detecting novel objects rapidly from extremely few annotated examples of previously unseen classes, has attracted significant research interest in the community.
Ranked #4 on Few-Shot Object Detection on MS-COCO (1-shot)
1 code implementation • ECCV 2020 • Guangyao Chen, Limeng Qiao, Yemin Shi, Peixi Peng, Jia Li, Tiejun Huang, ShiLiang Pu, Yonghong Tian
In this process, one of the key challenges is to reduce the risk of generalizing the inherent characteristics of numerous unknown samples learned from a small amount of known data.
no code implementations • ICCV 2019 • Limeng Qiao, Yemin Shi, Jia Li, Yao-Wei Wang, Tiejun Huang, Yonghong Tian
By solving the problem with its closed-form solution on the fly with the setup of transduction, our approach efficiently tailors an episodic-wise metric for each task to adapt all features from a shared task-agnostic embedding space into a more discriminative task-specific metric space.