Search Results for author: En Yu

Found 16 papers, 1 papers with code

Delving into the Trajectory Long-tail Distribution for Muti-object Tracking

1 code implementation • 7 Mar 2024 • Sijia Chen, En Yu, Jinyang Li, Wenbing Tao

In this study, we pioneer an exploration into the distribution patterns of tracking data and identify a pronounced long-tail distribution issue within existing MOT datasets.

Data Augmentation Multiple Object Tracking +1

Paper
Code

Small Language Model Meets with Reinforced Vision Vocabulary

no code implementations • 23 Jan 2024 • Haoran Wei, Lingyu Kong, Jinyue Chen, Liang Zhao, Zheng Ge, En Yu, Jianjian Sun, Chunrui Han, Xiangyu Zhang

In Vary-toy, we introduce an improved vision vocabulary, allowing the model to not only possess all features of Vary but also gather more generality.

Ranked #76 on Visual Question Answering on MM-Vet

Language Modelling Large Language Model +3

Paper
Add Code

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification

no code implementations • 17 Dec 2023 • En Yu, Jie Lu, Bin Zhang, Guangquan Zhang

Specifically, OBAL operates in a dual-phase mechanism, in the first of which we design an Adaptive COvariate Shift Adaptation (AdaCOSA) algorithm to construct an initialized ensemble model using archived data from various source streams, thus mitigating the covariate shift while learning the dynamic correlations via an adaptive re-weighting strategy.

Paper
Add Code

Merlin:Empowering Multimodal LLMs with Foresight Minds

no code implementations • 30 Nov 2023 • En Yu, Liang Zhao, Yana Wei, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, Wenbing Tao

Then, FIT requires MLLMs to first predict trajectories of related objects and then reason about potential future events based on them.

Ranked #61 on Visual Question Answering on MM-Vet

Visual Question Answering

Paper
Add Code

ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning

no code implementations • 18 Jul 2023 • Liang Zhao, En Yu, Zheng Ge, Jinrong Yang, Haoran Wei, HongYu Zhou, Jianjian Sun, Yuang Peng, Runpei Dong, Chunrui Han, Xiangyu Zhang

Based on precise referring instruction, we propose ChatSpot, a unified end-to-end multimodal large language model that supports diverse forms of interactivity including mouse clicks, drag-and-drop, and drawing boxes, which provides a more flexible and seamless interactive experience.

Instruction Following Language Modelling +1

Paper
Add Code

GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping

no code implementations • 18 Jul 2023 • Zhuoling Li, Chunrui Han, Zheng Ge, Jinrong Yang, En Yu, Haoqian Wang, Hengshuang Zhao, Xiangyu Zhang

Besides, GroupLane with ResNet18 still surpasses PersFormer by 4. 9% F1 score, while the inference speed is nearly 7x faster and the FLOPs is only 13. 3% of it.

3D Lane Detection

Paper
Add Code

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking

no code implementations • 23 May 2023 • En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao

Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from the conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics.

Denoising Multi-Object Tracking +1

Paper
Add Code

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation

no code implementations • 3 Dec 2022 • En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao

VLM joints the information in the generated visual prompts and the textual prompts from a pre-defined Trackbook to obtain instance-level pseudo textual description, which is domain invariant to different tracking scenes.

Domain Generalization Multi-Object Tracking +1

Paper
Add Code

Implicit and Efficient Point Cloud Completion for 3D Single Object Tracking

no code implementations • 1 Sep 2022 • Pan Wang, Liangliang Ren, Shengkai Wu, Jinrong Yang, En Yu, Hangcheng Yu, Xiaoping Li

The point cloud based 3D single object tracking has drawn increasing attention.

3D Single Object Tracking Object Tracking +2

Paper
Add Code

Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking

no code implementations • 23 Aug 2022 • Jinrong Yang, En Yu, Zeming Li, Xiaoping Li, Wenbing Tao

Recent advanced works generally employ a series of object attributes, e. g., position, size, velocity, and appearance, to provide the clues for the association in 3D MOT.

3D Multi-Object Tracking 3D Object Detection +2

Paper
Add Code

Delving into the Pre-training Paradigm of Monocular 3D Object Detection

no code implementations • 8 Jun 2022 • Zhuoling Li, Chuanrui Zhang, En Yu, Haoqian Wang

(2) Combining depth estimation and 2D object detection is a promising M3OD pre-training baseline.

Depth Estimation Monocular 3D Object Detection +3

Paper
Add Code

Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking

no code implementations • CVPR 2022 • En Yu, Zhuoling Li, Shoudong Han

To this end, we propose a strategy, namely multi-view trajectory contrastive learning, in which each trajectory is represented as a center vector.

Contrastive Learning Multi-Object Tracking +1

Paper
Add Code

RelationTrack: Relation-aware Multiple Object Tracking with Decoupled Representation

no code implementations • 10 May 2021 • En Yu, Zhuoling Li, Shoudong Han, Hongwei Wang

Existing online multiple object tracking (MOT) algorithms often consist of two subtasks, detection and re-identification (ReID).

Multiple Object Tracking Object +1

Paper
Add Code

MAT: Motion-Aware Multi-Object Tracking

no code implementations • 10 Sep 2020 • Shoudong Han, Piao Huang, Hongwei Wang, En Yu, Donghaisheng Liu, Xiaofeng Pan, Jun Zhao

Modern multi-object tracking (MOT) systems usually model the trajectories by associating per-frame detections.

Multi-Object Tracking Object

Paper
Add Code

Refinements in Motion and Appearance for Online Multi-Object Tracking

no code implementations • 16 Mar 2020 • Piao Huang, Shoudong Han, Jun Zhao, Donghaisheng Liu, Hongwei Wang, En Yu, Alex ChiChung Kot

Modern multi-object tracking (MOT) system usually involves separated modules, such as motion model for location and appearance model for data association.

Blocking Multi-Object Tracking +1

Paper
Add Code

Fusion-supervised Deep Cross-modal Hashing

no code implementations • 25 Apr 2019 • Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang

Deep hashing has recently received attention in cross-modal retrieval for its impressive advantages.

Cross-Modal Retrieval Deep Hashing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.