Search Results for author: Fan Jia

Found 15 papers, 5 papers with code

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

no code implementations • 28 Mar 2024 • Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang

Autonomous driving progress relies on large-scale annotated datasets.

Autonomous Driving

Paper
Add Code

VJT: A Video Transformer on Joint Tasks of Deblurring, Low-light Enhancement and Denoising

no code implementations • 26 Jan 2024 • Yuxiang Hui, Yang Liu, Yaofang Liu, Fan Jia, Jinshan Pan, Raymond Chan, Tieyong Zeng

Video restoration task aims to recover high-quality videos from low-quality observations.

Deblurring Denoising +2

Paper
Add Code

Stream Query Denoising for Vectorized HD Map Construction

no code implementations • 17 Jan 2024 • Shuo Wang, Fan Jia, Yingfei Liu, Yucheng Zhao, Zehui Chen, Tiancai Wang, Chi Zhang, Xiangyu Zhang, Feng Zhao

This paper introduces the Stream Query Denoising (SQD) strategy as a novel approach for temporal modeling in high-definition map (HD-map) construction.

Autonomous Driving Denoising

Paper
Add Code

Panacea: Panoramic and Controllable Video Generation for Autonomous Driving

no code implementations • 28 Nov 2023 • Yuqing Wen, Yucheng Zhao, Yingfei Liu, Fan Jia, Yanhui Wang, Chong Luo, Chi Zhang, Tiancai Wang, Xiaoyan Sun, Xiangyu Zhang

This work notably propels the field of autonomous driving by effectively augmenting the training dataset used for advanced BEV perception techniques.

Autonomous Driving Video Generation

Paper
Add Code

ADriver-I: A General World Model for Autonomous Driving

no code implementations • 22 Nov 2023 • Fan Jia, Weixin Mao, Yingfei Liu, Yucheng Zhao, Yuqing Wen, Chi Zhang, Xiangyu Zhang, Tiancai Wang

Based on the vision-action pairs, we construct a general world model based on MLLM and diffusion model for autonomous driving, termed ADriver-I.

Autonomous Driving

Paper
Add Code

VLM-Eval: A General Evaluation on Video Large Language Models

no code implementations • 20 Nov 2023 • Shuailin Li, Yuang Zhang, Yucheng Zhao, Qiuyue Wang, Fan Jia, Yingfei Liu, Tiancai Wang

Despite the rapid development of video Large Language Models (LLMs), a comprehensive evaluation is still absent.

Action Recognition Retrieval

Paper
Add Code

TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning

1 code implementation • 10 Oct 2023 • Dongming Wu, Jiahao Chang, Fan Jia, Yingfei Liu, Tiancai Wang, Jianbing Shen

Further, we propose TopoMLP, a simple yet high-performance pipeline for driving topology reasoning.

Ranked #3 on 3D Lane Detection on OpenLane-V2 val

3D Lane Detection Autonomous Driving

115

Paper
Code

Far3D: Expanding the Horizon for Surround-view 3D Object Detection

1 code implementation • 18 Aug 2023 • Xiaohui Jiang, Shuailin Li, Yingfei Liu, Shihao Wang, Fan Jia, Tiancai Wang, Lijin Han, Xiangyu Zhang

Recently 3D object detection from surround-view images has made notable advancements with its low deployment cost.

Ranked #1 on 3D Object Detection on nuScenes Camera Only

3D Object Detection Denoising +1

Paper
Code

The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge

1 code implementation • 16 Jun 2023 • Dongming Wu, Fan Jia, Jiahao Chang, Zhuoling Li, Jianjian Sun, Chunrui Han, Shuailin Li, Yingfei Liu, Zheng Ge, Tiancai Wang

We present the 1st-place solution of OpenLane Topology in Autonomous Driving Challenge.

Autonomous Driving

115

Paper
Code

Cross Modal Transformer: Towards Fast and Robust 3D Object Detection

2 code implementations • ICCV 2023 • Junjie Yan, Yingfei Liu, Jianjian Sun, Fan Jia, Shuailin Li, Tiancai Wang, Xiangyu Zhang

In this paper, we propose a robust 3D detector, named Cross Modal Transformer (CMT), for end-to-end 3D multi-modal detection.

object-detection Object Tracking +1

767

Paper
Code

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

1 code implementation • ICCV 2023 • Yingfei Liu, Junjie Yan, Fan Jia, Shuailin Li, Aqi Gao, Tiancai Wang, Xiangyu Zhang, Jian Sun

More specifically, we extend the 3D position embedding (3D PE) in PETR for temporal modeling.

Ranked #2 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU lane - 224x480 - 100x100 at 0.5 metric)

3D Lane Detection 3D Object Detection +6

767

Paper
Code

Adaptive Agent Architecture for Real-time Human-Agent Teaming

no code implementations • 7 Mar 2021 • Tianwei Ni, Huao Li, Siddharth Agrawal, Suhas Raja, Fan Jia, Yikang Gui, Dana Hughes, Michael Lewis, Katia Sycara

Previous human-human team research have shown complementary policies in TSF game and diversity in human players' skill, which encourages us to relax the assumptions on human policy.

Space Fortress

Paper
Add Code

A Regularized Convolutional Neural Network for Semantic Image Segmentation

no code implementations • 28 Jun 2019 • Fan Jia, Jun Liu, Xue-Cheng Tai

That is, spatial regularity of the segmented objects is still a problem for CNNs.

Image Segmentation object-detection +3

Paper
Add Code

Multi-label Learning with Missing Labels using Mixed Dependency Graphs

no code implementations • 31 Mar 2018 • Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem, Siwei Lyu

This work focuses on the problem of multi-label learning with missing labels (MLML), which aims to label each test instance with multiple class labels given training instances that have an incomplete/partial set of these labels.

Image Retrieval Missing Labels +2

Paper
Add Code

Diverse Image Annotation

no code implementations • CVPR 2017 • Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem

To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly.

TAG

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.