Search Results for author: Hanxue Zhang

Found 2 papers, 1 papers with code

DriveLM: Driving with Graph Visual Question Answering

1 code implementation • 21 Dec 2023 • Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, Ping Luo, Andreas Geiger, Hongyang Li

The experiments demonstrate that Graph VQA provides a simple, principled framework for reasoning about a driving scene, and DriveLM-Data provides a challenging benchmark for this task.

Autonomous Driving Question Answering +1

645

Paper
Code

Improving Audio Caption Fluency with Automatic Error Correction

no code implementations • 16 Jun 2023 • Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Automated audio captioning (AAC) is an important cross-modality translation task, aiming at generating descriptions for audio clips.

Audio captioning Sentence

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.