Search Results for author: Zhiyin Ma

Found 4 papers, 3 papers with code

FlameGS: Reconstruct flame light field via Gaussian Splatting

no code implementations24 Dec 2024 Yunhao Shui, Fuhao Zhang, Can Gao, Hao Xue, Zhiyin Ma, Gang Xun, Xuesong Li

To address the time-consuming and computationally intensive issues of traditional ART algorithms for flame combustion diagnosis, inspired by flame simulation technology, we propose a novel representation method for flames.

Exploring the Capabilities of Large Multimodal Models on Dense Text

1 code implementation9 May 2024 Shuo Zhang, Biao Yang, Zhang Li, Zhiyin Ma, Yuliang Liu, Xiang Bai

To further explore the capabilities of LMM in complex text tasks, we propose the DT-VQA dataset, with 170k question-answer pairs.

Prompt Engineering Visual Question Answering (VQA)

Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

1 code implementation CVPR 2024 Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai

Additionally, experiments on 18 datasets further demonstrate that Monkey surpasses existing LMMs in many tasks like Image Captioning and various Visual Question Answering formats.

Ranked #13 on MMR total on MRR-Benchmark (using extra training data)

Image Captioning MMR total +3

Cannot find the paper you are looking for? You can Submit a new open access paper.