Search Results for author: Yichao Ma

Found 2 papers, 0 papers with code

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

no code implementations15 Oct 2024 Zhiyuan Ma, Yuzhu Zhang, Guoli Jia, Liangliang Zhao, Yichao Ma, Mingjie Ma, Gaofeng Liu, Kaiyan Zhang, Jianjun Li, BoWen Zhou

As one of the most popular and sought-after generative models in the recent years, diffusion models have sparked the interests of many researchers and steadily shown excellent advantage in various generative tasks such as image synthesis, video generation, molecule design, 3D scene rendering and multimodal generation, relying on their dense theoretical principles and reliable application practices.

Image Generation multimodal generation +2

EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning

no code implementations22 Apr 2024 Mingjie Ma, zhihuan yu, Yichao Ma, GuoHui Li

First, by emulating the cognitive process of human reasoning, an Event-Aware Pretraining auxiliary task is introduced to better activate LLM's global comprehension of intricate scenarios.

Visual Commonsense Reasoning

Cannot find the paper you are looking for? You can Submit a new open access paper.