Search Results for author: Weifeng Chen

Found 20 papers, 9 papers with code

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

1 code implementation19 Dec 2024 Yatai Ji, Jiacheng Zhang, Jie Wu, Shilong Zhang, Shoufa Chen, Chongjian Ge, Peize Sun, Weifeng Chen, Wenqi Shao, Xuefeng Xiao, Weilin Huang, Ping Luo

Text-to-video models have made remarkable advancements through optimization on high-quality text-video pairs, where the textual prompts play a pivotal role in determining quality of output videos.

Video Generation

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

no code implementations19 Dec 2024 Jiacheng Zhang, Jie Wu, Weifeng Chen, Yatai Ji, Xuefeng Xiao, Weilin Huang, Kai Han

To tackle these issues, we introduce OnlineVPO, a more efficient preference learning approach tailored specifically for video diffusion models.

Video Quality Assessment Visual Question Answering (VQA)

Magic Clothing: Controllable Garment-Driven Image Synthesis

1 code implementation15 Apr 2024 Weifeng Chen, Tao Gu, Yuhao Xu, Chengcai Chen

We propose Magic Clothing, a latent diffusion model (LDM)-based network architecture for an unexplored garment-driven image synthesis task.

Image Generation

OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

2 code implementations4 Mar 2024 Yuhao Xu, Tao Gu, Weifeng Chen, Chengcai Chen

We present OOTDiffusion, a novel network architecture for realistic and controllable image-based virtual try-on (VTON).

Denoising Image Generation +1

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

1 code implementation9 Feb 2024 Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, QiXing Huang, Li Erran Li

By combining natural language understanding, generation capabilities, and breadth of knowledge of large language models with image perception, recent large vision language models (LVLMs) have shown unprecedented visual reasoning capabilities.

Hallucination Natural Language Understanding +2

DiffusionGPT: LLM-Driven Text-to-Image Generation System

no code implementations18 Jan 2024 Jie Qin, Jie Wu, Weifeng Chen, Yuxi Ren, Huixia Li, Hefeng Wu, Xuefeng Xiao, Rui Wang, Shilei Wen

Diffusion models have opened up new avenues for the field of image generation, resulting in the proliferation of high-quality models shared on open-source platforms.

Model Selection Text to Image Generation +1

MSRL: Distributed Reinforcement Learning with Dataflow Fragments

no code implementations3 Oct 2022 Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Yijie Chen, Liang Shi, Yaodong Yang, Peter Pietzuch, Lei Chen

Yet, current distributed RL systems tie the definition of RL algorithms to their distributed execution: they hard-code particular distribution strategies and only accelerate specific parts of the computation (e. g. policy network updates) on GPU workers.

reinforcement-learning Reinforcement Learning +1

HYPE-C: Evaluating Image Completion Models Through Standardized Crowdsourcing

no code implementations1 Jan 2021 Emily Walters, Weifeng Chen, Jia Deng

Recent work has proposed the use of human evaluation for image synthesis models, allowing for a reliable method to evaluate the visual quality of generated images.

Image Generation

OASIS: A Large-Scale Dataset for Single Image 3D in the Wild

no code implementations CVPR 2020 Weifeng Chen, Shengyi Qian, David Fan, Noriyuki Kojima, Max Hamilton, Jia Deng

Single-view 3D is the task of recovering 3D properties such as depth and surface normals from a single image.

3D geometry

Surface Normals in the Wild

no code implementations ICCV 2017 Weifeng Chen, Donglai Xiang, Jia Deng

We study the problem of single-image depth estimation for images in the wild.

Depth Estimation

Single-Image Depth Perception in the Wild

4 code implementations NeurIPS 2016 Weifeng Chen, Zhao Fu, Dawei Yang, Jia Deng

This paper studies single-image depth perception in the wild, i. e., recovering depth from a single image taken in unconstrained settings.

Depth Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.