Search Results for author: Fengyu Sun

Found 12 papers, 5 papers with code

PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation

1 code implementation18 Dec 2024 Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohammadreza Samadi, Jiao He, Fengyu Sun, Di Niu

Recent research explores the potential of Diffusion Models (DMs) for consistent object editing, which aims to modify object position, size, and composition, etc., while preserving the consistency of objects and background without changing their texture and attributes.

Object

Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric

no code implementations25 Nov 2024 Zhichao Zhang, Wei Sun, Xinyue Li, Yunhao Li, Qihang Ge, Jun Jia, ZiCheng Zhang, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai

To address this challenge, we conduct a pioneering study on human activity AGV quality assessment, focusing on visual quality evaluation and the identification of semantic distortions.

Video Generation Video Quality Assessment

LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models

no code implementations26 Aug 2024 Qihang Ge, Wei Sun, Yu Zhang, Yunhao Li, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai

Then, we design a spatiotemporal vision encoder to extract spatial and temporal features to represent the quality characteristics of videos, which are subsequently mapped into the language space by the spatiotemporal projector for modality alignment.

Large Language Model Video Quality Assessment +1

FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting

no code implementations21 Aug 2024 Liyao Jiang, Negar Hassanpour, Mohammad Salameh, Mohan Sai Singamsetti, Fengyu Sun, Wei Lu, Di Niu

Through extensive evaluations, we show FRAP generates images with significantly higher prompt-image alignment to prompts from complex datasets, while having a lower average latency compared to recent latent code optimization methods, e. g., 4 seconds faster than D&B on the COCO-Subject dataset.

Text-to-Image Generation

FunEditor: Achieving Complex Image Edits via Function Aggregation with Diffusion Models

no code implementations16 Aug 2024 Mohammadreza Samadi, Fred X. Han, Mohammad Salameh, Hao Wu, Fengyu Sun, Chunhua Zhou, Di Niu

This approach enables complex editing tasks, such as object movement, by aggregating multiple functions and applying them simultaneously to specific areas.

Image Quality Assessment Object

Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified Model

no code implementations31 Jul 2024 Zhichao Zhang, Wei Sun, Xinyue Li, Jun Jia, Xiongkuo Min, ZiCheng Zhang, Chunyi Li, Zijian Chen, Puyi Wang, Fengyu Sun, Shangling Jui, Guangtao Zhai

To bridge this gap, we propose the Unify Generated Video Quality assessment (UGVQ) model, designed to accurately evaluate the multi-dimensional quality of AIGC videos.

Benchmarking Large Language Model +4

Exploring the Naturalness of AI-Generated Images

1 code implementation9 Dec 2023 Zijian Chen, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images.

Ternary Singular Value Decomposition as a Better Parameterized Form in Linear Mapping

1 code implementation15 Aug 2023 BoYu Chen, Hanxuan Chen, Jiao He, Fengyu Sun, Shangling Jui

We present a simple yet novel parameterized form of linear mapping to achieves remarkable network compression performance: a pseudo SVD called Ternary SVD (TSVD).

Form Language Modeling +3

Cannot find the paper you are looking for? You can Submit a new open access paper.