Search Results for author: Jiwen Yu

Found 15 papers, 7 papers with code

GameFactory: Creating New Games with Generative Interactive Videos

no code implementations14 Jan 2025 Jiwen Yu, Yiran Qin, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu

In this paper, we present GameFactory, a framework focused on exploring scene generalization in game video generation.

Domain Generalization Minecraft +1

WorldSimBench: Towards Video Generation Models as World Simulators

no code implementations23 Oct 2024 Yiran Qin, Zhelun Shi, Jiwen Yu, Xijun Wang, Enshen Zhou, Lijun Li, Zhenfei Yin, Xihui Liu, Lu Sheng, Jing Shao, Lei Bai, Wanli Ouyang, Ruimao Zhang

WorldSimBench includes Explicit Perceptual Evaluation and Implicit Manipulative Evaluation, encompassing human preference assessments from the visual perspective and action-level evaluations in embodied tasks, covering three representative embodied scenarios: Open-Ended Embodied Environment, Autonomous, Driving, and Robot Manipulation.

Autonomous Driving Robot Manipulation +1

SkillMimic: Learning Reusable Basketball Skills from Demonstrations

no code implementations12 Aug 2024 Yinhuai Wang, Qihan Zhao, Runyi Yu, Ailing Zeng, Jing Lin, Zhengyi Luo, Hok Wai Tsui, Jiwen Yu, Xiu Li, Qifeng Chen, Jian Zhang, Lei Zhang, Ping Tan

SkillMimic employs a unified configuration to learn diverse skills from human-ball motion datasets, with skill diversity and generalization improving as the dataset grows.

Diffusion-Based Hierarchical Image Steganography

no code implementations19 May 2024 Youmin Xu, Xuanyu Zhang, Jiwen Yu, Chong Mou, Xiandong Meng, Jian Zhang

This paper introduces Hierarchical Image Steganography, a novel method that enhances the security and capacity of embedding multiple images into a single container using diffusion models.

Image Steganography

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

no code implementations25 Apr 2024 Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang

Meanwhile, we introduce a sample-level audio localization method and a cross-modal copyright extraction mechanism to couple the information of audio and video frames.

Video Editing

Invertible Diffusion Models for Compressed Sensing

1 code implementation25 Mar 2024 Bin Chen, Zhenyu Zhang, Weiqi Li, Chen Zhao, Jiwen Yu, Shijie Zhao, Jie Chen, Jian Zhang

To enable such memory-intensive end-to-end fine-tuning, we propose a novel two-level invertible design to transform both (1) multi-step sampling process and (2) noise estimation U-Net in each step into invertible networks.

Image Compressed Sensing Image Reconstruction +1

Neural Video Fields Editing

no code implementations12 Dec 2023 Shuzhou Yang, Chong Mou, Jiwen Yu, YuHan Wang, Xiandong Meng, Jian Zhang

Specifically, we construct a neural video field, powered by tri-plane and sparse grid, to enable encoding long videos with hundreds of frames in a memory-efficient manner.

Video Editing

EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection

no code implementations CVPR 2024 Xuanyu Zhang, Runyi Li, Jiwen Yu, Youmin Xu, Weiqi Li, Jian Zhang

In the era where AI-generated content (AIGC) models can produce stunning and lifelike images, the lingering shadow of unauthorized reproductions and malicious tampering poses imminent threats to copyright integrity and information security.

Image Steganography

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators

1 code implementation6 Dec 2023 Jiwen Yu, Xiaodong Cun, Chenyang Qi, Yong Zhang, Xintao Wang, Ying Shan, Jian Zhang

For appearance control, we borrow intermediate latents and their features from the text-to-image (T2I) generation for ensuring the generated first frame is equal to the given generated image.

Image Animation Video Generation

DiffLLE: Diffusion-guided Domain Calibration for Unsupervised Low-light Image Enhancement

no code implementations18 Aug 2023 Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang

Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.

Denoising Low-Light Image Enhancement

CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography

1 code implementation NeurIPS 2023 Jiwen Yu, Xuanyu Zhang, Youmin Xu, Jian Zhang

Current image steganography techniques are mainly focused on cover-based methods, which commonly have the risk of leaking secret images and poor robustness against degraded container images.

Diversity Image Steganography

FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model

1 code implementation ICCV 2023 Jiwen Yu, Yinhuai Wang, Chen Zhao, Bernard Ghanem, Jian Zhang

In this work, we propose a training-Free conditional Diffusion Model (FreeDoM) used for various conditions.

Face Detection

Unlimited-Size Diffusion Restoration

1 code implementation1 Mar 2023 Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang

Our simple, parameter-free approaches can be used not only for image restoration but also for image generation of unlimited sizes, with the potential to be a general tool for diffusion models.

Image Generation Image Restoration

Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

4 code implementations1 Dec 2022 Yinhuai Wang, Jiwen Yu, Jian Zhang

Most existing Image Restoration (IR) models are task-specific, which can not be generalized to different degradation operators.

Colorization Deblurring +7

GAN Prior based Null-Space Learning for Consistent Super-Resolution

1 code implementation24 Nov 2022 Yinhuai Wang, Yujie Hu, Jiwen Yu, Jian Zhang

Consistency and realness have always been the two critical issues of image super-resolution.

Image Super-Resolution

Cannot find the paper you are looking for? You can Submit a new open access paper.