Search Results for author: Zhilong Zhang

Found 8 papers, 2 papers with code

Structure-Guided Adversarial Training of Diffusion Models

no code implementations • 27 Feb 2024 • Ling Yang, Haotian Qian, Zhilong Zhang, Jingwei Liu, Bin Cui

In this pioneering approach, we compel the model to learn manifold structures between samples in each training batch.

Paper
Add Code

Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing

1 code implementation • 26 Feb 2024 • Ling Yang, Zhilong Zhang, Zhaochen Yu, Jingwei Liu, Minkai Xu, Stefano Ermon, Bin Cui

To address this issue, we propose a novel and general contextualized diffusion model (ContextDiff) by incorporating the cross-modal context encompassing interactions and alignments between text condition and visual sample into forward and reverse processes.

Text-to-Image Generation Text-to-Video Editing +1

Paper
Code

Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving

no code implementations • 16 Jan 2024 • Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo, Changchuan Yin

Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image segmentation features of important objects are transmitted to reduce transmission redundancy.

Autonomous Driving Image Segmentation +2

Paper
Add Code

Improving Diffusion-Based Image Synthesis with Context Prediction

no code implementations • NeurIPS 2023 • Ling Yang, Jingwei Liu, Shenda Hong, Zhilong Zhang, Zhilin Huang, Zheming Cai, Wentao Zhang, Bin Cui

In this way, each point can better reconstruct itself by preserving its semantic connections with neighborhood context.

Ranked #1 on Image Inpainting on CelebA (LPIPS metric)

Denoising Image Inpainting +2

Paper
Add Code

Identifying Subgroups of ICU Patients Using End-to-End Multivariate Time-Series Clustering Algorithm Based on Real-World Vital Signs Data

no code implementations • 3 Jun 2023 • Tongyue Shi, Zhilong Zhang, Wentie Liu, Junhua Fang, Jianguo Hao, Shuai Jin, Huiying Zhao, Guilan Kong

This study employed the MIMIC-IV database as data source to investigate the use of dynamic, high-frequency, multivariate time-series vital signs data, including temperature, heart rate, mean blood pressure, respiratory rate, and SpO2, monitored first 8 hours data in the ICU stay.

Clustering ICU Mortality +3

Paper
Add Code

Diffusion Models: A Comprehensive Survey of Methods and Applications

2 code implementations • 2 Sep 2022 • Ling Yang, Zhilong Zhang, Yang song, Shenda Hong, Runsheng Xu, Yue Zhao, Yingxia Shao, Wentao Zhang, Bin Cui, Ming-Hsuan Yang

This survey aims to provide a contextualized, in-depth look at the state of diffusion models, identifying the key areas of focus and pointing to potential areas for further exploration.

Image Super-Resolution Text-to-Image Generation +1

2,661

Paper
Code

A view synthesis-based 360° VR caching system over MEC-Enabled C-RAN

no code implementations • 1 Oct 2020 • Jianmei Dai, Zhilong Zhang, Shiwen Mao, Danpu Liu

If the requested content of a specific view is cached in the BBU pool or RRHs, or can be synthesized with the aid of the cached adjacent views, it is unnecessary to request the content from the remote VR video source server.

Edge-computing

Paper
Add Code

R3: A Reading Comprehension Benchmark Requiring Reasoning Processes

no code implementations • 2 Apr 2020 • Ran Wang, Kun Tao, Dingjie Song, Zhilong Zhang, Xiao Ma, Xi'ao Su, Xin-yu Dai

Existing question answering systems can only predict answers without explicit reasoning processes, which hinder their explainability and make us overestimate their ability of understanding and reasoning over natural language.

Question Answering Reading Comprehension

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.