Search Results for author: Zhilong Zhang

Found 8 papers, 2 papers with code

Structure-Guided Adversarial Training of Diffusion Models

no code implementations27 Feb 2024 Ling Yang, Haotian Qian, Zhilong Zhang, Jingwei Liu, Bin Cui

In this pioneering approach, we compel the model to learn manifold structures between samples in each training batch.

Conditional Image Generation Denoising

Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing

1 code implementation26 Feb 2024 Ling Yang, Zhilong Zhang, Zhaochen Yu, Jingwei Liu, Minkai Xu, Stefano Ermon, Bin Cui

To address this issue, we propose a novel and general contextualized diffusion model (ContextDiff) by incorporating the cross-modal context encompassing interactions and alignments between text condition and visual sample into forward and reverse processes.

Text-to-Image Generation Text-to-Video Editing +1

Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving

no code implementations16 Jan 2024 Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo, Changchuan Yin

Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image segmentation features of important objects are transmitted to reduce transmission redundancy.

Autonomous Driving Image Segmentation +2

Identifying Subgroups of ICU Patients Using End-to-End Multivariate Time-Series Clustering Algorithm Based on Real-World Vital Signs Data

no code implementations3 Jun 2023 Tongyue Shi, Zhilong Zhang, Wentie Liu, Junhua Fang, Jianguo Hao, Shuai Jin, Huiying Zhao, Guilan Kong

This study employed the MIMIC-IV database as data source to investigate the use of dynamic, high-frequency, multivariate time-series vital signs data, including temperature, heart rate, mean blood pressure, respiratory rate, and SpO2, monitored first 8 hours data in the ICU stay.

Clustering ICU Mortality +3

Diffusion Models: A Comprehensive Survey of Methods and Applications

2 code implementations2 Sep 2022 Ling Yang, Zhilong Zhang, Yang song, Shenda Hong, Runsheng Xu, Yue Zhao, Yingxia Shao, Wentao Zhang, Bin Cui, Ming-Hsuan Yang

This survey aims to provide a contextualized, in-depth look at the state of diffusion models, identifying the key areas of focus and pointing to potential areas for further exploration.

Image Super-Resolution Text-to-Image Generation +1

A view synthesis-based 360° VR caching system over MEC-Enabled C-RAN

no code implementations1 Oct 2020 Jianmei Dai, Zhilong Zhang, Shiwen Mao, Danpu Liu

If the requested content of a specific view is cached in the BBU pool or RRHs, or can be synthesized with the aid of the cached adjacent views, it is unnecessary to request the content from the remote VR video source server.

Edge-computing

R3: A Reading Comprehension Benchmark Requiring Reasoning Processes

no code implementations2 Apr 2020 Ran Wang, Kun Tao, Dingjie Song, Zhilong Zhang, Xiao Ma, Xi'ao Su, Xin-yu Dai

Existing question answering systems can only predict answers without explicit reasoning processes, which hinder their explainability and make us overestimate their ability of understanding and reasoning over natural language.

Question Answering Reading Comprehension

Cannot find the paper you are looking for? You can Submit a new open access paper.