Search Results for author: Jiayi Guo

Found 19 papers, 12 papers with code

Impact of Trip Distance Distribution Time Dependency and Aggregation Levels in Bathtub Models -- A Comparative Simulation Analysis

no code implementations14 Dec 2024 Jiayi Guo, Irene Martínez, Gonçalo Correia, Bart van Arem

The emergence of different bathtub models has raised the question of which model can provide more robust and accurate results under different demand scenarios and network properties.

ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

1 code implementation11 Nov 2024 Zanlin Ni, Yulin Wang, Renping Zhou, Yizeng Han, Jiayi Guo, Zhiyuan Liu, Yuan YAO, Gao Huang

At the spatial level, we disentangle the computations of visible and mask tokens by encoding visible tokens independently, while decoding mask tokens conditioned on the fully encoded visible tokens.

Image Generation

Taming Rectified Flow for Inversion and Editing

1 code implementation7 Nov 2024 Jiangshan Wang, Junfu Pu, Zhongang Qi, Jiayi Guo, Yue Ma, Nisha Huang, Yuxin Chen, Xiu Li, Ying Shan

To address this issue, we propose RF-Solver, a novel training-free sampler that effectively enhances inversion precision by mitigating the errors in the ODE-solving process of rectified flow.

Text-to-Image Generation Video Editing +1

UniAutoML: A Human-Centered Framework for Unified Discriminative and Generative AutoML with Large Language Models

1 code implementation9 Oct 2024 Jiayi Guo, Zan Chen, Yingrui Ji, Liyun Zhang, Daqin Luo, Zhigang Li, Yiqin Shen

Additionally, these frameworks lack interpretability and user engagement during the training process, primarily due to the absence of human-centered design.

AutoML Model Selection

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

1 code implementation31 Aug 2024 Zanlin Ni, Yulin Wang, Renping Zhou, Rui Lu, Jiayi Guo, Jinyi Hu, Zhiyuan Liu, Yuan YAO, Gao Huang

As a representative work, non-autoregressive Transformers (NATs) are able to synthesize images with decent quality in a small number of steps.

Image Generation Scheduling

On the KL-Divergence-based Robust Satisficing Model

no code implementations17 Aug 2024 Haojie Yan, Minglong Zhou, Jiayi Guo

Empirical risk minimization, a cornerstone in machine learning, is often hindered by the Optimizer's Curse stemming from discrepancies between the empirical and true data-generating distributions. To address this challenge, the robust satisficing framework has emerged recently to mitigate ambiguity in the true distribution.

model

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

1 code implementation11 Aug 2024 Yifan Pu, Zhuofan Xia, Jiayi Guo, Dongchen Han, Qixiu Li, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang, Xiu Li

In response to this observation, we present a novel diffusion transformer framework incorporating an additional set of mediator tokens to engage with queries and keys separately.

Denoising

UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

1 code implementation29 Jul 2024 Chaoqun Du, Yulin Wang, Jiayi Guo, Yizeng Han, Jie zhou, Gao Huang

To this end, we propose a Unified Test-Time Adaptation (UniTTA) benchmark, which is comprehensive and widely applicable.

Test-time Adaptation

COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing

1 code implementation13 Jun 2024 Jiangshan Wang, Yue Ma, Jiayi Guo, Yicheng Xiao, Gao Huang, Xiu Li

Specifically, we propose an efficient sliding-window-based strategy to calculate the similarity among tokens in the diffusion features of source videos, identifying the tokens with high correspondence across frames.

Denoising Video Editing

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

1 code implementation CVPR 2024 Zanlin Ni, Yulin Wang, Renping Zhou, Jiayi Guo, Jinyi Hu, Zhiyuan Liu, Shiji Song, Yuan YAO, Gao Huang

In this paper, we aim to re-evaluate the full potential of NATs by revisiting the design of their training and inference strategies.

Image Generation

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

1 code implementation6 Jun 2024 Jiayi Guo, Junhao Zhao, Chaoqun Du, Yulin Wang, Chunjiang Ge, Zanlin Ni, Shiji Song, Humphrey Shi, Gao Huang

The recently proposed diffusion-driven TTA methods mitigate this by adapting model inputs instead of weights, where an unconditional diffusion model, trained on the source domain, transforms target-domain data into a synthetic domain that is expected to approximate the source domain.

Test-time Adaptation

GRA: Detecting Oriented Objects through Group-wise Rotating and Attention

no code implementations17 Mar 2024 Jiangshan Wang, Yifan Pu, Yizeng Han, Jiayi Guo, Yiru Wang, Xiu Li, Gao Huang

GRA can adaptively capture fine-grained features of objects with diverse orientations, comprising two key components: Group-wise Rotating and Group-wise Attention.

Object object-detection +2

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

1 code implementation CVPR 2024 Jiayi Guo, Xingqian Xu, Yifan Pu, Zanlin Ni, Chaofei Wang, Manushree Vasu, Shiji Song, Gao Huang, Humphrey Shi

Specifically, we introduce Step-wise Variation Regularization to enforce the proportion between the variations of an arbitrary input latent and that of the output image is a constant at any diffusion training step.

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models

1 code implementation CVPR 2024 Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi

Text-to-image (T2I) research has grown explosively in the past year, owing to the large-scale pre-trained diffusion models and many emerging personalization and editing approaches.

Conditional Text-to-Image Synthesis Image Generation +3

Zero-shot Generative Model Adaptation via Image-specific Prompt Learning

1 code implementation CVPR 2023 Jiayi Guo, Chaofei Wang, You Wu, Eric Zhang, Kai Wang, Xingqian Xu, Shiji Song, Humphrey Shi, Gao Huang

Recently, CLIP-guided image synthesis has shown appealing performance on adapting a pre-trained source-domain generator to an unseen target domain.

Diversity Image Generation

Assessing a Single Image in Reference-Guided Image Synthesis

no code implementations8 Dec 2021 Jiayi Guo, Chaoqun Du, Jiangshan Wang, Huijuan Huang, Pengfei Wan, Gao Huang

For Reference-guided Image Synthesis (RIS) tasks, i. e., rendering a source image in the style of another reference image, where assessing the quality of a single generated image is crucial, these metrics are not applicable.

Image Generation

Privacy-Preserving Representation Learning on Graphs: A Mutual Information Perspective

no code implementations3 Jul 2021 Binghui Wang, Jiayi Guo, Ang Li, Yiran Chen, Hai Li

Existing representation learning methods on graphs have achieved state-of-the-art performance on various graph-related tasks such as node classification, link prediction, etc.

Link Prediction Node Classification +2

Meta-Semi: A Meta-learning Approach for Semi-supervised Learning

no code implementations5 Jul 2020 Yulin Wang, Jiayi Guo, Shiji Song, Gao Huang

In this paper, we propose a novel meta-learning based SSL algorithm (Meta-Semi) that requires tuning only one additional hyper-parameter, compared with a standard supervised deep learning algorithm, to achieve competitive performance under various conditions of SSL.

Meta-Learning

DeepObfuscator: Obfuscating Intermediate Representations with Privacy-Preserving Adversarial Learning on Smartphones

no code implementations9 Sep 2019 Ang Li, Jiayi Guo, Huanrui Yang, Flora D. Salim, Yiran Chen

Our experiments on CelebA and LFW datasets show that the quality of the reconstructed images from the obfuscated features of the raw image is dramatically decreased from 0. 9458 to 0. 3175 in terms of multi-scale structural similarity.

General Classification Image Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.