Search Results for author: Haoyu Chen

Found 63 papers, 23 papers with code

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

no code implementations12 Jun 2025 Sixiang Chen, Jianyu Lai, Jialin Gao, Tian Ye, Haoyu Chen, Hengyu Shi, Shitong Shao, Yunlong Lin, Song Fei, Zhaohu Xing, Yeying Jin, Junfeng Luo, Xiaoming Wei, Lei Zhu

Generating aesthetic posters is more challenging than simple design images: it requires not only precise text rendering but also the seamless integration of abstract artistic content, striking layouts, and overall stylistic harmony.

PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production

no code implementations10 Jun 2025 Yu Guan, Zhiyu Yin, Haoyu Chen, Sheng Cheng, Chaojie Yang, Kun Qian, Tianyin Xu, Yang Zhang, Hanyu Zhao, Yong Li, Wei Lin, Dennis Cai, Ennan Zhai

In this paper, we present PerfTracker, the first online troubleshooting system utilizing fine-grained profiling, to diagnose performance issues of large-scale model training in production.

Diagnostic

Sounding Like a Winner? Prosodic Differences in Post-Match Interviews

no code implementations2 Jun 2025 Sofoklis Kakouros, Haoyu Chen

This study examines the prosodic characteristics associated with winning and losing in post-match tennis interviews.

Self-Supervised Learning

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

no code implementations29 May 2025 Haoyu Chen, Keda Tao, Yizao Wang, Xinlei Wang, Lei Zhu, Jinjin Gu

Photo retouching is integral to photographic art, extending far beyond simple technical fixes to heighten emotional expression and narrative depth.

Language Modeling Language Modelling +1

Automatically Generating Rules of Malicious Software Packages via Large Language Model

no code implementations24 Apr 2025 XiangRui Zhang, Haoyu Chen, Yongzhong He, Wenjia Niu, Qiang Li

Today's security tools predominantly rely on predefined rules crafted by experts, making them poorly adapted to the emergence of software supply chain attacks.

Language Modeling Language Modelling +1

Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

no code implementations20 Apr 2025 Jingjing Ren, Wenbo Li, Zhongdao Wang, Haoze Sun, Bangzhen Liu, Haoyu Chen, Jiaqi Xu, Aoxue Li, Shifeng Zhang, Bin Shao, Yong Guo, Lei Zhu

Compared to existing methods, Turbo2K is up to 20$\times$ faster for inference, making high-resolution video generation more scalable and practical for real-world applications.

2k Knowledge Distillation +2

Hiding Images in Diffusion Models by Editing Learned Score Functions

1 code implementation CVPR 2025 Haoyu Chen, Yunqiao Yang, Nan Zhong, Kede Ma

Hiding data using neural networks (i. e., neural steganography) has achieved remarkable success across both discriminative classifiers and generative adversarial networks.

Denoising parameter-efficient fine-tuning

POSTA: A Go-to Framework for Customized Artistic Poster Generation

no code implementations CVPR 2025 Haoyu Chen, Xiaojie Xu, Wenbo Li, Jingjing Ren, Tian Ye, Songhua Liu, Ying-Cong Chen, Lei Zhu, Xinchao Wang

To train our models, we develop the PosterArt dataset, comprising high-quality artistic posters annotated with layout, typography, and pixel-level stylized text segmentation.

Text Segmentation

From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification

no code implementations CVPR 2025 Yan Jiang, Hao Yu, Xu Cheng, Haoyu Chen, Zhaodong Sun, Guoying Zhao

The rationale of L2RW is that integrating decentralized training into VI-ReID can address privacy concerns in scenarios with limited data-sharing regulation.

Person Re-Identification

VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models

1 code implementation14 Jan 2025 Hui Kuurila-Zhang, Haoyu Chen, Guoying Zhao

Extensive experiments demonstrate that VENOM achieves superior ASR and image quality compared to prior methods, marking a significant advancement in adversarial example generation and providing insights into model vulnerabilities for improved defense development.

LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data

no code implementations3 Jan 2025 Yuxin Zhang, Haoyu Chen, Zheng Lin, Zhe Chen, Jin Zhao

By leveraging model partitioning and adopting distinct aggregation strategies for each sub-model, LCFed effectively incorporates global knowledge into intra-cluster co-training, achieving optimal training performance.

Clustering Computational Efficiency +1

Beyond Generation: A Diffusion-based Low-level Feature Extractor for Detecting AI-generated Images

no code implementations CVPR 2025 Nan Zhong, Haoyu Chen, Yiran Xu, Zhenxing Qian, Xinpeng Zhang

This image set comprises the original image as well as versions that have been subjected to varying levels of noise and subsequently denoised using a pre-trained diffusion model.

Denoising Image Generation +1

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

no code implementations10 Dec 2024 Wufei Ma, Haoyu Chen, Guofeng Zhang, Yu-Cheng Chou, Celso M de Melo, Alan Yuille

We benchmark a wide range of open-sourced and proprietary LMMs, uncovering their limitations in various aspects of 3D awareness, such as height, orientation, location, and multi-object reasoning, as well as their degraded performance on images with uncommon camera viewpoints.

Autonomous Navigation Spatial Reasoning +1

Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization

1 code implementation10 Oct 2024 Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero, Jingjing Ren, Sixiang Chen, Haoyu Chen, Lei Zhu

Specifically, we construct a real-world dataset with 85 snowy videos, and then present a Semi-supervised Video Desnowing Network (SemiVDN) equipped by a novel Distribution-driven Contrastive Regularization.

Ranked #2 on Snow Removal on RVSD (using extra training data)

Snow Removal

RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models

no code implementations25 Jul 2024 Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Sixiang Chen, Tian Ye, Renjing Pei, Kaiwen Zhou, Fenglong Song, Lei Zhu

RestoreAgent autonomously assesses the type and extent of degradation in input images and performs restoration through (1) determining the appropriate restoration tasks, (2) optimizing the task sequence, (3) selecting the most suitable models, and (4) executing the restoration.

Image Restoration Low-Light Image Enhancement

Learned HDR Image Compression for Perceptually Optimal Storage and Display

no code implementations18 Jul 2024 Peibei Cao, Haoyu Chen, Jingzhe Ma, Yu-Chieh Yuan, Zhiyong Xie, Xin Xie, Haiqing Bai, Kede Ma

High dynamic range (HDR) capture and display have seen significant growth in popularity driven by the advancements in technology and increasing consumer demand for superior image quality.

Image Compression Image Reconstruction

UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

no code implementations2 Jul 2024 Jingjing Ren, Wenbo Li, Haoyu Chen, Renjing Pei, Bin Shao, Yong Guo, Long Peng, Fenglong Song, Lei Zhu

Ultra-high-resolution image generation poses great challenges, such as increased semantic planning complexity and detail synthesis difficulties, alongside substantial training resource demands.

Computational Efficiency Denoising +1

NeRO: Neural Road Surface Reconstruction

1 code implementation17 May 2024 Ruibo Wang, Song Zhang, Ping Huang, Donghai Zhang, Haoyu Chen

Accurately reconstructing road surfaces is pivotal for various applications especially in autonomous driving.

Autonomous Driving Position +1

Towards Robust 3D Pose Transfer with Adversarial Learning

no code implementations CVPR 2024 Haoyu Chen, Hao Tang, Ehsan Adeli, Guoying Zhao

This work is driven by the intuition that the robustness of the model can be enhanced by introducing adversarial samples into the training, leading to a more invulnerable model to the noisy inputs, which even can be further extended to directly handling the real-world data like raw point clouds/scans without intermediate processing.

3D Generation Pose Transfer

FedAC: An Adaptive Clustered Federated Learning Framework for Heterogeneous Data

no code implementations25 Mar 2024 Yuxin Zhang, Haoyu Chen, Zheng Lin, Zhe Chen, Jin Zhao

Clustered federated learning (CFL) is proposed to mitigate the performance deterioration stemming from data heterogeneity in federated learning (FL) by grouping similar clients for cluster-wise model training.

Dimensionality Reduction Federated Learning

Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

no code implementations CVPR 2024 Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Haoze Sun, Xueyi Zou, Zhensong Zhang, Youliang Yan, Lei Zhu

Leveraging unseen LR images for self-supervised learning guides the model to adapt its modeling space to the target domain, facilitating fine-tuning of SR models without requiring paired high-resolution (HR) images.

Image Super-Resolution Self-Supervised Learning

Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis

no code implementations26 Dec 2023 Jingjing Ren, Cheng Xu, Haoyu Chen, Xinran Qin, Lei Zhu

Recent progress in multi-modal conditioned face synthesis has enabled the creation of visually striking and accurately aligned facial images.

Denoising Face Generation

CoSeR: Bridging Image and Language for Cognitive Super-Resolution

1 code implementation CVPR 2024 Haoze Sun, Wenbo Li, Jianzhuang Liu, Haoyu Chen, Renjing Pei, Xueyi Zou, Youliang Yan, Yujiu Yang

We achieve this by marrying image appearance and language understanding to generate a cognitive embedding, which not only activates prior information from large text-to-image diffusion models but also facilitates the generation of high-quality reference images to optimize the SR process.

Super-Resolution

Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution

no code implementations29 May 2023 Ruofan Zhang, Jinjin Gu, Haoyu Chen, Chao Dong, Yulun Zhang, Wenming Yang

In this work, we introduce a novel approach to craft training degradation distributions using a small set of reference images.

Super-Resolution

Learning a Deep Color Difference Metric for Photographic Images

1 code implementation CVPR 2023 Haoyu Chen, Zhihua Wang, Yang Yang, Qilin Sun, Kede Ma

Most well-established and widely used color difference (CD) metrics are handcrafted and subject-calibrated against uniformly colored patches, which do not generalize well to photographic images characterized by natural scene complexities.

Masked Image Training for Generalizable Deep Image Denoising

1 code implementation CVPR 2023 Haoyu Chen, Jinjin Gu, Yihao Liu, Salma Abdel Magid, Chao Dong, Qiong Wang, Hanspeter Pfister, Lei Zhu

To address this issue, we present a novel approach to enhance the generalization performance of denoising networks, known as masked training.

Deep Learning Image Denoising

Reliable Multimodality Eye Disease Screening via Mixture of Student's t Distributions

1 code implementation17 Mar 2023 Ke Zou, Tian Lin, Xuedong Yuan, Haoyu Chen, Xiaojing Shen, Meng Wang, Huazhu Fu

To address this issue, we introduce a novel multimodality evidential fusion pipeline for eye disease screening, EyeMoSt, which provides a measure of confidence for unimodality and elegantly integrates the multimodality information from a multi-distribution fusion perspective.

Decision Making

Prior Information based Decomposition and Reconstruction Learning for Micro-Expression Recognition

no code implementations3 Mar 2023 Jinsheng Wei, Haoyu Chen, Guanming Lu, Jingjie Yan, Yue Xie, Guoying Zhao

To solve this issue, driven by the prior information that the category of ME can be inferred by the relationship between the actions of facial different components, this work designs a novel model that can conform to this prior information and learn ME movement features in an interpretable way.

Graph Representation Learning Micro Expression Recognition +1

Uncertainty-Aware Distillation for Semi-Supervised Few-Shot Class-Incremental Learning

1 code implementation24 Jan 2023 Yawen Cui, Wanxia Deng, Haoyu Chen, Li Liu

Given a model well-trained with a large-scale base dataset, Few-Shot Class-Incremental Learning (FSCIL) aims at incrementally learning novel classes from a few labeled samples by avoiding overfitting, without catastrophically forgetting all encountered classes previously.

class-incremental learning Few-Shot Class-Incremental Learning +2

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning

no code implementations28 Nov 2022 Tu Trinh, Haoyu Chen, Daniel S. Brown

We evaluate our approach in simulation for both discrete and continuous state-space domains and illustrate the feasibility of developing a robotic system that can accurately evaluate demonstration sufficiency.

Active Learning reinforcement-learning +2

Hiding Images in Deep Probabilistic Models

no code implementations5 Oct 2022 Haoyu Chen, Linqi Song, Zhenxing Qian, Xinpeng Zhang, Kede Ma

As an instantiation, we adopt a SinGAN, a pyramid of generative adversarial networks (GANs), to learn the patch distribution of one cover image.

Golfer: Trajectory Prediction with Masked Goal Conditioning MnM Network

no code implementations2 Jul 2022 Xiaocheng Tang, Soheil Sadeghi Eshkevari, Haoyu Chen, Weidan Wu, Wei Qian, Xiaoming Wang

Transformers have enabled breakthroughs in NLP and computer vision, and have recently began to show promising performance in trajectory prediction for Autonomous Vehicle (AV).

motion prediction Prediction +1

On Learning and Testing of Counterfactual Fairness through Data Preprocessing

no code implementations25 Feb 2022 Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh

Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly.

BIG-bench Machine Learning counterfactual +2

Geometry-Contrastive Transformer for Generalized 3D Pose Transfer

1 code implementation14 Dec 2021 Haoyu Chen, Hao Tang, Zitong Yu, Nicu Sebe, Guoying Zhao

Specifically, we propose a novel geometry-contrastive Transformer that has an efficient 3D structured perceiving ability to the global geometric inconsistencies across the given meshes.

Pose Transfer

AniFormer: Data-driven 3D Animation with Transformer

1 code implementation20 Oct 2021 Haoyu Chen, Hao Tang, Nicu Sebe, Guoying Zhao

Instead, we introduce AniFormer, a novel Transformer-based architecture, that generates animated 3D sequences by directly taking the raw driving sequences and arbitrary same-type target meshes as inputs.

regression

iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis

1 code implementation CVPR 2021 Xin Liu, Henglin Shi, Haoyu Chen, Zitong Yu, Xiaobai Li, Guoying Zhaoz?

We introduce a new dataset for the emotional artificial intelligence research: identity-free video dataset for Micro-Gesture Understanding and Emotion analysis (iMiGUE).

Emotion Recognition

Attention in Attention Network for Image Super-Resolution

2 code implementations19 Apr 2021 Haoyu Chen, Jinjin Gu, Zhi Zhang

In this work, we attempt to quantify and visualize attention mechanisms in SISR and show that not all attention modules are equally beneficial.

Image Super-Resolution

Counterfactual Fairness through Data Preprocessing

no code implementations1 Jan 2021 Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh

Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly.

BIG-bench Machine Learning counterfactual +2

Image Quality Assessment for Perceptual Image Restoration: A New Dataset, Benchmark and Metric

no code implementations30 Nov 2020 Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy Ren, Chao Dong

To answer the questions and promote the development of IQA methods, we contribute a large-scale IQA dataset, called Perceptual Image Processing ALgorithms (PIPAL) dataset.

Image Quality Assessment Image Restoration

Statistical Inference for Online Decision Making via Stochastic Gradient Descent

1 code implementation14 Oct 2020 Haoyu Chen, Wenbin Lu, Rui Song

Focusing on the statistical inference of online decision making, we establish the asymptotic normality of the parameter estimator produced by our algorithm and the online inverse probability weighted value estimator we used to estimate the optimal value.

Decision Making

Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting

no code implementations14 Oct 2020 Haoyu Chen, Wenbin Lu, Rui Song

Based on the properties of the parameter estimators, we further show that the in-sample inverse propensity weighted value estimator is asymptotically normal.

Decision Making

2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework

no code implementations10 Aug 2020 Haoyu Chen, Zitong Yu, Xin Liu, Wei Peng, Yoon Lee, Guoying Zhao

To address the problem of training on small datasets for action recognition tasks, most prior works are either based on a large number of training samples or require pre-trained models transferred from other large datasets to tackle overfitting problems.

Action Recognition Optical Flow Estimation

PIPAL: a Large-Scale Image Quality Assessment Dataset for Perceptual Image Restoration

no code implementations ECCV 2020 Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy Ren, Chao Dong

To answer these questions and promote the development of IQA methods, we contribute a large-scale IQA dataset, called Perceptual Image Processing Algorithms (PIPAL) dataset.

Image Quality Assessment Image Restoration +1

Learning Graph Convolutional Network for Skeleton-based Human Action Recognition by Neural Searching

1 code implementation11 Nov 2019 Wei Peng, Xiaopeng Hong, Haoyu Chen, Guoying Zhao

Human action recognition from skeleton data, fueled by the Graph Convolutional Network (GCN), has attracted lots of attention, due to its powerful capability of modeling non-Euclidean structure data.

Action Recognition Neural Architecture Search +1

Super-Resolution Perception for Industrial Sensor Data

no code implementations6 Sep 2018 Jinjin Gu, Haoyu Chen, Guolong Liu, Gaoqi Liang, Xinlei Wang, Junhua Zhao

In this paper, we present the problem formulation and methodology framework of Super-Resolution Perception (SRP) on industrial sensor data.

Fault Detection Super-Resolution

An Efficient Minibatch Acceptance Test for Metropolis-Hastings

no code implementations19 Oct 2016 Daniel Seita, Xinlei Pan, Haoyu Chen, John Canny

We present a novel Metropolis-Hastings method for large datasets that uses small expected-size minibatches of data.

Fast Parallel SAME Gibbs Sampling on General Discrete Bayesian Networks

no code implementations19 Nov 2015 Daniel Seita, Haoyu Chen, John Canny

A fundamental task in machine learning and related fields is to perform inference on Bayesian networks.

Experiments on Parallel Training of Deep Neural Network using Model Averaging

1 code implementation5 Jul 2015 Hang Su, Haoyu Chen

Data is partitioned and distributed to different nodes for local model updates, and model averaging across nodes is done every few minibatches.

Cannot find the paper you are looking for? You can Submit a new open access paper.