no code implementations • 12 Jun 2025 • Sixiang Chen, Jianyu Lai, Jialin Gao, Tian Ye, Haoyu Chen, Hengyu Shi, Shitong Shao, Yunlong Lin, Song Fei, Zhaohu Xing, Yeying Jin, Junfeng Luo, Xiaoming Wei, Lei Zhu
Generating aesthetic posters is more challenging than simple design images: it requires not only precise text rendering but also the seamless integration of abstract artistic content, striking layouts, and overall stylistic harmony.
no code implementations • 10 Jun 2025 • Yu Guan, Zhiyu Yin, Haoyu Chen, Sheng Cheng, Chaojie Yang, Kun Qian, Tianyin Xu, Yang Zhang, Hanyu Zhao, Yong Li, Wei Lin, Dennis Cai, Ennan Zhai
In this paper, we present PerfTracker, the first online troubleshooting system utilizing fine-grained profiling, to diagnose performance issues of large-scale model training in production.
no code implementations • 2 Jun 2025 • Sofoklis Kakouros, Haoyu Chen
This study examines the prosodic characteristics associated with winning and losing in post-match tennis interviews.
no code implementations • 29 May 2025 • Haoyu Chen, Keda Tao, Yizao Wang, Xinlei Wang, Lei Zhu, Jinjin Gu
Photo retouching is integral to photographic art, extending far beyond simple technical fixes to heighten emotional expression and narrative depth.
no code implementations • 24 Apr 2025 • XiangRui Zhang, Haoyu Chen, Yongzhong He, Wenjia Niu, Qiang Li
Today's security tools predominantly rely on predefined rules crafted by experts, making them poorly adapted to the emergence of software supply chain attacks.
no code implementations • 22 Apr 2025 • Anirudhan Badrinath, Alex Yang, Kousik Rajesh, Prabhat Agarwal, Jaewon Yang, Haoyu Chen, Jiajing Xu, Charles Rosenberg
This paper presents OmniSage, a large-scale representation framework that learns universal representations for a variety of applications at Pinterest.
no code implementations • 20 Apr 2025 • Jingjing Ren, Wenbo Li, Zhongdao Wang, Haoze Sun, Bangzhen Liu, Haoyu Chen, Jiaqi Xu, Aoxue Li, Shifeng Zhang, Bin Shao, Yong Guo, Lei Zhu
Compared to existing methods, Turbo2K is up to 20$\times$ faster for inference, making high-resolution video generation more scalable and practical for real-world applications.
1 code implementation • CVPR 2025 • Haoyu Chen, Yunqiao Yang, Nan Zhong, Kede Ma
Hiding data using neural networks (i. e., neural steganography) has achieved remarkable success across both discriminative classifiers and generative adversarial networks.
no code implementations • CVPR 2025 • Haoyu Chen, Xiaojie Xu, Wenbo Li, Jingjing Ren, Tian Ye, Songhua Liu, Ying-Cong Chen, Lei Zhu, Xinchao Wang
To train our models, we develop the PosterArt dataset, comprising high-quality artistic posters annotated with layout, typography, and pixel-level stylized text segmentation.
no code implementations • CVPR 2025 • Yan Jiang, Hao Yu, Xu Cheng, Haoyu Chen, Zhaodong Sun, Guoying Zhao
The rationale of L2RW is that integrating decentralized training into VI-ReID can address privacy concerns in scenarios with limited data-sharing regulation.
no code implementations • 10 Feb 2025 • Qingshan Hou, Yukun Zhou, Jocelyn Hui Lin Goh, Ke Zou, Samantha Min Er Yew, Sahana Srinivasan, Meng Wang, Thaddaeus Lo, Xiaofeng Lei, Siegfried K. Wagner, Mark A. Chia, Dawei Yang, Hongyang Jiang, Anran Ran, Rui Santos, Gabor Mark Somfai, Juan Helen Zhou, Haoyu Chen, Qingyu Chen, Carol Yim-Lui Cheung, Pearse A. Keane, Yih Chung Tham
DINOv2-large model outperformed RETFound in detecting diabetic retinopathy (AUROC=0. 850-0. 952 vs 0. 823-0. 944, across three datasets, all P<=0. 007) and multi-class eye diseases (AUROC=0. 892 vs. 0. 846, P<0. 001).
1 code implementation • 14 Jan 2025 • Hui Kuurila-Zhang, Haoyu Chen, Guoying Zhao
Extensive experiments demonstrate that VENOM achieves superior ASR and image quality compared to prior methods, marking a significant advancement in adversarial example generation and providing insights into model vulnerabilities for improved defense development.
no code implementations • 3 Jan 2025 • Yuxin Zhang, Haoyu Chen, Zheng Lin, Zhe Chen, Jin Zhao
By leveraging model partitioning and adopting distinct aggregation strategies for each sub-model, LCFed effectively incorporates global knowledge into intra-cluster co-training, achieving optimal training performance.
no code implementations • CVPR 2025 • Nan Zhong, Haoyu Chen, Yiran Xu, Zhenxing Qian, Xinpeng Zhang
This image set comprises the original image as well as versions that have been subjected to varying levels of noise and subsequently denoised using a pre-trained diffusion model.
no code implementations • CVPR 2025 • Yunlong Lin, Zixu Lin, Haoyu Chen, Panwang Pan, Chenxin Li, Sixiang Chen, Kairun Wen, Yeying Jin, Wenbo Li, Xinghao Ding
Vision-centric perception systems often struggle with unpredictable and coupled weather degradations in the wild.
1 code implementation • CVPR 2025 • Yante Li, Hanwen Qi, Haoyu Chen, Xinlian Liang, Guoying Zhao
In environmental protection, tree monitoring plays an essential role in maintaining and improving ecosystem health.
no code implementations • 10 Dec 2024 • Wufei Ma, Haoyu Chen, Guofeng Zhang, Yu-Cheng Chou, Celso M de Melo, Alan Yuille
We benchmark a wide range of open-sourced and proprietary LMMs, uncovering their limitations in various aspects of 3D awareness, such as height, orientation, location, and multi-object reasoning, as well as their degraded performance on images with uncommon camera viewpoints.
1 code implementation • 10 Oct 2024 • Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero, Jingjing Ren, Sixiang Chen, Haoyu Chen, Lei Zhu
Specifically, we construct a real-world dataset with 85 snowy videos, and then present a Semi-supervised Video Desnowing Network (SemiVDN) equipped by a novel Distribution-driven Contrastive Regularization.
Ranked #2 on
Snow Removal
on RVSD
(using extra training data)
no code implementations • Applied Energy 2024 • Haoyu Chen, Hai Huang, Yong Zheng, Bing Yang
To improve the granularity of load decomposition within integrated energy system (IES), a novel multiple load forecasting approach is proposed.
no code implementations • 25 Jul 2024 • Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Sixiang Chen, Tian Ye, Renjing Pei, Kaiwen Zhou, Fenglong Song, Lei Zhu
RestoreAgent autonomously assesses the type and extent of degradation in input images and performs restoration through (1) determining the appropriate restoration tasks, (2) optimizing the task sequence, (3) selecting the most suitable models, and (4) executing the restoration.
no code implementations • 18 Jul 2024 • Peibei Cao, Haoyu Chen, Jingzhe Ma, Yu-Chieh Yuan, Zhiyong Xie, Xin Xie, Haiqing Bai, Kede Ma
High dynamic range (HDR) capture and display have seen significant growth in popularity driven by the advancements in technology and increasing consumer demand for superior image quality.
no code implementations • 2 Jul 2024 • Jingjing Ren, Wenbo Li, Haoyu Chen, Renjing Pei, Bin Shao, Yong Guo, Long Peng, Fenglong Song, Lei Zhu
Ultra-high-resolution image generation poses great challenges, such as increased semantic planning complexity and detail synthesis difficulties, alongside substantial training resource demands.
1 code implementation • 13 Jun 2024 • Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham, Dianbo Liu, Wendy Wong, Sahil Thakur, Beau Fenner, Danqi Fang, Siying Liu, Qingyun Liu, Yuqiang Huang, Hongqiang Zeng, Yanda Meng, Yukun Zhou, Zehua Jiang, Minghui Qiu, Changqing Zhang, Xinjian Chen, Sophia Y. Wang, Cecilia S. Lee, Lucia Sobrin, Carol Y Cheung, Chi Pui Pang, Pearse A. Keane, Ching-Yu Cheng, Haoyu Chen, Huazhu Fu
In zero-shot scenarios, RetiZero achieves Top-5 accuracies of 0. 843 for 15 diseases and 0. 756 for 52 diseases.
1 code implementation • 28 May 2024 • Ke Zou, Tian Lin, Zongbo Han, Meng Wang, Xuedong Yuan, Haoyu Chen, Changqing Zhang, Xiaojing Shen, Huazhu Fu
In this study, we propose a novel multi-modality evidential fusion pipeline for eye disease screening.
1 code implementation • 17 May 2024 • Ruibo Wang, Song Zhang, Ping Huang, Donghai Zhang, Haoyu Chen
Accurately reconstructing road surfaces is pivotal for various applications especially in autonomous driving.
1 code implementation • 15 Apr 2024 • Alexander Vedernikov, Puneet Kumar, Haoyu Chen, Tapio Seppanen, Xiaobai Li
Engagement analysis finds various applications in healthcare, education, advertisement, services.
no code implementations • CVPR 2024 • Haoyu Chen, Hao Tang, Ehsan Adeli, Guoying Zhao
This work is driven by the intuition that the robustness of the model can be enhanced by introducing adversarial samples into the training, leading to a more invulnerable model to the noisy inputs, which even can be further extended to directly handling the real-world data like raw point clouds/scans without intermediate processing.
no code implementations • 25 Mar 2024 • Yuxin Zhang, Haoyu Chen, Zheng Lin, Zhe Chen, Jin Zhao
Clustered federated learning (CFL) is proposed to mitigate the performance deterioration stemming from data heterogeneity in federated learning (FL) by grouping similar clients for cluster-wise model training.
no code implementations • CVPR 2024 • Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Haoze Sun, Xueyi Zou, Zhensong Zhang, Youliang Yan, Lei Zhu
Leveraging unseen LR images for self-supervised learning guides the model to adapt its modeling space to the target domain, facilitating fine-tuning of SR models without requiring paired high-resolution (HR) images.
no code implementations • 26 Dec 2023 • Jingjing Ren, Cheng Xu, Haoyu Chen, Xinran Qin, Lei Zhu
Recent progress in multi-modal conditioned face synthesis has enabled the creation of visually striking and accurately aligned facial images.
1 code implementation • CVPR 2024 • Haoze Sun, Wenbo Li, Jianzhuang Liu, Haoyu Chen, Renjing Pei, Xueyi Zou, Youliang Yan, Yujiu Yang
We achieve this by marrying image appearance and language understanding to generate a cognitive embedding, which not only activates prior information from large text-to-image diffusion models but also facilitates the generation of high-quality reference images to optimize the SR process.
no code implementations • 29 May 2023 • Ruofan Zhang, Jinjin Gu, Haoyu Chen, Chao Dong, Yulun Zhang, Wenming Yang
In this work, we introduce a novel approach to craft training degradation distributions using a small set of reference images.
no code implementations • 8 Apr 2023 • Meng Wang, Tian Lin, Lianyu Wang, Aidi Lin, Ke Zou, Xinxing Xu, Yi Zhou, Yuanyuan Peng, Qingquan Meng, Yiming Qian, Guoyao Deng, Zhiqun Wu, Junhong Chen, Jianhong Lin, Mingzhi Zhang, Weifang Zhu, Changqing Zhang, Daoqiang Zhang, Rick Siow Mong Goh, Yong liu, Chi Pui Pang, Xinjian Chen, Haoyu Chen, Huazhu Fu
Failure to recognize samples from the classes unseen during training is a major limitation of artificial intelligence in the real-world implementation for recognition and classification of retinal anomalies.
1 code implementation • CVPR 2023 • Haoyu Chen, Zhihua Wang, Yang Yang, Qilin Sun, Kede Ma
Most well-established and widely used color difference (CD) metrics are handcrafted and subject-calibrated against uniformly colored patches, which do not generalize well to photographic images characterized by natural scene complexities.
1 code implementation • CVPR 2023 • Haoyu Chen, Jinjin Gu, Yihao Liu, Salma Abdel Magid, Chao Dong, Qiong Wang, Hanspeter Pfister, Lei Zhu
To address this issue, we present a novel approach to enhance the generalization performance of denoising networks, known as masked training.
1 code implementation • 17 Mar 2023 • Ke Zou, Tian Lin, Xuedong Yuan, Haoyu Chen, Xiaojing Shen, Meng Wang, Huazhu Fu
To address this issue, we introduce a novel multimodality evidential fusion pipeline for eye disease screening, EyeMoSt, which provides a measure of confidence for unimodality and elegantly integrates the multimodality information from a multi-distribution fusion perspective.
no code implementations • 3 Mar 2023 • Jinsheng Wei, Haoyu Chen, Guanming Lu, Jingjie Yan, Yue Xie, Guoying Zhao
To solve this issue, driven by the prior information that the category of ME can be inferred by the relationship between the actions of facial different components, this work designs a novel model that can conform to this prior information and learn ME movement features in an interpretable way.
Graph Representation Learning
Micro Expression Recognition
+1
1 code implementation • 24 Jan 2023 • Yawen Cui, Wanxia Deng, Haoyu Chen, Li Liu
Given a model well-trained with a large-scale base dataset, Few-Shot Class-Incremental Learning (FSCIL) aims at incrementally learning novel classes from a few labeled samples by avoiding overfitting, without catastrophically forgetting all encountered classes previously.
class-incremental learning
Few-Shot Class-Incremental Learning
+2
1 code implementation • ICCV 2023 • Haoyu Chen, Jingjing Ren, Jinjin Gu, Hongtao Wu, Xuequan Lu, Haoming Cai, Lei Zhu
We also develop a deep learning framework for video snow removal.
Ranked #4 on
Snow Removal
on RVSD
no code implementations • 28 Nov 2022 • Tu Trinh, Haoyu Chen, Daniel S. Brown
We evaluate our approach in simulation for both discrete and continuous state-space domains and illustrate the feasibility of developing a robotic system that can accurately evaluate demonstration sufficiency.
no code implementations • 5 Oct 2022 • Haoyu Chen, Linqi Song, Zhenxing Qian, Xinpeng Zhang, Kede Ma
As an instantiation, we adopt a SinGAN, a pyramid of generative adversarial networks (GANs), to learn the patch distribution of one cover image.
no code implementations • 2 Jul 2022 • Xiaocheng Tang, Soheil Sadeghi Eshkevari, Haoyu Chen, Weidan Wu, Wei Qian, Xiaoming Wang
Transformers have enabled breakthroughs in NLP and computer vision, and have recently began to show promising performance in trajectory prediction for Autonomous Vehicle (AV).
no code implementations • 24 May 2022 • Paul Baltescu, Haoyu Chen, Nikil Pancha, Andrew Zhai, Jure Leskovec, Charles Rosenberg
Learned embeddings for products are an important building block for web-scale e-commerce recommendation systems.
no code implementations • 10 May 2022 • QIUJING LU, Weiqiao Han, Jeffrey Ling, Minfa Wang, Haoyu Chen, Balakrishnan Varadarajan, Paul Covington
Predicting future trajectories of road agents is a critical task for autonomous driving.
no code implementations • 25 Feb 2022 • Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh
Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly.
1 code implementation • 14 Dec 2021 • Haoyu Chen, Hao Tang, Zitong Yu, Nicu Sebe, Guoying Zhao
Specifically, we propose a novel geometry-contrastive Transformer that has an efficient 3D structured perceiving ability to the global geometric inconsistencies across the given meshes.
1 code implementation • 20 Oct 2021 • Haoyu Chen, Hao Tang, Nicu Sebe, Guoying Zhao
Instead, we introduce AniFormer, a novel Transformer-based architecture, that generates animated 3D sequences by directly taking the raw driving sequences and arbitrary same-type target meshes as inputs.
1 code implementation • ICCV 2021 • Haoyu Chen, Hao Tang, Henglin Shi, Wei Peng, Nicu Sebe, Guoying Zhao
With the strength of deep generative models, 3D pose transfer regains intensive research interests in recent years.
1 code implementation • CVPR 2021 • Xin Liu, Henglin Shi, Haoyu Chen, Zitong Yu, Xiaobai Li, Guoying Zhaoz?
We introduce a new dataset for the emotional artificial intelligence research: identity-free video dataset for Micro-Gesture Understanding and Emotion analysis (iMiGUE).
2 code implementations • 19 Apr 2021 • Haoyu Chen, Jinjin Gu, Zhi Zhang
In this work, we attempt to quantify and visualize attention mechanisms in SISR and show that not all attention modules are equally beneficial.
no code implementations • 1 Jan 2021 • Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh
Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly.
no code implementations • 30 Nov 2020 • Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy Ren, Chao Dong
To answer the questions and promote the development of IQA methods, we contribute a large-scale IQA dataset, called Perceptual Image Processing ALgorithms (PIPAL) dataset.
1 code implementation • 14 Oct 2020 • Haoyu Chen, Wenbin Lu, Rui Song
Focusing on the statistical inference of online decision making, we establish the asymptotic normality of the parameter estimator produced by our algorithm and the online inverse probability weighted value estimator we used to estimate the optimal value.
no code implementations • 14 Oct 2020 • Haoyu Chen, Wenbin Lu, Rui Song
Based on the properties of the parameter estimators, we further show that the in-sample inverse propensity weighted value estimator is asymptotically normal.
no code implementations • 14 Sep 2020 • Dario Fuoli, Zhiwu Huang, Shuhang Gu, Radu Timofte, Arnau Raventos, Aryan Esfandiari, Salah Karout, Xuan Xu, Xin Li, Xin Xiong, Jinge Wang, Pablo Navarrete Michelini, Wen-Hao Zhang, Dongyang Zhang, Hanwei Zhu, Dan Xia, Haoyu Chen, Jinjin Gu, Zhi Zhang, Tongtong Zhao, Shanshan Zhao, Kazutoshi Akita, Norimichi Ukita, Hrishikesh P. S, Densen Puthussery, Jiji C. V
Missing information can be restored well in this region, especially in HR videos, where the high-frequency content mostly consists of texture details.
1 code implementation • 21 Aug 2020 • Zitong Yu, Benjia Zhou, Jun Wan, Pichao Wang, Haoyu Chen, Xin Liu, Stan Z. Li, Guoying Zhao
Gesture recognition has attracted considerable attention owing to its great potential in applications.
no code implementations • 10 Aug 2020 • Haoyu Chen, Zitong Yu, Xin Liu, Wei Peng, Yoon Lee, Guoying Zhao
To address the problem of training on small datasets for action recognition tasks, most prior works are either based on a large number of training samples or require pre-trained models transferred from other large datasets to tackle overfitting problems.
no code implementations • ECCV 2020 • Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy Ren, Chao Dong
To answer these questions and promote the development of IQA methods, we contribute a large-scale IQA dataset, called Perceptual Image Processing Algorithms (PIPAL) dataset.
1 code implementation • 11 Nov 2019 • Wei Peng, Xiaopeng Hong, Haoyu Chen, Guoying Zhao
Human action recognition from skeleton data, fueled by the Graph Convolutional Network (GCN), has attracted lots of attention, due to its powerful capability of modeling non-Euclidean structure data.
no code implementations • 6 Sep 2018 • Jinjin Gu, Haoyu Chen, Guolong Liu, Gaoqi Liang, Xinlei Wang, Junhua Zhao
In this paper, we present the problem formulation and methodology framework of Super-Resolution Perception (SRP) on industrial sensor data.
no code implementations • 19 Oct 2016 • Daniel Seita, Xinlei Pan, Haoyu Chen, John Canny
We present a novel Metropolis-Hastings method for large datasets that uses small expected-size minibatches of data.
no code implementations • 19 Nov 2015 • Daniel Seita, Haoyu Chen, John Canny
A fundamental task in machine learning and related fields is to perform inference on Bayesian networks.
1 code implementation • 5 Jul 2015 • Hang Su, Haoyu Chen
Data is partitioned and distributed to different nodes for local model updates, and model averaging across nodes is done every few minibatches.