Search Results for author: Yeying Jin

Found 25 papers, 13 papers with code

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

no code implementations12 Jun 2025 Sixiang Chen, Jianyu Lai, Jialin Gao, Tian Ye, Haoyu Chen, Hengyu Shi, Shitong Shao, Yunlong Lin, Song Fei, Zhaohu Xing, Yeying Jin, Junfeng Luo, Xiaoming Wei, Lei Zhu

Generating aesthetic posters is more challenging than simple design images: it requires not only precise text rendering but also the seamless integration of abstract artistic content, striking layouts, and overall stylistic harmony.

Fast or Slow? Integrating Fast Intuition and Deliberate Thinking for Enhancing Visual Question Answering

no code implementations1 Jun 2025 Songtao Jiang, Chenyi Zhou, Yan Zhang, Yeying Jin, Zuozhu Liu

Multimodal large language models (MLLMs) still struggle with complex reasoning tasks in Visual Question Answering (VQA).

All MME +2

MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

1 code implementation29 May 2025 Siyuan Wang, Jiawei Liu, Wei Wang, Yeying Jin, Jinsong Du, Zhi Han

Specifically, we propose a Motion Mask-Guided Two-Stage Network (MMGT) that uses audio, as well as motion masks and motion features generated from the audio signal to jointly drive the generation of synchronized speech gesture videos.

Motion Generation Video Generation

UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache

no code implementations20 May 2025 Pu Wang, Pengwen Dai, Chen Wu, Yeying Jin, Dianjie Lu, Guijuan Zhang, Youshan Zhang, Zhuoran Zheng

In this paper, we propose an efficient visual transformer framework for ultra-high-definition (UHD) image dehazing that addresses the key challenges of slow training speed and high memory consumption for existing methods.

4k 8k +3

Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation

no code implementations17 May 2025 Zhiying Li, GuangGang Geng, Yeying Jin, Zhizhi Guo, Bruce Gu, Jidong Huo, Zhaoxin Fan, Wenjun Wu

These findings underscore the urgent need to address and mitigate security risks associated with digital human generation systems.

Pose Estimation

DSDNet: Raw Domain Demoiréing via Dual Color-Space Synergy

no code implementations22 Apr 2025 Qirui Yang, Fangpu Zhang, Yeying Jin, Qihua Cheng, PengTao Jiang, Huanjing Yue, Jingyu Yang

To address these limitations, we propose a single-stage raw domain demoir\'eing framework, Dual-Stream Demoir\'eing Network (DSDNet), which leverages the synergy of raw and YCbCr images to remove moir\'e while preserving luminance and color fidelity.

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

1 code implementation17 Apr 2025 Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, YuFei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, YuTing Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, YuBo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, XinCao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun, Subhajit Paul, Ni Tang, Junhao Huang, Zihan Cheng, Hongyun Zhu, Yuehan Wu, Kaixin Deng, Hang Ouyang, Tianxin Xiao, Fan Yang, Zhizun Luo, Zeyu Xiao, Zhuoyuan Li, Nguyen Pham Hoang Le, An Dinh Thien, Son T. Luu, Kiet Van Nguyen, Ronghua Xu, Xianmin Tian, Weijian Zhou, Jiacheng Zhang, Yuqian Chen, Yihang Duan, Yujie Wu, Suresh Raikwar, Arsh Garg, Kritika, Jianhua Zheng, Xiaoshan Ma, Ruolin Zhao, Yongyu Yang, Yongsheng Liang, Guiming Huang, Qiang Li, Hongbin Zhang, Xiangyu Zheng, A. N. Rajagopalan

This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images.

Raindrop Removal Rain Removal +1

Modality-Fair Preference Optimization for Trustworthy MLLM Alignment

no code implementations20 Oct 2024 Songtao Jiang, Yan Zhang, Ruizhe Chen, Yeying Jin, Zuozhu Liu

To address this, we propose Modality-Fair Preference Optimization (MFPO) to balance text and image preferences.

Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation

1 code implementation29 Sep 2024 Xiaofeng Cong, Jing Zhang, Yeying Jin, JunMing Hou, Yu Zhao, Jie Gui, James Tin-Yau Kwok, Yuan Yan Tang

ColorCode offers three key features: 1) color enhancement, producing an enhanced image with a fixed color; 2) color adaptation, enabling controllable adjustments of long-wavelength color components using guidance images; and 3) color interpolation, allowing for the smooth generation of multiple colors through continuous sampling of the color code.

Image Enhancement

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

no code implementations18 Jul 2024 Xin Li, Bingchen Li, Yeying Jin, Cuiling Lan, Hanxin Zhu, Yulin Ren, Zhibo Chen

Compressed Image Super-resolution (CSR) aims to simultaneously super-resolve the compressed images and tackle the challenging hybrid distortions caused by compression.

Compressed Image Super-resolution Image Super-Resolution +1

Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models

2 code implementations16 Apr 2024 Songtao Jiang, Tuo Zheng, Yan Zhang, Yeying Jin, Li Yuan, Zuozhu Liu

Recent advancements in general-purpose or domain-specific multimodal large language models (LLMs) have witnessed remarkable progress for medical decision-making.

image-classification Image Classification +3

Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models

1 code implementation6 Apr 2024 Songtao Jiang, Yan Zhang, Chenyi Zhou, Yeying Jin, Yang Feng, Jian Wu, Zuozhu Liu

In this paper, we present a novel approach, Joint Visual and Text Prompting (VTPrompt), that employs fine-grained visual information to enhance the capability of MLLMs in VQA, especially for object-oriented perception.

MME Object +2

How Powerful Potential of Attention on Image Restoration?

no code implementations15 Mar 2024 Cong Wang, Jinshan Pan, Yeying Jin, Liyan Wang, Wei Wang, Gang Fu, Wenqi Ren, Xiaochun Cao

Our designs provide a closer look at the attention mechanism and reveal that some simple operations can significantly affect the model performance.

Image Restoration

NightHaze: Nighttime Image Dehazing via Self-Prior Learning

no code implementations12 Mar 2024 Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Robby T. Tan

By increasing the noise values to approach as high as the pixel intensity values of the glow and light effect blended images, our augmentation becomes severe, resulting in stronger priors.

Image Dehazing Image Enhancement

SeD: Semantic-Aware Discriminator for Image Super-Resolution

1 code implementation CVPR 2024 Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, Zhibo Chen

In particular, one discriminator is utilized to enable the SR network to learn the distribution of real-world high-quality images in an adversarial training manner.

Image Super-Resolution

Polyp-DAM: Polyp segmentation via depth anything model

no code implementations3 Feb 2024 Zhuoran Zheng, Chen Wu, Wei Wang, Yeying Jin, Xiuyi Jia

In this paper, we unfold a new perspective on polyp segmentation modeling by leveraging the Depth Anything Model (DAM) to provide depth prior to polyp segmentation models.

Segmentation

NightRain: Nighttime Video Deraining via Adaptive-Rain-Removal and Adaptive-Correction

no code implementations1 Jan 2024 Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Shunli Zhang, Robby Tan

However, the intricacies of the real world, particularly with the presence of light effects and low-light regions affected by noise, create significant domain gaps, hampering synthetic-trained models in removing rain streaks properly and leading to over-saturation and color shifts.

Rain Removal Video deraining

Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution

1 code implementation3 Aug 2023 Yeying Jin, Beibei Lin, Wending Yan, Yuan Yuan, Wei Ye, Robby T. Tan

In this paper, we enhance the visibility from a single nighttime haze image by suppressing glow and enhancing low-light regions.

Image Dehazing

Estimating Reflectance Layer from A Single Image: Integrating Reflectance Guidance and Shadow/Specular Aware Learning

1 code implementation27 Nov 2022 Yeying Jin, Ruoteng Li, Wenhan Yang, Robby T. Tan

To further enforce the reflectance layer to be independent of shadows and specularities in the second-stage refinement, we introduce an S-Aware network that distinguishes the reflectance image from the input image.

highlight removal Intrinsic Image Decomposition +1

DeS3: Adaptive Attention-driven Self and Soft Shadow Removal using ViT Similarity

1 code implementation15 Nov 2022 Yeying Jin, Wei Ye, Wenhan Yang, Yuan Yuan, Robby T. Tan

Most existing methods rely on binary shadow masks, without considering the ambiguous boundaries of soft and self shadows.

Image Shadow Removal Shadow Removal

Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal

1 code implementation6 Oct 2022 Yeying Jin, Wending Yan, Wenhan Yang, Robby T. Tan

Few existing image defogging or dehazing methods consider dense and non-uniform particle distributions, which usually happen in smoke, dust and fog.

Image Dehazing Image Enhancement +3

Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression

1 code implementation21 Jul 2022 Yeying Jin, Wenhan Yang, Robby T. Tan

To address this problem, we need to suppress the light effects in bright regions while, at the same time, boosting the intensity of dark regions.

Hallucination Image Restoration +1

Cannot find the paper you are looking for? You can Submit a new open access paper.