Search Results for author: Zhineng Chen

Found 25 papers, 16 papers with code

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

1 code implementation12 Sep 2024 Yifu Chen, Jingwen Chen, Yingwei Pan, Yehao Li, Ting Yao, Zhineng Chen, Tao Mei

In this paper, we propose to decompose the typical single-stage object inpainting into two cascaded processes: 1) semantic pre-inpainting that infers the semantic features of desired objects in a multi-modal feature space; 2) high-fieldity object generation in diffusion latent space that pivots on such inpainted semantic features.

Denoising Object

FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process

no code implementations11 Sep 2024 Yang Luo, Yiheng Zhang, Zhaofan Qiu, Ting Yao, Zhineng Chen, Yu-Gang Jiang, Tao Mei

Technically, FreeEnhance is a two-stage process that firstly adds random noise to the input image and then capitalizes on a pre-trained image diffusion model (i. e., Latent Diffusion Models) to denoise and enhance the image details.

Denoising Image Enhancement +1

Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models

1 code implementation11 Sep 2024 Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Chong-Wah Ngo, Tao Mei

Despite having tremendous progress in image-to-3D generation, existing methods still struggle to produce multi-view consistent images with high-resolution textures in detail, especially in the paradigm of 2D diffusion that lacks 3D awareness.

3D Generation 3D Reconstruction +3

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation

no code implementations11 Sep 2024 Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Tao Mei

In the fine stage, DreamMesh jointly manipulates the mesh and refines the texture map, leading to high-quality triangle meshes with high-fidelity textured materials.

3D Architecture 3D Generation +1

CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer

no code implementations11 Sep 2024 Feiyang Jia, Zhineng Chen, Ziying Song, Lin Liu, Caiyan Jia

Super-resolution (SR) aims to enhance the quality of low-resolution images and has been widely applied in medical imaging.

Super-Resolution

Decoder Pre-Training with only Text for Scene Text Recognition

1 code implementation11 Aug 2024 Shuai Zhao, Yongkun Du, Zhineng Chen, Yu-Gang Jiang

Extensive experiments across various STR decoders and language recognition tasks underscore the broad applicability and remarkable performance of DPTR, providing a novel insight for STR pre-training.

Decoder Scene Text Recognition

AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning

1 code implementation4 Aug 2024 Xin Wang, Kai Chen, Xingjun Ma, Zhineng Chen, Jingjing Chen, Yu-Gang Jiang

During this process, the queries made to the target model are intermediate adversarial examples crafted at the previous attack step, which share high similarities in the pixel space.

Learning to Rank Patches for Unbiased Image Redundancy Reduction

1 code implementation CVPR 2024 Yang Luo, Zhineng Chen, Peng Zhou, Zuxuan Wu, Xieping Gao, Yu-Gang Jiang

The results demonstrate that LTRP outperforms both supervised and other self-supervised methods due to the fair assessment of image content.

Image Reconstruction Inductive Bias +1

Instruction-Guided Scene Text Recognition

1 code implementation31 Jan 2024 Yongkun Du, Zhineng Chen, Yuchen Su, Caiyan Jia, Yu-Gang Jiang

We propose a novel instruction-guided scene text recognition (IGTR) paradigm that formulates STR as an instruction learning problem and understands text images by predicting character attributes, e. g., character frequency, position, etc.

Question Answering Scene Text Recognition

3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models

1 code implementation9 Nov 2023 Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Tao Mei

In this work, we propose a new 3DStyle-Diffusion model that triggers fine-grained stylization of 3D meshes with additional controllable appearance and geometric guidance from 2D Diffusion models.

Image Generation

Context Perception Parallel Decoder for Scene Text Recognition

1 code implementation23 Jul 2023 Yongkun Du, Zhineng Chen, Caiyan Jia, Xiaoting Yin, Chenxia Li, Yuning Du, Yu-Gang Jiang

We first present an empirical study of AR decoding in STR, and discover that the AR decoder not only models linguistic context, but also provides guidance on visual context perception.

 Ranked #1 on Scene Text Recognition on CUTE80 (using extra training data)

Decoder Language Modelling +1

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition

1 code implementation ICCV 2023 Tianlun Zheng, Zhineng Chen, Bingchen Huang, Wei zhang, Yu-Gang Jiang

In this paper, we propose the Incremental MLTR (IMLTR) task in the context of incremental learning (IL), where different languages are introduced in batches.

Continual Learning Incremental Learning +2

TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition

1 code implementation9 May 2023 Tianlun Zheng, Zhineng Chen, Jinfeng Bai, Hongtao Xie, Yu-Gang Jiang

In this work, we introduce TPS++, an attention-enhanced TPS transformation that incorporates the attention mechanism to text rectification for the first time.

Optical Character Recognition (OCR) Scene Text Recognition

Bi-Directional Feature Fusion Generative Adversarial Network for Ultra-High Resolution Pathological Image Virtual Re-Staining

no code implementations CVPR 2023 Kexin Sun, Zhineng Chen, Gongwei Wang, Jun Liu, Xiongjun Ye, Yu-Gang Jiang

In order to eliminate the square effect, we design a bi-directional feature fusion generative adversarial network (BFF-GAN) with a global branch and a local branch.

Generative Adversarial Network

Prototypical Residual Networks for Anomaly Detection and Localization

no code implementations CVPR 2023 HUI ZHANG, Zuxuan Wu, Zheng Wang, Zhineng Chen, Yu-Gang Jiang

Anomaly detection and localization are widely used in industrial manufacturing for its efficiency and effectiveness.

Ranked #4 on Supervised Anomaly Detection on MVTec AD (using extra training data)

Supervised Anomaly Detection

SVTR: Scene Text Recognition with a Single Visual Model

3 code implementations30 Apr 2022 Yongkun Du, Zhineng Chen, Caiyan Jia, Xiaoting Yin, Tianlun Zheng, Chenxia Li, Yuning Du, Yu-Gang Jiang

Dominant scene text recognition models commonly contain two building blocks, a visual model for feature extraction and a sequence model for text transcription.

Scene Text Recognition

CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

2 code implementations22 Nov 2021 Tianlun Zheng, Zhineng Chen, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang

In this paper, we propose a novel module called Multi-Domain Character Distance Perception (MDCDP) to establish a visually and semantically related position embedding.

Position Scene Text Recognition

ACE-Net: Biomedical Image Segmentation with Augmented Contracting and Expansive Paths

no code implementations23 Aug 2019 Yanhao Zhu, Zhineng Chen, Shuai Zhao, Hongtao Xie, Wenming Guo, Yongdong Zhang

Nowadays U-net-like FCNs predominate various biomedical image segmentation applications and attain promising performance, largely due to their elegant architectures, e. g., symmetric contracting and expansive paths as well as lateral skip-connections.

Image Segmentation Segmentation +1

NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition

3 code implementations4 Jun 2018 Fenfen Sheng, Zhineng Chen, Bo Xu

Considering scene image has large variation in text and background, we further design a modality-transform block to effectively transform 2D input images to 1D sequences, combined with the encoder to extract more discriminative features.

Decoder Optical Character Recognition (OCR) +1

Binarized Mode Seeking for Scalable Visual Pattern Discovery

no code implementations CVPR 2017 Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, Zhineng Chen

Second, we further extend bMS to a more general form, namely contrastive binary mean shift (cbMS), which maximizes the contrastive density in binary space, for finding informative patterns that are both frequent and discriminative for the dataset.

Cannot find the paper you are looking for? You can Submit a new open access paper.