Search Results for author: Jinheng Xie

Found 19 papers, 14 papers with code

Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt

1 code implementation17 Apr 2024 Zhanjie Zhang, Quanwei Zhang, Huaizhong Lin, Wei Xing, Juncheng Mo, Shuaicheng Huang, Jinheng Xie, Guangyuan Li, Junsheng Luan, Lei Zhao, Dalong Zhang, Lixia Chen

To address the above problems, we propose a novel pre-trained diffusion-based artistic style transfer method, called LSAST, which can generate highly realistic artistic stylized images while preserving the content structure of input content images well, without bringing obvious artifacts and disharmonious style patterns.

Generative Adversarial Network Style Transfer

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

1 code implementation3 Apr 2024 Wentian Zhang, Haozhe Liu, Jinheng Xie, Francesco Faccio, Mike Zheng Shou, Jürgen Schmidhuber

This study explores the role of cross-attention during inference in text-conditional diffusion models.

Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation

1 code implementation18 Jan 2024 Songhe Deng, Wei Zhuo, Jinheng Xie, Linlin Shen

Class Activation Map (CAM) has emerged as a popular tool for weakly supervised semantic segmentation (WSSS), allowing the localization of object regions in an image using only image-level labels.

Contrastive Learning Prompt Engineering +4

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

no code implementations29 Dec 2023 Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan

Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.

Object Object Discovery +2

TCSloT: Text Guided 3D Context and Slope Aware Triple Network for Dental Implant Position Prediction

no code implementations10 Aug 2023 Xinquan Yang, Jinheng Xie, Xuechen Li, Xuguang Li, Linlin Shen, Yongqiang Deng

In this paper, we design a Text Guided 3D Context and Slope Aware Triple Network (TCSloT) which enables the perception of contextual information from multiple adjacent slices and awareness of variation of implant slopes.

Position

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

2 code implementations ICCV 2023 Jinheng Xie, Yuexiang Li, Yawen Huang, Haozhe Liu, Wentian Zhang, Yefeng Zheng, Mike Zheng Shou

As such paired data is time-consuming and labor-intensive to acquire and restricted to a closed set, this potentially becomes the bottleneck for applications in an open world.

Conditional Text-to-Image Synthesis Denoising

TCEIP: Text Condition Embedded Regression Network for Dental Implant Position Prediction

no code implementations26 Jun 2023 Xinquan Yang, Jinheng Xie, Xuguang Li, Xuechen Li, Xin Li, Linlin Shen, Yongqiang Deng

When deep neural network has been proposed to assist the dentist in designing the location of dental implant, most of them are targeting simple cases where only one missing tooth is available.

Position Position regression +1

Dynamically Masked Discriminator for Generative Adversarial Networks

1 code implementation13 Jun 2023 Wentian Zhang, Haozhe Liu, Bing Li, Jinheng Xie, Yawen Huang, Yuexiang Li, Yefeng Zheng, Bernard Ghanem

By treating the generated data in training as a stream, we propose to detect whether the discriminator slows down the learning of new knowledge in generated data.

Continual Learning

VisorGPT: Learning Visual Prior via Generative Pre-Training

1 code implementation23 May 2023 Jinheng Xie, Kai Ye, Yudong Li, Yuexiang Li, Kevin Qinghong Lin, Yefeng Zheng, Linlin Shen, Mike Zheng Shou

Experimental results demonstrate that VisorGPT can effectively model the visual prior, which can be employed for many vision tasks, such as customizing accurate human pose for conditional image synthesis models like ControlNet.

Image Generation Language Modelling +1

Open-World Weakly-Supervised Object Localization

1 code implementation17 Apr 2023 Jinheng Xie, Zhaochuan Luo, Yuexiang Li, Haozhe Liu, Linlin Shen, Mike Zheng Shou

To handle such data, we propose a novel paradigm of contrastive representation co-learning using both labeled and unlabeled data to generate a complete G-CAM (Generalized Class Activation Map) for object localization, without the requirement of bounding box annotation.

Object Representation Learning +1

Decoupled Mixup for Generalized Visual Recognition

1 code implementation26 Oct 2022 Haozhe Liu, Wentian Zhang, Jinheng Xie, Haoqian Wu, Bing Li, Ziqi Zhang, Yuexiang Li, Yawen Huang, Bernard Ghanem, Yefeng Zheng

Since the observation is that noise-prone regions such as textural and clutter backgrounds are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair.

A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays

1 code implementation5 Sep 2022 Haoqin Ji, Haozhe Liu, Yuexiang Li, Jinheng Xie, Nanjun He, Yawen Huang, Dong Wei, Xinrong Chen, Linlin Shen, Yefeng Zheng

Such a point annotation setting can provide weakly instance-level information for abnormality localization with a marginal annotation cost.

Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation

2 code implementations25 Mar 2022 Jinheng Xie, Jianfeng Xiang, Junliang Chen, Xianxu Hou, Xiaodong Zhao, Linlin Shen

While class activation map (CAM) generated by image classification network has been widely used for weakly supervised object localization (WSOL) and semantic segmentation (WSSS), such classifiers usually focus on discriminative object regions.

Contrastive Learning Image Classification +3

Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity

1 code implementation CVPR 2022 Cheng Luo, Qinliang Lin, Weicheng Xie, Bizhu Wu, Jinheng Xie, Linlin Shen

Current adversarial attack research reveals the vulnerability of learning-based classifiers against carefully crafted perturbations.

Adversarial Attack Semantic Similarity +1

Cross Language Image Matching for Weakly Supervised Semantic Segmentation

2 code implementations5 Mar 2022 Jinheng Xie, Xianxu Hou, Kai Ye, Linlin Shen

As only a fixed set of image-level object labels are available to the WSSS (weakly supervised semantic segmentation) model, it could be very difficult to suppress those diverse background regions consisting of open set objects.

Object Weakly supervised Semantic Segmentation +1

C2AM: Contrastive Learning of Class-Agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation

1 code implementation CVPR 2022 Jinheng Xie, Jianfeng Xiang, Junliang Chen, Xianxu Hou, Xiaodong Zhao, Linlin Shen

While class activation map (CAM) generated by image classification network has been widely used for weakly supervised object localization (WSOL) and semantic segmentation (WSSS), such classifiers usually focus on discriminative object regions.

Contrastive Learning Image Classification +3

CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation

no code implementations CVPR 2022 Jinheng Xie, Xianxu Hou, Kai Ye, Linlin Shen

As only a fixed set of image-level object labels are available to the WSSS (weakly supervised semantic segmentation) model, it could be very difficult to suppress those diverse background regions consisting of open set objects.

Object Weakly supervised Semantic Segmentation +1

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

1 code implementation ICCV 2021 Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin Shen

In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner.

Object Weakly-Supervised Object Localization

Think about boundary: Fusing multi-level boundary information for landmark heatmap regression

no code implementations25 Aug 2020 Jinheng Xie, Jun Wan, Linlin Shen, Zhihui Lai

Although current face alignment algorithms have obtained pretty good performances at predicting the location of facial landmarks, huge challenges remain for faces with severe occlusion and large pose variations, etc.

Face Alignment regression

Cannot find the paper you are looking for? You can Submit a new open access paper.