Search Results for author: Jinheng Xie

Found 19 papers, 14 papers with code

Towards Highly Realistic Artistic Style Transfer via Stable Diffusion with Step-aware and Layer-aware Prompt

1 code implementation • 17 Apr 2024 • Zhanjie Zhang, Quanwei Zhang, Huaizhong Lin, Wei Xing, Juncheng Mo, Shuaicheng Huang, Jinheng Xie, Guangyuan Li, Junsheng Luan, Lei Zhao, Dalong Zhang, Lixia Chen

To address the above problems, we propose a novel pre-trained diffusion-based artistic style transfer method, called LSAST, which can generate highly realistic artistic stylized images while preserving the content structure of input content images well, without bringing obvious artifacts and disharmonious style patterns.

Generative Adversarial Network Style Transfer

Paper
Code

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

1 code implementation • 3 Apr 2024 • Wentian Zhang, Haozhe Liu, Jinheng Xie, Francesco Faccio, Mike Zheng Shou, Jürgen Schmidhuber

This study explores the role of cross-attention during inference in text-conditional diffusion models.

206

Paper
Code

Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation

1 code implementation • 18 Jan 2024 • Songhe Deng, Wei Zhuo, Jinheng Xie, Linlin Shen

Class Activation Map (CAM) has emerged as a popular tool for weakly supervised semantic segmentation (WSSS), allowing the localization of object regions in an image using only image-level labels.

Ranked #7 on Weakly-Supervised Semantic Segmentation on PASCAL VOC 2012 test (using extra training data)

Contrastive Learning Prompt Engineering +4

Paper
Code

HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping

no code implementations • 29 Dec 2023 • Xin Zhang, Jinheng Xie, Yuan Yuan, Michael Bi Mi, Robby T. Tan

Further, to ensure the distinguishability among various regions, we introduce a region-level contrastive clustering loss to pull closer similar regions across images.

Object Object Discovery +2

Paper
Add Code

TCSloT: Text Guided 3D Context and Slope Aware Triple Network for Dental Implant Position Prediction

no code implementations • 10 Aug 2023 • Xinquan Yang, Jinheng Xie, Xuechen Li, Xuguang Li, Linlin Shen, Yongqiang Deng

In this paper, we design a Text Guided 3D Context and Slope Aware Triple Network (TCSloT) which enables the perception of contextual information from multiple adjacent slices and awareness of variation of implant slopes.

Position

Paper
Add Code

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

2 code implementations • ICCV 2023 • Jinheng Xie, Yuexiang Li, Yawen Huang, Haozhe Liu, Wentian Zhang, Yefeng Zheng, Mike Zheng Shou

As such paired data is time-consuming and labor-intensive to acquire and restricted to a closed set, this potentially becomes the bottleneck for applications in an open world.

Ranked #5 on Conditional Text-to-Image Synthesis on COCO-MIG

Conditional Text-to-Image Synthesis Denoising

206

Paper
Code

TCEIP: Text Condition Embedded Regression Network for Dental Implant Position Prediction

no code implementations • 26 Jun 2023 • Xinquan Yang, Jinheng Xie, Xuguang Li, Xuechen Li, Xin Li, Linlin Shen, Yongqiang Deng

When deep neural network has been proposed to assist the dentist in designing the location of dental implant, most of them are targeting simple cases where only one missing tooth is available.

Position Position regression +1

Paper
Add Code

Dynamically Masked Discriminator for Generative Adversarial Networks

1 code implementation • 13 Jun 2023 • Wentian Zhang, Haozhe Liu, Bing Li, Jinheng Xie, Yawen Huang, Yuexiang Li, Yefeng Zheng, Bernard Ghanem

By treating the generated data in training as a stream, we propose to detect whether the discriminator slows down the learning of new knowledge in generated data.

Continual Learning

Paper
Code

VisorGPT: Learning Visual Prior via Generative Pre-Training

1 code implementation • 23 May 2023 • Jinheng Xie, Kai Ye, Yudong Li, Yuexiang Li, Kevin Qinghong Lin, Yefeng Zheng, Linlin Shen, Mike Zheng Shou

Experimental results demonstrate that VisorGPT can effectively model the visual prior, which can be employed for many vision tasks, such as customizing accurate human pose for conditional image synthesis models like ControlNet.

Image Generation Language Modelling +1

125

Paper
Code

Open-World Weakly-Supervised Object Localization

1 code implementation • 17 Apr 2023 • Jinheng Xie, Zhaochuan Luo, Yuexiang Li, Haozhe Liu, Linlin Shen, Mike Zheng Shou

To handle such data, we propose a novel paradigm of contrastive representation co-learning using both labeled and unlabeled data to generate a complete G-CAM (Generalized Class Activation Map) for object localization, without the requirement of bounding box annotation.

Object Representation Learning +1

Paper
Code

Decoupled Mixup for Generalized Visual Recognition

1 code implementation • 26 Oct 2022 • Haozhe Liu, Wentian Zhang, Jinheng Xie, Haoqian Wu, Bing Li, Ziqi Zhang, Yuexiang Li, Yawen Huang, Bernard Ghanem, Yefeng Zheng

Since the observation is that noise-prone regions such as textural and clutter backgrounds are adverse to the generalization ability of CNN models during training, we enhance features from discriminative regions and suppress noise-prone ones when combining an image pair.

Paper
Code

A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays

1 code implementation • 5 Sep 2022 • Haoqin Ji, Haozhe Liu, Yuexiang Li, Jinheng Xie, Nanjun He, Yawen Huang, Dong Wei, Xinrong Chen, Linlin Shen, Yefeng Zheng

Such a point annotation setting can provide weakly instance-level information for abnormality localization with a marginal annotation cost.

Paper
Code

Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation

2 code implementations • 25 Mar 2022 • Jinheng Xie, Jianfeng Xiang, Junliang Chen, Xianxu Hou, Xiaodong Zhao, Linlin Shen

While class activation map (CAM) generated by image classification network has been widely used for weakly supervised object localization (WSOL) and semantic segmentation (WSSS), such classifiers usually focus on discriminative object regions.

Contrastive Learning Image Classification +3

177

Paper
Code

Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity

1 code implementation • CVPR 2022 • Cheng Luo, Qinliang Lin, Weicheng Xie, Bizhu Wu, Jinheng Xie, Linlin Shen

Current adversarial attack research reveals the vulnerability of learning-based classifiers against carefully crafted perturbations.

Adversarial Attack Semantic Similarity +1

Paper
Code

Cross Language Image Matching for Weakly Supervised Semantic Segmentation

2 code implementations • 5 Mar 2022 • Jinheng Xie, Xianxu Hou, Kai Ye, Linlin Shen

As only a fixed set of image-level object labels are available to the WSSS (weakly supervised semantic segmentation) model, it could be very difficult to suppress those diverse background regions consisting of open set objects.

Object Weakly supervised Semantic Segmentation +1

177

Paper
Code

C2AM: Contrastive Learning of Class-Agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation

1 code implementation • CVPR 2022 • Jinheng Xie, Jianfeng Xiang, Junliang Chen, Xianxu Hou, Xiaodong Zhao, Linlin Shen

Contrastive Learning Image Classification +3

177

Paper
Code

CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation

no code implementations • CVPR 2022 • Jinheng Xie, Xianxu Hou, Kai Ye, Linlin Shen

Object Weakly supervised Semantic Segmentation +1

Paper
Add Code

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

1 code implementation • ICCV 2021 • Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin Shen

In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner.

Object Weakly-Supervised Object Localization

Paper
Code

Think about boundary: Fusing multi-level boundary information for landmark heatmap regression

no code implementations • 25 Aug 2020 • Jinheng Xie, Jun Wan, Linlin Shen, Zhihui Lai

Although current face alignment algorithms have obtained pretty good performances at predicting the location of facial landmarks, huge challenges remain for faces with severe occlusion and large pose variations, etc.

Face Alignment regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.