Search Results for author: Kang Zhang

Found 32 papers, 9 papers with code

Towards Understanding Dual BN In Hybrid Adversarial Training

no code implementations28 Mar 2024 Chenshuang Zhang, Chaoning Zhang, Kang Zhang, Axi Niu, Junmo Kim, In So Kweon

There is a growing concern about applying batch normalization (BN) in adversarial training (AT), especially when the model is trained on both adversarial samples and clean samples (termed Hybrid-AT).

BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout Detection

1 code implementation12 Feb 2024 Kang Zhang, Osamu Yoshie, Weiran Huang

To address these issues, we introduce BreakGPT, the first large language model for financial breakout detection.

Language Modelling Large Language Model

Human Aesthetic Preference-Based Large Text-to-Image Model Personalization: Kandinsky Generation as an Example

no code implementations9 Feb 2024 Aven-Le Zhou, Yu-Ao Wang, Wei Wu, Kang Zhang

This paper introduces a prompting-free generative approach that empowers users to automatically generate personalized painterly content that incorporates their aesthetic preferences in a customized artistic style.

Multi-modal vision-language model for generalizable annotation-free pathological lesions localization and clinical diagnosis

no code implementations4 Jan 2024 Hao Yang, Hong-Yu Zhou, Zhihuan Li, Yuanxu Gao, Cheng Li, Weijian Huang, Jiarun Liu, Hairong Zheng, Kang Zhang, Shanshan Wang

Defining pathologies automatically from medical images aids the understanding of the emergence and progression of diseases, and such an ability is crucial in clinical diagnostics.

Contrastive Learning Language Modelling

The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model

no code implementations4 Dec 2023 Yilin Ye, Qian Zhu, Shishi Xiao, Kang Zhang, Wei Zeng

Moreover, the intent expansion framework enables users to perform flexible contextualized interactions with the search results to further specify or adjust their detailed search intents iteratively.

Image Retrieval Interactive Segmentation +2

DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution

no code implementations30 Nov 2023 Axi Niu, Kang Zhang, Joshua Tian Jin Tee, Trung X. Pham, Jinqiu Sun, Chang D. Yoo, In So Kweon, Yanning Zhang

It is well known the adversarial optimization of GAN-based image super-resolution (SR) methods makes the preceding SR model generate unpleasant and undesirable artifacts, leading to large distortion.

Attribute Data Augmentation +1

Archiving Body Movements: Collective Generation of Chinese Calligraphy

no code implementations23 Nov 2023 Aven Le Zhou, Jiayi Ye, Tianchen Liu, Kang Zhang

As a communication channel, body movements have been widely explored in behavioral studies and kinesics.

ACDMSR: Accelerated Conditional Diffusion Models for Single Image Super-Resolution

no code implementations3 Jul 2023 Axi Niu, Pham Xuan Trung, Kang Zhang, Jinqiu Sun, Yu Zhu, In So Kweon, Yanning Zhang

To speed up inference and further enhance the performance, our research revisits diffusion models in image super-resolution and proposes a straightforward yet significant diffusion model-based super-resolution method called ACDMSR (accelerated conditional diffusion model for image super-resolution).

Denoising Image Super-Resolution +1

A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics

1 code implementation1 Jun 2023 Hong-Yu Zhou, Yizhou Yu, Chengdi Wang, Shu Zhang, Yuanxu Gao, Jia Pan, Jun Shao, Guangming Lu, Kang Zhang, Weimin Li

During the diagnostic process, clinicians leverage multimodal information, such as chief complaints, medical images, and laboratory-test results.

Representation Learning

Learning from Multi-Perception Features for Real-Word Image Super-resolution

no code implementations26 May 2023 Axi Niu, Kang Zhang, Trung X. Pham, Pei Wang, Jinqiu Sun, In So Kweon, Yanning Zhang

Currently, there are two popular approaches for addressing real-world image super-resolution problems: degradation-estimation-based and blind-based methods.

Image Super-Resolution

Everyone Can Be Picasso? A Computational Framework into the Myth of Human versus AI Painting

1 code implementation17 Apr 2023 Yilin Ye, Rong Huang, Kang Zhang, Wei Zeng

The recent advances of AI technology, particularly in AI-Generated Content (AIGC), have enabled everyone to easily generate beautiful paintings with simple text description.

Semi-Supervised Video Inpainting with Cycle Consistency Constraints

no code implementations CVPR 2023 Zhiliang Wu, Hanyu Xuan, Changchang Sun, Kang Zhang, Yan Yan

Specifically, in this work, we propose an end-to-end trainable framework consisting of completion network and mask prediction network, which are designed to generate corrupted contents of the current frame using the known mask and decide the regions to be filled of the next frame, respectively.

Video Inpainting

On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning

no code implementations11 Aug 2022 Trung Pham, Chaoning Zhang, Axi Niu, Kang Zhang, Chang D. Yoo

Exponential Moving Average (EMA or momentum) is widely used in modern self-supervised learning (SSL) approaches, such as MoCo, for enhancing performance.

Representation Learning Self-Supervised Learning

A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond

no code implementations30 Jul 2022 Chaoning Zhang, Chenshuang Zhang, Junha Song, John Seon Keun Yi, Kang Zhang, In So Kweon

Masked autoencoders are scalable vision learners, as the title of MAE \cite{he2022masked}, which suggests that self-supervised learning (SSL) in vision might undertake a similar trajectory as in NLP.

Contrastive Learning Denoising +1

Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness

2 code implementations22 Jul 2022 Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Axi Niu, Jiu Feng, Chang D. Yoo, In So Kweon

Adversarial training (AT) for robust representation learning and self-supervised learning (SSL) for unsupervised representation learning are two active research fields.

Adversarial Robustness Contrastive Learning +3

Understanding and Improving Group Normalization

1 code implementation5 Jul 2022 Agus Gunawan, Xu Yin, Kang Zhang

Various normalization layers have been proposed to help the training of neural networks.

Image Classification

Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo

2 code implementations CVPR 2022 Chaoning Zhang, Kang Zhang, Trung X. Pham, Axi Niu, Zhinan Qiao, Chang D. Yoo, In So Kweon

Contrastive learning (CL) is widely known to require many negative samples, 65536 in MoCo for instance, for which the performance of a dictionary-free framework is often inferior because the negative sample size (NSS) is limited by its mini-batch size (MBS).

Contrastive Learning

Investigating Top-$k$ White-Box and Transferable Black-box Attack

no code implementations30 Mar 2022 Chaoning Zhang, Philipp Benz, Adil Karjauv, Jae Won Cho, Kang Zhang, In So Kweon

It is widely reported that stronger I-FGSM transfers worse than simple FGSM, leading to a popular belief that transferability is at odds with the white-box attack strength.

Fast Adversarial Training with Noise Augmentation: A Unified Perspective on RandStart and GradAlign

no code implementations11 Feb 2022 Axi Niu, Kang Zhang, Chaoning Zhang, Chenshuang Zhang, In So Kweon, Chang D. Yoo, Yanning Zhang

The former works only for a relatively small perturbation 8/255 with the l_\infty constraint, and GradAlign improves it by extending the perturbation size to 16/255 (with the l_\infty constraint) but at the cost of being 3 to 4 times slower.

Data Augmentation

Investigating Top-k White-Box and Transferable Black-Box Attack

no code implementations CVPR 2022 Chaoning Zhang, Philipp Benz, Adil Karjauv, Jae Won Cho, Kang Zhang, In So Kweon

It is widely reported that stronger I-FGSM transfers worse than simple FGSM, leading to a popular belief that transferability is at odds with the white-box attack strength.

Early Stop And Adversarial Training Yield Better surrogate Model: Very Non-Robust Features Harm Adversarial Transferability

no code implementations29 Sep 2021 Chaoning Zhang, Gyusang Cho, Philipp Benz, Kang Zhang, Chenshuang Zhang, Chan-Hyun Youn, In So Kweon

The transferability of adversarial examples (AE); known as adversarial transferability, has attracted significant attention because it can be exploited for TransferableBlack-box Attacks (TBA).

Attribute

Towards Personalized and Semantic Retrieval: An End-to-End Solution for E-commerce Search via Embedding Learning

no code implementations3 Jun 2020 Han Zhang, Songlin Wang, Kang Zhang, Zhiling Tang, Yunjiang Jiang, Yun Xiao, Weipeng Yan, Wen-Yun Yang

Two critical challenges stay in today's e-commerce search: how to retrieve items that are semantically relevant but not exact matching to query terms, and how to retrieve items that are more personalized to different users for the same search query.

Retrieval Semantic Retrieval

A Computational Model of Afterimages based on Simultaneous and Successive Contrasts

no code implementations13 Sep 2017 Jinhui Yu, Kailin Wu, Kang Zhang, Xianjun Sam Zheng

The colors of negative afterimages differ from the old stimulating colors in the original image when the color in the new area is either neutral or chromatic.

Bayesian regression and Bitcoin

15 code implementations6 Oct 2014 Devavrat Shah, Kang Zhang

In this paper, we discuss the method of Bayesian regression and its efficacy for predicting price variation of Bitcoin, a recently popularized virtual, cryptographic currency.

Bayesian Inference Binary Classification +2

Cross-Scale Cost Aggregation for Stereo Matching

1 code implementation CVPR 2014 Kang Zhang, Yuqiang Fang, Dongbo Min, Lifeng Sun, Shiqiang Yang. Shuicheng Yan, Qi Tian

We firstly reformulate cost aggregation from a unified optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels.

Stereo Matching Stereo Matching Hand

Binary Stereo Matching

1 code implementation10 Feb 2014 Kang Zhang, Jiyang Li, Yijing Li, Weidong Hu, Lifeng Sun, Shiqiang Yang

In this paper, we propose a novel binary-based cost computation and aggregation approach for stereo matching problem.

Computational Efficiency Stereo Matching +1

Cannot find the paper you are looking for? You can Submit a new open access paper.