Search Results for author: Kang Zhang

Found 32 papers, 9 papers with code

Towards Understanding Dual BN In Hybrid Adversarial Training

no code implementations • 28 Mar 2024 • Chenshuang Zhang, Chaoning Zhang, Kang Zhang, Axi Niu, Junmo Kim, In So Kweon

There is a growing concern about applying batch normalization (BN) in adversarial training (AT), especially when the model is trained on both adversarial samples and clean samples (termed Hybrid-AT).

Paper
Add Code

BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout Detection

1 code implementation • 12 Feb 2024 • Kang Zhang, Osamu Yoshie, Weiran Huang

To address these issues, we introduce BreakGPT, the first large language model for financial breakout detection.

Language Modelling Large Language Model

Paper
Code

Human Aesthetic Preference-Based Large Text-to-Image Model Personalization: Kandinsky Generation as an Example

no code implementations • 9 Feb 2024 • Aven-Le Zhou, Yu-Ao Wang, Wei Wu, Kang Zhang

This paper introduces a prompting-free generative approach that empowers users to automatically generate personalized painterly content that incorporates their aesthetic preferences in a customized artistic style.

Paper
Add Code

Multi-modal vision-language model for generalizable annotation-free pathological lesions localization and clinical diagnosis

no code implementations • 4 Jan 2024 • Hao Yang, Hong-Yu Zhou, Zhihuan Li, Yuanxu Gao, Cheng Li, Weijian Huang, Jiarun Liu, Hairong Zheng, Kang Zhang, Shanshan Wang

Defining pathologies automatically from medical images aids the understanding of the emergence and progression of diseases, and such an ability is crucial in clinical diagnostics.

Contrastive Learning Language Modelling

Paper
Add Code

The Contemporary Art of Image Search: Iterative User Intent Expansion via Vision-Language Model

no code implementations • 4 Dec 2023 • Yilin Ye, Qian Zhu, Shishi Xiao, Kang Zhang, Wei Zeng

Moreover, the intent expansion framework enables users to perform flexible contextualized interactions with the search results to further specify or adjust their detailed search intents iteratively.

Image Retrieval Interactive Segmentation +2

Paper
Add Code

DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution

no code implementations • 30 Nov 2023 • Axi Niu, Kang Zhang, Joshua Tian Jin Tee, Trung X. Pham, Jinqiu Sun, Chang D. Yoo, In So Kweon, Yanning Zhang

It is well known the adversarial optimization of GAN-based image super-resolution (SR) methods makes the preceding SR model generate unpleasant and undesirable artifacts, leading to large distortion.

Attribute Data Augmentation +1

Paper
Add Code

Archiving Body Movements: Collective Generation of Chinese Calligraphy

no code implementations • 23 Nov 2023 • Aven Le Zhou, Jiayi Ye, Tianchen Liu, Kang Zhang

As a communication channel, body movements have been widely explored in behavioral studies and kinesics.

Paper
Add Code

ACDMSR: Accelerated Conditional Diffusion Models for Single Image Super-Resolution

no code implementations • 3 Jul 2023 • Axi Niu, Pham Xuan Trung, Kang Zhang, Jinqiu Sun, Yu Zhu, In So Kweon, Yanning Zhang

To speed up inference and further enhance the performance, our research revisits diffusion models in image super-resolution and proposes a straightforward yet significant diffusion model-based super-resolution method called ACDMSR (accelerated conditional diffusion model for image super-resolution).

Denoising Image Super-Resolution +1

Paper
Add Code

A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics

1 code implementation • 1 Jun 2023 • Hong-Yu Zhou, Yizhou Yu, Chengdi Wang, Shu Zhang, Yuanxu Gao, Jia Pan, Jun Shao, Guangming Lu, Kang Zhang, Weimin Li

During the diagnostic process, clinicians leverage multimodal information, such as chief complaints, medical images, and laboratory-test results.

Representation Learning

347

Paper
Code

Learning from Multi-Perception Features for Real-Word Image Super-resolution

no code implementations • 26 May 2023 • Axi Niu, Kang Zhang, Trung X. Pham, Pei Wang, Jinqiu Sun, In So Kweon, Yanning Zhang

Currently, there are two popular approaches for addressing real-world image super-resolution problems: degradation-estimation-based and blind-based methods.

Image Super-Resolution

Paper
Add Code

MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and Results

no code implementations • 27 Apr 2023 • Qingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling

To evaluate the performance of different depth completion methods, we organized an RGB+sparse ToF depth completion competition.

Depth Completion

Paper
Add Code

Everyone Can Be Picasso? A Computational Framework into the Myth of Human versus AI Painting

1 code implementation • 17 Apr 2023 • Yilin Ye, Rong Huang, Kang Zhang, Wei Zeng

The recent advances of AI technology, particularly in AI-Generated Content (AIGC), have enabled everyone to easily generate beautiful paintings with simple text description.

Paper
Code

CDPMSR: Conditional Diffusion Probabilistic Models for Single Image Super-Resolution

no code implementations • 14 Feb 2023 • Axi Niu, Kang Zhang, Trung X. Pham, Jinqiu Sun, Yu Zhu, In So Kweon, Yanning Zhang

Diffusion probabilistic models (DPM) have been widely adopted in image-to-image translation to generate high-quality images.

Conditional Image Generation Denoising +2

Paper
Add Code

Semi-Supervised Video Inpainting with Cycle Consistency Constraints

no code implementations • CVPR 2023 • Zhiliang Wu, Hanyu Xuan, Changchang Sun, Kang Zhang, Yan Yan

Specifically, in this work, we propose an end-to-end trainable framework consisting of completion network and mask prediction network, which are designed to generate corrupted contents of the current frame using the known mask and decide the regions to be filled of the next frame, respectively.

Video Inpainting

Paper
Add Code

On the Pros and Cons of Momentum Encoder in Self-Supervised Visual Representation Learning

no code implementations • 11 Aug 2022 • Trung Pham, Chaoning Zhang, Axi Niu, Kang Zhang, Chang D. Yoo

Exponential Moving Average (EMA or momentum) is widely used in modern self-supervised learning (SSL) approaches, such as MoCo, for enhancing performance.

Representation Learning Self-Supervised Learning

Paper
Add Code

A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond

no code implementations • 30 Jul 2022 • Chaoning Zhang, Chenshuang Zhang, Junha Song, John Seon Keun Yi, Kang Zhang, In So Kweon

Masked autoencoders are scalable vision learners, as the title of MAE \cite{he2022masked}, which suggests that self-supervised learning (SSL) in vision might undertake a similar trajectory as in NLP.

Contrastive Learning Denoising +1

Paper
Add Code

Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness

2 code implementations • 22 Jul 2022 • Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Axi Niu, Jiu Feng, Chang D. Yoo, In So Kweon

Adversarial training (AT) for robust representation learning and self-supervised learning (SSL) for unsupervised representation learning are two active research fields.

Adversarial Robustness Contrastive Learning +3

Paper
Code

Understanding and Improving Group Normalization

1 code implementation • 5 Jul 2022 • Agus Gunawan, Xu Yin, Kang Zhang

Various normalization layers have been proposed to help the training of neural networks.

Image Classification

Paper
Code

Dual Temperature Helps Contrastive Learning Without Many Negative Samples: Towards Understanding and Simplifying MoCo

2 code implementations • CVPR 2022 • Chaoning Zhang, Kang Zhang, Trung X. Pham, Axi Niu, Zhinan Qiao, Chang D. Yoo, In So Kweon

Contrastive learning (CL) is widely known to require many negative samples, 65536 in MoCo for instance, for which the performance of a dictionary-free framework is often inferior because the negative sample size (NSS) is limited by its mini-batch size (MBS).

Contrastive Learning

Paper
Code

Investigating Top-$k$ White-Box and Transferable Black-box Attack

no code implementations • 30 Mar 2022 • Chaoning Zhang, Philipp Benz, Adil Karjauv, Jae Won Cho, Kang Zhang, In So Kweon

It is widely reported that stronger I-FGSM transfers worse than simple FGSM, leading to a popular belief that transferability is at odds with the white-box attack strength.

Paper
Add Code

How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning

no code implementations • 30 Mar 2022 • Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Trung X. Pham, Chang D. Yoo, In So Kweon

This yields a unified perspective on how negative samples and SimSiam alleviate collapse.

Contrastive Learning Self-Supervised Learning

Paper
Add Code

Fast Adversarial Training with Noise Augmentation: A Unified Perspective on RandStart and GradAlign

no code implementations • 11 Feb 2022 • Axi Niu, Kang Zhang, Chaoning Zhang, Chenshuang Zhang, In So Kweon, Chang D. Yoo, Yanning Zhang

The former works only for a relatively small perturbation 8/255 with the l_\infty constraint, and GradAlign improves it by extending the perturbation size to 16/255 (with the l_\infty constraint) but at the cost of being 3 to 4 times slower.

Data Augmentation

Paper
Add Code

Investigating Top-k White-Box and Transferable Black-Box Attack

no code implementations • CVPR 2022 • Chaoning Zhang, Philipp Benz, Adil Karjauv, Jae Won Cho, Kang Zhang, In So Kweon

It is widely reported that stronger I-FGSM transfers worse than simple FGSM, leading to a popular belief that transferability is at odds with the white-box attack strength.

Paper
Add Code

How Does SimSiam Avoid Collapse Without Negative Samples? Towards a Unified Understanding of Progress in SSL

no code implementations • ICLR 2022 • Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Trung X. Pham, Chang D. Yoo, In So Kweon

Towards avoiding collapse in self-supervised learning (SSL), contrastive loss is widely used but often requires a large number of negative samples.

Self-Supervised Learning

Paper
Add Code

Early Stop And Adversarial Training Yield Better surrogate Model: Very Non-Robust Features Harm Adversarial Transferability

no code implementations • 29 Sep 2021 • Chaoning Zhang, Gyusang Cho, Philipp Benz, Kang Zhang, Chenshuang Zhang, Chan-Hyun Youn, In So Kweon

The transferability of adversarial examples (AE); known as adversarial transferability, has attracted significant attention because it can be exploited for TransferableBlack-box Attacks (TBA).

Attribute

Paper
Add Code

Query Rewriting via Cycle-Consistent Translation for E-Commerce Search

no code implementations • 1 Mar 2021 • Yiming Qiu, Kang Zhang, Han Zhang, Songlin Wang, Sulong Xu, Yun Xiao, Bo Long, Wen-Yun Yang

Online A/B experiments show that it improves core e-commerce business metrics significantly.

Machine Translation Translation

Paper
Add Code

Towards Personalized and Semantic Retrieval: An End-to-End Solution for E-commerce Search via Embedding Learning

no code implementations • 3 Jun 2020 • Han Zhang, Songlin Wang, Kang Zhang, Zhiling Tang, Yunjiang Jiang, Yun Xiao, Weipeng Yan, Wen-Yun Yang

Two critical challenges stay in today's e-commerce search: how to retrieve items that are semantically relevant but not exact matching to query terms, and how to retrieve items that are more personalized to different users for the same search query.

Retrieval Semantic Retrieval

Paper
Add Code

A Computational Model of Afterimages based on Simultaneous and Successive Contrasts

no code implementations • 13 Sep 2017 • Jinhui Yu, Kailin Wu, Kang Zhang, Xianjun Sam Zheng

The colors of negative afterimages differ from the old stimulating colors in the original image when the color in the new area is either neutral or chromatic.

Paper
Add Code

3D Fragment Reassembly Using Integrated Template Guidance and Fracture-Region Matching

no code implementations • ICCV 2015 • Kang Zhang, Wuyi Yu, Mary Manhein, Warren Waggenspack, Xin Li

This paper studies matching of fragmented objects to recompose their original geometry.

Paper
Add Code

Bayesian regression and Bitcoin

15 code implementations • 6 Oct 2014 • Devavrat Shah, Kang Zhang

In this paper, we discuss the method of Bayesian regression and its efficacy for predicting price variation of Bitcoin, a recently popularized virtual, cryptographic currency.

Bayesian Inference Binary Classification +2

315

Paper
Code

Cross-Scale Cost Aggregation for Stereo Matching

1 code implementation • CVPR 2014 • Kang Zhang, Yuqiang Fang, Dongbo Min, Lifeng Sun, Shiqiang Yang. Shuicheng Yan, Qi Tian

We firstly reformulate cost aggregation from a unified optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels.

Stereo Matching Stereo Matching Hand

208

Paper
Code

Binary Stereo Matching

1 code implementation • 10 Feb 2014 • Kang Zhang, Jiyang Li, Yijing Li, Weidong Hu, Lifeng Sun, Shiqiang Yang

In this paper, we propose a novel binary-based cost computation and aggregation approach for stereo matching problem.

Computational Efficiency Stereo Matching +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.