Search Results for author: Kai Zhu

Found 29 papers, 9 papers with code

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

no code implementations3 Jul 2024 Zhantao Yang, Ruili Feng, Keyu Yan, Huangji Wang, Zhicai Wang, Shangwen Zhu, Han Zhang, Jie Xiao, Pingyu Wu, Kai Zhu, Jixuan Chen, Chen-Wei Xie, Chaojie Mao, Yue Yang, Hongyang Zhang, Yu Liu, Fan Cheng

This paper presents Bag-of-Concept Graph (BACON) to gift models with limited linguistic abilities to taste the privilege of Vision Language Models (VLMs) and boost downstream tasks such as detection, visual question answering (VQA), and image generation.

Image Generation Question Answering +1

Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation

no code implementations22 Mar 2024 Jiaheng Yu, Jing Li, Yue He, Kai Zhu, Shuyi Zhang, Wen Hu

Recent methods utilize graph contrastive Learning within graph-structured user-item interaction data for collaborative filtering and have demonstrated their efficacy in recommendation tasks.

Collaborative Filtering Contrastive Learning +1

Intention-driven Ego-to-Exo Video Generation

no code implementations14 Mar 2024 Hongchen Luo, Kai Zhu, Wei Zhai, Yang Cao

Finally, the inferred human movement and high-level action descriptions jointly guide the generation of exocentric motion and interaction content (i. e., corresponding optical flow and occlusion maps) in the backward process of the diffusion model, ultimately warping them into the corresponding exocentric video.

Optical Flow Estimation Stereo Matching +1

HomoFormer: Homogenized Transformer for Image Shadow Removal

1 code implementation CVPR 2024 Jie Xiao, Xueyang Fu, Yurui Zhu, Dong Li, Jie Huang, Kai Zhu, Zheng-Jun Zha

The spatial non-uniformity and diverse patterns of shadow degradation conflict with the weight sharing manner of dominant models which may lead to an unsatisfactory compromise.

Diversity Image Shadow Removal +1

CCM: Adding Conditional Controls to Text-to-Image Consistency Models

no code implementations12 Dec 2023 Jie Xiao, Kai Zhu, Han Zhang, Zhiheng Liu, Yujun Shen, Yu Liu, Xueyang Fu, Zheng-Jun Zha

Consistency Models (CMs) have showed a promise in creating visual content efficiently and with high quality.

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection

1 code implementation4 Dec 2023 Fan Lu, Kai Zhu, Kecheng Zheng, Wei Zhai, Yang Cao

Full-spectrum out-of-distribution (F-OOD) detection aims to accurately recognize in-distribution (ID) samples while encountering semantic and covariate shifts simultaneously.

Out-of-Distribution Detection

Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation

2 code implementations22 Sep 2023 Wei Zhai, Pingyu Wu, Kai Zhu, Yang Cao, Feng Wu, Zheng-Jun Zha

In addition, our method also achieves state-of-the-art weakly supervised semantic segmentation performance on the PASCAL VOC 2012 and MS COCO 2014 datasets.

Object Weakly-Supervised Object Localization +2

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models

no code implementations ICCV 2023 Kecheng Zheng, Wei Wu, Ruili Feng, Kai Zhu, Jiawei Liu, Deli Zhao, Zheng-Jun Zha, Wei Chen, Yujun Shen

To bring the useful knowledge back into light, we first identify a set of parameters that are important to a given downstream task, then attach a binary mask to each parameter, and finally optimize these masks on the downstream data with the parameters frozen.

Eliminating Lipschitz Singularities in Diffusion Models

no code implementations20 Jun 2023 Zhantao Yang, Ruili Feng, Han Zhang, Yujun Shen, Kai Zhu, Lianghua Huang, Yifei Zhang, Yu Liu, Deli Zhao, Jingren Zhou, Fan Cheng

Diffusion models, which employ stochastic differential equations to sample images through integrals, have emerged as a dominant class of generative models.

Cones 2: Customizable Image Synthesis with Multiple Subjects

1 code implementation30 May 2023 Zhiheng Liu, Yifei Zhang, Yujun Shen, Kecheng Zheng, Kai Zhu, Ruili Feng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao

Synthesizing images with user-specified subjects has received growing attention due to its practical applications.

Image Generation

Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection

1 code implementation CVPR 2023 Fan Lu, Kai Zhu, Wei Zhai, Kecheng Zheng, Yang Cao

Semantically coherent out-of-distribution (SCOOD) detection aims to discern outliers from the intended data distribution with access to unlabeled extra set.

Out-of-Distribution Detection

Cones: Concept Neurons in Diffusion Models for Customized Generation

1 code implementation9 Mar 2023 Zhiheng Liu, Ruili Feng, Kai Zhu, Yifei Zhang, Kecheng Zheng, Yu Liu, Deli Zhao, Jingren Zhou, Yang Cao

Concatenating multiple clusters of concept neurons can vividly generate all related concepts in a single image.

Neural Dependencies Emerging from Learning Massive Categories

no code implementations CVPR 2023 Ruili Feng, Kecheng Zheng, Kai Zhu, Yujun Shen, Jian Zhao, Yukun Huang, Deli Zhao, Jingren Zhou, Michael Jordan, Zheng-Jun Zha

Through investigating the properties of the problem solution, we confirm that neural dependency is guaranteed by a redundant logit covariance matrix, which condition is easily met given massive categories, and that neural dependency is highly sparse, implying that one category correlates to only a few others.

Image Classification

FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization

no code implementations24 Mar 2022 Kecheng Zheng, Yang Cao, Kai Zhu, Ruijing Zhao, Zheng-Jun Zha

However, its generalization performance to heterogeneous tasks is inferior to other architectures (e. g., CNNs and transformers) due to the extensive retention of domain information.

Domain Generalization

Self-Paced Imbalance Rectification for Class Incremental Learning

no code implementations8 Feb 2022 Zhiheng Liu, Kai Zhu, Yang Cao

Exemplar-based class-incremental learning is to recognize new classes while not forgetting old ones, whose samples can only be saved in limited memory.

Class Incremental Learning Incremental Learning +1

Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior

1 code implementation3 Dec 2021 Feng Zhang, Yuanjie Shao, Yishi Sun, Kai Zhu, Changxin Gao, Nong Sang

We introduce a Noise Disentanglement Module (NDM) to disentangle the noise and content in the reflectance maps with the reliable aid of unpaired clean images.

Disentanglement Image Restoration +1

Mitigating Greenhouse Gas Emissions Through Generative Adversarial Networks Based Wildfire Prediction

no code implementations20 Aug 2021 Sifat Chowdhury, Kai Zhu, Yu Zhang

Over the past decade, the number of wildfire has increased significantly around the world, especially in the State of California.

Data Augmentation Generative Adversarial Network

FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads

no code implementations23 Sep 2020 Zhen Zheng, Pengzhan Zhao, Guoping Long, Feiwen Zhu, Kai Zhu, Wenyi Zhao, Lansong Diao, Jun Yang, Wei. Lin

We show in this work that memory intensive computations can result in severe performance problems due to off-chip memory access and CPU-GPU context switch overheads in a wide range of deep learning models.

Code Generation

Self-Supervised Tuning for Few-Shot Segmentation

no code implementations12 Apr 2020 Kai Zhu, Wei Zhai, Zheng-Jun Zha, Yang Cao

Few-shot segmentation aims at assigning a category label to each image pixel with few annotated samples.

Meta-Learning Segmentation

One-Shot Texture Retrieval with Global Context Metric

no code implementations16 May 2019 Kai Zhu, Wei Zhai, Zheng-Jun Zha, Yang Cao

In this paper, we tackle one-shot texture retrieval: given an example of a new reference texture, detect and segment all the pixels of the same texture category within an arbitrary image.

Relation Relation Network +2

FusionStitching: Deep Fusion and Code Generation for Tensorflow Computations on GPUs

no code implementations13 Nov 2018 Guoping Long, Jun Yang, Kai Zhu, Wei. Lin

In recent years, there is a surge on machine learning applications in industry.

Distributed, Parallel, and Cluster Computing Mathematical Software

Recognize Complex Events From Static Images by Fusing Deep Channels

no code implementations CVPR 2015 Yuanjun Xiong, Kai Zhu, Dahua Lin, Xiaoou Tang

A considerable portion of web images capture events that occur in our personal lives or social activities.

Collaborative Filtering with Information-Rich and Information-Sparse Entities

no code implementations6 Mar 2014 Kai Zhu, Rui Wu, Lei Ying, R. Srikant

In particular, we consider both the clustering model, where only users (or items) are clustered, and the co-clustering model, where both users and items are clustered, and further, we assume that some users rate many items (information-rich users) and some users rate only a few items (information-sparse users).

Clustering Collaborative Filtering +1

Jointly Clustering Rows and Columns of Binary Matrices: Algorithms and Trade-offs

no code implementations1 Oct 2013 Jiaming Xu, Rui Wu, Kai Zhu, Bruce Hajek, R. Srikant, Lei Ying

In standard clustering problems, data points are represented by vectors, and by stacking them together, one forms a data matrix with row or column cluster structure.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.