1 code implementation • 24 May 2024 • Jingcheng Deng, Zihao Wei, Liang Pang, Hanxing Ding, HuaWei Shen, Xueqi Cheng
Firstly, in the layer dimension, we propose non-local block key-value storage to replace local layer key-value storage, increasing the representation ability of key-value pairs and incorporating attention layer knowledge.
1 code implementation • CVPR 2024 • Zihao Wei, Zixuan Pan, Andrew Owens
We propose a simple strategy for masking image patches during visual-language contrastive learning that improves the quality of the learned representations and the training speed.
1 code implementation • 7 Apr 2024 • Zihao Wei, Jingcheng Deng, Liang Pang, Hanxing Ding, HuaWei Shen, Xueqi Cheng
We evaluate the multilingual knowledge editing generalization capabilities of existing methods on MLaKE.
1 code implementation • CVPR 2024 • Mude Hui, Zihao Wei, Hongru Zhu, Fei Xia, Yuyin Zhou
This strategy enriches the diffusion process with structured 3D information, enhancing detail and reducing noise in localized 2D images.
1 code implementation • 20 Feb 2024 • Zihao Wei, Liang Pang, Hanxing Ding, Jingcheng Deng, HuaWei Shen, Xueqi Cheng
The premise of localization results in an incomplete knowledge editing, whereas an isolated assumption may impair both other knowledge and general abilities.
no code implementations • 16 Feb 2024 • Hanxing Ding, Liang Pang, Zihao Wei, HuaWei Shen, Xueqi Cheng
A careful and balanced integration of the parametric knowledge within LLMs with external information is crucial to alleviate hallucinations.
1 code implementation • 24 Jul 2023 • YiQing Wang, Zihan Li, Jieru Mei, Zihao Wei, Li Liu, Chen Wang, Shengtian Sang, Alan Yuille, Cihang Xie, Yuyin Zhou
To address this limitation, we present Masked Multi-view with Swin Transformers (SwinMM), a novel multi-view pipeline for enabling accurate and data-efficient self-supervised medical image analysis.
1 code implementation • 22 May 2023 • Hanxing Ding, Liang Pang, Zihao Wei, HuaWei Shen, Xueqi Cheng, Tat-Seng Chua
Multi-aspect controllable text generation aims to generate fluent sentences that possess multiple desired attributes simultaneously.
1 code implementation • 28 Sep 2022 • Jiaqi Luo, Zihao Wei, Junkai Man, Shixin Xu
Gradient Boosting Machines (GBMs) have demonstrated remarkable success in solving diverse problems by utilizing Taylor expansions in functional space.
1 code implementation • 23 Apr 2022 • Zixuan Pan, Zihao Wei, Yidong Huang, Aditya Gupta
The aim of this paper is to demonstrate the efficacy of using Contrastive Random Walk as a curiosity method to achieve faster convergence to the optimal policy. Contrastive Random Walk defines the transition matrix of a random walk with the help of neural networks.
1 code implementation • 19 Dec 2021 • Zihao Wei, Yidong Huang, Yuang Chen, Chenhao Zheng, Jinnan Gao
In this paper, we present A-ESRGAN, a GAN model for blind SR tasks featuring an attention U-Net based, multi-scale discriminator that can be seamlessly integrated with other generators.