Search Results for author: Kai Zhang

Found 201 papers, 97 papers with code

基于异构用户知识融合的隐式情感分析研究(Research on Implicit Sentiment Analysis based on Heterogeneous User Knowledge Fusion)

no code implementations • CCL 2022 • Jian Liao, Kai Zhang, Suge Wang, Jia Lei, Yiyang Zhang

Sentiment Analysis

Paper
Add Code

Multi-Stage Pre-training for Automated Chinese Essay Scoring

no code implementations • EMNLP 2020 • Wei Song, Kai Zhang, Ruiji Fu, Lizhen Liu, Ting Liu, Miaomiao Cheng

This paper proposes a pre-training based automated Chinese essay scoring method.

Domain Adaptation

Paper
Add Code

Evaluation of Retrieval-Augmented Generation: A Survey

1 code implementation • 13 May 2024 • Hao Yu, Aoran Gan, Kai Zhang, Shiwei Tong, Qi Liu, Zhaofeng Liu

We consequently enhanced an extensive survey and proposed an analysis framework for benchmarks of RAG systems, RAGR (Retrieval, Generation, Additional Requirement), designed to systematically analyze RAG benchmarks by focusing on measurable outputs and established truths.

Paper
Code

Point Resampling and Ray Transformation Aid to Editable NeRF Models

no code implementations • 12 May 2024 • Zhenyang Li, Zilong Chen, Feifan Qu, Mingqing Wang, Yizhou Zhao, Kai Zhang, Yifan Peng

In NeRF-aided editing tasks, object movement presents difficulties in supervision generation due to the introduction of variability in object positions.

Paper
Add Code

GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting

no code implementations • 30 Apr 2024 • Kai Zhang, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu

We propose GS-LRM, a scalable large reconstruction model that can predict high-quality 3D Gaussian primitives from 2-4 posed sparse images in 0. 23 seconds on single A100 GPU.

3D Generation

Paper
Add Code

Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras

1 code implementation • 29 Apr 2024 • Jun Yu, Yutong Dai, Xiaokang Liu, Jin Huang, Yishan Shen, Ke Zhang, Rong Zhou, Eashan Adhikarla, Wenxuan Ye, Yixin Liu, Zhaoming Kong, Kai Zhang, Yilong Yin, Vinod Namboodiri, Brian D. Davison, Jason H. Moore, Yong Chen

Overall, we hope this survey provides the research community with a comprehensive overview of the advancements in MTL from its inception in 1997 to the present in 2023.

Multi-Task Learning Recommendation Systems

Paper
Code

Multi-centre normative brain mapping of intracranial EEG lifespan patterns in the human brain

no code implementations • 27 Apr 2024 • Heather Woodhouse, Gerard Hall, Callum Simpson, Csaba Kozma, Frances Turner, Gabrielle M. Schroeder, Beate Diehl, John S. Duncan, Jiajie Mo, Kai Zhang, Aswin Chari, Martin Tisdall, Friederike Moeller, Chris Petkov, Matthew A. Howard, George M. Ibrahim, Elizabeth Donner, Nebras M. Warsi, Raheel Ahmed, Peter N. Taylor, Yujiang Wang

Results: Recording site significantly impacted normative icEEG maps in all frequency bands, and age was a more influential predictor of band power than sex.

EEG

Paper
Add Code

MeshLRM: Large Reconstruction Model for High-Quality Mesh

no code implementations • 18 Apr 2024 • Xinyue Wei, Kai Zhang, Sai Bi, Hao Tan, Fujun Luan, Valentin Deschaintre, Kalyan Sunkavalli, Hao Su, Zexiang Xu

This allows for end-to-end mesh reconstruction by fine-tuning a pre-trained NeRF LRM with mesh rendering.

3D Generation Image to 3D +1

Paper
Add Code

NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

1 code implementation • 15 Apr 2024 • Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou, Hongyu An, Xinfeng Zhang, Zhiyuan Song, Ziyue Dong, Qing Zhao, Xiaogang Xu, Pengxu Wei, Zhi-chao Dou, Gui-ling Wang, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Cansu Korkmaz, A. Murat Tekalp, Yubin Wei, Xiaole Yan, Binren Li, Haonan Chen, Siqi Zhang, Sihan Chen, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi, Anjali Sarvaiya, Pooja Choksy, Jagrit Joshi, Shubh Kawa, Kishor Upla, Sushrut Patwardhan, Raghavendra Ramachandra, Sadat Hossain, Geongi Park, S. M. Nadim Uddin, Hao Xu, Yanhui Guo, Aman Urumbekov, Xingzhuo Yan, Wei Hao, Minghan Fu, Isaac Orais, Samuel Smith, Ying Liu, Wangwang Jia, Qisheng Xu, Kele Xu, Weijun Yuan, Zhan Li, Wenqin Kuang, Ruijin Guan, Ruting Deng, Zhao Zhang, Bo wang, Suiyi Zhao, Yan Luo, Yanyan Wei, Asif Hussain Khan, Christian Micheloni, Niki Martinel

This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained.

Image Super-Resolution valid

Paper
Code

DATENeRF: Depth-Aware Text-based Editing of NeRFs

no code implementations • 6 Apr 2024 • Sara Rojas, Julien Philip, Kai Zhang, Sai Bi, Fujun Luan, Bernard Ghanem, Kalyan Sunkavall

However, extending these techniques to edit scenes in Neural Radiance Fields (NeRF) is complex, as editing individual 2D frames can result in inconsistencies across multiple views.

Paper
Add Code

How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?

1 code implementation • 4 Apr 2024 • Siye Wu, Jian Xie, Jiangjie Chen, Tinghui Zhu, Kai Zhang, Yanghua Xiao

By leveraging the retrieval of information from external knowledge databases, Large Language Models (LLMs) exhibit enhanced capabilities for accomplishing many knowledge-intensive tasks.

Retrieval

Paper
Code

Personalized LLM Response Generation with Parameterized Memory Injection

no code implementations • 4 Apr 2024 • Kai Zhang, Lizhi Qing, Yangyang Kang, Xiaozhong Liu

Large Language Models (LLMs) have exhibited remarkable proficiency in comprehending and generating natural language.

Bayesian Optimisation Response Generation

Paper
Add Code

AddSR: Accelerating Diffusion-based Blind Super-Resolution with Adversarial Diffusion Distillation

1 code implementation • 2 Apr 2024 • Rui Xie, Ying Tai, Kai Zhang, Zhenyu Zhang, Jun Zhou, Jian Yang

Blind super-resolution methods based on stable diffusion showcase formidable generative capabilities in reconstructing clear high-resolution images with intricate details from low-resolution inputs.

Blind Super-Resolution Super-Resolution

Paper
Code

PURPLE: Making a Large Language Model a Better SQL Writer

no code implementations • 29 Mar 2024 • Tonghui Ren, Yuankai Fan, Zhenying He, Ren Huang, Jiaqi Dai, Can Huang, Yinan Jing, Kai Zhang, Yifan Yang, X. Sean Wang

LLMs can learn to organize operator compositions from the input demonstrations for the given task.

Language Modelling Large Language Model +2

Paper
Add Code

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

no code implementations • 28 Mar 2024 • Kai Zhang, Yi Luan, Hexiang Hu, Kenton Lee, Siyuan Qiao, Wenhu Chen, Yu Su, Ming-Wei Chang

Image retrieval, i. e., finding desired images given a reference image, inherently encompasses rich, multi-faceted search intents that are difficult to capture solely using image-based measures.

Image Retrieval Implicit Relations +2

Paper
Add Code

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

no code implementations • 20 Mar 2024 • Yibo Wang, Ruiyuan Gao, Kai Chen, Kaiqiang Zhou, Yingjie Cai, Lanqing Hong, Zhenguo Li, Lihui Jiang, Dit-yan Yeung, Qiang Xu, Kai Zhang

Furthermore, image syntheses from DetDiffusion can effectively augment training data, significantly enhancing downstream detection performance.

Attribute Data Augmentation +3

Paper
Add Code

Data-Enabled Predictive Repetitive Control

no code implementations • 18 Mar 2024 • Kai Zhang, Riccardo Zuliani, Efe C. Balta, John Lygeros

This work introduces the Data-Enabled Predictive Repetitive Control (DeePRC) algorithm, a direct data-driven approach for repetitive LTI systems.

Paper
Add Code

Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation

1 code implementation • 27 Feb 2024 • Yuankai Fan, Zhenying He, Tonghui Ren, Can Huang, Yinan Jing, Kai Zhang, X. Sean Wang

While these translation models have greatly improved the overall translation accuracy, surpassing 70% on NLIDB benchmarks, the use of auto-regressive decoding to generate single SQL queries may result in sub-optimal outputs, potentially leading to erroneous translations.

Learning-To-Rank Translation

Paper
Code

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

1 code implementation • 27 Feb 2024 • Yixin Liu, Kai Zhang, Yuan Li, Zhiling Yan, Chujie Gao, Ruoxi Chen, Zhengqing Yuan, Yue Huang, Hanchi Sun, Jianfeng Gao, Lifang He, Lichao Sun

Sora is a text-to-video generative AI model, released by OpenAI in February 2024.

Marketing Video Generation

457

Paper
Code

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models

no code implementations • 20 Feb 2024 • Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Zekun Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Stephen W. Huang, Chenghua Lin, Wenhu Chen, Jie Fu

The advancement of large language models (LLMs) has enhanced the ability to generalize across a wide range of unseen natural language processing (NLP) tasks through instruction-following.

Instruction Following

Paper
Add Code

A Neural-network Enhanced Video Coding Framework beyond ECM

no code implementations • 13 Feb 2024 • Yanchen Zhao, Wenxuan He, Chuanmin Jia, Qizhe Wang, Junru Li, Yue Li, Chaoyi Lin, Kai Zhang, Li Zhang, Siwei Ma

In this paper, a hybrid video compression framework is proposed that serves as a demonstrative showcase of deep learning-based approaches extending beyond the confines of traditional coding methodologies.

Video Compression

Paper
Add Code

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

1 code implementation • 2 Feb 2024 • Jian Xie, Kai Zhang, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su

Are these language agents capable of planning in more complex settings that are out of the reach of prior AI agents?

123

Paper
Code

LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression

no code implementations • 1 Feb 2024 • Wei Jiang, Junru Li, Kai Zhang, Li Zhang

To validate the effectiveness of our proposed LGMC, we integrate it with DCVC-TCM and obtain learned video compression with joint local and global motion compensation (LVC-LGMC).

Motion Compensation Video Compression

Paper
Add Code

Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning

1 code implementation • 31 Jan 2024 • Tinghui Zhu, Kai Zhang, Jian Xie, Yu Su

Recent advancements have significantly augmented the reasoning capabilities of Large Language Models (LLMs) through various methodologies, especially chain-of-thought (CoT) reasoning.

Paper
Code

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology

no code implementations • 29 Jan 2024 • Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Dan Wan, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin, Lin Yang

To address this, we introduce PathMMU, the largest and highest-quality expert-validated pathology benchmark for Large Multimodal Models (LMMs).

Paper
Add Code

LKFormer: Large Kernel Transformer for Infrared Image Super-Resolution

1 code implementation • 22 Jan 2024 • Feiwei Qin, Kang Yan, Changmiao Wang, Ruiquan Ge, Yong Peng, Kai Zhang

Given the broad application of infrared technology across diverse fields, there is an increasing emphasis on investigating super-resolution techniques for infrared images within the realm of deep learning.

Image Super-Resolution Infrared image super-resolution

Paper
Code

Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting

1 code implementation • 17 Jan 2024 • Benjamin Ummenhofer, Sanskar Agrawal, Rene Sepulveda, Yixing Lao, Kai Zhang, Tianhang Cheng, Stephan Richter, Shenlong Wang, German Ros

Reconstructing an object from photos and placing it virtually in a new environment goes beyond the standard novel view synthesis task as the appearance of the object has to not only adapt to the novel viewpoint but also to the new lighting conditions and yet evaluations of inverse rendering methods rely on novel view synthesis data or simplistic synthetic datasets for quantitative analysis.

Inverse Rendering Novel View Synthesis

Paper
Code

CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model

no code implementations • 10 Jan 2024 • Yinghui Xing, Litao Qu, Shizhou Zhang, Kai Zhang, Yanning Zhang

Fusion of a panchromatic (PAN) image and corresponding multispectral (MS) image is also known as pansharpening, which aims to combine abundant spatial details of PAN and spectral information of MS. Due to the absence of high-resolution MS images, available deep-learning-based methods usually follow the paradigm of training at reduced resolution and testing at both reduced and full resolution.

Pansharpening

Paper
Add Code

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

1 code implementation • 8 Jan 2024 • Tong Wu, Guandao Yang, Zhibing Li, Kai Zhang, Ziwei Liu, Leonidas Guibas, Dahua Lin, Gordon Wetzstein

These metrics lack the flexibility to generalize to different evaluation criteria and might not align well with human preferences.

3D Generation Text to 3D

185

Paper
Code

UMIE: Unified Multimodal Information Extraction with Instruction Tuning

1 code implementation • 5 Jan 2024 • Lin Sun, Kai Zhang, Qingyuan Li, Renze Lou

Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases.

Paper
Code

MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

no code implementations • 5 Dec 2023 • Renze Lou, Kai Zhang, Jian Xie, Yuxuan Sun, Janice Ahn, Hanzi Xu, Yu Su, Wenpeng Yin

In the realm of large language models (LLMs), enhancing instruction-following capability often involves curating expansive training data.

Instruction Following

Paper
Add Code

Robust Computer Vision in an Ever-Changing World: A Survey of Techniques for Tackling Distribution Shifts

no code implementations • 3 Dec 2023 • Eashan Adhikarla, Kai Zhang, Jun Yu, Lichao Sun, John Nicholson, Brian D. Davison

As a result, it raises concerns about the overall robustness of the machine learning techniques for computer vision applications that are deployed publicly for consumers.

Data Augmentation Transfer Learning

Paper
Add Code

Multi-scale Iterative Refinement towards Robust and Versatile Molecular Docking

no code implementations • 30 Nov 2023 • Jiaxian Yan, Zaixi Zhang, Kai Zhang, Qi Liu

This model is then paired with GPU-accelerated sampling algorithms.

Blind Docking

Paper
Add Code

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

2 code implementations • 27 Nov 2023 • Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning.

Complex Query Answering Logical Reasoning +1

7,236

Paper
Code

Deep Equilibrium Diffusion Restoration with Parallel Sampling

1 code implementation • 20 Nov 2023 • JieZhang Cao, Yue Shi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc van Gool

Due to the inherent property of diffusion models, most existing methods need long serial sampling chains to restore HQ images step-by-step, resulting in expensive sampling time and high computation costs.

Image Restoration

Paper
Code

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

no code implementations • 20 Nov 2023 • Peng Wang, Hao Tan, Sai Bi, Yinghao Xu, Fujun Luan, Kalyan Sunkavalli, Wenping Wang, Zexiang Xu, Kai Zhang

We propose a Pose-Free Large Reconstruction Model (PF-LRM) for reconstructing a 3D object from a few unposed images even with little visual overlap, while simultaneously estimating the relative camera poses in ~1. 3 seconds on a single A100 GPU.

3D Reconstruction Image to 3D +1

Paper
Add Code

MoVideo: Motion-Aware Video Generation with Diffusion Models

no code implementations • 19 Nov 2023 • Jingyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc van Gool, Rakesh Ranjan

While recent years have witnessed great progress on using diffusion models for video generation, most of them are simple extensions of image generation frameworks, which fail to explicitly consider one of the key differences between videos and images, i. e., motion.

Image Generation Image to Video Generation +1

Paper
Add Code

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model

no code implementations • 19 Nov 2023 • Zhenghao Pan, Haijin Zeng, JieZhang Cao, Kai Zhang, Yongyong Chen

Specifically, firstly, we employ a pre-trained diffusion model, which has been trained on a substantial corpus of RGB images, as the generative denoiser within the Plug-and-Play framework for the first time.

Denoising

Paper
Add Code

Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models

1 code implementation • 15 Nov 2023 • Weize Liu, Guocong Li, Kai Zhang, Bang Du, Qiyuan Chen, Xuming Hu, Hongxia Xu, Jintai Chen, Jian Wu

While techniques such as chain-of-thought (CoT) distillation have displayed promise in distilling LLMs into small language models (SLMs), there is a risk that distilled SLMs may still inherit flawed reasoning and hallucinations from LLMs.

Transfer Learning

Paper
Code

DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model

no code implementations • 15 Nov 2023 • Yinghao Xu, Hao Tan, Fujun Luan, Sai Bi, Peng Wang, Jiahao Li, Zifan Shi, Kalyan Sunkavalli, Gordon Wetzstein, Zexiang Xu, Kai Zhang

We propose \textbf{DMV3D}, a novel 3D generation approach that uses a transformer-based 3D large reconstruction model to denoise multi-view diffusion.

3D Generation Denoising +2

Paper
Add Code

Holistic Evaluation of GPT-4V for Biomedical Imaging

no code implementations • 10 Nov 2023 • Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, Jingyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang, Xinyu Wang, Xu Zhang, Lin Zhao, Yiheng Liu, Kai Zhang, Liheng Yan, Lichao Sun, Jun Liu, Ning Qiang, Bao Ge, Xiaoyan Cai, Shijie Zhao, Xintao Hu, Yixuan Yuan, Gang Li, Shu Zhang, Xin Zhang, Xi Jiang, Tuo Zhang, Dinggang Shen, Quanzheng Li, Wei Liu, Xiang Li, Dajiang Zhu, Tianming Liu

GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain.

Anatomy Image Captioning +1

Paper
Add Code

Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model

no code implementations • 10 Nov 2023 • Jiahao Li, Hao Tan, Kai Zhang, Zexiang Xu, Fujun Luan, Yinghao Xu, Yicong Hong, Kalyan Sunkavalli, Greg Shakhnarovich, Sai Bi

Text-to-3D with diffusion models has achieved remarkable progress in recent years.

Text to 3D

Paper
Add Code

LRM: Large Reconstruction Model for Single Image to 3D

1 code implementation • 8 Nov 2023 • Yicong Hong, Kai Zhang, Jiuxiang Gu, Sai Bi, Yang Zhou, Difan Liu, Feng Liu, Kalyan Sunkavalli, Trung Bui, Hao Tan

We propose the first Large Reconstruction Model (LRM) that predicts the 3D model of an object from a single input image within just 5 seconds.

Image to 3D

776

Paper
Code

CDR-Adapter: Learning Adapters to Dig Out More Transferring Ability for Cross-Domain Recommendation Models

no code implementations • 4 Nov 2023 • Yanyu Chen, Yao Yao, Wai Kin Victor Chan, Li Xiao, Kai Zhang, Liang Zhang, Yun Ye

In this paper, we present a scalable and efficient paradigm to address data sparsity and cold-start issues in CDR, named CDR-Adapter, by decoupling the original recommendation model from the mapping function, without requiring re-engineering the network structure.

Recommendation Systems Transfer Learning

Paper
Add Code

On the Opportunities of Green Computing: A Survey

no code implementations • 1 Nov 2023 • You Zhou, Xiujing Lin, Xiang Zhang, Maolin Wang, Gangwei Jiang, Huakang Lu, Yupeng Wu, Kai Zhang, Zhe Yang, Kehang Wang, Yongduo Sui, Fengwei Jia, Zuoli Tang, Yao Zhao, Hongxuan Zhang, Tiannuo Yang, Weibo Chen, Yunong Mao, Yi Li, De Bao, Yu Li, Hongrui Liao, Ting Liu, Jingwen Liu, Jinchi Guo, Xiangyu Zhao, Ying WEI, Hong Qian, Qi Liu, Xiang Wang, Wai Kin, Chan, Chenliang Li, Yusen Li, Shiyu Yang, Jining Yan, Chao Mou, Shuai Han, Wuxia Jin, Guannan Zhang, Xiaodong Zeng

To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic.

Fairness Speech Synthesis +1

Paper
Add Code

Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V

1 code implementation • 29 Oct 2023 • Zhiling Yan, Kai Zhang, Rong Zhou, Lifang He, Xiang Li, Lichao Sun

In this paper, we critically evaluate the capabilities of the state-of-the-art multimodal large language model, i. e., GPT-4 with Vision (GPT-4V), on Visual Question Answering (VQA) task.

Language Modelling Large Language Model +2

Paper
Code

FERI: A Multitask-based Fairness Achieving Algorithm with Applications to Fair Organ Transplantation

no code implementations • 20 Oct 2023 • Can Li, Dejian Lai, Xiaoqian Jiang, Kai Zhang

Liver transplantation often faces fairness challenges across subgroups defined by sensitive attributes like age group, gender, and race/ethnicity.

Fairness

Paper
Add Code

AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking

1 code implementation • NeurIPS 2023 • Yang Yu, Qi Liu, Kai Zhang, Yuren Zhang, Chao Song, Min Hou, Yuqing Yuan, Zhihao Ye, Zaixi Zhang, Sanshi Lei Yu

Specifically, we adopt a multiple pairwise ranking loss which trains the user model to capture the similarity orders between the implicitly augmented view, the explicitly augmented view, and views from other users.

Contrastive Learning Data Augmentation

Paper
Code

Multimodal Question Answering for Unified Information Extraction

1 code implementation • 4 Oct 2023 • Yuxuan Sun, Kai Zhang, Yu Su

In addition, the effectiveness of our framework can successfully transfer to the few-shot setting, enhancing LMMs on a scale of 10B parameters to be competitive or outperform much larger language models such as ChatGPT and GPT-4.

Question Answering

Paper
Code

ImagenHub: Standardizing the evaluation of conditional image generation models

2 code implementations • 2 Oct 2023 • Max Ku, Tianle Li, Kai Zhang, Yujie Lu, Xingyu Fu, Wenwen Zhuang, Wenhu Chen

Recently, a myriad of conditional image generation and editing models have been developed to serve different downstream tasks, including text-to-image generation, text-guided image editing, subject-driven image generation, control-guided image generation, etc.

Conditional Image Generation text-guided-image-editing

117

Paper
Code

Subjective Face Transform using Human First Impressions

1 code implementation • 27 Sep 2023 • Chaitanya Roygaga, Joshua Krinsky, Kai Zhang, Kenny Kwok, Aparna Bharati

Humans tend to form quick subjective first impressions of non-physical attributes when seeing someone's face, such as perceived trustworthiness or attractiveness.

Attribute

Paper
Code

Survey on Deep Face Restoration: From Non-blind to Blind and Beyond

1 code implementation • 27 Sep 2023 • Wenjie Li, Mei Wang, Kai Zhang, Juncheng Li, Xiaoming Li, Yuhang Zhang, Guangwei Gao, Weihong Deng, Chia-Wen Lin

We also discuss notable benchmarks commonly utilized in the field.

Image Restoration

Paper
Code

LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination

1 code implementation • 21 Sep 2023 • Kai Zhang, Yangyang Kang, Fubang Zhao, Xiaozhong Liu

We contend that a mere memory module is inadequate and fully training an LLM can be excessively costly.

Paper
Code

Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language Modeling

1 code implementation • 19 Sep 2023 • Junzhe Jiang, Shang Qu, Mingyue Cheng, Qi Liu, Zhiding Liu, Hao Zhang, Rujiao Zhang, Kai Zhang, Rui Li, Jiatong Li, Min Gao

Recommender systems are indispensable in the realm of online applications, and sequential recommendation has enjoyed considerable prevalence due to its capacity to encapsulate the dynamic shifts in user interests.

Language Modelling Sequential Recommendation +1

Paper
Code

Designs and Implementations in Neural Network-based Video Coding

no code implementations • 11 Sep 2023 • Yue Li, Junru Li, Chaoyi Lin, Kai Zhang, Li Zhang, Franck Galpin, Thierry Dumas, Hongtao Wang, Muhammed Coban, Jacob Ström, Du Liu, Kenneth Andersson

The past decade has witnessed the huge success of deep learning in well-known artificial intelligence applications such as face recognition, autonomous driving, and large language model like ChatGPT.

Autonomous Driving Face Recognition +3

Paper
Add Code

Evaluating Large Language Models for Radiology Natural Language Processing

1 code implementation • 25 Jul 2023 • Zhengliang Liu, Tianyang Zhong, Yiwei Li, Yutong Zhang, Yi Pan, Zihao Zhao, Peixin Dong, Chao Cao, Yuxiao Liu, Peng Shu, Yaonai Wei, Zihao Wu, Chong Ma, Jiaqi Wang, Sheng Wang, Mengyue Zhou, Zuowei Jiang, Chunlin Li, Jason Holmes, Shaochen Xu, Lu Zhang, Haixing Dai, Kai Zhang, Lin Zhao, Yuanhao Chen, Xu Liu, Peilong Wang, Pingkun Yan, Jun Liu, Bao Ge, Lichao Sun, Dajiang Zhu, Xiang Li, Wei Liu, Xiaoyan Cai, Xintao Hu, Xi Jiang, Shu Zhang, Xin Zhang, Tuo Zhang, Shijie Zhao, Quanzheng Li, Hongtu Zhu, Dinggang Shen, Tianming Liu

The rise of large language models (LLMs) has marked a pivotal shift in the field of natural language processing (NLP).

811

Paper
Code

NTIRE 2023 Quality Assessment of Video Enhancement Challenge

no code implementations • 19 Jul 2023 • Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu, Yusheng Zhang, Rongyu Zhang, Hang Shi, Qihang Xu, Longan Xiao, Zhiliang Ma, Mirko Agarla, Luigi Celona, Claudio Rota, Raimondo Schettini, Zhiwei Huang, Yanan Li, Xiaotao Wang, Lei Lei, Hongye Liu, Wei Hong, Ironhead Chuang, Allen Lin, Drake Guan, Iris Chen, Kae Lou, Willy Huang, Yachun Tasi, Yvonne Kao, Haotian Fan, Fangyuan Kong, Shiqi Zhou, Hao liu, Yu Lai, Shanshan Chen, Wenqi Wang, HaoNing Wu, Chaofeng Chen, Chunzheng Zhu, Zekun Guo, Shiling Zhao, Haibing Yin, Hongkui Wang, Hanene Brachemi Meftah, Sid Ahmed Fezza, Wassim Hamidouche, Olivier Déforges, Tengfei Shi, Azadeh Mansouri, Hossein Motamednia, Amir Hossein Bakhtiari, Ahmad Mahmoudi Aznaveh

61 participating teams submitted their prediction results during the development phase, with a total of 3168 submissions.

Deblurring Image Restoration +3

Paper
Add Code

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

1 code implementation • NeurIPS 2023 • Kai Zhang, Lingbo Mo, Wenhu Chen, Huan Sun, Yu Su

To address this issue, we introduce MagicBrush (https://osu-nlp-group. github. io/MagicBrush/), the first large-scale, manually annotated dataset for instruction-guided real image editing that covers diverse scenarios: single-turn, multi-turn, mask-provided, and mask-free editing.

text-guided-image-editing

249

Paper
Code

Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning

no code implementations • 14 Jun 2023 • Xuechen Mu, Hankz Hankui Zhuo, Chen Chen, Kai Zhang, Chao Yu, Jianye Hao

Exploring sparse reward multi-agent reinforcement learning (MARL) environments with traps in a collaborative manner is a complex task.

Multi-agent Reinforcement Learning reinforcement-learning

Paper
Add Code

FedMLSecurity: A Benchmark for Attacks and Defenses in Federated Learning and Federated LLMs

1 code implementation • 8 Jun 2023 • Shanshan Han, Baturalp Buyukates, Zijian Hu, Han Jin, Weizhao Jin, Lichao Sun, Xiaoyang Wang, Wenxuan Wu, Chulin Xie, Yuhang Yao, Kai Zhang, Qifan Zhang, Yuhui Zhang, Carlee Joe-Wong, Salman Avestimehr, Chaoyang He

This paper introduces FedSecurity, an end-to-end benchmark designed to simulate adversarial attacks and corresponding defense mechanisms in Federated Learning (FL).

Federated Learning

4,073

Paper
Code

A Feature Reuse Framework with Texture-adaptive Aggregation for Reference-based Super-Resolution

1 code implementation • 2 Jun 2023 • Xiaoyong Mei, Yi Yang, Ming Li, Changqin Huang, Kai Zhang, Pietro Lió

In this study, we propose a feature reuse framework that guides the step-by-step texture reconstruction process through different stages, reducing the negative impacts of perceptual and adversarial loss.

Image Super-Resolution Reference-based Super-Resolution

Paper
Code

Decomposed Human Motion Prior for Video Pose Estimation via Adversarial Training

no code implementations • 30 May 2023 • Wenshuo Chen, Xiang Zhou, Zhengdi Yu, Weixi Gu, Kai Zhang

Estimating human pose from video is a task that receives considerable attention due to its applicability in numerous 3D fields.

Ranked #60 on 3D Human Pose Estimation on 3DPW (PA-MPJPE metric)

3D Human Pose Estimation

Paper
Add Code

BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks

1 code implementation • 26 May 2023 • Kai Zhang, Jun Yu, Eashan Adhikarla, Rong Zhou, Zhiling Yan, Yixin Liu, Zhengliang Liu, Lifang He, Brian Davison, Xiang Li, Hui Ren, Sunyang Fu, James Zou, Wei Liu, Jing Huang, Chen Chen, Yuyin Zhou, Tianming Liu, Xun Chen, Yong Chen, Quanzheng Li, Hongfang Liu, Lichao Sun

Conventional task- and modality-specific artificial intelligence (AI) models are inflexible in real-world deployment and maintenance for biomedicine.

Ranked #1 on Text Summarization on MeQSum

Image Captioning Medical Visual Question Answering +5

293

Paper
Code

PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology

1 code implementation • 24 May 2023 • Yuxuan Sun, Chenglu Zhu, Sunyi Zheng, Kai Zhang, Lin Sun, Zhongyi Shui, Yunlong Zhang, Honglin Li, Lin Yang

Secondly, by leveraging the collected data, we construct PathCLIP, a pathology-dedicated CLIP, to enhance PathAsst's capabilities in interpreting pathology images.

Instruction Following Language Modelling +1

Paper
Code

Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts

1 code implementation • 22 May 2023 • Jian Xie, Kai Zhang, Jiangjie Chen, Renze Lou, Yu Su

By providing external information to large language models (LLMs), tool augmentation (including retrieval augmentation) has emerged as a promising solution for addressing the limitations of LLMs' static parametric memory.

Retrieval

Paper
Code

Equivariant Multi-Modality Image Fusion

3 code implementations • 19 May 2023 • Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Kai Zhang, Shuang Xu, Dongdong Chen, Radu Timofte, Luc van Gool

These components enable the net training to follow the principles of the natural sensing-imaging process while satisfying the equivariant imaging prior.

Self-Supervised Learning

325

Paper
Code

Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors

1 code implementation • 18 May 2023 • Kai Zhang, Bernal Jiménez Gutiérrez, Yu Su

Recent work has shown that fine-tuning large language models (LLMs) on large-scale instruction-following datasets substantially improves their performance on a wide range of NLP tasks, especially in the zero-shot setting.

Ranked #1 on Relation Extraction on SemEval-2010 Task 8

Instruction Following Question Answering +2

Paper
Code

Denoising Diffusion Models for Plug-and-Play Image Restoration

2 code implementations • 15 May 2023 • Yuanzhi Zhu, Kai Zhang, Jingyun Liang, JieZhang Cao, Bihan Wen, Radu Timofte, Luc van Gool

Although diffusion models have shown impressive performance for high-quality image synthesis, their potential to serve as a generative denoiser prior to the plug-and-play IR methods remains to be further explored.

Deblurring Denoising +4

314

Paper
Code

Automatic Evaluation of Attribution by Large Language Models

1 code implementation • 10 May 2023 • Xiang Yue, Boshi Wang, Ziru Chen, Kai Zhang, Yu Su, Huan Sun

We manually curate a set of test examples covering 12 domains from a generative search engine, New Bing.

Fact Checking Language Modelling +3

Paper
Code

Sensitive Data Detection with High-Throughput Machine Learning Models in Electrical Health Records

no code implementations • 30 Apr 2023 • Kai Zhang, Xiaoqian Jiang

Based on this novel finding, we engineered over 30 features from the metadata of the original features and used machine learning to build classification models to automatically identify PHI fields in structured Electronic Health Record (EHR) data.

De-identification

Paper
Add Code

Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation

no code implementations • ICCV 2023 • Eric Ming Chen, Sidhanth Holalkere, Ruyu Yan, Kai Zhang, Abe Davis

Multi-view image generation attracts particular attention these days due to its promising 3D-related applications, e. g., image viewpoint editing.

Image Generation

Paper
Add Code

A Scalable Test Problem Generator for Sequential Transfer Optimization

2 code implementations • 17 Apr 2023 • Xiaoming Xue, Cuie Yang, Liang Feng, Kai Zhang, Linqi Song, Kay Chen Tan

Lastly, a benchmark suite with 12 STO problems featured by a variety of customized similarity relationships is developed using the proposed generator.

Paper
Code

Exploring Causes of Demographic Variations In Face Recognition Accuracy

no code implementations • 14 Apr 2023 • Gabriella Pangelinan, K. S. Krishnapriya, Vitor Albiero, Grace Bezold, Kai Zhang, Kushal Vangara, Michael C. King, Kevin W. Bowyer

In recent years, media reports have called out bias and racism in face recognition technology.

Face Recognition

Paper
Add Code

Predicting multiple sclerosis disease severity with multimodal deep neural networks

1 code implementation • 8 Apr 2023 • Kai Zhang, John A. Lincoln, Xiaoqian Jiang, Elmer V. Bernstam, Shayan Shams

Multiple Sclerosis (MS) is a chronic disease developed in human brain and spinal cord, which can cause permanent damage or deterioration of the nerves.

Disease Prediction

Paper
Code

A Transformer-Based Deep Learning Approach for Fairly Predicting Post-Liver Transplant Risk Factors

no code implementations • 5 Apr 2023 • Can Li, Xiaoqian Jiang, Kai Zhang

Specifically, we proposed a deep-learning model to predict multiple risk factors after a liver transplant.

Fairness Multi-Task Learning

Paper
Add Code

Multi-Task Learning for Post-transplant Cause of Death Analysis: A Case Study on Liver Transplant

no code implementations • 30 Mar 2023 • Sirui Ding, Qiaoyu Tan, Chia-Yuan Chang, Na Zou, Kai Zhang, Nathan R. Hoot, Xiaoqian Jiang, Xia Hu

Organ transplant is the essential treatment method for some end-stage diseases, such as liver failure.

Decision Making Multi-Task Learning

Paper
Add Code

Towards Fair Patient-Trial Matching via Patient-Criterion Level Fairness Constraint

no code implementations • 24 Mar 2023 • Chia-Yuan Chang, Jiayi Yuan, Sirui Ding, Qiaoyu Tan, Kai Zhang, Xiaoqian Jiang, Xia Hu, Na Zou

To tackle these challenges, deep learning frameworks have been created to match patients to trials.

Fairness

Paper
Add Code

ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge

1 code implementation • 24 Mar 2023 • Yunxiang Li, Zihan Li, Kai Zhang, Ruilong Dan, Steve Jiang, You Zhang

The primary aim of this research was to address the limitations observed in the medical knowledge of prevalent large language models (LLMs) such as ChatGPT, by creating a specialized language model with enhanced accuracy in medical advice.

Information Retrieval Language Modelling +3

3,377

Paper
Code

A Comprehensive Survey on Instruction Following

1 code implementation • 18 Mar 2023 • Renze Lou, Kai Zhang, Wenpeng Yin

This survey paper tries to summarize and provide insights to the current research on instruction following, particularly, by answering the following questions: (i) What is task instruction, and what instruction types exist?

Instruction Following

405

Paper
Code

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion

3 code implementations • ICCV 2023 • Zixiang Zhao, Haowen Bai, Yuanzhi Zhu, Jiangshe Zhang, Shuang Xu, Yulun Zhang, Kai Zhang, Deyu Meng, Radu Timofte, Luc van Gool

To leverage strong generative priors and address challenges such as unstable training and lack of interpretability for GAN-based generative methods, we propose a novel fusion algorithm based on the denoising diffusion probabilistic model (DDPM).

Denoising

325

Paper
Code

QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

6 code implementations • 10 Mar 2023 • Kedeng Tong, Yaojun Wu, Yue Li, Kai Zhang, Li Zhang, Xin Jin

In this paper, we present a Quantization-error-aware Variable Rate Framework (QVRF) that utilizes a univariate quantization regulator a to achieve wide-range variable rates within a single model.

Image Compression Quantization

Paper
Code

Memory-adaptive Depth-wise Heterogenous Federated Learning

1 code implementation • 8 Mar 2023 • Kai Zhang, Yutong Dai, Hongyi Wang, Eric Xing, Xun Chen, Lichao Sun

Federated learning is a promising paradigm that allows multiple clients to collaboratively train a model without sharing the local data.

Federated Learning

Paper
Code

Adversarial Modality Alignment Network for Cross-Modal Molecule Retrieval

1 code implementation • IEEE Transactions on Artificial Intelligence 2023 • Wenyu Zhao, Dong Zhou, Buqing Cao, Kai Zhang, Jinjun Chen

Our method utilizes a SciBERT as a text encoder and a graph transformer network as a molecule encoder to generate multimodal representations.

Ranked #1 on Cross-Modal Retrieval on ChEBI-20

Contrastive Learning Cross-Modal Retrieval +1

Paper
Code

Securing Biomedical Images from Unauthorized Training with Anti-Learning Perturbation

no code implementations • 5 Mar 2023 • Yixin Liu, Haohui Ye, Kai Zhang, Lichao Sun

The volume of open-source biomedical data has been essential to the development of various spheres of the healthcare community since more `free' data can provide individual researchers more chances to contribute.

Paper
Add Code

Fairly Predicting Graft Failure in Liver Transplant for Organ Assigning

no code implementations • 18 Feb 2023 • Sirui Ding, Ruixiang Tang, Daochen Zha, Na Zou, Kai Zhang, Xiaoqian Jiang, Xia Hu

To tackle this problem, this work proposes a fair machine learning framework targeting graft failure prediction in liver transplant.

Fairness Knowledge Distillation

Paper
Add Code

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

no code implementations • 18 Feb 2023 • Ce Zhou, Qian Li, Chen Li, Jun Yu, Yixin Liu, Guangjing Wang, Kai Zhang, Cheng Ji, Qiben Yan, Lifang He, Hao Peng, JianXin Li, Jia Wu, Ziwei Liu, Pengtao Xie, Caiming Xiong, Jian Pei, Philip S. Yu, Lichao Sun

This study provides a comprehensive review of recent research advancements, challenges, and opportunities for PFMs in text, image, graph, as well as other data modalities.

Graph Learning Language Modelling +1

Paper
Add Code

Building Shortcuts between Distant Nodes with Biaffine Mapping for Graph Convolutional Networks

no code implementations • 17 Feb 2023 • Acong Zhang, Jincheng Huang, Ping Li, Kai Zhang

Multiple recent studies show a paradox in graph convolutional networks (GCNs), that is, shallow architectures limit the capability of learning information from high-order neighbors, while deep architectures suffer from over-smoothing or over-squashing.

Contrastive Learning Node Classification +1

Paper
Add Code

Event-Based Frame Interpolation with Ad-hoc Deblurring

no code implementations • CVPR 2023 • Lei Sun, Christos Sakaridis, Jingyun Liang, Peng Sun, JieZhang Cao, Kai Zhang, Qi Jiang, Kaiwei Wang, Luc van Gool

The performance of video frame interpolation is inherently correlated with the ability to handle motion in the input scene.

Deblurring Image Deblurring +1

Paper
Add Code

Joint Spatio-Temporal Modeling for the Semantic Change Detection in Remote Sensing Images

1 code implementation • 10 Dec 2022 • Lei Ding, Jing Zhang, Kai Zhang, Haitao Guo, Bing Liu, Lorenzo Bruzzone

Semantic Change Detection (SCD) refers to the task of simultaneously extracting the changed areas and the semantic categories (before and after the changes) in Remote Sensing Images (RSIs).

Ranked #1 on Change Detection on SECOND

Change Detection

Paper
Code

CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution

1 code implementation • CVPR 2023 • JieZhang Cao, Qin Wang, Yongqin Xian, Yawei Li, Bingbing Ni, Zhiming Pi, Kai Zhang, Yulun Zhang, Radu Timofte, Luc van Gool

We explicitly design an implicit attention network to learn the ensemble weights for the nearby local features.

Image Super-Resolution

103

Paper
Code

GreenEyes: An Air Quality Evaluating Model based on WaveNet

1 code implementation • 8 Dec 2022 • Kan Huang, Kai Zhang, Ming Liu

Accompanying rapid industrialization, humans are suffering from serious air pollution problems.

Paper
Code

SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization

1 code implementation • NIPS 2022 • Zheng Chuanyang, Zheyang Li, Kai Zhang, Zhi Yang, Wenming Tan, Jun Xiao, Ye Ren, ShiLiang Pu

In this paper, we introduce joint importance, which integrates essential structural-aware interactions between components for the first time, to perform collaborative pruning.

object-detection Object Detection

Paper
Code

Changes from Classical Statistics to Modern Statistics and Data Science

no code implementations • 30 Oct 2022 • Kai Zhang, Shan Liu, Momiao Xiong

We urgently need to shift the paradigm for data analysis from the classical Euclidean data analysis to both Euclidean and non Euclidean data analysis and develop more and more innovative methods for describing, estimating and inferring non Euclidean geometries of modern real datasets.

Paper
Add Code

BELIEF in Dependence: Leveraging Atomic Linearity in Data Bits for Rethinking Generalized Linear Models

no code implementations • 19 Oct 2022 • Benjamin Brown, Kai Zhang, Xiao-Li Meng

Two linearly uncorrelated binary variables must be also independent because non-linear dependence cannot manifest with only two possible states.

Paper
Add Code

DBT-DMAE: An Effective Multivariate Time Series Pre-Train Model under Missing Data

no code implementations • 16 Sep 2022 • Kai Zhang, Qinmin Yang, Chao Li

Multivariate time series(MTS) is a universal data type related to many practical applications.

Time Series Time Series Analysis

Paper
Add Code

LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval

1 code implementation • 29 Aug 2022 • Kai Zhang, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang

The alignment is achieved by weakened knowledge distillations to enlighten the retriever via two aspects -- 1) a lexicon-augmented contrastive objective to challenge the dense encoder and 2) a pair-wise rank-consistent regularization to make dense model's behavior incline to the other.

Representation Learning Retrieval

Paper
Code

Learning Task-Oriented Flows to Mutually Guide Feature Alignment in Synthesized and Real Video Denoising

no code implementations • 25 Aug 2022 • JieZhang Cao, Qin Wang, Jingyun Liang, Yulun Zhang, Kai Zhang, Radu Timofte, Luc van Gool

To this end, we propose a new multi-scale refined optical flow-guided video denoising method, which is more robust to different noise levels.

Ranked #1 on Video Denoising on VideoLQ

Denoising Optical Flow Estimation +1

Paper
Add Code

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

no code implementations • 24 Aug 2022 • Guangqi Xie, Xin Li, Shiqi Lin, Li Zhang, Kai Zhang, Yue Li, Zhibo Chen

In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC.

Hierarchical Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Query-Response Interactions by Multi-tasks in Semantic Search for Chatbot Candidate Retrieval

no code implementations • 23 Aug 2022 • Libin Shi, Kai Zhang, Wenge Rong

Semantic search for candidate retrieval is an important yet neglected problem in retrieval-based Chatbots, which aims to select a bunch of candidate responses efficiently from a large pool.

Chatbot Retrieval

Paper
Add Code

CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations

no code implementations • COLING 2022 • Borun Chen, Hongyin Tang, Jiahao Bu, Kai Zhang, Jingang Wang, Qifan Wang, Hai-Tao Zheng, Wei Wu, Liqian Yu

However, most current models use Chinese characters as inputs and are not able to encode semantic information contained in Chinese words.

Contrastive Learning Language Modelling +1

Paper
Add Code

Enhancing the Robustness via Adversarial Learning and Joint Spatial-Temporal Embeddings in Traffic Forecasting

2 code implementations • 5 Aug 2022 • Juyong Jiang, Binqing Wu, Ling Chen, Kai Zhang, Sunghun Kim

On the one hand, our model simultaneously incorporates spatial (node-wise) embeddings and temporal (time-wise) embeddings to account for heterogeneous space-and-time convolutions; on the other hand, it uses GAN structure to systematically evaluate statistical consistencies between the real and the predicted time series in terms of both the temporal trending and the complex spatial-temporal dependencies.

Time Series Time Series Analysis

Paper
Code

Unified Normalization for Accelerating and Stabilizing Transformers

1 code implementation • 2 Aug 2022 • Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, ShiLiang Pu

To tackle these issues, we propose Unified Normalization (UN), which can speed up the inference by being fused with other linear operations and achieve comparable performance on par with LN.

Paper
Code

Reference-based Image Super-Resolution with Deformable Attention Transformer

1 code implementation • 25 Jul 2022 • JieZhang Cao, Jingyun Liang, Kai Zhang, Yawei Li, Yulun Zhang, Wenguan Wang, Luc van Gool

Reference-based image super-resolution (RefSR) aims to exploit auxiliary reference (Ref) images to super-resolve low-resolution (LR) images.

Ranked #1 on Reference-based Super-Resolution on CUFED5 - 4x upscaling

Image Super-Resolution Reference-based Super-Resolution

124

Paper
Code

Towards Interpretable Video Super-Resolution via Alternating Optimization

1 code implementation • 21 Jul 2022 • JieZhang Cao, Jingyun Liang, Kai Zhang, Wenguan Wang, Qin Wang, Yulun Zhang, Hao Tang, Luc van Gool

These issues can be alleviated by a cascade of three separate sub-tasks, including video deblurring, frame interpolation, and super-resolution, which, however, would fail to capture the spatial and temporal correlations among video sequences.

Deblurring Space-time Video Super-resolution +2

Paper
Code

Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss

1 code implementation • 21 Jun 2022 • Yikang Ding, Zhenyang Li, Dihe Huang, Zhiheng Li, Kai Zhang

Learning-based multi-view stereo (MVS) methods have made impressive progress and surpassed traditional methods in recent years.

Contrastive Learning

259

Paper
Code

ARF: Artistic Radiance Fields

1 code implementation • 13 Jun 2022 • Kai Zhang, Nick Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, Noah Snavely

We present a method for transferring the artistic features of an arbitrary style image to a 3D scene.

480

Paper
Code

Recurrent Video Restoration Transformer with Guided Deformable Attention

3 code implementations • 5 Jun 2022 • Jingyun Liang, Yuchen Fan, Xiaoyu Xiang, Rakesh Ranjan, Eddy Ilg, Simon Green, JieZhang Cao, Kai Zhang, Radu Timofte, Luc van Gool

Specifically, RVRT divides the video into multiple clips and uses the previously inferred clip feature to estimate the subsequent clip feature.

Ranked #1 on Video Super-Resolution on Vid4 - 4x upscaling - BD degradation

Analog Video Restoration Deblurring +3

332

Paper
Code

MORE: A Metric Learning Based Framework for Open-domain Relation Extraction

1 code implementation • 1 Jun 2022 • Yutong Wang, Renze Lou, Kai Zhang, MaoYan Chen, Yujiu Yang

To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-based Open Relation Extraction).

Clustering Metric Learning +2

Paper
Code

WT-MVSNet: Window-based Transformers for Multi-view Stereo

no code implementations • 28 May 2022 • Jinli Liao, Yikang Ding, Yoli Shavit, Dihe Huang, Shihao Ren, Jia Guo, Wensen Feng, Kai Zhang

In this work, we propose Window-based Transformers (WT) for local feature matching and global feature aggregation in multi-view stereo.

Paper
Add Code

UnifieR: A Unified Retriever for Large-Scale Retrieval

no code implementations • 23 May 2022 • Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Guodong Long, Kai Zhang, Daxin Jiang

Large-scale retrieval is to recall relevant documents from a huge collection given a query.

Passage Retrieval Representation Learning +1

Paper
Add Code

Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification

no code implementations • 18 May 2022 • Kai Zhang, Qi Liu, Zhenya Huang, Mingyue Cheng, Kun Zhang, Mengdi Zhang, Wei Wu, Enhong Chen

Existing studies in this task attach more attention to the sequence modeling of sentences while largely ignoring the rich domain-invariant semantics embedded in graph structures (i. e., the part-of-speech tags and dependency relations).

Classification Graph Attention +4

Paper
Add Code

AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation

1 code implementation • 18 May 2022 • Juyong Jiang, Peiyan Zhang, Yingtao Luo, Chaozhuo Li, Jae Boum Kim, Kai Zhang, Senzhang Wang, Xing Xie, Sunghun Kim

Sequential recommendation (SR) aims to model users dynamic preferences from a series of interactions.

Inductive Bias Sequential Recommendation

Paper
Code

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

117

Paper
Code

Adversarial Learning of Hard Positives for Place Recognition

no code implementations • 8 May 2022 • Wenxuan Fang, Kai Zhang, Yoli Shavit, Wensen Feng

Our method learns local and global augmentation policies which will increase the training loss, while the image retrieval network is forced to learn more powerful features for discriminating increasingly difficult examples.

Image Retrieval Retrieval

Paper
Add Code

Discussion of Multiscale Fisher's Independence Test for Multivariate Dependence

no code implementations • 26 Apr 2022 • Duyeol Lee, Helal El-Zaatari, Michael R. Kosorok, Xinyi Li, Kai Zhang

In this comment, we would like to discuss a general framework unifying the MULTIFIT and other tests and compare it with the binary expansion randomized ensemble test (BERET hereafter) proposed by Lee et al. (In press).

Paper
Add Code

ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching

no code implementations • CVPR 2022 • Yan Shi, Jun-Xiong Cai, Yoli Shavit, Tai-Jiang Mu, Wensen Feng, Kai Zhang

Graph Neural Networks (GNNs) with attention have been successfully applied for learning visual feature matching.

Paper
Add Code

Indoor simultaneous localization and mapping based on fringe projection profilometry

no code implementations • 23 Apr 2022 • Yang Zhao, Kai Zhang, Haotian Yu, Yi Zhang, Dongliang Zheng, Jing Han

Simultaneous Localization and Mapping (SLAM) plays an important role in outdoor and indoor applications ranging from autonomous driving to indoor robotics.

Autonomous Driving Simultaneous Localization and Mapping

Paper
Add Code

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

2 code implementations • 20 Apr 2022 • Ren Yang, Radu Timofte, Meisong Zheng, Qunliang Xing, Minglang Qiao, Mai Xu, Lai Jiang, Huaida Liu, Ying Chen, Youcheng Ben, Xiao Zhou, Chen Fu, Pei Cheng, Gang Yu, Junyi Li, Renlong Wu, Zhilu Zhang, Wei Shang, Zhengyao Lv, Yunjin Chen, Mingcai Zhou, Dongwei Ren, Kai Zhang, WangMeng Zuo, Pavel Ostyakov, Vyal Dmitry, Shakarim Soltanayev, Chervontsev Sergey, Zhussip Magauiya, Xueyi Zou, Youliang Yan, Pablo Navarrete Michelini, Yunhua Lu, Diankai Zhang, Shaoli Liu, Si Gao, Biao Wu, Chengjian Zheng, Xiaofeng Zhang, Kaidi Lu, Ning Wang, Thuong Nguyen Canh, Thong Bach, Qing Wang, Xiaopeng Sun, Haoyu Ma, Shijie Zhao, Junlin Li, Liangbin Xie, Shuwei Shi, Yujiu Yang, Xintao Wang, Jinjin Gu, Chao Dong, Xiaodi Shi, Chunmei Nian, Dong Jiang, Jucai Lin, Zhihuai Xie, Mao Ye, Dengyan Luo, Liuhan Peng, Shengjie Chen, Qian Wang, Xin Liu, Boyang Liang, Hang Dong, Yuhao Huang, Kai Chen, Xingbei Guo, Yujing Sun, Huilei Wu, Pengxu Wei, Yulin Huang, Junying Chen, Ik Hyun Lee, Sunder Ali Khowaja, Jiseok Yoon

This challenge includes three tracks.

Super-Resolution

Paper
Code

IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images

no code implementations • CVPR 2022 • Kai Zhang, Fujun Luan, Zhengqi Li, Noah Snavely

We propose a neural inverse rendering pipeline called IRON that operates on photometric images and outputs high-quality 3D content in the format of triangle meshes and material textures readily deployable in existing graphics pipelines.

Disentanglement Inverse Rendering

Paper
Add Code

Selecting task with optimal transport self-supervised learning for few-shot classification

no code implementations • 1 Apr 2022 • Renjie Xu, Xinghao Yang, BaoDi Liu, Kai Zhang, Weifeng Liu

Few-Shot classification aims at solving problems that only a few samples are available in the training process.

Data Augmentation Few-Shot Learning +1

Paper
Add Code

Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis

no code implementations • Findings (ACL) 2022 • Kai Zhang, Kun Zhang, Mengdi Zhang, Hongke Zhao, Qi Liu, Wei Wu, Enhong Chen

Aspect-based sentiment analysis (ABSA) predicts sentiment polarity towards a specific aspect in the given sentence.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Add Code

APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction

1 code implementation • 30 Mar 2022 • Bencheng Yan, Pengjie Wang, Kai Zhang, Feng Li, Hongbo Deng, Jian Xu, Bo Zheng

In many web applications, deep learning-based CTR prediction models (deep CTR models for short) are widely adopted.

Click-Through Rate Prediction

796

Paper
Code

Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis

2 code implementations • 24 Mar 2022 • Kai Zhang, Yawei Li, Jingyun Liang, JieZhang Cao, Yulun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, Luc van Gool

While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved.

Ranked #1 on Image Denoising on urban100 sigma15

Image Denoising Image-to-Image Translation

592

Paper
Code

Efficient Federated Learning on Knowledge Graphs via Privacy-preserving Relation Embedding Aggregation

1 code implementation • 17 Mar 2022 • Kai Zhang, Yu Wang, Hongyi Wang, Lifu Huang, Carl Yang, Xun Chen, Lichao Sun

Furthermore, we propose a Federated learning paradigm with privacy-preserving Relation embedding aggregation (FedR) to tackle the privacy issue in FedE.

Entity Embeddings Federated Learning +4

Paper
Code

Phased Flight Trajectory Prediction with Deep Learning

no code implementations • 17 Mar 2022 • Kai Zhang, Bowen Chen

The unprecedented increase of commercial airlines and private jets over the next ten years presents a challenge for air traffic control.

Decision Making Management +1

Paper
Add Code

Self-supervised Transparent Liquid Segmentation for Robotic Pouring

1 code implementation • 3 Mar 2022 • Gautham Narayan Narasimhan, Kai Zhang, Ben Eisner, Xingyu Lin, David Held

Liquid state estimation is important for robotics tasks such as pouring; however, estimating the state of transparent liquids is a challenging problem.

Segmentation

Paper
Code

VRT: A Video Restoration Transformer

1 code implementation • 28 Jan 2022 • Jingyun Liang, JieZhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc van Gool

Besides, parallel warping is used to further fuse information from neighboring frames by parallel feature warping.

Ranked #1 on Deblurring on BASED

Deblurring Denoising +7

1,258

Paper
Code

An Adaptive Neuro-Fuzzy System with Integrated Feature Selection and Rule Extraction for High-Dimensional Classification Problems

no code implementations • 10 Jan 2022 • Guangdong Xue, Qin Chang, Jian Wang, Kai Zhang, Nikhil R. Pal

The effectiveness of the FSRE-AdaTSK is demonstrated on 19 datasets of which five are in more than 2000 dimension including two with dimension greater than 7000.

feature selection

Paper
Add Code

Gendered Differences in Face Recognition Accuracy Explained by Hairstyles, Makeup, and Facial Morphology

no code implementations • 29 Dec 2021 • Vítor Albiero, Kai Zhang, Michael C. King, Kevin W. Bowyer

There is consensus in the research literature that face recognition accuracy is lower for females, who often have both a higher false match rate and a higher false non-match rate.

Face Recognition

Paper
Add Code

Improving Sequential Recommendations via Bidirectional Temporal Data Augmentation with Pre-training

1 code implementation • 13 Dec 2021 • Juyong Jiang, Peiyan Zhang, Yingtao Luo, Chaozhuo Li, Jaeboum Kim, Kai Zhang, Senzhang Wang, Sunghun Kim

Our approach leverages bidirectional temporal augmentation and knowledge-enhanced fine-tuning to synthesize authentic pseudo-prior items that \emph{retain user preferences and capture deeper item semantic correlations}, thus boosting the model's expressive power.

Data Augmentation Self-Knowledge Distillation +1

Paper
Code

Multiple Fusion Adaptation: A Strong Framework for Unsupervised Semantic Segmentation Adaptation

1 code implementation • 1 Dec 2021 • Kai Zhang, Yifan Sun, Rui Wang, Haichang Li, Xiaohui Hu

MFA basically considers three parallel information fusion strategies, i. e., the cross-model fusion, temporal fusion and a novel online-offline pseudo label fusion.

Ranked #22 on Synthetic-to-Real Translation on GTAV-to-Cityscapes Labels

Pseudo Label Segmentation +3

Paper
Code

Scalable Causal Structure Learning: Scoping Review of Traditional and Deep Learning Algorithms and New Opportunities in Biomedicine

no code implementations • 15 Oct 2021 • Pulakesh Upadhyaya, Kai Zhang, Can Li, Xiaoqian Jiang, Yejin Kim

Causal structure learning refers to a process of identifying causal structures from observational data, and it can have multiple applications in biomedicine and health care.

BIG-bench Machine Learning Causal Discovery +1

Paper
Add Code

Towards Flexible Blind JPEG Artifacts Removal

2 code implementations • ICCV 2021 • Jiaxi Jiang, Kai Zhang, Radu Timofte

Training a single deep blind model to handle different quality factors for JPEG image artifacts removal has been attracting considerable attention due to its convenience for practical usage.

Ranked #1 on JPEG Artifact Correction on LIVE1 (Quality 30 Color)

Image Compression Image Compression Artifact Reduction +5

394

Paper
Code

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks

no code implementations • EMNLP 2021 • Weicheng Ma, Renze Lou, Kai Zhang, Lili Wang, Soroush Vosoughi

Compared to AUTOSEM, a strong baseline method, GradTS improves the performance of MT-DNN with a bert-base-cased backend model, from 0. 33% to 17. 93% on 8 natural language understanding (NLU) tasks in the GLUE benchmarks.

Multi-Task Learning Natural Language Understanding

Paper
Add Code

A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy

no code implementations • 10 Sep 2021 • Kai Zhang, Chao Tian, Kun Zhang, Todd Johnson, Xiaoqian Jiang

The PC algorithm is the state-of-the-art algorithm for causal structure discovery on observational data.

Paper
Add Code

Learning Effective and Efficient Embedding via an Adaptively-Masked Twins-based Layer

no code implementations • 24 Aug 2021 • Bencheng Yan, Pengjie Wang, Kai Zhang, Wei Lin, Kuang-Chih Lee, Jian Xu, Bo Zheng

Each feature value is mapped to an embedding vector via an embedding learning process.

Neural Architecture Search

Paper
Add Code

SwinIR: Image Restoration Using Swin Transformer

9 code implementations • 23 Aug 2021 • Jingyun Liang, JieZhang Cao, Guolei Sun, Kai Zhang, Luc van Gool, Radu Timofte

In particular, the deep feature extraction module is composed of several residual Swin Transformer blocks (RSTB), each of which has several Swin Transformer layers together with a residual connection.

Ranked #2 on Color Image Denoising on urban100 sigma15

Color Image Denoising Grayscale Image Denoising +6

6,233

Paper
Code

SIFN: A Sentiment-aware Interactive Fusion Network for Review-based Item Recommendation

no code implementations • 18 Aug 2021 • Kai Zhang, Hao Qian, Qi Liu, Zhiqiang Zhang, Jun Zhou, Jianhui Ma, Enhong Chen

Specifically, we first encode user/item reviews via BERT and propose a light-weighted sentiment learner to extract semantic features of each review.

Recommendation Systems

Paper
Add Code

Learning to Detect: A Data-driven Approach for Network Intrusion Detection

no code implementations • 18 Aug 2021 • Zachary Tauscher, Yushan Jiang, Kai Zhang, Jian Wang, Houbing Song

With massive data being generated daily and the ever-increasing interconnectivity of the world's Internet infrastructures, a machine learning based intrusion detection system (IDS) has become a vital component to protect our economic and national security.

Network Intrusion Detection Representation Learning

Paper
Add Code

Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks

no code implementations • ACL 2021 • Weicheng Ma, Kai Zhang, Renze Lou, Lili Wang, Soroush Vosoughi

Through extensive experiments, we show that (1) pruning a number of attention heads in a multi-lingual Transformer-based model has, in general, positive effects on its performance in cross-lingual and multi-lingual tasks and (2) the attention heads to be pruned can be ranked using gradients and identified with a few trial experiments.

XLM-R

Paper
Add Code

Federated Variational Learning for Anomaly Detection in Multivariate Time Series

no code implementations • 18 Aug 2021 • Kai Zhang, Yushan Jiang, Lee Seversky, Chengtao Xu, Dahai Liu, Houbing Song

Anomaly detection has been a challenging task given high-dimensional multivariate time series data generated by networked sensors and actuators in Cyber-Physical Systems (CPS).

Anomaly Detection Representation Learning +2

Paper
Add Code

Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling

1 code implementation • ICCV 2021 • Jingyun Liang, Andreas Lugmayr, Kai Zhang, Martin Danelljan, Luc van Gool, Radu Timofte

More specifically, HCFlow learns a bijective mapping between HR and LR image pairs by modelling the distribution of the LR image and the rest high-frequency component simultaneously.

Ranked #25 on Video Super-Resolution on MSU Video Super Resolution Benchmark: Detail Restoration

Image Super-Resolution Video Super-Resolution

186

Paper
Code

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution

1 code implementation • ICCV 2021 • Jingyun Liang, Guolei Sun, Kai Zhang, Luc van Gool, Radu Timofte

Extensive experiments on synthetic and real images show that the proposed MANet not only performs favorably for both spatially variant and invariant kernel estimation, but also leads to state-of-the-art blind SR performance when combined with non-blind SR methods.

Image Super-Resolution

164

Paper
Code

Video Super-Resolution Transformer

1 code implementation • 12 Jun 2021 • JieZhang Cao, Yawei Li, Kai Zhang, Luc van Gool

Specifically, to tackle the first issue, we present a spatial-temporal convolutional self-attention layer with a theoretical understanding to exploit the locality information.

Optical Flow Estimation Video Super-Resolution

245

Paper
Code

Open Hierarchical Relation Extraction

1 code implementation • NAACL 2021 • Kai Zhang, Yuan YAO, Ruobing Xie, Xu Han, Zhiyuan Liu, Fen Lin, Leyu Lin, Maosong Sun

To establish the bidirectional connections between OpenRE and relation hierarchy, we propose the task of open hierarchical relation extraction and present a novel OHRE framework for the task.

Clustering Relation +1

Paper
Code

LocalViT: Bringing Locality to Vision Transformers

2 code implementations • 12 Apr 2021 • Yawei Li, Kai Zhang, JieZhang Cao, Radu Timofte, Luc van Gool

The importance of locality mechanisms is validated in two ways: 1) A wide range of design choices (activation function, layer placement, expansion ratio) are available for incorporating locality mechanisms and all proper choices can lead to a performance gain over the baseline, and 2) The same locality mechanism is successfully applied to 4 vision transformers, which shows the generalization of the locality concept.

Ranked #623 on Image Classification on ImageNet

Image Classification

108

Paper
Code

PQA: Perceptual Question Answering

1 code implementation • CVPR 2021 • Yonggang Qi, Kai Zhang, Aneeshan Sain, Yi-Zhe Song

Perceptual organization remains one of the very few established theories on the human visual system.

Question Answering

Paper
Code

PhySG: Inverse Rendering with Spherical Gaussians for Physics-based Material Editing and Relighting

no code implementations • CVPR 2021 • Kai Zhang, Fujun Luan, Qianqian Wang, Kavita Bala, Noah Snavely

We present PhySG, an end-to-end inverse rendering pipeline that includes a fully differentiable renderer and can reconstruct geometry, materials, and illumination from scratch from a set of RGB input images.

Ranked #5 on Surface Normals Estimation on Stanford-ORB

Depth Prediction Image Relighting +3

Paper
Add Code

Flow-based Kernel Prior with Application to Blind Super-Resolution

1 code implementation • CVPR 2021 • Jingyun Liang, Kai Zhang, Shuhang Gu, Luc van Gool, Radu Timofte

Kernel estimation is generally one of the key problems for blind image super-resolution (SR).

Blind Super-Resolution Image Super-Resolution

145

Paper
Code

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution

3 code implementations • ICCV 2021 • Kai Zhang, Jingyun Liang, Luc van Gool, Radu Timofte

It is widely acknowledged that single image super-resolution (SISR) methods would not perform well if the assumed degradation model deviates from those in real images.

Ranked #1 on Video Super-Resolution on MSU Video Upscalers: Quality Enhancement

Image Super-Resolution Video Super-Resolution

1,141

Paper
Code

Spatio-Temporal Data Mining for Aviation Delay Prediction

no code implementations • 20 Mar 2021 • Kai Zhang, Yushan Jiang, Dahai Liu, Houbing Song

A key role of collaborative decision making for air traffic scheduling and airspace resource management is the accurate prediction of flight delay.

Decision Making Management +1

Paper
Add Code

BEAUTY Powered BEAST

no code implementations • 1 Mar 2021 • Kai Zhang, Zhigen Zhao, Wen Zhou

To approach this oracle power, we devise the BEAST through a regularized resampling approximation of the oracle test.

Paper
Add Code

Local Change Point Detection and Cleaning of EEMD Signals with Application to Acoustic Shockwaves

no code implementations • 1 Mar 2021 • Kentaro Hoffman, Jonathan M. Lees, Kai Zhang

Using this technique, we demonstrate improved signal cleaning performance for acoustic shockwave signal detection.

Change Point Detection

Paper
Add Code

RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER

1 code implementation • 5 Feb 2021 • Lin Sun, Jiquan Wang, Kai Zhang, Yindu Su, Fangsheng Weng

We integrate soft or hard gates to select visual clues and propose a multitask algorithm to train on the MNER datasets.

Ranked #6 on Multi-modal Named Entity Recognition on Twitter-15

Multi-modal Named Entity Recognition named-entity-recognition +3

Paper
Code

Real-time Prediction for Mechanical Ventilation in COVID-19 Patients using A Multi-task Gaussian Process Multi-objective Self-attention Network

no code implementations • 1 Feb 2021 • Kai Zhang, Siddharth Karanth, Bela Patel, Robert Murphy, Xiaoqian Jiang

We propose a novel in-time risk trajectory predictive model to handle the irregular sampling rate in the data, which follows the dynamics of risk of performing mechanical ventilation for individual patients.

Trajectory Prediction

Paper
Add Code

Convolutional neural networks for fluid flow analysis: toward effective metamodeling and low-dimensionalization

no code implementations • 7 Jan 2021 • Masaki Morimoto, Kai Fukami, Kai Zhang, Aditya G. Nair, Koji Fukagata

We then discuss the influence of various parameters and operations on the CNN performance, with the utilization of autoencoder (AE).

Dimensionality Reduction Fluid Dynamics Computational Physics

Paper
Add Code

Nanoscale spin detection of copper ions using double electron-electron resonance at room temperature

no code implementations • 7 Jan 2021 • Kai Zhang, Shreya Ghosh, Sunil Saxena, M. V. Gurudev Dutt

We report the nanoscale spin detection and electron paramagnetic resonance (EPR) spectrum of copper (Cu$^{2+}$) ions via double electron-electron resonance with single spins in diamond at room temperature and low magnetic fields.

Quantum Physics Mesoscale and Nanoscale Physics

Paper
Add Code

Kinetic Energy Distribution of Fragments for Thermal Neutron-Induced $^{235}$U and $^{239}$Pu Fission Reactions

no code implementations • 24 Dec 2020 • Xiaojun Sun, Haiyuan Peng, Liying Xie, Kai Zhang, Yan Liang, Yinlu Han, Nengchuan Su, Jie Yan, Jun Xiao, Junjie Sun

(2) Every complementary pair of the primary fission fragments is approximatively described as two ellipsoids with large deformation at scission moment.

Nuclear Theory

Paper
Add Code

SDSS-IV/MaNGA: Can impulsive gaseous inflows explain steep oxygen abundance profiles \& anomalously-low-metallicity regions?

no code implementations • 23 Dec 2020 • Zachary J. Pace, Christy Tremonti, Adam L. Schaefer, David V. Stark, Catherine A. Witherspoon, Karen L. Masters, Niv Drory, Kai Zhang

We reveal a mutual correlation between steep oxygen abundance profiles between $0. 25-1. 5 R_e$, increased variability of metallicity between $1. 25-1. 75 R_e$, and elevated HI content at fixed total galaxy stellar mass.

Astrophysics of Galaxies

Paper
Add Code

Multi-Interactive Attention Network for Fine-grained Feature Learning in CTR Prediction

no code implementations • 13 Dec 2020 • Kai Zhang, Hao Qian, Qing Cui, Qi Liu, Longfei Li, Jun Zhou, Jianhui Ma, Enhong Chen

In the Click-Through Rate (CTR) prediction scenario, user's sequential behaviors are well utilized to capture the user interest in the recent literature.

Click-Through Rate Prediction

Paper
Add Code

GMOT-40: A Benchmark for Generic Multiple Object Tracking

1 code implementation • CVPR 2021 • Hexin Bai, Wensheng Cheng, Peng Chu, Juehuan Liu, Kai Zhang, Haibin Ling

Multiple Object Tracking (MOT) has witnessed remarkable advances in recent years.

Multiple Object Tracking Object

Paper
Code

NeRF++: Analyzing and Improving Neural Radiance Fields

5 code implementations • 15 Oct 2020 • Kai Zhang, Gernot Riegler, Noah Snavely, Vladlen Koltun

Neural Radiance Fields (NeRF) achieve impressive view synthesis results for a variety of capture settings, including 360 capture of bounded scenes and forward-facing capture of bounded and unbounded scenes.

1,250

Paper
Code

Cascaded Semantic and Positional Self-Attention Network for Document Classification

no code implementations • Findings of the Association for Computational Linguistics 2020 • Juyong Jiang, Jie Zhang, Kai Zhang

In this work, we propose a new architecture to aggregate the two sources of information using cascaded semantic and positional self-attention network (CSPAN) in the context of document classification.

Classification Document Classification +2

Paper
Add Code

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

3 code implementations • 15 Sep 2020 • Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, Jiji C. V, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Jiangtao Zhang, Xiaotong Luo, Liang Chen, Yanyun Qu, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni

This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results.

Image Super-Resolution

2,739

Paper
Code

Plug-and-Play Image Restoration with Deep Denoiser Prior

4 code implementations • 31 Aug 2020 • Kai Zhang, Yawei Li, WangMeng Zuo, Lei Zhang, Luc van Gool, Radu Timofte

Recent works on plug-and-play image restoration have shown that a denoiser can implicitly serve as the image prior for model-based methods to solve many inverse problems.

Deblurring Demosaicking +1

601

Paper
Code

The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures

1 code implementation • CVPR 2021 • Yawei Li, Wen Li, Martin Danelljan, Kai Zhang, Shuhang Gu, Luc van Gool, Radu Timofte

Based on that, we articulate the heterogeneity hypothesis: with the same training protocol, there exists a layer-wise differentiated network architecture (LW-DNA) that can outperform the original network with regular channel configurations but with a lower level of model complexity.

Image Classification Image Restoration +1

Paper
Code

Structural Landmarking and Interaction Modelling: on Resolution Dilemmas in Graph Classification

no code implementations • 29 Jun 2020 • Kai Zhang, Yaokang Zhu, Jun Wang, Jie Zhang, Hongyuan Zha

Graph neural networks are promising architecture for learning and inference with graph-structured data.

General Classification Graph Classification +1

Paper
Add Code

Learning Context-Based Non-local Entropy Modeling for Image Compression

no code implementations • 10 May 2020 • Mu Li, Kai Zhang, WangMeng Zuo, Radu Timofte, David Zhang

To address this issue, we propose a non-local operation for context modeling by employing the global similarity within the context.

Image Compression

Paper
Add Code

NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results

no code implementations • 3 May 2020 • Kai Zhang, Shuhang Gu, Radu Timofte, Taizhang Shang, Qiuju Dai, Shengchen Zhu, Tong Yang, Yandong Guo, Younghyun Jo, Sejong Yang, Seon Joo Kim, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Jing Liu, Kwangjin Yoon, Taegyun Jeon, Kazutoshi Akita, Takeru Ooba, Norimichi Ukita, Zhipeng Luo, Yuehan Yao, Zhenyu Xu, Dongliang He, Wenhao Wu, Yukang Ding, Chao Li, Fu Li, Shilei Wen, Jianwei Li, Fuzhi Yang, Huan Yang, Jianlong Fu, Byung-Hoon Kim, JaeHyun Baek, Jong Chul Ye, Yuchen Fan, Thomas S. Huang, Junyeop Lee, Bokyeung Lee, Jungki Min, Gwantae Kim, Kanghyu Lee, Jaihyun Park, Mykola Mykhailych, Haoyu Zhong, Yukai Shi, Xiaojun Yang, Zhijing Yang, Liang Lin, Tongtong Zhao, Jinjia Peng, Huibing Wang, Zhi Jin, Jiahao Wu, Yifu Chen, Chenming Shang, Huanrong Zhang, Jeongki Min, Hrishikesh P. S, Densen Puthussery, Jiji C. V

This paper reviews the NTIRE 2020 challenge on perceptual extreme super-resolution with focus on proposed solutions and results.

Image Super-Resolution

Paper
Add Code

Adaptive Structural Fingerprints for Graph Attention Networks

no code implementations • ICLR 2020 • Kai Zhang, Yaokang Zhu, Jun Wang, Jie Zhang

Yet, how to fully exploit rich structural information in the attention mechanism remains a challenge.

Graph Attention

Paper
Add Code

A Method for Curation of Web-Scraped Face Image Datasets

2 code implementations • 7 Apr 2020 • Kai Zhang, Vítor Albiero, Kevin W. Bowyer

The numbers of subjects and images acquired in web-scraped datasets are usually very large, with number of images on the millions scale.

Face Recognition

Paper
Code

Depth Sensing Beyond LiDAR Range

no code implementations • CVPR 2020 • Kai Zhang, Jiaxin Xie, Noah Snavely, Qifeng Chen

Depth sensing is a critical component of autonomous driving technologies, but today's LiDAR- or stereo camera-based solutions have limited range.

Autonomous Driving

Paper
Add Code

Computational Performance of a Germline Variant Calling Pipeline for Next Generation Sequencing

no code implementations • 1 Apr 2020 • Jie Liu, Xiaotian Wu, Kai Zhang, Bing Liu, Renyi Bao, Xiao Chen, Yiran Cai, Yiming Shen, Xinjun He, Jun Yan, Weixing Ji

With the booming of next generation sequencing technology and its implementation in clinical practice and life science research, the need for faster and more efficient data analysis methods becomes pressing in the field of sequencing.

Paper
Add Code

DHP: Differentiable Meta Pruning via HyperNetworks

2 code implementations • ECCV 2020 • Yawei Li, Shuhang Gu, Kai Zhang, Luc van Gool, Radu Timofte

Passing the sparsified latent vectors through the hypernetworks, the corresponding slices of the generated weight parameters can be removed, achieving the effect of network pruning.

Denoising Image Classification +3

Paper
Code

Deep Unfolding Network for Image Super-Resolution

1 code implementation • CVPR 2020 • Kai Zhang, Luc van Gool, Radu Timofte

As a result, the proposed network inherits the flexibility of model-based methods to super-resolve blurry, noisy images for different scale factors via a single model, while maintaining the advantages of learning-based methods.

Image Super-Resolution

838

Paper
Code

How Does Gender Balance In Training Data Affect Face Recognition Accuracy?

1 code implementation • 7 Feb 2020 • Vítor Albiero, Kai Zhang, Kevin W. Bowyer

Deep learning methods have greatly increased the accuracy of face recognition, but an old problem still persists: accuracy is usually higher for men than women.

Face Recognition

Paper
Code

Analysis of Gender Inequality In Face Recognition Accuracy

no code implementations • 31 Jan 2020 • Vítor Albiero, Krishnapriya K. S., Kushal Vangara, Kai Zhang, Michael C. King, Kevin W. Bowyer

We show that the female genuine distribution improves when only female images without facial cosmetics are used, but that the female impostor distribution also degrades at the same time.

Face Recognition

Paper
Add Code

The Binary Expansion Randomized Ensemble Test (BERET)

no code implementations • 8 Dec 2019 • Duyeol Lee, Kai Zhang, Michael R. Kosorok

Recently, the binary expansion testing framework was introduced to test the independence of two continuous random variables by utilizing symmetry statistics that are complete sufficient statistics for dependence.

Paper
Add Code

AIM 2019 Challenge on Constrained Super-Resolution: Methods and Results

2 code implementations • 4 Nov 2019 • Kai Zhang, Shuhang Gu, Radu Timofte, Zheng Hui, Xiumei Wang, Xinbo Gao, Dongliang Xiong, Shuai Liu, Ruipeng Gang, Nan Nan, Chenghua Li, Xueyi Zou, Ning Kang, Zhan Wang, Hang Xu, Chaofeng Wang, Zheng Li, Lin-Lin Wang, Jun Shi, Wenyu Sun, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Yazhe Niu, Peijin Zhuo, Xiangzhen Kong, Long Sun, Wenhao Wang

The challenge had 3 tracks.

Image Super-Resolution

415

Paper
Code

Unsupervised Context Rewriting for Open Domain Conversation

no code implementations • IJCNLP 2019 • Kun Zhou, Kai Zhang, Yu Wu, Shujie Liu, Jingsong Yu

Context modeling has a pivotal role in open domain conversation.

Decoder Reinforcement Learning (RL) +2

Paper
Add Code

Exploring Overall Contextual Information for Image Captioning in Human-Like Cognitive Style

no code implementations • ICCV 2019 • Hongwei Ge, Zehang Yan, Kai Zhang, Mingde Zhao, Liang Sun

In the training process, the forward and backward LSTMs encode the succeeding and preceding words into their respective hidden states by simultaneously constructing the whole sentence in a complementary manner.

Decoder Image Captioning +1

Paper
Add Code

Leveraging Vision Reconstruction Pipelines for Satellite Imagery

no code implementations • 7 Oct 2019 • Kai Zhang, Jin Sun, Noah Snavely

Reconstructing 3D geometry from satellite imagery is an important topic of research.

3D Reconstruction

Paper
Add Code

Neural Blind Deconvolution Using Deep Priors

1 code implementation • CVPR 2020 • Dongwei Ren, Kai Zhang, Qilong Wang, QinGhua Hu, WangMeng Zuo

To connect MAP and deep models, we in this paper present two generative networks for respectively modeling the deep priors of clean image and blur kernel, and propose an unconstrained neural optimization solution to blind deconvolution.

Deblurring Self-Supervised Learning

330

Paper
Code

A Convolutional Forward and Back-Projection Model for Fan-Beam Geometry

no code implementations • 24 Jul 2019 • Kai Zhang, Alireza Entezari

Iterative methods for tomographic image reconstruction have great potential for enabling high quality imaging from low-dose projection data.

Image Reconstruction

Paper
Add Code

TOI-CNN: a Solution of Information Extraction on Chinese Insurance Policy

no code implementations • NAACL 2019 • Lin Sun, Kai Zhang, Fule Ji, Zhenhua Yang

The advantage of TOI pooling layer is that the nested elements from one sentence could share computation and context in the forward and backward passes.

Sentence

Paper
Add Code

A Reference Vector based Many-Objective Evolutionary Algorithm with Feasibility-aware Adaptation

no code implementations • 12 Apr 2019 • Mingde Zhao, Hongwei Ge, Kai Zhang, Yaqing Hou

The infeasible parts of the objective space in difficult many-objective optimization problems cause trouble for evolutionary algorithms.

Evolutionary Algorithms

Paper
Add Code

Deep Plug-and-Play Super-Resolution for Arbitrary Blur Kernels

1 code implementation • CVPR 2019 • Kai Zhang, WangMeng Zuo, Lei Zhang

In this paper, we propose a principled formulation and framework by extending bicubic degradation based deep SISR with the help of plug-and-play framework to handle LR images with arbitrary blur kernels.

Deblurring Image Restoration +1

834

Paper
Code

Generalizable Meta-Heuristic based on Temporal Estimation of Rewards for Large Scale Blackbox Optimization

no code implementations • 17 Dec 2018 • Mingde Zhao, Hongwei Ge, Yi Lian, Kai Zhang

The generalization abilities of heuristic optimizers may deteriorate with the increment of the search space dimensionality.

Multi-Armed Bandits

Paper
Add Code

Toward Convolutional Blind Denoising of Real Photographs

3 code implementations • CVPR 2019 • Shi Guo, Zifei Yan, Kai Zhang, WangMeng Zuo, Lei Zhang

While deep convolutional neural networks (CNNs) have achieved impressive success in image denoising with additive white Gaussian noise (AWGN), their performance remains limited on real-world noisy photographs.

Ranked #4 on Denoising on Darmstadt Noise Dataset

Image Denoising Noise Estimation

492

Paper
Code

Multi-level Wavelet-CNN for Image Restoration

5 code implementations • 18 May 2018 • Pengju Liu, Hongzhi Zhang, Kai Zhang, Liang Lin, WangMeng Zuo

With the modified U-Net architecture, wavelet transform is introduced to reduce the size of feature maps in the contracting subnetwork.

Ranked #2 on Grayscale Image Denoising on Set12 sigma25

Computational Efficiency Image Denoising +2

221

Paper
Code

Learning a Single Convolutional Super-Resolution Network for Multiple Degradations

1 code implementation • CVPR 2018 • Kai Zhang, WangMeng Zuo, Lei Zhang

Recent years have witnessed the unprecedented success of deep convolutional neural networks (CNNs) in single image super-resolution (SISR).

Ranked #27 on Video Super-Resolution on MSU Video Super Resolution Benchmark: Detail Restoration

Image Super-Resolution Video Super-Resolution

420

Paper
Code

FFDNet: Toward a Fast and Flexible Solution for CNN based Image Denoising

7 code implementations • 11 Oct 2017 • Kai Zhang, WangMeng Zuo, Lei Zhang

Due to the fast inference and good performance, discriminative learning methods have been widely studied in image denoising.

Ranked #1 on Grayscale Image Denoising on BSD68 sigma75

Color Image Denoising Image Denoising

442

Paper
Code

Learning Deep CNN Denoiser Prior for Image Restoration

2 code implementations • CVPR 2017 • Kai Zhang, WangMeng Zuo, Shuhang Gu, Lei Zhang

Recent works have revealed that, with the aid of variable splitting techniques, denoiser prior can be plugged in as a modular part of model-based optimization methods to solve other inverse problems (e. g., deblurring).

Ranked #1 on Color Image Denoising on BSD68 sigma5

Color Image Denoising Deblurring +2

580

Paper
Code

Single Image Super-resolution via a Lightweight Residual Convolutional Neural Network

no code implementations • 23 Mar 2017 • Yudong Liang, Ze Yang, Kai Zhang, Yihui He, Jinjun Wang, Nanning Zheng

To tackle with the second problem, a lightweight CNN architecture which has carefully designed width, depth and skip connections was proposed.

Image Super-Resolution SSIM

Paper
Add Code

BET on Independence

no code implementations • 17 Oct 2016 • Kai Zhang

To avoid such power loss, we approach the nonparametric test of independence through the new framework of binary expansion statistics (BEStat) and binary expansion testing (BET), which examine dependence through a novel binary expansion filtration approximation of the copula.

Paper
Add Code

Entity Embedding-based Anomaly Detection for Heterogeneous Categorical Events

no code implementations • 26 Aug 2016 • Ting Chen, Lu-An Tang, Yizhou Sun, Zhengzhang Chen, Kai Zhang

Anomaly detection plays an important role in modern data-driven security applications, such as detecting suspicious access to a socket from a process.

Anomaly Detection

Paper
Add Code

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

21 code implementations • 13 Aug 2016 • Kai Zhang, WangMeng Zuo, Yunjin Chen, Deyu Meng, Lei Zhang

Discriminative model learning for image denoising has been recently attracting considerable attentions due to its favorable denoising performance.

Ranked #4 on JPEG Artifact Correction on LIVE1 (Quality 40 Grayscale)

Color Image Denoising Image Deblocking +3

1,393

Paper
Code

Seeing the Forest from the Trees in Two Looks: Matrix Sketching by Cascaded Bilateral Sampling

no code implementations • 25 Jul 2016 • Kai Zhang, Chuanren Liu, Jie Zhang, Hui Xiong, Eric Xing, Jieping Ye

Given a matrix A of size m by n, state-of-the-art randomized algorithms take O(m * n) time and space to obtain its low-rank decomposition.

Paper
Add Code

Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations

2 code implementations • ACM SIGKDD international conference on Knowledge discovery and data mining 2016 • Wei Cheng, Kai Zhang, Haifeng Chen, Guofei Jiang, Zhengzhang Chen, Wei Wang

Structures and evolutions of the invariance network, in particular the vanishing correlations, can shed important light on locating causal anomalies and performing diagnosis.

Management Root Cause Ranking

Paper
Code

Distributed Flexible Nonlinear Tensor Factorization

no code implementations • NeurIPS 2016 • Shandian Zhe, Kai Zhang, Pengyuan Wang, Kuang-Chih Lee, Zenglin Xu, Yuan Qi, Zoubin Ghahramani

Tensor factorization is a powerful tool to analyse multi-way data.

Click-Through Rate Prediction Computational Efficiency

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.