Search Results for author: Lei Zhu

Found 148 papers, 74 papers with code

MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction

no code implementations • 19 Apr 2024 • Zixuan Gong, Qi Zhang, Guangyin Bao, Lei Zhu, Ke Liu, Liang Hu, Duoqian Miao

Decoding natural visual scenes from brain activity has flourished, with extensive research in single-subject tasks and, however, less in cross-subject tasks.

Paper
Add Code

Dragtraffic: A Non-Expert Interactive and Point-Based Controllable Traffic Scene Generation Framework

no code implementations • 19 Apr 2024 • Sheng Wang, Ge Sun, Fulong Ma, Tianshuai Hu, Yongkang Song, Lei Zhu, Ming Liu

However, most existing scene generation methods lack controllability, accuracy, and versatility, resulting in unsatisfactory generation results.

Paper
Add Code

Disentangled Cascaded Graph Convolution Networks for Multi-Behavior Recommendation

1 code implementation • 17 Apr 2024 • Zhiyong Cheng, Jianhua Dong, Fan Liu, Lei Zhu, Xun Yang, Meng Wang

Furthermore, these models overlook the personalized nature of user behavioral preferences by employing uniform transformation networks for all users and items.

Recommendation Systems

Paper
Code

Dynamic Backtracking in GFlowNets: Enhancing Decision Steps with Reward-Dependent Adjustment Mechanisms

no code implementations • 8 Apr 2024 • Shuai Guo, Jielei Chu, Lei Zhu, Tianrui Li

Generative Flow Networks (GFlowNets) are probabilistic models predicated on Markov flows, employing specific amortization algorithms to learn stochastic policies that generate compositional substances including biomolecules, chemical materials, and more.

Decision Making

Paper
Add Code

Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

no code implementations • 24 Mar 2024 • Haoyuan Wang, WenBo Hu, Lei Zhu, Rynson W. H. Lau

Our method has two stages: the geometry of the target object and the pre-filtered environmental radiance fields are reconstructed in the first stage, and materials of the target object are estimated in the second stage with the proposed NeP and material-aware cone sampling strategy.

Inverse Rendering Object

Paper
Add Code

Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

no code implementations • 17 Mar 2024 • Zhihao Liang, Qi Zhang, WenBo Hu, Ying Feng, Lei Zhu, Kui Jia

This is because 3DGS treats each pixel as an isolated, single point rather than as an area, causing insensitivity to changes in the footprints of pixels.

Paper
Add Code

Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal

no code implementations • 12 Mar 2024 • Yijun Yang, Hongtao Wu, Angelica I. Aviles-Rivero, Yulun Zhang, Jing Qin, Lei Zhu

Although ViWS-Net is proposed to remove adverse weather conditions in videos with a single set of pre-trained weights, it is seriously blinded by seen weather at train-time and degenerates when coming to unseen weather during test-time.

Test-time Adaptation

Paper
Add Code

Beyond Text: Frozen Large Language Models in Visual Signal Comprehension

1 code implementation • 12 Mar 2024 • Lei Zhu, Fangyun Wei, Yanye Lu

To achieve this, we present the Vision-to-Language Tokenizer, abbreviated as V2T Tokenizer, which transforms an image into a ``foreign language'' with the combined aid of an encoder-decoder, the LLM vocabulary, and a CLIP model.

Deblurring Image Captioning +5

Paper
Code

Agile Multi-Source-Free Domain Adaptation

1 code implementation • 8 Mar 2024 • Xinyao Li, Jingjing Li, Fengling Li, Lei Zhu, Ke Lu

Efficiently utilizing rich knowledge in pretrained models has become a critical topic in the era of large models.

Source-Free Domain Adaptation Specificity

Paper
Code

Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation

no code implementations • 5 Mar 2024 • Zhekai Du, Xinyao Li, Fengling Li, Ke Lu, Lei Zhu, Jingjing Li

Specifically, the image contextual information is utilized to prompt the language branch in a domain-agnostic and instance-conditioned way.

Transfer Learning Unsupervised Domain Adaptation

Paper
Add Code

Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

no code implementations • 5 Mar 2024 • Haoyu Chen, Wenbo Li, Jinjin Gu, Jingjing Ren, Haoze Sun, Xueyi Zou, Zhensong Zhang, Youliang Yan, Lei Zhu

Leveraging unseen LR images for self-supervised learning guides the model to adapt its modeling space to the target domain, facilitating fine-tuning of SR models without requiring paired high-resolution (HR) images.

Image Super-Resolution Self-Supervised Learning

Paper
Add Code

Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label

1 code implementation • 27 Feb 2024 • Xinliang Zhang, Lei Zhu, Hangzhou He, Lujia Jin, Yanye Lu

In this study, we propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Code

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

no code implementations • 23 Feb 2024 • Francis Engelmann, Ayca Takmaz, Jonas Schult, Elisabetta Fedele, Johanna Wald, Songyou Peng, Xi Wang, Or Litany, Siyu Tang, Federico Tombari, Marc Pollefeys, Leonidas Guibas, Hongbo Tian, Chunjie Wang, Xiaosheng Yan, Bingwen Wang, Xuanyang Zhang, Xiao Liu, Phuc Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham, Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby

This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023.

Scene Understanding

Paper
Add Code

RelayAttention for Efficient Large Language Model Serving with Long System Prompts

1 code implementation • 22 Feb 2024 • Lei Zhu, Xinjiang Wang, Wayne Zhang, Rynson W. H. Lau

Practical large language model (LLM) services may involve a long system prompt, which specifies the instructions, examples, and knowledge documents of the task and is reused across numerous requests.

Language Modelling Large Language Model

Paper
Code

Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

no code implementations • 29 Jan 2024 • Jiahao Huang, Yinzhe Wu, Fanwen Wang, Yingying Fang, Yang Nan, Cagan Alkan, Lei Xu, Zhifan Gao, Weiwen Wu, Lei Zhu, Zhaolin Chen, Peter Lally, Neal Bangerter, Kawin Setsompop, Yike Guo, Daniel Rueckert, Ge Wang, Guang Yang

Magnetic Resonance Imaging (MRI) is a pivotal clinical diagnostic tool, yet its extended scanning times often compromise patient comfort and image quality, especially in volumetric, temporal and quantitative scans.

Federated Learning MRI Reconstruction

Paper
Add Code

An objective comparison of methods for augmented reality in laparoscopic liver resection by preoperative-to-intraoperative image fusion

no code implementations • 28 Jan 2024 • Sharib Ali, Yamid Espinel, Yueming Jin, Peng Liu, Bianca Güttner, Xukun Zhang, Lihua Zhang, Tom Dowrick, Matthew J. Clarkson, Shiting Xiao, Yifan Wu, Yijun Yang, Lei Zhu, Dai Sun, Lan Li, Micha Pfeiffer, Shahid Farid, Lena Maier-Hein, Emmanuel Buc, Adrien Bartoli

A total of 6 teams from 4 countries participated, whose proposed methods were evaluated on 16 images and two preoperative 3D models from two patients.

Paper
Add Code

Vivim: a Video Vision Mamba for Medical Video Object Segmentation

1 code implementation • 25 Jan 2024 • Yijun Yang, Zhaohu Xing, Chunwang Huang, Lei Zhu

Traditional convolutional neural networks have a limited receptive field while transformer-based networks are mediocre in constructing long-term dependency from the perspective of computational complexity.

Lesion Segmentation Segmentation +3

Paper
Code

SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

1 code implementation • 24 Jan 2024 • Zhaohu Xing, Tian Ye, Yijun Yang, Guang Liu, Lei Zhu

Our SegMamba, in contrast to Transformer-based methods, excels in whole volume feature modeling from a state space model standpoint, maintaining superior processing speed, even with volume features at a resolution of {$64\times 64\times 64$}.

Image Segmentation Medical Image Segmentation +1

203

Paper
Code

SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese

no code implementations • 22 Jan 2024 • Liang Xu, Hang Xue, Lei Zhu, Kangkang Zhao

We introduce SuperCLUE-Math6(SC-Math6), a new benchmark dataset to evaluate the mathematical reasoning abilities of Chinese language models.

GSM8K Math +1

Paper
Add Code

MCRPL: A Pretrain, Prompt & Fine-tune Paradigm for Non-overlapping Many-to-one Cross-domain Recommendation

no code implementations • 16 Jan 2024 • Hao liu, Lei Guo, Lei Zhu, Yongqiang Jiang, Min Gao, Hongzhi Yin

To overcome the above challenges, we focus on NMCR, and devise MCRPL as our solution.

Paper
Add Code

A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model

no code implementations • 5 Jan 2024 • Dongdi Zhao, Jianbo Ma, Lu Lu, Jinke Li, Xuan Ji, Lei Zhu, Fuming Fang, Ming Liu, Feijun Jiang

Far-field speech recognition is a challenging task that conventionally uses signal processing beamforming to attack noise and interference problem.

Speech Enhancement speech-recognition +1

Paper
Add Code

EPA: Neural Collapse Inspired Robust Out-of-Distribution Detector

no code implementations • 3 Jan 2024 • Jiawei Zhang, Yufan Chen, Cheng Jin, Lei Zhu, Yuantao Gu

Out-of-distribution (OOD) detection plays a crucial role in ensuring the security of neural networks.

Out of Distribution (OOD) Detection

Paper
Add Code

Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis

no code implementations • 26 Dec 2023 • Jingjing Ren, Cheng Xu, Haoyu Chen, Xinran Qin, Lei Zhu

Recent progress in multi-modal conditioned face synthesis has enabled the creation of visually striking and accurately aligned facial images.

Denoising Face Generation

Paper
Add Code

Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

no code implementations • 21 Dec 2023 • Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Weiping Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, Pingyu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis, Francesco Prinzi, Gianluca Carlini, Lisa Cuneo, Abhirup Banerjee, Zhaohu Xing, Lei Zhu, Zacharia Mesbah, Dhruv Jain, Tsiry Mayet, Hongyu Yuan, Qing Lyu, Abdul Qayyum, Moona Mazher, Athol Wells, Simon LF Walsh, Guang Yang

The online validation set incorporated 52 HRCT scans from patients with fibrotic lung disease and the offline test set included 140 cases from fibrosis and COVID-19 patients.

Mortality Prediction

Paper
Add Code

SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

1 code implementation • 15 Dec 2023 • Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, Jin Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein, Nchongmaje Ndipenoch, Alina Miron, Yongmin Li, Yimeng Zhang, Yu Chen, Lu Bai, Jinlong Huang, Chengyang An, Lisheng Wang, Kaiwen Huang, Yunqi Gu, Tao Zhou, Mu Zhou, Shichuan Zhang, Wenjun Liao, Guotai Wang, Shaoting Zhang

The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis.

Computed Tomography (CT) Image Segmentation +3

Paper
Code

Lite-Mind: Towards Efficient and Robust Brain Representation Network

no code implementations • 6 Dec 2023 • Zixuan Gong, Qi Zhang, Guangyin Bao, Lei Zhu, Yu Zhang, Ke Liu, Liang Hu, Duoqian Miao

The limited data availability and the low signal-to-noise ratio of fMRI signals lead to the challenging task of fMRI-to-image retrieval.

Brain Decoding Image Retrieval +2

Paper
Add Code

GBD-TS: Goal-based Pedestrian Trajectory Prediction with Diffusion using Tree Sampling Algorithm

no code implementations • 25 Nov 2023 • Ge Sun, Sheng Wang, Yang Xiao, Lei Zhu, Ming Liu

GBD combines goal prediction with the diffusion network.

Autonomous Driving Denoising +2

Paper
Add Code

SC-Safety: A Multi-round Open-ended Question Adversarial Safety Benchmark for Large Language Models in Chinese

no code implementations • 9 Oct 2023 • Liang Xu, Kangkang Zhao, Lei Zhu, Hang Xue

To systematically assess the safety of Chinese LLMs, we introduce SuperCLUE-Safety (SC-Safety) - a multi-round adversarial benchmark with 4912 open-ended questions covering more than 20 safety sub-dimensions.

Model Selection Natural Language Understanding

Paper
Add Code

Shifting More Attention to Breast Lesion Segmentation in Ultrasound Videos

1 code implementation • 3 Oct 2023 • Junhao Lin, Qian Dai, Lei Zhu, Huazhu Fu, Qiong Wang, Weibin Li, Wenhao Rao, Xiaoyang Huang, Liansheng Wang

We also devise a localization-based contrastive loss to reduce the lesion location distance between neighboring video frames within the same video and enlarge the location distances between frames from different ultrasound videos.

Lesion Segmentation Segmentation +1

Paper
Code

Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation

1 code implementation • ICCV 2023 • Yijun Yang, Angelica I. Aviles-Rivero, Huazhu Fu, Ye Liu, Weiming Wang, Lei Zhu

In this work, we propose the first framework for restoring videos from all adverse weather conditions by developing a video adverse-weather-component suppression network (ViWS-Net).

Paper
Code

Multi-level Asymmetric Contrastive Learning for Medical Image Segmentation Pre-training

no code implementations • 21 Sep 2023 • Shuang Zeng, Lei Zhu, Xinliang Zhang, Zifeng Tian, Qian Chen, Lujia Jin, Jiayi Wang, Yanye Lu

In this work, we propose a novel asymmetric contrastive learning framework named JCL for medical image segmentation with self-supervised pre-training.

Contrastive Learning Image Segmentation +3

Paper
Add Code

Towards Self-Adaptive Pseudo-Label Filtering for Semi-Supervised Learning

no code implementations • 18 Sep 2023 • Lei Zhu, Zhanghan Ke, Rynson Lau

In this work, we observe that the distribution gap between the confidence values of correct and incorrect pseudo labels emerges at the very beginning of the training, which can be utilized to filter pseudo labels.

Pseudo Label Pseudo Label Filtering

Paper
Add Code

Towards High-Quality Specular Highlight Removal by Leveraging Large-Scale Synthetic Data

1 code implementation • ICCV 2023 • Gang Fu, Qing Zhang, Lei Zhu, Chunxia Xiao, Ping Li

This paper aims to remove specular highlights from a single object-level image.

highlight removal Object

Paper
Code

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

1 code implementation • 1 Sep 2023 • Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby

When integrated with powerful 2D open-world models such as ODISE and GroundingDINO, excellent results were observed on open-vocabulary instance segmentation.

Ranked #1 on 3D Open-Vocabulary Object Detection on ScanNet on unseen classes

3D Open-Vocabulary Instance Segmentation 3D Open-Vocabulary Object Detection +5

Paper
Code

Cross-Modal Retrieval: A Systematic Review of Methods and Future Directions

1 code implementation • 28 Aug 2023 • Fengling Li, Lei Zhu, Tianshi Wang, Jingjing Li, Zheng Zhang, Heng Tao Shen

With the exponential surge in diverse multi-modal data, traditional uni-modal retrieval methods struggle to meet the needs of users demanding access to data from various modalities.

Cross-Modal Retrieval Retrieval

Paper
Code

Sparse Sampling Transformer with Uncertainty-Driven Ranking for Unified Removal of Raindrops and Rain Streaks

1 code implementation • ICCV 2023 • Sixiang Chen, Tian Ye, Jinbin Bai, ErKang Chen, Jun Shi, Lei Zhu

In the real world, image degradations caused by rain often exhibit a combination of rain streaks and raindrops, thereby increasing the challenges of recovering the underlying clean image.

Rain Removal

124

Paper
Code

Federated Pseudo Modality Generation for Incomplete Multi-Modal MRI Reconstruction

no code implementations • 20 Aug 2023 • Yunlu Yan, Chun-Mei Feng, Yuexiang Li, Rick Siow Mong Goh, Lei Zhu

In this paper, we propose a novel communication-efficient federated learning framework, namely Fed-PMG, to address the missing modality challenge in federated multi-modal MRI reconstruction.

Federated Learning MRI Reconstruction

Paper
Add Code

Rethinking Client Drift in Federated Learning: A Logit Perspective

no code implementations • 20 Aug 2023 • Yunlu Yan, Chun-Mei Feng, Mang Ye, WangMeng Zuo, Ping Li, Rick Siow Mong Goh, Lei Zhu, C. L. Philip Chen

Concretely, FedCSD introduces a class prototype similarity distillation to align the local logits with the refined global logits that are weighted by the similarity between local logits and the global prototype.

Federated Learning

Paper
Add Code

Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery

no code implementations • 18 Aug 2023 • Hongqiu Wang, Lei Zhu, Guang Yang, Yike Guo, Shichen Zhang, Bo Xu, Yueming Jin

Our method is verified on these datasets, and experimental results exhibit that the VIS-Net can significantly outperform existing state-of-the-art referring segmentation methods.

Robot Navigation Segmentation

Paper
Add Code

Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

no code implementations • 9 Aug 2023 • Lei Zhu, Hangzhou He, Xinliang Zhang, Qian Chen, Shuang Zeng, Qiushi Ren, Yanye Lu

Existing methods adopt an online-trained classification branch to provide pseudo annotations for supervising the segmentation branch.

Classification Segmentation +3

Paper
Add Code

SuperCLUE: A Comprehensive Chinese Large Language Model Benchmark

no code implementations • 27 Jul 2023 • Liang Xu, Anqi Li, Lei Zhu, Hang Xue, Changtai Zhu, Kangkang Zhao, Haonan He, Xuanwei Zhang, Qiyue Kang, Zhenzhong Lan

We fill this gap by proposing a comprehensive Chinese benchmark SuperCLUE, named after another popular Chinese LLM benchmark CLUE.

Language Modelling Large Language Model

Paper
Add Code

A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

no code implementations • 14 Jun 2023 • Yunlu Yan, Lei Zhu

To achieve this goal, we propose FedRDN, a simple yet remarkably effective data augmentation method for feature distribution skewed FL, which randomly injects the statistics of the dataset from the entire federation into the client's data.

Data Augmentation Federated Learning

Paper
Add Code

Cross-Modal Vertical Federated Learning for MRI Reconstruction

no code implementations • 5 Jun 2023 • Yunlu Yan, Hong Wang, Yawen Huang, Nanjun He, Lei Zhu, Yuexiang Li, Yong Xu, Yefeng Zheng

To this end, we formulate this practical-yet-challenging cross-modal vertical federated learning task, in which shape data from multiple hospitals have different modalities with a small amount of multi-modality data collected from the same individuals.

Disentanglement MRI Reconstruction +1

Paper
Add Code

Dynamic Interactive Relation Capturing via Scene Graph Learning for Robotic Surgical Report Generation

no code implementations • 5 Jun 2023 • Hongqiu Wang, Yueming Jin, Lei Zhu

For robot-assisted surgery, an accurate surgical report reflects clinical operations during surgery and helps document entry tasks, post-operative analysis and follow-up treatment.

Graph Learning Relation

Paper
Add Code

Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset

1 code implementation • 5 Jun 2023 • Junling Liu, Peilin Zhou, Yining Hua, Dading Chong, Zhongyu Tian, Andrew Liu, Helin Wang, Chenyu You, Zhenhua Guo, Lei Zhu, Michael Lingzhi Li

To the best of our knowledge, CMExam is the first Chinese medical exam dataset to provide comprehensive medical annotations.

Benchmarking Multiple-choice +1

Paper
Code

Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification

no code implementations • 10 Apr 2023 • Zan Gao, Shenxun Wei, Weili Guan, Lei Zhu, Meng Wang, Shenyong Chen

Moreover, human semantic information and pedestrian identity information are not fully explored.

Paper
Add Code

Automated Prompting for Non-overlapping Cross-domain Sequential Recommendation

no code implementations • 9 Apr 2023 • Lei Guo, Chunxiao Wang, Xinhua Wang, Lei Zhu, Hongzhi Yin

Cross-domain Recommendation (CR) has been extensively studied in recent years to alleviate the data sparsity issue in recommender systems by utilizing different domain information.

Sequential Recommendation

Paper
Add Code

Multi-Behavior Recommendation with Cascading Graph Convolution Networks

1 code implementation • 28 Mar 2023 • Zhiyong Cheng, Sai Han, Fan Liu, Lei Zhu, Zan Gao, Yuxin Peng

Most existing multi-behavior models fail to capture such dependencies in a behavior chain for embedding learning.

Paper
Code

Masked Image Training for Generalizable Deep Image Denoising

1 code implementation • CVPR 2023 • Haoyu Chen, Jinjin Gu, Yihao Liu, Salma Abdel Magid, Chao Dong, Qiong Wang, Hanspeter Pfister, Lei Zhu

To address this issue, we present a novel approach to enhance the generalization performance of denoising networks, known as masked training.

Image Denoising

220

Paper
Code

Neural Preset for Color Style Transfer

1 code implementation • CVPR 2023 • Zhanghan Ke, Yuhao Liu, Lei Zhu, Nanxuan Zhao, Rynson W. H. Lau

In this paper, we present a Neural Preset technique to address the limitations of existing color style transfer methods, including visual artifacts, vast memory requirement, and slow style switching speed.

4k Color Normalization +4

237

Paper
Code

Distribution Aligned Diffusion and Prototype-guided network for Unsupervised Domain Adaptive Segmentation

1 code implementation • 22 Mar 2023 • Haipeng Zhou, Lei Zhu, Yuyin Zhou

In order to explore its potential further, we have taken a step forward and considered a more complex scenario in the medical image domain, specifically, under an unsupervised adaptation condition.

Paper
Code

DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

1 code implementation • 19 Mar 2023 • Yijun Yang, Huazhu Fu, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Lei Zhu

However, while a substantial amount of diffusion-based research has focused on generative tasks, few studies have applied diffusion models to general medical image classification.

Diabetic Retinopathy Grading Image Classification +3

122

Paper
Code

HybridMIM: A Hybrid Masked Image Modeling Framework for 3D Medical Image Segmentation

1 code implementation • 18 Mar 2023 • Zhaohu Xing, Lei Zhu, Lequan Yu, Zhiheng Xing, Liang Wan

Masked image modeling (MIM) with transformer backbones has recently been exploited as a powerful self-supervised pre-training technique.

Contrastive Learning Image Segmentation +3

Paper
Code

Diff-UNet: A Diffusion Embedded Network for Volumetric Segmentation

1 code implementation • 18 Mar 2023 • Zhaohu Xing, Liang Wan, Huazhu Fu, Guang Yang, Lei Zhu

Our experimental results also indicate the universality and effectiveness of the proposed model.

Denoising Segmentation

123

Paper
Code

Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior

1 code implementation • CVPR 2023 • Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng

Video dehazing aims to recover haze-free frames with high visibility and contrast.

Paper
Code

Learning Physical-Spatio-Temporal Features for Video Shadow Removal

no code implementations • 16 Mar 2023 • Zhihao Chen, Liang Wan, Yefan Xiao, Lei Zhu, Huazhu Fu

Then, we develop a progressive aggregation module to enhance the spatio and temporal characteristics of features maps, and effectively integrate the three kinds of features.

Shadow Removal Video Restoration

Paper
Add Code

BiFormer: Vision Transformer with Bi-Level Routing Attention

1 code implementation • CVPR 2023 • Lei Zhu, Xinjiang Wang, Zhanghan Ke, Wayne Zhang, Rynson Lau

As the core building block of vision transformers, attention is a powerful tool to capture long-range dependency.

Ranked #9 on Object Detection on COCO 2017 (mAP metric)

Computational Efficiency Image Classification +3

421

Paper
Code

GeoSpark: Sparking up Point Cloud Segmentation with Geometry Clue

no code implementations • 14 Mar 2023 • Zhening Huang, Xiaoyang Wu, Hengshuang Zhao, Lei Zhu, Shujun Wang, Georgios Hadjidemetriou, Ioannis Brilakis

For feature aggregation, it improves feature modeling by allowing the network to learn from both local points and neighboring geometry partitions, resulting in an enlarged data-tailored receptive field.

Point Cloud Segmentation

Paper
Add Code

A Comprehensive Survey on Source-free Domain Adaptation

no code implementations • 23 Feb 2023 • Zhiqi Yu, Jingjing Li, Zhekai Du, Lei Zhu, Heng Tao Shen

Over the past decade, domain adaptation has become a widely studied branch of transfer learning that aims to improve performance on target domains by leveraging knowledge from the source domain.

Source-Free Domain Adaptation Transfer Learning

Paper
Add Code

One-Pot Multi-Frame Denoising

no code implementations • 18 Feb 2023 • Lujia Jin, Shi Zhao, Lei Zhu, Qian Chen, Yanye Lu

Therefore, it is necessary to avoid the restriction of clean labels and make full use of noisy data for model training.

Denoising

Paper
Add Code

Learning to Control and Coordinate Mixed Traffic Through Robot Vehicles at Complex and Unsignalized Intersections

1 code implementation • 12 Jan 2023 • Dawei Wang, Weizi Li, Lei Zhu, Jia Pan

In contrast, without RVs, congestion starts to develop when the traffic demand reaches as low as 200 vehicles per hour.

Multi-agent Reinforcement Learning

Paper
Code

Snow Removal in Video: A New Dataset and A Novel Method

no code implementations • ICCV 2023 • Haoyu Chen, Jingjing Ren, Jinjin Gu, Hongtao Wu, Xuequan Lu, Haoming Cai, Lei Zhu

We also develop a deep learning framework for video snow removal.

Contrastive Learning Snow Removal

Paper
Add Code

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

no code implementations • 29 Nov 2022 • Haochuan Cui, Junjie Sheng, Bo Jin, Yiqiu Hu, Li Su, Lei Zhu, Wenli Zhou, Xiangfeng Wang

With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences.

Cloud Computing Scheduling

Paper
Add Code

Who is Gambling? Finding Cryptocurrency Gamblers Using Multi-modal Retrieval Methods

1 code implementation • 27 Nov 2022 • Zhengjie Huang, Zhenguang Liu, Jianhai Chen, Qinming He, Shuang Wu, Lei Zhu, Meng Wang

Meanwhile, decentralized applications have also attracted intense attention from the online gambling community, with more and more decentralized gambling platforms created through the help of smart contracts.

Retrieval

Paper
Code

SCOTCH and SODA: A Transformer Video Shadow Detection Framework

no code implementations • CVPR 2023 • Lihao Liu, Jean Prost, Lei Zhu, Nicolas Papadakis, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

In this work, we argue that accounting for shadow deformation is essential when designing a video shadow detection method.

Contrastive Learning Shadow Detection

Paper
Add Code

Dual Multi-scale Mean Teacher Network for Semi-supervised Infection Segmentation in Chest CT Volume for COVID-19

1 code implementation • 10 Nov 2022 • Liansheng Wang, Jiacheng Wang, Lei Zhu, Huazhu Fu, Ping Li, Gary Cheng, Zhipeng Feng, Shuo Li, Pheng-Ann Heng

Automated detecting lung infections from computed tomography (CT) data plays an important role for combating COVID-19.

Computed Tomography (CT) Segmentation

Paper
Code

CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking with Camera-LiDAR Fusion

no code implementations • 6 Sep 2022 • Li Wang, Xinyu Zhang, Wenyuan Qin, Xiaoyu Li, Lei Yang, Zhiwei Li, Lei Zhu, Hong Wang, Jun Li, Huaping Liu

As such, we propose a novel camera-LiDAR fusion 3D MOT framework based on the Combined Appearance-Motion Optimization (CAMO-MOT), which uses both camera and LiDAR data and significantly reduces tracking failures caused by occlusion and false detection.

3D Multi-Object Tracking Autonomous Driving +2

Paper
Add Code

Joint Prediction of Meningioma Grade and Brain Invasion via Task-Aware Contrastive Learning

1 code implementation • 4 Sep 2022 • Tianling Liu, Wennan Liu, Lequan Yu, Liang Wan, Tong Han, Lei Zhu

Preoperative and noninvasive prediction of the meningioma grade is important in clinical practice, as it directly influences the clinical decision making.

Contrastive Learning Decision Making +1

Paper
Code

NestedFormer: Nested Modality-Aware Transformer for Brain Tumor Segmentation

1 code implementation • 31 Aug 2022 • Zhaohu Xing, Lequan Yu, Liang Wan, Tong Han, Lei Zhu

Multi-modal MR imaging is routinely used in clinical practice to diagnose and investigate brain tumors by providing rich complementary information.

Brain Tumor Segmentation MRI segmentation +2

Paper
Code

Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization

1 code implementation • 16 Jul 2022 • Lei Zhu, Qian Chen, Lujia Jin, Yunfei You, Yanye Lu

Classification activation map (CAM), utilizing the classification structure to generate pixel-wise localization maps, is a crucial mechanism for weakly supervised object localization (WSOL).

Object Weakly-Supervised Object Localization

Paper
Code

Harmonizer: Learning to Perform White-Box Image and Video Harmonization

1 code implementation • 4 Jul 2022 • Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W. H. Lau

Unlike prior methods that are based on black-box autoencoders, Harmonizer contains a neural network for filter argument prediction and several white-box filters (based on the predicted arguments) for image harmonization.

Ranked #7 on Image Harmonization on iHarmony4

Image Harmonization Video Harmonization

275

Paper
Code

A New Dataset and A Baseline Model for Breast Lesion Detection in Ultrasound Videos

2 code implementations • 1 Jul 2022 • Zhi Lin, Junhao Lin, Lei Zhu, Huazhu Fu, Jing Qin, Liansheng Wang

Moreover, we learn video-level features to classify the breast lesions of the original video as benign or malignant lesions to further enhance the final breast lesion detection performance in ultrasound videos.

Lesion Classification Lesion Detection

Paper
Code

Time Interval-enhanced Graph Neural Network for Shared-account Cross-domain Sequential Recommendation

1 code implementation • 16 Jun 2022 • Lei Guo, Jinyu Zhang, Li Tang, Tong Chen, Lei Zhu, Hongzhi Yin

Shared-account Cross-domain Sequential Recommendation (SCSR) task aims to recommend the next item via leveraging the mixed user behaviors in multiple domains.

Representation Learning Sequential Recommendation +1

Paper
Code

Copy Motion From One to Another: Fake Motion Video Generation

no code implementations • 3 May 2022 • Zhenguang Liu, Sifan Wu, Chejian Xu, Xiang Wang, Lei Zhu, Shuang Wu, Fuli Feng

3) To enhance texture details, we encode facial features with geometric guidance and employ local GANs to refine the face, feet, and hands.

Video Generation

Paper
Add Code

RSCFed: Random Sampling Consensus Federated Semi-supervised Learning

1 code implementation • CVPR 2022 • Xiaoxiao Liang, Yiqun Lin, Huazhu Fu, Lei Zhu, Xiaomeng Li

In this paper, we present a Random Sampling Consensus Federated learning, namely RSCFed, by considering the uneven reliability among models from fully-labeled clients, fully-unlabeled clients or partially labeled clients.

Federated Learning

Paper
Code

Multi-modal learning for predicting the genotype of glioma

no code implementations • 21 Mar 2022 • Yiran Wei, Xi Chen, Lei Zhu, Lipei Zhang, Carola-Bibiane Schönlieb, Stephen J. Price, Chao Li

In this study, we propose a multi-modal learning framework using three separate encoders to extract features of focal tumor image, tumor geometrics and global brain networks.

Clinical Knowledge

Paper
Add Code

BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation

1 code implementation • CVPR 2022 • Wenqiao Zhang, Lei Zhu, James Hallinan, Andrew Makmur, Shengyu Zhang, Qingpeng Cai, Beng Chin Ooi

In this paper, we propose a novel semi-supervised learning (SSL) framework named BoostMIS that combines adaptive pseudo labeling and informative active annotation to unleash the potential of medical image SSL models: (1) BoostMIS can adaptively leverage the cluster assumption and consistency regularization of the unlabeled data according to the current learning status.

Active Learning

Paper
Code

Weakly Supervised Object Localization as Domain Adaption

1 code implementation • CVPR 2022 • Lei Zhu, Qi She, Qian Chen, Yunfei You, Boyu Wang, Yanye Lu

To avoid this problem, this work provides a novel perspective that models WSOL as a domain adaption (DA) task, where the score estimator trained on the source/image domain is tested on the target/pixel domain to locate objects.

Classification Domain Adaptation +2

Paper
Code

Content-Noise Complementary Learning for Medical Image Denoising

2 code implementations • IEEE Transactions on Medical Imaging 2022 • Mufeng Geng, Xiangxi Meng, Jiangyuan Yu, Lei Zhu, Lujia Jin, Zhe Jiang, Bin Qiu, Hui Li, Hanjing Kong, Jianmin Yuan, Kun Yang, Hongming Shan, Hongbin Han, Zhi Yang, Qiushi Ren, Yanye Lu

In this study, we propose a simple yet effective strategy, the content-noise complementary learning (CNCL) strategy, in which two deep learning predictors are used to learn the respective content and noise of the image dataset complementarily.

Generative Adversarial Network Image Denoising +1

Paper
Code

Motion Prediction via Joint Dependency Modeling in Phase Space

no code implementations • 7 Jan 2022 • Pengxiang Su, Zhenguang Liu, Shuang Wu, Lei Zhu, Yifang Yin, Xuanjing Shen

In this paper, we introduce a novel convolutional neural model to effectively leverage explicit prior knowledge of motion anatomy, and simultaneously capture both spatial and temporal information of joint trajectory dynamics.

Anatomy motion prediction

Paper
Add Code

Distinguishing Unseen From Seen for Generalized Zero-Shot Learning

no code implementations • CVPR 2022 • Hongzu Su, Jingjing Li, Zhi Chen, Lei Zhu, Ke Lu

In this paper, we present a novel method which leverages both visual and semantic modalities to distinguish seen and unseen categories.

Generalized Zero-Shot Learning

Paper
Add Code

Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images

1 code implementation • 1 Jan 2022 • Xiaoqiang Wang, Lei Zhu, Siliang Tang, Huazhu Fu, Ping Li, Fei Wu, Yi Yang, Yueting Zhuang

The depth estimation branch is trained with RGB-D images and then used to estimate the pseudo depth maps for all unlabeled RGB images to form the paired data.

Depth Estimation object-detection +3

Paper
Code

Background-aware Classification Activation Map for Weakly Supervised Object Localization

1 code implementation • 29 Dec 2021 • Lei Zhu, Qi She, Qian Chen, Xiangxi Meng, Mufeng Geng, Lujia Jin, Zhe Jiang, Bin Qiu, Yunfei You, Yibao Zhang, Qiushi Ren, Yanye Lu

In our B-CAM, two image-level features, aggregated by pixel-level features of potential background and object locations, are used to purify the object feature from the object-related background and to represent the feature of the pure-background sample, respectively.

Classification Object +1

Paper
Code

VMAgent: Scheduling Simulator for Reinforcement Learning

2 code implementations • 9 Dec 2021 • Junjie Sheng, Shengliang Cai, Haochuan Cui, Wenhao Li, Yun Hua, Bo Jin, Wenli Zhou, Yiqiu Hu, Lei Zhu, Qian Peng, Hongyuan Zha, Xiangfeng Wang

A novel simulator called VMAgent is introduced to help RL researchers better explore new methods, especially for virtual machine scheduling.

Cloud Computing reinforcement-learning +2

Paper
Code

Network-wide Multi-step Traffic Volume Prediction using Graph Convolutional Gated Recurrent Neural Network

1 code implementation • 22 Nov 2021 • Lei Lin, Weizi Li, Lei Zhu

For instance, our model reduces MAE by 25. 3%, RMSE by 29. 2%, and MAPE by 20. 2%, compared to the state-of-the-art Diffusion Convolutional Recurrent Neural Network (DCRNN) model using the hourly dataset.

Paper
Code

Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration Network

1 code implementation • 5 Nov 2021 • Ge-Peng Ji, Lei Zhu, Mingchen Zhuge, Keren Fu

Camouflaged Object Detection (COD) aims to detect objects with similar patterns (e. g., texture, intensity, colour, etc) to their surroundings, and recently has attracted growing research interest.

Image Segmentation Medical Image Segmentation +3

Paper
Code

Domain Adaptive Semantic Segmentation without Source Data

1 code implementation • 13 Oct 2021 • Fuming You, Jingjing Li, Lei Zhu, Ke Lu, Zhi Chen, Zi Huang

To address these problems, we investigate domain adaptive semantic segmentation without source data, which assumes that the model is pre-trained on the source domain, and then adapting to the target domain without accessing source data anymore.

Segmentation Semantic Segmentation

Paper
Code

Boundary-aware Transformers for Skin Lesion Segmentation

1 code implementation • 8 Oct 2021 • Jiacheng Wang, Lan Wei, Liansheng Wang, Qichao Zhou, Lei Zhu, Jing Qin

Skin lesion segmentation from dermoscopy images is of great importance for improving the quantitative analysis of skin cancer.

Ranked #5 on Lesion Segmentation on ISIC 2018

Inductive Bias Lesion Segmentation +2

110

Paper
Code

HCDG: A Hierarchical Consistency Framework for Domain Generalization on Medical Image Segmentation

1 code implementation • 13 Sep 2021 • Yijun Yang, Shujun Wang, Lei Zhu, Lequan Yu

Particularly, for the Extrinsic Consistency, we leverage the knowledge across multiple source domains to enforce data-level consistency.

Data Augmentation Domain Generalization +4

Paper
Code

Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal

no code implementations • 9 Sep 2021 • Lei Zhu, Zhaojing Luo, Wei Wang, Meihui Zhang, Gang Chen, Kaiping Zheng

In multimedia analysis, domain adaptation studies the problem of cross-domain knowledge transfer from a label rich source domain to a label scarce target domain, thus potentially alleviates the annotation requirement for deep learning models.

Domain Adaptation Transfer Learning

Paper
Add Code

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection

1 code implementation • ICCV 2021 • Yujun Zhang, Lei Zhu, Wei Feng, Huazhu Fu, Mingqian Wang, Qingxia Li, Cheng Li, Song Wang

Lane detection plays a key role in autonomous driving.

Autonomous Driving Lane Detection +3

Paper
Code

MT-ORL: Multi-Task Occlusion Relationship Learning

1 code implementation • ICCV 2021 • Panhe Feng, Qi She, Lei Zhu, Jiaxin Li, Lin Zhang, Zijian Feng, Changhu Wang, Chunpeng Li, Xuejing Kang, Anlong Ming

Retrieving occlusion relation among objects in a single image is challenging due to sparsity of boundaries in image.

Paper
Code

From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data

1 code implementation • 6 Aug 2021 • Ye Liu, Lei Zhu, Shunda Pei, Huazhu Fu, Jing Qin, Qing Zhang, Liang Wan, Wei Feng

Our DID-Net predicts the three component maps by progressively integrating features across scales, and refines each map by passing an independent refinement network.

Ranked #6 on Image Dehazing on Haze4k

Image Dehazing Single Image Dehazing

Paper
Code

Unifying Nonlocal Blocks for Neural Networks

1 code implementation • ICCV 2021 • Lei Zhu, Qi She, Duo Li, Yanye Lu, Xuejing Kang, Jie Hu, Changhu Wang

The nonlocal-based blocks are designed for capturing long-range spatial-temporal dependencies in computer vision tasks.

Action Recognition Image Classification +2

Paper
Code

Adversarial Energy Disaggregation for Non-intrusive Load Monitoring

no code implementations • 2 Aug 2021 • Zhekai Du, Jingjing Li, Lei Zhu, Ke Lu, Heng Tao Shen

Energy disaggregation, also known as non-intrusive load monitoring (NILM), challenges the problem of separating the whole-home electricity usage into appliance-specific individual consumptions, which is a typical application of data analysis.

Non-Intrusive Load Monitoring

Paper
Add Code

Bayesian Statistics Guided Label Refurbishment Mechanism: Mitigating Label Noise in Medical Image Classification

1 code implementation • 23 Jun 2021 • Mengdi Gao, Ximeng Feng, Mufeng Geng, Zhe Jiang, Lei Zhu, Xiangxi Meng, Chuanqing Zhou, Qiushi Ren, Yanye Lu

BLRM utilizes maximum a posteriori probability (MAP) in the Bayesian statistics and the exponentially time-weighted technique to selectively correct the labels of noisy images.

Image Classification Medical Image Classification

Paper
Code

A Multi-Task Network for Joint Specular Highlight Detection and Removal

1 code implementation • CVPR 2021 • Gang Fu, Qing Zhang, Lei Zhu, Ping Li, Chunxia Xiao

Specular highlight detection and removal are fundamental and challenging tasks.

16k Highlight Detection +1

Paper
Code

Smart Contract Vulnerability Detection: From Pure Neural Network to Interpretable Graph Feature and Expert Pattern Fusion

1 code implementation • 17 Jun 2021 • Zhenguang Liu, Peng Qian, Xiang Wang, Lei Zhu, Qinming He, Shouling Ji

In this paper, we explore combining deep learning with expert patterns in an explainable fashion.

Vulnerability Detection

Paper
Code

Cross-Domain Gradient Discrepancy Minimization for Unsupervised Domain Adaptation

1 code implementation • CVPR 2021 • Zhekai Du, Jingjing Li, Hongzu Su, Lei Zhu, Ke Lu

Previous bi-classifier adversarial learning methods only focus on the similarity between the outputs of two distinct classifiers.

Clustering Self-Supervised Learning +1

Paper
Code

UGRec: Modeling Directed and Undirected Relations for Recommendation

1 code implementation • 10 May 2021 • Xinxiao Zhao, Zhiyong Cheng, Lei Zhu, Jiecai Zheng, Xueqing Li

In particular, for a directed relation, we transform the head and tail entities into the corresponding relation space to model their relation; and for an undirected co-occurrence relation, we project head and tail entities into a unique hyperplane in the entity space to minimize their distance.

Attribute Collaborative Filtering +2

Paper
Code

DA-GCN: A Domain-aware Attentive Graph Convolution Network for Shared-account Cross-domain Sequential Recommendation

no code implementations • 7 May 2021 • Lei Guo, Li Tang, Tong Chen, Lei Zhu, Quoc Viet Hung Nguyen, Hongzhi Yin

Shared-account Cross-domain Sequential recommendation (SCSR) is the task of recommending the next item based on a sequence of recorded user behaviors, where multiple users share a single account, and their behaviours are available in multiple domains.

Sequential Recommendation Transfer Learning

Paper
Add Code

Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images

no code implementations • 5 Apr 2021 • Cheng Xue, Lei Zhu, Huazhu Fu, Xiaowei Hu, Xiaomeng Li, Hai Zhang, Pheng Ann Heng

The BD modules learn additional breast lesion boundary map to enhance the boundary quality of a segmentation result refinement.

Boundary Detection Image Segmentation +3

Paper
Add Code

Learning the Superpixel in a Non-iterative and Lifelong Manner

1 code implementation • CVPR 2021 • Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu

Superpixel is generated by automatically clustering pixels in an image into hundreds of compact partitions, which is widely used to perceive the object contours for its excellent contour adherence.

Clustering

Paper
Code

Triple-cooperative Video Shadow Detection

1 code implementation • CVPR 2021 • Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu, Wennan Liu, Jing Qin

The bottleneck is the lack of a well-established dataset with high-quality annotations for video shadow detection.

Saliency Detection Semantic Segmentation +3

Paper
Code

Involution: Inverting the Inherence of Convolution for Visual Recognition

13 code implementations • CVPR 2021 • Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen

Convolution has been the core ingredient of modern neural networks, triggering the surge of deep learning in vision.

Ranked #703 on Image Classification on ImageNet

Image Classification

5,251

Paper
Code

Feature-level Attentive ICF for Recommendation

1 code implementation • 22 Feb 2021 • Zhiyong Cheng, Fan Liu, Shenghan Mei, Yangyang Guo, Lei Zhu, Liqiang Nie

To demonstrate the effectiveness of our method, we design a light attention neural network to integrate both item-level and feature-level attention for neural ICF models.

Collaborative Filtering Recommendation Systems

Paper
Code

Interest-aware Message-Passing GCN for Recommendation

1 code implementation • 19 Feb 2021 • Fan Liu, Zhiyong Cheng, Lei Zhu, Zan Gao, Liqiang Nie

To form the subgraphs, we design an unsupervised subgraph generation module, which can effectively identify users with common interests by exploiting both user feature and graph structure.

Paper
Code

Deep Texture-Aware Features for Camouflaged Object Detection

no code implementations • 5 Feb 2021 • Jingjing Ren, Xiaowei Hu, Lei Zhu, Xuemiao Xu, Yangyang Xu, Weiming Wang, Zijun Deng, Pheng-Ann Heng

Camouflaged object detection is a challenging task that aims to identify objects having similar texture to the surroundings.

Object object-detection +1

Paper
Add Code

Mitigating Intensity Bias in Shadow Detection via Feature Decomposition and Reweighting

no code implementations • ICCV 2021 • Lei Zhu, Ke Xu, Zhanghan Ke, Rynson W.H. Lau

These two phenomenons reveal that deep shadow detectors heavily depend on the intensity cue, which we refer to as intensity bias.

Shadow Detection

Paper
Add Code

A Unified Framework to Analyze and Design the Nonlocal Blocks for Neural Networks

no code implementations • 1 Jan 2021 • Lei Zhu, Qi She, Changhu Wang

When choosing Chebyshev graph filter, a generalized formulation can be derived for explaining the existing nonlocal-based blocks (e. g. nonlocal block, nonlocal stage, double attention block) and uses to analyze their irrationality.

Action Recognition Fine-Grained Image Classification

Paper
Add Code

MLCask: Efficient Management of Component Evolution in Collaborative Data Analytics Pipelines

no code implementations • 17 Oct 2020 • Zhaojing Luo, Sai Ho Yeung, Meihui Zhang, Kaiping Zheng, Lei Zhu, Gang Chen, Feiyi Fan, Qian Lin, Kee Yuan Ngiam, Beng Chin Ooi

In this paper, we identify two main challenges that arise during the deployment of machine learning pipelines, and address them with the design of versioning for an end-to-end analytics system MLCask.

BIG-bench Machine Learning Management

Paper
Add Code

Learning to Detect Specular Highlights from Real-world Images

1 code implementation • 10 Oct 2020 • Gang Fu, Qing Zhang, QiFeng Lin, Lei Zhu, and Chunaxia Xiao

Specular highlight detection is a challenging problem, and has many applications such as shiny object detection and light source estimation.

Highlight Detection object-detection +1

Paper
Code

Dual-level Semantic Transfer Deep Hashing for Efficient Social Image Retrieval

1 code implementation • 10 Jun 2020 • Lei Zhu, Hui Cui, Zhiyong Cheng, Jingjing Li, Zheng Zhang

Specifically, we design a complementary dual-level semantic transfer mechanism to efficiently discover the potential semantics of tags and seamlessly transfer them into binary hash codes.

Deep Hashing Image Retrieval +1

Paper
Code

Constrained Multi-shape Evolution for Overlapping Cytoplasm Segmentation

no code implementations • 8 Apr 2020 • Youyi Song, Lei Zhu, Baiying Lei, Bin Sheng, Qi Dou, Jing Qin, Kup-Sze Choi

In the shape evolution, we compensate intensity deficiency for the segmentation by introducing not only the modeled local shape priors but also global shape priors (clump--level) modeled by considering mutual shape constraints of cytoplasms in the clump.

Paper
Add Code

Task-adaptive Asymmetric Deep Cross-modal Hashing

no code implementations • 1 Apr 2020 • Fengling Li, Tong Wang, Lei Zhu, Zheng Zhang, Xinhua Wang

Unlike previous cross-modal hashing approaches, our learning framework jointly optimizes semantic preserving that transforms deep features of multimedia data into binary hash codes, and the semantic regression which directly regresses query modality representation to explicit label.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Multi-Feature Discrete Collaborative Filtering for Fast Cold-start Recommendation

no code implementations • 24 Mar 2020 • Yang Xu, Lei Zhu, Zhiyong Cheng, Jingjing Li, Jiande Sun

Additionally, we develop a fast discrete optimization algorithm to directly compute the binary hash codes with simple operations.

Collaborative Filtering Quantization

Paper
Add Code

Robust Medical Instrument Segmentation Challenge 2019

no code implementations • 23 Mar 2020 • Tobias Ross, Annika Reinke, Peter M. Full, Martin Wagner, Hannes Kenngott, Martin Apitz, Hellena Hempe, Diana Mindroc Filimon, Patrick Scholz, Thuy Nuong Tran, Pierangela Bruno, Pablo Arbeláez, Gui-Bin Bian, Sebastian Bodenstedt, Jon Lindström Bolmgren, Laura Bravo-Sánchez, Hua-Bin Chen, Cristina González, Dong Guo, Pål Halvorsen, Pheng-Ann Heng, Enes Hosgor, Zeng-Guang Hou, Fabian Isensee, Debesh Jha, Tingting Jiang, Yueming Jin, Kadir Kirtac, Sabrina Kletz, Stefan Leger, Zhixuan Li, Klaus H. Maier-Hein, Zhen-Liang Ni, Michael A. Riegler, Klaus Schoeffmann, Ruohua Shi, Stefanie Speidel, Michael Stenzel, Isabell Twick, Gutai Wang, Jiacheng Wang, Liansheng Wang, Lu Wang, Yu-Jie Zhang, Yan-Jie Zhou, Lei Zhu, Manuel Wiesenfarth, Annette Kopp-Schneider, Beat P. Müller-Stich, Lena Maier-Hein

The validation of the competing methods for the three tasks (binary segmentation, multi-instance detection and multi-instance segmentation) was performed in three different stages with an increasing domain gap between the training and the test data.

Benchmarking Instance Segmentation +2

Paper
Add Code

A^2-GCN: An Attribute-aware Attentive GCN Model for Recommendation

no code implementations • 20 Mar 2020 • Fan Liu, Zhiyong Cheng, Lei Zhu, Chenghao Liu, Liqiang Nie

Considering the fact that for different users, the attributes of an item have different influence on their preference for this item, we design a novel attention mechanism to filter the message passed from an item to a target user by considering the attribute information.

Attribute Recommendation Systems

Paper
Add Code

Neural Networks Weights Quantization: Target None-retraining Ternary (TNT)

no code implementations • 18 Dec 2019 • Tianyu Zhang, Lei Zhu, Qian Zhao, Kilho Shin

Quantization of weights of deep neural networks (DNN) has proven to be an effective solution for the purpose of implementing DNNs on edge devices such as mobiles, ASICs and FPGAs, because they have no sufficient resources to support computation involving millions of high precision weights and multiply-accumulate operations.

Quantization

Paper
Add Code

DDNet: Dual-path Decoder Network for Occlusion Relationship Reasoning

no code implementations • 26 Nov 2019 • Panhe Feng, Xuejing Kang, Lizhu Ye, Lei Zhu, Chunpeng Li, Anlong Ming

Besides, considering the restriction of occlusion orientation presentation to occlusion orientation learning, we design a new orthogonal representation for occlusion orientation and proposed the Orthogonal Orientation Regression loss which can get rid of the unfitness between occlusion representation and learning and further prompt the occlusion orientation learning.

regression

Paper
Add Code

CANet: Cross-disease Attention Network for Joint Diabetic Retinopathy and Diabetic Macular Edema Grading

1 code implementation • 4 Nov 2019 • Xiaomeng Li, Xiao-Wei Hu, Lequan Yu, Lei Zhu, Chi-Wing Fu, Pheng-Ann Heng

In this paper, we present a novel cross-disease attention network (CANet) to jointly grade DR and DME by exploring the internal relationship between the diseases with only image-level supervision.

Paper
Code

A Spectral Nonlocal Block for Neural Networks

no code implementations • 4 Nov 2019 • Lei Zhu, Qi She, Lidan Zhang, Ping Guo

The nonlocal-based blocks are designed for capturing long-range spatial-temporal dependencies in computer vision tasks.

Action Recognition Fine-Grained Image Classification +3

Paper
Add Code

Distribution Matching Prototypical Network for Unsupervised Domain Adaptation

no code implementations • 25 Sep 2019 • Lei Zhu, Wei Wang, Mei Hui Zhang, Beng Chin Ooi, Chang Yao

State-of-the-art Unsupervised Domain Adaptation (UDA) methods learn transferable features by minimizing the feature distribution discrepancy between the source and target domains.

Unsupervised Domain Adaptation

Paper
Add Code

Spectral Nonlocal Block for Neural Network

no code implementations • 25 Sep 2019 • Lei Zhu, Qi She, Lidan Zhang, Ping Guo

The nonlocal network is designed for capturing long-range spatial-temporal dependencies in several computer vision tasks.

Video Classification

Paper
Add Code

Cycle-consistent Conditional Adversarial Transfer Networks

1 code implementation • 17 Sep 2019 • Jingjing Li, Erpeng Chen, Zhengming Ding, Lei Zhu, Ke Lu, Zi Huang

Domain adaptation investigates the problem of cross-domain knowledge transfer where the labeled source domain and unlabeled target domain have distinctive data distributions.

Ranked #3 on Domain Adaptation on USPS-to-MNIST

Domain Adaptation Transfer Learning

Paper
Code

Alleviating Feature Confusion for Generative Zero-shot Learning

1 code implementation • 17 Sep 2019 • Jingjing Li, Mengmeng Jing, Ke Lu, Lei Zhu, Yang Yang, Zi Huang

An inevitable issue of such a paradigm is that the synthesized unseen features are prone to seen references and incapable to reflect the novelty and diversity of real unseen instances.

Generalized Zero-Shot Learning

Paper
Code

Personalized Hashtag Recommendation for Micro-videos

1 code implementation • 27 Aug 2019 • Yinwei Wei, Zhiyong Cheng, Xuzheng Yu, Zhou Zhao, Lei Zhu, Liqiang Nie

The hashtags, that a user provides to a post (e. g., a micro-video), are the ones which in her mind can well describe the post content where she is interested in.

Paper
Code

Enhancing Underexposed Photos using Perceptually Bidirectional Similarity

no code implementations • 25 Jul 2019 • Qing Zhang, Yongwei Nie, Lei Zhu, Chunxia Xiao, Wei-Shi Zheng

To obtain high-quality results free of these artifacts, we present a novel underexposed photo enhancement approach that is able to maintain the perceptual consistency.

Video Enhancement

Paper
Add Code

Deep Attentive Features for Prostate Segmentation in 3D Transrectal Ultrasound

1 code implementation • 3 Jul 2019 • Yi Wang, Haoran Dou, Xiao-Wei Hu, Lei Zhu, Xin Yang, Ming Xu, Jing Qin, Pheng-Ann Heng, Tianfu Wang, Dong Ni

Our attention module utilizes the attention mechanism to selectively leverage the multilevel features integrated from different layers to refine the features at each individual layer, suppressing the non-prostate noise at shallow layers of the CNN and increasing more prostate details into features at deep layers.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Probabilistic Multilayer Regularization Network for Unsupervised 3D Brain Image Registration

no code implementations • 3 Jul 2019 • Lihao Liu, Xiaowei Hu, Lei Zhu, Pheng-Ann Heng

This paper presents a novel framework for unsupervised 3D brain image registration by capturing the feature-level transformation relationships between the unaligned image and reference image.

Image Registration

Paper
Add Code

From Zero-Shot Learning to Cold-Start Recommendation

1 code implementation • 20 Jun 2019 • Jingjing Li, Mengmeng Jing, Ke Lu, Lei Zhu, Yang Yang, Zi Huang

This work, for the first time, formulates CSR as a ZSL problem, and a tailor-made ZSL method is proposed to handle CSR.

Recommendation Systems Zero-Shot Learning

Paper
Code

PAC-GAN: An Effective Pose Augmentation Scheme for Unsupervised Cross-View Person Re-identification

no code implementations • 5 Jun 2019 • Chengyuan Zhang, Lei Zhu, Shichao Zhang

In this paper, we introduce a novel unsupervised pose augmentation cross-view person Re-Id scheme called PAC-GAN to overcome these limitations.

Cross-Modal Person Re-Identification Generative Adversarial Network +2

Paper
Add Code

Adaptive Collaborative Similarity Learning for Unsupervised Multi-view Feature Selection

no code implementations • 25 Apr 2019 • Xiao Dong, Lei Zhu, Xuemeng Song, Jingjing Li, Zhiyong Cheng

We propose to dynamically learn the collaborative similarity structure, and further integrate it with the ultimate feature selection into a unified framework.

feature selection

Paper
Add Code

Exploring Auxiliary Context: Discrete Semantic Transfer Hashing for Scalable Image Retrieval

no code implementations • 25 Apr 2019 • Lei Zhu, Zi Huang, Zhihui Li, Liang Xie, Heng Tao Shen

To address the problem, in this paper, we propose a novel hashing approach, dubbed as \emph{Discrete Semantic Transfer Hashing} (DSTH).

Content-Based Image Retrieval Retrieval

Paper
Add Code

Discrete Optimal Graph Clustering

1 code implementation • 25 Apr 2019 • Yudong Han, Lei Zhu, Zhiyong Cheng, Jingjing Li, Xiaobai Liu

2) the relaxing process of cluster labels may cause significant information loss.

Clustering Graph Clustering +1

Paper
Code

Fusion-supervised Deep Cross-modal Hashing

no code implementations • 25 Apr 2019 • Li Wang, Lei Zhu, En Yu, Jiande Sun, Huaxiang Zhang

Deep hashing has recently received attention in cross-modal retrieval for its impressive advantages.

Cross-Modal Retrieval Deep Hashing

Paper
Add Code

Leveraging the Invariant Side of Generative Zero-Shot Learning

1 code implementation • CVPR 2019 • Jingjing Li, Mengmeng Jin, Ke Lu, Zhengming Ding, Lei Zhu, Zi Huang

In this paper, we take the advantage of generative adversarial networks (GANs) and propose a novel method, named leveraging invariant side GAN (LisGAN), which can directly generate the unseen features from random noises which are conditioned by the semantic descriptions.

Ranked #4 on Generalized Zero-Shot Learning on SUN Attribute

Generalized Zero-Shot Learning

Paper
Code

SAC-Net: Spatial Attenuation Context for Salient Object Detection

no code implementations • 25 Mar 2019 • Xiaowei Hu, Chi-Wing Fu, Lei Zhu, Tianyu Wang, Pheng-Ann Heng

This paper presents a new deep neural network design for salient object detection by maximizing the integration of local and global image context within, around, and beyond the salient objects.

Object object-detection +2

Paper
Add Code

Explicit Interaction Model towards Text Classification

1 code implementation • 23 Nov 2018 • Cunxiao Du, Zhaozheng Chin, Fuli Feng, Lei Zhu, Tian Gan, Liqiang Nie

To address this problem, we introduce the interaction mechanism to incorporate word-level matching signals into the text classification task.

Ranked #4 on Text Classification on Yahoo! Answers

General Classification Multi Class Text Classification +3

Paper
Code

MMALFM: Explainable Recommendation by Leveraging Reviews and Images

no code implementations • 12 Nov 2018 • Zhiyong Cheng, Xiaojun Chang, Lei Zhu, Rose C. Kanjirathinkal, Mohan Kankanhalli

Then the aspect importance is integrated into a novel aspect-aware latent factor model (ALFM), which learns user's and item's latent factors based on ratings.

Explainable Recommendation

Paper
Add Code

Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection

1 code implementation • ECCV 2018 • Lei Zhu, Zijun Deng, Xiao-Wei Hu, Chi-Wing Fu, Xuemiao Xu, Jing Qin, Pheng-Ann Heng

Second, we develop a bidirectional feature pyramid network (BFPN) to aggregate shadow contexts spanned across different CNN layers by deploying two series of RAR modules in the network to iteratively combine and refine context features: one series to refine context features from deep to shallow layers, and another series from shallow to deep layers.

Ranked #3 on Shadow Detection on SBU

Shadow Detection

122

Paper
Code

Direction-aware Spatial Context Features for Shadow Detection and Removal

2 code implementations • 12 May 2018 • Xiaowei Hu, Chi-Wing Fu, Lei Zhu, Jing Qin, Pheng-Ann Heng

This paper presents a novel deep neural network design for shadow detection and removal by analyzing the spatial image context in a direction-aware manner.

Ranked #6 on Shadow Removal on ISTD

Shadow Detection And Removal Shadow Removal

140

Paper
Code

Direction-aware Spatial Context Features for Shadow Detection

2 code implementations • CVPR 2018 • Xiaowei Hu, Lei Zhu, Chi-Wing Fu, Jing Qin, Pheng-Ann Heng

To achieve this, we first formulate the direction-aware attention mechanism in a spatial recurrent neural network (RNN) by introducing attention weights when aggregating spatial context features in the RNN.

Ranked #2 on RGB Salient Object Detection on SBU

Detecting Shadows Shadow Detection

140

Paper
Code

Leveraging Weak Semantic Relevance for Complex Video Event Classification

no code implementations • ICCV 2017 • Chao Li, Jiewei Cao, Zi Huang, Lei Zhu, Heng Tao Shen

In this paper, we propose a novel approach to automatically maximize the utility of weak semantic annotations (formalized as the semantic relevance of video shots to the target event) to facilitate video event classification.

Classification General Classification

Paper
Add Code

Joint Bi-Layer Optimization for Single-Image Rain Streak Removal

no code implementations • ICCV 2017 • Lei Zhu, Chi-Wing Fu, Dani Lischinski, Pheng-Ann Heng

A third prior is defined on the rain-streak layer R, based on similarity of patches to the extracted rain patches.

Rain Removal

Paper
Add Code

Saliency Pattern Detection by Ranking Structured Trees

1 code implementation • ICCV 2017 • Lei Zhu, Haibin Ling, Jin Wu, Huiping Deng, Jin Liu

We show that the linear combination of structured labels can well model the saliency distribution in local regions.

object-detection RGB Salient Object Detection +2

Paper
Code

Discrete Multi-modal Hashing with Canonical Views for Robust Mobile Landmark Search

no code implementations • 13 Jul 2017 • Lei Zhu, Zi Huang, Xiaobai Liu, Xiangnan He, Jingkuan Song, Xiaofang Zhou

Finally, compact binary codes are learned on intermediate representation within a tailored discrete binary embedding model which preserves visual relations of images measured with canonical views and removes the involved noises.

Paper
Add Code

A Non-Local Low-Rank Framework for Ultrasound Speckle Reduction

no code implementations • CVPR 2017 • Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng

`Speckle' refers to the granular patterns that occur in ultrasound images due to wave interference.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.