Search Results for author: Li Liu

Found 237 papers, 107 papers with code

Region Graph Embedding Network for Zero-Shot Learning

no code implementations ECCV 2020 Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao

To exploit the progressive interactions among these regions, we represent them as a region graph, on which the parts relation reasoning is performed with graph convolutions, thus leading to our PRR branch.

Graph Embedding Relation +1

MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking

no code implementations24 Nov 2024 Chunhui Zhang, Li Liu, Hao Wen, Xi Zhou, Yanfeng Wang

Night unmanned aerial vehicle (UAV) tracking is impeded by the challenges of poor illumination, with previous daylight-optimized methods demonstrating suboptimal performance in low-light conditions, limiting the utility of UAV applications.

Image Enhancement Mamba

GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving

1 code implementation19 Nov 2024 Shaoqing Xu, Fang Li, Shengyin Jiang, Ziying Song, Li Liu, Zhi-Xin Yang

In this context, we are excited to introduce GaussianPretrain, a novel pre-training paradigm that achieves a holistic understanding of the scene by uniformly integrating geometric and texture representations.

3D Object Detection Autonomous Driving +3

Step-wise Distribution Alignment Guided Style Prompt Tuning for Source-free Cross-domain Few-shot Learning

1 code implementation15 Nov 2024 Huali Xu, Yongxiang Liu, Li Liu, Shuaifeng Zhi, Shuzhou Sun, Tianpeng Liu, MingMing Cheng

This paper addresses the source-free CDFSL (SF-CDFSL) problem, tackling few-shot learning (FSL) in the target domain using only pre-trained models and a few target samples without source data or strategies.

cross-domain few-shot learning

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting

no code implementations14 Nov 2024 Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang

In this paper, we introduce the Spatio-Temporal Unitized Model (STUM), a unified framework designed to capture both spatial and temporal dependencies while addressing spatio-temporal heterogeneity through techniques such as distribution alignment and feature fusion.

Computational Efficiency Hyperparameter Optimization

MaDiNet: Mamba Diffusion Network for SAR Target Detection

1 code implementation12 Nov 2024 Jie zhou, Chao Xiao, Bowen Peng, Tianpeng Liu, Zhen Liu, Yongxiang Liu, Li Liu

The fundamental challenge in SAR target detection lies in developing discriminative, efficient, and robust representations of target characteristics within intricate non-cooperative environments.

Mamba

Advances in Photoacoustic Imaging Reconstruction and Quantitative Analysis for Biomedical Applications

no code implementations5 Nov 2024 Lei Wang, Weiming Zeng, Kai Long, Rongfeng Lan, Li Liu, Wai Ting Siok, Nizhuan Wang

Photoacoustic imaging (PAI) represents an innovative biomedical imaging modality that harnesses the advantages of optical resolution and acoustic penetration depth while ensuring enhanced safety.

Image Reconstruction

Right this way: Can VLMs Guide Us to See More to Answer Questions?

1 code implementation1 Nov 2024 Li Liu, Diji Yang, Sijia Zhong, Kalyana Suma Sree Tholeti, Lei Ding, Yi Zhang, Leilani H. Gilpin

To investigate this gap, we identify a critical and challenging task in the Visual Question Answering (VQA) scenario: can VLMs indicate how to adjust an image when the visual information is insufficient to answer a question?

Question Answering Visual Question Answering

Artificial Intelligence of Things: A Survey

1 code implementation25 Oct 2024 Shakhrul Iman Siam, Hyunho Ahn, Li Liu, Samiul Alam, Hui Shen, Zhichao Cao, Ness Shroff, Bhaskar Krishnamachari, Mani Srivastava, Mi Zhang

We hope this survey will serve as a valuable resource for those engaged in AIoT research and act as a catalyst for future explorations to bridge gaps and drive advancements in this exciting field.

Survey

New Paradigm of Adversarial Training: Breaking Inherent Trade-Off between Accuracy and Robustness via Dummy Classes

1 code implementation16 Oct 2024 Yanyun Wang, Li Liu, Zi Liang, Qingqing Ye, Haibo Hu

Accordingly, to relax the tension between clean and robust learning derived from this overstrict assumption, we propose a new AT paradigm by introducing an additional dummy class for each original class, aiming to accommodate the hard adversarial samples with shifted distribution after perturbation.

Adversarial Robustness

S$^4$ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack

no code implementations13 Oct 2024 Yongxiang Liu, Bowen Peng, Li Liu, Xiang Li

Transferable targeted adversarial attacks (TTAs) against deep neural networks have been proven significantly more challenging than untargeted ones, yet they remain relatively underexplored.

Numerical Approximation Capacity of Neural Networks with Bounded Parameters: Do Limits Exist, and How Can They Be Measured?

no code implementations25 Sep 2024 Li Liu, Tengchao Yu, Heng Yong

This leads us to question: \textbf{Does the approximation capacity of a neural network remain universal, or does it have a limit when the parameters are practically bounded?

Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2

2 code implementations25 Sep 2024 Chunhui Zhang, Li Liu, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang

Over the past decade, significant progress has been made in visual object tracking, largely due to the availability of large-scale training datasets.

Object Video Segmentation +2

Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework

1 code implementation19 Sep 2024 Xinyi Ying, Li Liu, Zaipin Lin, Yangsi Shi, Yingqian Wang, Ruojing Li, Xu Cao, Boyang Li, Shilin Zhou

To address the aforementioned challenges, in this paper, we first build a large-scale dataset for MIRST detection in satellite videos (namely IRSatVideo-LEO), and then develop a recurrent feature refinement (RFR) framework as the baseline method.

Motion Compensation Video Generation

Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation

no code implementations4 Sep 2024 Li Liu, Ruijie Zhu, Jiacheng Deng, Ziyang Song, Wenfei Yang, Tianzhu Zhang

Specifically, in the proposed plane guided depth generator (PGDG), we design a set of plane queries as prototypes to softly model planes in the scene and predict per-pixel plane coefficients.

Depth Prediction Monocular Depth Estimation

Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion

no code implementations1 Sep 2024 Yan Rong, Li Liu

Face-based Voice Conversion (FVC) is a novel task that leverages facial images to generate the target speaker's voice style.

Contrastive Learning Disentanglement +2

Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation

no code implementations28 Aug 2024 Weilin Lin, Li Liu, Jianze Li, Hui Xiong

This method, based on our findings on neuron weight changes (NWCs) of random unlearning, uses optimal transport (OT)-based model fusion to combine the advantages of both pruned and backdoored models.

backdoor defense

Prior-free Balanced Replay: Uncertainty-guided Reservoir Sampling for Long-Tailed Continual Learning

no code implementations27 Aug 2024 Lei Liu, Li Liu, Yawen Cui

Even in the era of large models, one of the well-known issues in continual learning (CL) is catastrophic forgetting, which is significantly challenging when the continual data stream exhibits a long-tailed distribution, termed as Long-Tailed Continual Learning (LTCL).

class-incremental learning Class Incremental Learning +1

Segment Anything for Videos: A Systematic Survey

2 code implementations31 Jul 2024 Chunhui Zhang, Yawen Cui, Weilin Lin, Guanjie Huang, Yan Rong, Li Liu, Shiguang Shan

To address this gap, this work conducts a systematic review on SAM for videos in the era of foundation models.

Image Segmentation Semantic Segmentation +5

BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning

1 code implementation29 Jul 2024 Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen

1) We provide an integrated implementation of state-of-the-art (SOTA) backdoor learning algorithms (currently including 20 attack and 32 defense algorithms), based on an extensible modular-based codebase.

Backdoor Attack

Enhancing Transferability of Targeted Adversarial Examples: A Self-Universal Perspective

1 code implementation22 Jul 2024 Bowen Peng, Li Liu, Tianpeng Liu, Zhen Liu, Yongxiang Liu

We also contribute a surprising empirical insight that one of the most fundamental transformations, simple image scaling, is highly effective, scalable, sufficient, and necessary in enhancing targeted transferability.

A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights

1 code implementation11 Jul 2024 Wentao Lei, Jinting Wang, Fengji Ma, Guanjie Huang, Li Liu

The goal of this survey is to offer the research community a clear and holistic view of the advancements in human video generation, highlighting the milestones achieved and the challenges that lie ahead.

Motion Generation Survey +1

ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation

1 code implementation11 Jul 2024 Ruijie Zhu, Chuxin Wang, Ziyang Song, Li Liu, Tianzhu Zhang, Yongdong Zhang

Our method decomposes metric depth into scene scale and relative depth, and predicts them through a semantic-aware scale prediction (SASP) module and an adaptive relative depth estimation (ARDE) module, respectively.

Monocular Depth Estimation

Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher

no code implementations10 Jul 2024 Jiangming Chen, Li Liu, Wanxia Deng, Zhen Liu, Yu Liu, YingMei Wei, Yongxiang Liu

Cross domain object detection learns an object detector for an unlabeled target domain by transferring knowledge from an annotated source domain.

object-detection Object Detection +1

Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines

1 code implementation20 Jun 2024 Xinyi Ying, Chao Xiao, Ruojing Li, Xu He, Boyang Li, Zhaoxu Li, Yingqian Wang, Mingyuan Hu, Qingyu Xu, Zaiping Lin, Miao Li, Shilin Zhou, Wei An, Weidong Sheng, Li Liu

Based on the proposed RGBT-Tiny dataset and SAFit measure, extensive evaluations have been conducted, including 23 recent state-of-the-art algorithms that cover four different types (i. e., visible generic detection, visible SOD, thermal SOD and RGBT object detection).

Diversity object-detection +1

TwinS: Revisiting Non-Stationarity in Multivariate Time Series Forecasting

no code implementations6 Jun 2024 Jiaxi Hu, Qingsong Wen, Sijie Ruan, Li Liu, Yuxuan Liang

In this paper, we begin by validating this theory through wavelet analysis and propose the Transformer-based TwinS model, which consists of three modules to address the non-stationary periodic distributions: Wavelet Convolution, Period-Aware Attention, and Channel-Temporal Mixed MLP.

Multivariate Time Series Forecasting Time Series

WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark

1 code implementation30 May 2024 Chunhui Zhang, Li Liu, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang

Most existing trackers are tailored for open-air environments, leading to performance degradation when applied to UOT due to domain gaps.

Knowledge Distillation Object Tracking

Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness

no code implementations30 May 2024 Weilin Lin, Li Liu, Shaokui Wei, Jianze Li, Hui Xiong

Recently, without poisoned data, unlearning models with clean data and then learning a pruning mask have contributed to backdoor defense.

backdoor defense

TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability

no code implementations27 May 2024 Fengji Ma, Li Liu, Hei Victor Cheng

Simultaneously, fixed pre-trained image embeddings are used as cross-modal auxiliary supervision to maintain the similarity between the MHE-tuned and original text embeddings by the knowledge distillation, preserving semantic information between different classes.

Adversarial Robustness Knowledge Distillation +1

Awesome Multi-modal Object Tracking

4 code implementations23 May 2024 Chunhui Zhang, Li Liu, Hao Wen, Xi Zhou, Yanfeng Wang

To leverage more modalities, some recent efforts have been made to learn a unified visual object tracking model for any modality.

Autonomous Driving Knowledge Distillation +5

SARatrX: Towards Building A Foundation Model for SAR Target Recognition

2 code implementations15 May 2024 Weijie Li, Wei Yang, Yuenan Hou, Li Liu, Yongxiang Liu, Xiang Li

Despite the remarkable progress in synthetic aperture radar automatic target recognition (SAR ATR), recent efforts have concentrated on the detection or classification of a specific and coarse category, e. g., vehicles, ships, airplanes, or buildings.

Earth Observation Self-Supervised Learning

Explainable Survival Analysis with Uncertainty using Convolution-Involved Vision Transformer

no code implementations journal 2024 Zhihao Tang, Li Liu, Yifan Shen, Zongyi Chen, Guixiang Ma, Jiyan Dong, Xujie Sun, Xi Zhang, Chaozhuo Li, Qingfeng Zheng, Lin Yang

Highlights•Without patching WSIs, a novel ViT-based model is proposed for survival predictions.•We first introduce aleatoric uncertainty into the survival loss function.•We explain survival prediction using a post-hoc explainable method.•Our method outperforms baselines in accuracy, explainability, and reliability.

Survival Analysis Survival Prediction

From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives

no code implementations3 Apr 2024 Shuxian Fan, Adam Visokay, Kentaro Hoffman, Stephen Salerno, Li Liu, Jeffrey T. Leek, Tyler H. McCormick

In this paper, we develop a method for valid inference using outcomes (in our case COD) predicted from free-form text using state-of-the-art NLP techniques.

Decision Making Language Modelling +1

SceneTracker: Long-term Scene Flow Estimation Network

1 code implementation29 Mar 2024 Bo wang, Jian Li, Yang Yu, Li Liu, Zhenping Sun, Dewen Hu

Considering the complementarity of scene flow estimation in the spatial domain's focusing capability and 3D object tracking in the temporal domain's coherence, this study aims to address a comprehensive new task that can simultaneously capture fine-grained and long-term 3D motion in an online manner: long-term scene flow estimation (LSFE).

3D Object Tracking Object Tracking +1

Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review

no code implementations22 Mar 2024 Jinge Wang, Zien Cheng, Qiuming Yao, Li Liu, Dong Xu, Gangqing Hu

The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines.

Chatbot Drug Discovery +3

LSKNet: A Foundation Lightweight Backbone for Remote Sensing

2 code implementations18 Mar 2024 YuXuan Li, Xiang Li, Yimian Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang

While a considerable amount of research has been dedicated to remote sensing classification, object detection and semantic segmentation, most of these studies have overlooked the valuable prior knowledge embedded within remote sensing scenarios.

Change Detection object-detection +2

SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection

1 code implementation11 Mar 2024 YuXuan Li, Xiang Li, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang

To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.

Ranked #2 on 2D Object Detection on SARDet-100K (using extra training data)

2k Object +2

Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds

1 code implementation CVPR 2024 Tianrui Lou, Xiaojun Jia, Jindong Gu, Li Liu, Siyuan Liang, Bangyan He, Xiaochun Cao

We find that concealing deformation perturbations in areas insensitive to human eyes can achieve a better trade-off between imperceptibility and adversarial strength, specifically in parts of the object surface that are complex and exhibit drastic curvature changes.

3D Point Cloud Classification Adversarial Attack +1

A Survey on Human-AI Teaming with Large Pre-Trained Models

no code implementations7 Mar 2024 Vanshika Vats, Marzia Binta Nizam, Minghao Liu, Ziyuan Wang, Richard Ho, Mohnish Sai Prasad, Vincent Titterton, Sai Venkat Malreddy, Riya Aggarwal, Yanwen Xu, Lei Ding, Jay Mehta, Nathan Grinnell, Li Liu, Sijia Zhong, Devanathan Nallur Gandamani, Xinyi Tang, Rohan Ghosalkar, Celeste Shen, Rachel Shen, Nafisa Hussain, Kesav Ravichandran, James Davis

In the rapidly evolving landscape of artificial intelligence (AI), the collaboration between human intelligence and AI systems, known as Human-AI (HAI) Teaming, has emerged as a cornerstone for advancing problem-solving and decision-making processes.

Decision Making

Enhancing Information Maximization with Distance-Aware Contrastive Learning for Source-Free Cross-Domain Few-Shot Learning

1 code implementation4 Mar 2024 Huali Xu, Li Liu, Shuaifeng Zhi, Shaojing Fu, Zhuo Su, Ming-Ming Cheng, Yongxiang Liu

For this reason, this paper explores a Source-Free CDFSL (SF-CDFSL) problem, in which CDFSL is addressed through the use of existing pretrained models instead of training a model with source data, avoiding accessing source data.

Contrastive Learning cross-domain few-shot learning

Lightweight Pixel Difference Networks for Efficient Visual Representation Learning

1 code implementation1 Feb 2024 Zhuo Su, Jiehua Zhang, Longguang Wang, Hua Zhang, Zhen Liu, Matti Pietikäinen, Li Liu

With PDC and Bi-PDC, we further present two lightweight deep networks named \emph{Pixel Difference Networks (PiDiNet)} and \emph{Binary PiDiNet (Bi-PiDiNet)} respectively to learn highly efficient yet more accurate representations for visual tasks including edge detection and object recognition.

Edge Detection Object Recognition +1

Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition

no code implementations31 Jan 2024 Lei Liu, Li Liu, Haizhou Li

Cued Speech (CS) is a pure visual coding method used by hearing-impaired people that combines lip reading with several specific hand shapes to make the spoken language visible.

Lip Reading speech-recognition +1

Towards Assessing the Synthetic-to-Measured Adversarial Vulnerability of SAR ATR

1 code implementation30 Jan 2024 Bowen Peng, Bo Peng, Jingyuan Xia, Tianpeng Liu, Yongxiang Liu, Li Liu

Recently, there has been increasing concern about the vulnerability of deep neural network (DNN)-based synthetic aperture radar (SAR) automatic target recognition (ATR) to adversarial attacks, where a DNN could be easily deceived by clean input with imperceptible but aggressive perturbations.

BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning

no code implementations26 Jan 2024 Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen

We hope that our efforts could build a solid foundation of backdoor learning to facilitate researchers to investigate existing algorithms, develop more innovative algorithms, and explore the intrinsic mechanism of backdoor learning.

Backdoor Attack

Bilateral Reference for High-Resolution Dichotomous Image Segmentation

1 code implementation7 Jan 2024 Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, Nicu Sebe

It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef).

Camouflaged Object Segmentation Dichotomous Image Segmentation +3

Defenses in Adversarial Machine Learning: A Survey

no code implementations13 Dec 2023 Baoyuan Wu, Shaokui Wei, Mingli Zhu, Meixi Zheng, Zihao Zhu, Mingda Zhang, Hongrui Chen, Danni Yuan, Li Liu, Qingshan Liu

Adversarial phenomenon has been widely observed in machine learning (ML) systems, especially in those using deep neural networks, describing that ML systems may produce inconsistent and incomprehensible predictions with humans at some particular cases.

Survey

Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture

2 code implementations26 Nov 2023 Weijie Li, Yang Wei, Tianpeng Liu, Yuenan Hou, YuXuan Li, Zhen Liu, Yongxiang Liu, Li Liu

The growing Synthetic Aperture Radar (SAR) data has the potential to build a foundation model through Self-Supervised Learning (SSL) methods, which can achieve various SAR Automatic Target Recognition (ATR) tasks with pre-training in large-scale unlabeled data and fine-tuning in small labeled samples.

Representation Learning Self-Supervised Learning

Enhancing Representations through Heterogeneous Self-Supervised Learning

no code implementations8 Oct 2023 Zhong-Yu Li, Bo-Wen Yin, Yongxiang Liu, Li Liu, Ming-Ming Cheng

Thus, we propose Heterogeneous Self-Supervised Learning (HSSL), which enforces a base model to learn from an auxiliary head whose architecture is heterogeneous from the base model.

Image Classification Instance Segmentation +5

Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior

no code implementations5 Oct 2023 Jinting Wang, Li Liu, Jun Wang, Hei Victor Cheng

To overcome this challenge, we introduce the concept of residuals by integrating a statistical face prior to the diffusion process.

Face Generation

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision

1 code implementation18 Sep 2023 Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Bing Wang, Hongwei Xie, Li Liu, Shanghang Zhang

3D occupancy prediction holds significant promise in the fields of robot perception and autonomous driving, which quantifies 3D scenes into grid cells with semantic labels.

Autonomous Driving

DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation

1 code implementation18 Sep 2023 Bowen Yin, Xuying Zhang, Zhongyu Li, Li Liu, Ming-Ming Cheng, Qibin Hou

We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks.

3D geometry Decoder +7

Grouping Boundary Proposals for Fast Interactive Image Segmentation

no code implementations8 Sep 2023 Li Liu, Da Chen, Minglei Shu, Laurent D. Cohen

These boundary proposals are then incorporated into the proposed image segmentation model, such that the target segmentation contours are made up of a set of selected boundary proposals and the corresponding geodesic paths linking them.

Image Segmentation Segmentation +1

A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation

1 code implementation17 Aug 2023 Li Liu, Lufei Gao, Wentao Lei, Fengji Ma, Xiaotian Lin, Jinting Wang

In summary, this survey paper provides a comprehensive understanding of deep multi-modal learning for various BL generations and recognitions for the first time.

Domain Adaptation Self-Supervised Learning +1

Few-shot Class-incremental Learning: A Survey

no code implementations13 Aug 2023 Jinghua Zhang, Li Liu, Olli Silvén, Matti Pietikäinen, Dewen Hu

In our in-depth examination, we delve into various facets of FSCIL, encompassing the problem definition, the discussion of the primary challenges of unreliable empirical risk minimization and the stability-plasticity dilemma, general schemes, and relevant problems of IL and Few-shot Learning (FSL).

class-incremental learning Class-Incremental Object Detection +6

Pareto Invariant Representation Learning for Multimedia Recommendation

no code implementations9 Aug 2023 Shanshan Huang, Haoxuan Li, Qingsong Li, Chunyuan Zheng, Li Liu

Multimedia recommendation involves personalized ranking tasks, where multimedia content is usually represented using a generic encoder.

Multimedia recommendation Representation Learning

SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation

1 code implementation24 Jul 2023 YiQing Wang, Zihan Li, Jieru Mei, Zihao Wei, Li Liu, Chen Wang, Shengtian Sang, Alan Yuille, Cihang Xie, Yuyin Zhou

To address this limitation, we present Masked Multi-view with Swin Transformers (SwinMM), a novel multi-view pipeline for enabling accurate and data-efficient self-supervised medical image analysis.

Contrastive Learning Image Reconstruction +5

ROFusion: Efficient Object Detection using Hybrid Point-wise Radar-Optical Fusion

1 code implementation17 Jul 2023 Liu Liu, Shuaifeng Zhi, Zhenhua Du, Li Liu, Xinyu Zhang, Kai Huo, Weidong Jiang

In this paper, we propose a hybrid point-wise Radar-Optical fusion approach for object detection in autonomous driving scenarios.

Autonomous Driving Object +3

Unbiased Scene Graph Generation via Two-stage Causal Modeling

no code implementations11 Jul 2023 Shuzhou Sun, Shuaifeng Zhi, Qing Liao, Janne Heikkilä, Li Liu

To remedy this, we propose Two-stage Causal Modeling (TsCM) for the SGG task, which takes the long-tailed distribution and semantic confusion as confounders to the Structural Causal Model (SCM) and then decouples the causal intervention into two stages.

Causal Inference Graph Generation +2

All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment

no code implementations7 Jul 2023 Chunhui Zhang, Xin Sun, Li Liu, Yiqian Yang, Qiong Liu, Xi Zhou, Yanfeng Wang

This approach achieves feature integration in a unified backbone, removing the need for carefully-designed fusion modules and resulting in a more effective and efficient VL tracking framework.

Revisiting Computer-Aided Tuberculosis Diagnosis

1 code implementation6 Jul 2023 Yun Liu, Yu-Huan Wu, Shi-Chen Zhang, Li Liu, Min Wu, Ming-Ming Cheng

This dataset enables the training of sophisticated detectors for high-quality CTD.

Image Classification

UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering

no code implementations15 Jun 2023 Mingjie Pan, Li Liu, Jiaming Liu, Peixiang Huang, Longlong Wang, Shanghang Zhang, Shaoqing Xu, Zhiyi Lai, Kuiyuan Yang

In this technical report, we present our solution, named UniOCC, for the Vision-Centric 3D occupancy prediction track in the nuScenes Open Dataset Challenge at CVPR 2023.

Prediction Of Occupancy Grid Maps

SplatFlow: Learning Multi-frame Optical Flow via Splatting

1 code implementation15 Jun 2023 Bo wang, Yifan Zhang, Jian Li, Yang Yu, Zhenping Sun, Li Liu, Dewen Hu

The occlusion problem remains a crucial challenge in optical flow estimation (OFE).

Optical Flow Estimation

Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks

no code implementations6 Jun 2023 Jianrong Wang, Yaxin Zhao, Li Liu, Tianyi Xu, Qi Li, Sen Li

Given an audio clip and a reference face image, the goal of the talking head generation is to generate a high-fidelity talking head video.

Talking Head Generation

A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus

1 code implementation5 Jun 2023 Lufei Gao, Shan Huang, Li Liu

Cued Speech (CS) is a multi-modal visual coding system combining lip reading with several hand cues at the phonetic level to make the spoken language visible to the hearing impaired.

Lip Reading

MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information

1 code implementation4 Jun 2023 Jianrong Wang, Yuchen Huo, Li Liu, Tianyi Xu, Qi Li, Sen Li

Audio-visual speech recognition (AVSR) gains increasing attention from researchers as an important part of human-computer interaction.

Audio-Visual Speech Recognition speech-recognition +1

Versatile Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers

no code implementations1 Jun 2023 Ruotong Wang, Hongrui Chen, Zihao Zhu, Li Liu, Baoyuan Wu

Deep neural networks (DNNs) can be manipulated to exhibit specific behaviors when exposed to specific trigger patterns, without affecting their performance on benign samples, dubbed \textit{backdoor attack}.

Backdoor Attack backdoor defense +1

X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models

1 code implementation18 May 2023 Yixiong Chen, Li Liu, Chris Ding

This paper introduces a novel explainable image quality evaluation approach called X-IQE, which leverages visual large language models (LLMs) to evaluate text-to-image generation methods by generating textual explanations.

Benchmarking Text-to-Image Generation

FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning

no code implementations15 May 2023 Penghui Wei, Hongjian Dou, Shaoguo Liu, Rongjun Tang, Li Liu, Liang Wang, Bo Zheng

We introduce FedAds, the first benchmark for CVR estimation with vFL, to facilitate standardized and systematical evaluations for vFL algorithms.

Privacy Preserving Vertical Federated Learning

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

1 code implementation14 May 2023 Chunhui Zhang, Li Liu, Yawen Cui, Guanjie Huang, Weilin Lin, Yiqian Yang, Yuehong Hu

As the first to comprehensively review the progress of segmenting anything task for vision and beyond based on the foundation model of SAM, this work focuses on its applications to various tasks and data types by discussing its historical development, recent progress, and profound impact on broad applications.

Deep Intellectual Property Protection: A Survey

no code implementations28 Apr 2023 Yuchen Sun, Tianpeng Liu, Panhe Hu, Qing Liao, Shaojing Fu, Nenghai Yu, Deke Guo, Yongxiang Liu, Li Liu

Deep Neural Networks (DNNs), from AlexNet to ResNet to ChatGPT, have made revolutionary progress in recent years, and are widely used in various fields.

Survey

A Forward and Backward Compatible Framework for Few-shot Class-incremental Pill Recognition

1 code implementation24 Apr 2023 Jinghua Zhang, Li Liu, Kai Gao, Dewen Hu

In forward-compatible learning, we propose an innovative virtual class synthesis strategy and a Center-Triplet (CT) loss to enhance discriminative feature learning.

class-incremental learning Few-Shot Class-Incremental Learning +6

Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution

1 code implementation13 Apr 2023 Zhuo Su, Jiehua Zhang, Tianpeng Liu, Zhen Liu, Shuanghui Zhang, Matti Pietikäinen, Li Liu

This paper proposes a novel module called middle spectrum grouped convolution (MSGC) for efficient deep convolutional neural networks (DCNNs) with the mechanism of grouped convolution.

Image Classification object-detection +1

Hierarchical Disentanglement-Alignment Network for Robust SAR Vehicle Recognition

1 code implementation7 Apr 2023 Weijie Li, Wei Yang, Wenpeng Zhang, Tianpeng Liu, Yongxiang Liu, Li Liu

However, robustly recognizing vehicle targets is a challenging task in SAR due to the large intraclass variations and small interclass variations.

Data Augmentation Disentanglement

Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection with Single Point Supervision

1 code implementation CVPR 2023 Xinyi Ying, Li Liu, Yingqian Wang, Ruojing Li, Nuo Chen, Zaiping Lin, Weidong Sheng, Shilin Zhou

Interestingly, during the training phase supervised by point labels, we discover that CNNs first learn to segment a cluster of pixels near the targets, and then gradually converge to predict groundtruth point labels.

Learning Invariant Representation via Contrastive Feature Alignment for Clutter Robust SAR Target Recognition

no code implementations4 Apr 2023 Bowen Peng, Jianyue Xie, Bo Peng, Li Liu

The proposed method contributes a mixed clutter variants generation strategy and a new inference branch equipped with channel-weighted mean square error (CWMSE) loss for invariant representation learning.

Contrastive Learning Representation Learning

Discovering and Explaining the Non-Causality of Deep Learning in SAR ATR

2 code implementations3 Apr 2023 Weijie Li, Wei Yang, Li Liu, Wenpeng Zhang, Yongxiang Liu

Therefore, the degree of overfitting for clutter reflects the non-causality of deep learning in SAR ATR.

Deep Learning Selection bias

FER-former: Multi-modal Transformer for Facial Expression Recognition

no code implementations23 Mar 2023 Yande Li, Mingjie Wang, Minglun Gong, Yonggang Lu, Li Liu

The ever-increasing demands for intuitive interactions in Virtual Reality has triggered a boom in the realm of Facial Expression Recognition (FER).

Facial Expression Recognition Facial Expression Recognition (FER)

SRFormerV2: Taking a Closer Look at Permuted Self-Attention for Image Super-Resolution

1 code implementation ICCV 2023 Yupeng Zhou, Zhen Li, Chun-Le Guo, Li Liu, Ming-Ming Cheng, Qibin Hou

Without any bells and whistles, we show that our SRFormer achieves a 33. 86dB PSNR score on the Urban100 dataset, which is 0. 46dB higher than that of SwinIR but uses fewer parameters and computations.

Image Super-Resolution

Deep Learning for Cross-Domain Few-Shot Visual Recognition: A Survey

no code implementations15 Mar 2023 Huali Xu, Shuaifeng Zhi, Shuzhou Sun, Vishal M. Patel, Li Liu

To address this, Few-shot learning (FSL) enables models to perform the target tasks with very few labeled examples by leveraging prior knowledge from related tasks.

cross-domain few-shot learning

Spatio-Temporal Structure Consistency for Semi-supervised Medical Image Classification

no code implementations3 Mar 2023 Wentao Lei, Lei Liu, Li Liu

Experiments on two medical image datasets (i. e., ISIC 2018 challenge and ChestX-ray14) show that our method outperforms state-of-the-art SSL methods.

Image Classification Medical Diagnosis +1

Attacks in Adversarial Machine Learning: A Systematic Survey from the Life-cycle Perspective

1 code implementation19 Feb 2023 Baoyuan Wu, Zihao Zhu, Li Liu, Qingshan Liu, Zhaofeng He, Siwei Lyu

Adversarial machine learning (AML) studies the adversarial phenomenon of machine learning, which may make inconsistent or unexpected predictions with humans.

Backdoor Attack

Generalized Few-Shot Continual Learning with Contrastive Mixture of Adapters

1 code implementation12 Feb 2023 Yawen Cui, Zitong Yu, Rizhao Cai, Xun Wang, Alex C. Kot, Li Liu

The goal of Few-Shot Continual Learning (FSCL) is to incrementally learn novel tasks with limited labeled samples and preserve previous capabilities simultaneously, while current FSCL methods are all for the class-incremental purpose.

Continual Learning Contrastive Learning +2

Learning Symbolic Models for Graph-structured Physical Mechanism

no code implementations ICLR 2023 2023 Hongzhi Shi, Jingtao Ding, Yufan Cao, Quanming Yao, Li Liu, Yong Li

The essence of our method is to model the formula skeleton with a message-passing flow, which helps transform the discovery of the skeleton into the search for the message-passing flow.

regression scientific discovery +1

Uncertainty-Aware Distillation for Semi-Supervised Few-Shot Class-Incremental Learning

1 code implementation24 Jan 2023 Yawen Cui, Wanxia Deng, Haoyu Chen, Li Liu

Given a model well-trained with a large-scale base dataset, Few-Shot Class-Incremental Learning (FSCIL) aims at incrementally learning novel classes from a few labeled samples by avoiding overfitting, without catastrophically forgetting all encountered classes previously.

class-incremental learning Few-Shot Class-Incremental Learning +2

Enabling Augmented Segmentation and Registration in Ultrasound-Guided Spinal Surgery via Realistic Ultrasound Synthesis from Diagnostic CT Volume

no code implementations5 Jan 2023 Ang Li, Jiayi Han, Yongjian Zhao, Keyu Li, Li Liu

While the US is not a standard paradigm for spinal surgery, the scarcity of intra-operative clinical US data is an insurmountable bottleneck in training a neural network.

Segmentation

Global Balanced Experts for Federated Long-Tailed Learning

1 code implementation ICCV 2023 Yaopei Zeng, Lei Liu, Li Liu, Li Shen, Shaoguo Liu, Baoyuan Wu

In particular, a proxy is derived from the accumulated gradients uploaded by the clients after local training, and is shared by all clients as the class prior for re-balance training.

Federated Learning Privacy Preserving

WL-Align: Weisfeiler-Lehman Relabeling for Aligning Users across Networks via Regularized Representation Learning

1 code implementation29 Dec 2022 Li Liu, Penggang Chen, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang

Aligning users across networks using graph representation learning has been found effective where the alignment is accomplished in a low-dimensional embedding space.

Graph Representation Learning

TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning

1 code implementation28 Dec 2022 Peixiang Huang, Li Liu, Renrui Zhang, Song Zhang, Xinli Xu, Baichao Wang, Guoyi Liu

In this paper, we propose the learning scheme of Target Inner-Geometry from the LiDAR modality into camera-based BEV detectors for both dense depth and BEV features, termed as TiG-BEV.

3D Object Detection object-detection

AMDET: Attention based Multiple Dimensions EEG Transformer for Emotion Recognition

no code implementations23 Dec 2022 Yongling Xu, Yang Du, Jing Zou, Tianying Zhou, Lushan Xiao, Li Liu, Pengcheng

In this paper, we propose a deep model called Attention-based Multiple Dimensions EEG Transformer (AMDET), which can exploit the complementarity among the spectral-spatial-temporal features of EEG data by employing the multi-dimensional global attention mechanism.

Brain Computer Interface EEG +1

Generating and Weighting Semantically Consistent Sample Pairs for Ultrasound Contrastive Learning

1 code implementation8 Dec 2022 Yixiong Chen, Chunhui Zhang, Chris H. Q. Ding, Li Liu

In this work, we pre-train DNNs on ultrasound (US) domains instead of ImageNet to reduce the domain gap in medical US applications.

Cancer Classification Contrastive Learning +3

Rethinking Two Consensuses of the Transferability in Deep Learning

no code implementations1 Dec 2022 Yixiong Chen, Jingxian Li, Chris Ding, Li Liu

Deep transfer learning (DTL) has formed a long-term quest toward enabling deep neural networks (DNNs) to reuse historical experiences as efficiently as humans.

Deep Learning General Knowledge +3

TAOTF: A Two-stage Approximately Orthogonal Training Framework in Deep Neural Networks

no code implementations25 Nov 2022 Taoyong Cui, Jianze Li, Yuhan Dong, Li Liu

In the first stage, we propose a novel algorithm called polar decomposition-based orthogonal initialization (PDOI) to find a good initialization for the orthogonal optimization.

Boosting Binary Neural Networks via Dynamic Thresholds Learning

no code implementations4 Nov 2022 Jiehua Zhang, Xueyang Zhang, Zhuo Su, Zitong Yu, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

For ViTs, DyBinaryCCT presents the superiority of the convolutional embedding layer in fully binarized ViTs and achieves 56. 1% on the ImageNet dataset, which is nearly 9% higher than the baseline.

Binarization

HiCo: Hierarchical Contrastive Learning for Ultrasound Video Model Pretraining

1 code implementation10 Oct 2022 Chunhui Zhang, Yixiong Chen, Li Liu, Qiong Liu, Xi Zhou

This work proposes a hierarchical contrastive learning (HiCo) method to improve the transferability for the US video model pretraining.

Contrastive Learning

On Clustering Trend in Language Evolution Based on Dynamical Behaviors of Multi-Agent Model

no code implementations3 Oct 2022 Yu Zhang, Li Liu, Chen Diao, Ning Cai

Computer model has been extensively adopted to overcome the time limitation of language evolution by transforming language theory into physical modeling mechanism, which helps to explore the general laws of the evolution.

Clustering

SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud Representation

1 code implementation13 Sep 2022 Zhuo Su, Max Welling, Matti Pietikäinen, Li Liu

Precisely, the presence of scalar features makes the major part of the network binarizable, while vector features serve to retain rich structural information and ensure SO(3) equivariance.

Autonomous Driving Binarization +1

Scattering Model Guided Adversarial Examples for SAR Target Recognition: Attack and Defense

1 code implementation11 Sep 2022 Bowen Peng, Bo Peng, Jie zhou, Jianyue Xie, Li Liu

Toward building more robust DNN-based SAR ATR models, this article explores the domain knowledge of SAR imaging process and proposes a novel Scattering Model Guided Adversarial Attack (SMGAA) algorithm which can generate adversarial perturbations in the form of electromagnetic scattering response (called adversarial scatterers).

Adversarial Attack Adversarial Robustness

Bag of Tricks for FGSM Adversarial Training

no code implementations6 Sep 2022 Zichao Li, Li Liu, Zeyu Wang, Yuyin Zhou, Cihang Xie

Adversarial training (AT) with samples generated by Fast Gradient Sign Method (FGSM), also known as FGSM-AT, is a computationally simple method to train robust networks.

Cross-Domain Few-Shot Classification via Inter-Source Stylization

no code implementations17 Aug 2022 Huali Xu, Shuaifeng Zhi, Li Liu

The goal of Cross-Domain Few-Shot Classification (CDFSC) is to accurately classify a target dataset with limited labelled data by exploiting the knowledge of a richly labelled auxiliary dataset, despite the differences between the domains of the two datasets.

Classification Cross-Domain Few-Shot +2

KL-divergence Based Deep Learning for Discrete Time Model

no code implementations10 Aug 2022 Li Liu, Xiangeng Fang, Di Wang, Weijing Tang, Kevin He

Neural Network (Deep Learning) is a modern model in Artificial Intelligence and it has been exploited in Survival Analysis.

Deep Learning Survival Analysis +1

Data-free Backdoor Removal based on Channel Lipschitzness

1 code implementation5 Aug 2022 Runkai Zheng, Rongjun Tang, Jianze Li, Li Liu

Pruning these channels was then shown to be effective in mitigating the backdoor behaviors.

Graph Signal Processing for Heterogeneous Change Detection Part II: Spectral Domain Analysis

no code implementations3 Aug 2022 Yuli Sun, Lin Lei, Dongdong Guan, Gangyao Kuang, Li Liu

Then, we propose a regression model for the HCD, which decomposes the source signal into the regressed signal and changed signal, and requires the regressed signal have the same spectral property as the target signal on the same graph.

Change Detection regression

Advanced Conditional Variational Autoencoders (A-CVAE): Towards interpreting open-domain conversation generation via disentangling latent feature representation

no code implementations26 Jul 2022 Ye Wang, Jingbo Liao, Hong Yu, Guoyin Wang, Xiaoxia Zhang, Li Liu

Particularly, the model integrates the macro-level guided-category knowledge and micro-level open-domain dialogue data for the training, leveraging the priori knowledge into the latent space, which enables the model to disentangle the latent variables within the mesoscopic scale.

Disentanglement

Rethinking Few-Shot Class-Incremental Learning with Open-Set Hypothesis in Hyperbolic Geometry

no code implementations20 Jul 2022 Yawen Cui, Zitong Yu, Wei Peng, Li Liu

Few-Shot Class-Incremental Learning (FSCIL) aims at incrementally learning novel classes from a few labeled samples by avoiding the overfitting and catastrophic forgetting simultaneously.

class-incremental learning Few-Shot Class-Incremental Learning +3

HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints

no code implementations16 Jul 2022 Wei Wu, Junlin He, Yu Qiao, Guoheng Fu, Li Liu, Jin Yu

The in-memory approximate nearest neighbor search (ANNS) algorithms have achieved great success for fast high-recall query processing, but are extremely inefficient when handling hybrid queries with unstructured (i. e., feature vectors) and structured (i. e., related attributes) constraints.

Attribute

MetaLR: Meta-tuning of Learning Rates for Transfer Learning in Medical Imaging

1 code implementation3 Jun 2022 Yixiong Chen, Li Liu, Jingxian Li, Hua Jiang, Chris Ding, Zongwei Zhou

In this work, we propose a meta-learning-based LR tuner, named MetaLR, to make different layers automatically co-adapt to downstream tasks based on their transferabilities across domains.

Medical Image Analysis Meta-Learning +1

Median Pixel Difference Convolutional Network for Robust Face Recognition

no code implementations30 May 2022 Jiehua Zhang, Zhuo Su, Li Liu

Face recognition is one of the most active tasks in computer vision and has been widely used in the real world.

Face Recognition Robust Face Recognition

Deep Learning for Visual Speech Analysis: A Survey

no code implementations22 May 2022 Changchong Sheng, Gangyao Kuang, Liang Bai, Chenping Hou, Yulan Guo, Xin Xu, Matti Pietikäinen, Li Liu

Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment.

Deep Learning speech-recognition +2

Acoustic-to-articulatory Inversion based on Speech Decomposition and Auxiliary Feature

no code implementations2 Apr 2022 Jianrong Wang, Jinyu Liu, Longxuan Zhao, Shanyu Wang, Ruiguo Yu, Li Liu

Acoustic-to-articulatory inversion (AAI) is to obtain the movement of articulators from speech signals.

Residual-guided Personalized Speech Synthesis based on Face Image

no code implementations1 Apr 2022 Jianrong Wang, Zixuan Wang, Xiaosheng Hu, XueWei Li, Qiang Fang, Li Liu

Experimental results show that the speech synthesized by our model is comparable to the personalized speech synthesized by training a large amount of audio data in previous works.

Speech Synthesis

Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation

1 code implementation8 Feb 2022 Li Liu, Qingle Huang, Sihao Lin, Hongwei Xie, Bing Wang, Xiaojun Chang, Xiaodan Liang

Extensive experiments on two vision tasks, includ-ing ImageNet classification and Pascal VOC segmentation, demonstrate the superiority of our ICKD, which consis-tently outperforms many existing methods, advancing thestate-of-the-art in the fields of Knowledge Distillation.

Diversity Knowledge Distillation

WebUAV-3M: A Benchmark for Unveiling the Power of Million-Scale Deep UAV Tracking

1 code implementation19 Jan 2022 Chunhui Zhang, Guanjie Huang, Li Liu, Shan Huang, Yinan Yang, Xiang Wan, Shiming Ge, DaCheng Tao

In this work, we propose WebUAV-3M, the largest public UAV tracking benchmark to date, to facilitate both the development and evaluation of deep UAV trackers.

Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery

no code implementations18 Jan 2022 Yan Zhao, Lingjun Zhao, Zhong Liu, Dewen Hu, Gangyao Kuang, Li Liu

Aircraft detection in Synthetic Aperture Radar (SAR) imagery is a challenging task in SAR Automatic Target Recognition (SAR ATR) areas due to aircraft's extremely discrete appearance, obvious intraclass variation, small size and serious background's interference.

Improved (Related-key) Differential-based Neural Distinguishers for SIMON and SIMECK Block Ciphers

1 code implementation11 Jan 2022 Jinyu Lu, Guoqiang Liu, Bing Sun, Chao Li, Li Liu

In CRYPTO 2019, Gohr made a pioneering attempt and successfully applied deep learning to the differential cryptanalysis against NSA block cipher SPECK32/64, achieving higher accuracy than the pure differential distinguishers.

Cryptanalysis Deep Learning

Decoupling Makes Weakly Supervised Local Feature Better

1 code implementation CVPR 2022 Kunhong Li, Longguang Wang, Li Liu, Qing Ran, Kai Xu, Yulan Guo

Weakly supervised learning can help local feature methods to overcome the obstacle of acquiring a large-scale dataset with densely labeled correspondences.

Camera Localization Image Matching +1

Local Motion and Contrast Priors Driven Deep Network for Infrared Small Target Super-Resolution

1 code implementation4 Jan 2022 Xinyi Ying, Yingqian Wang, Longguang Wang, Weidong Sheng, Li Liu, Zaiping Lin, Shilin Zhou

Specifically, motivated by the local motion prior in the spatio-temporal dimension, we propose a local spatio-temporal attention module to perform implicit frame alignment and incorporate the local spatio-temporal information to enhance the local features (especially for small targets).

Super-Resolution

Learnable Lookup Table for Neural Network Quantization

1 code implementation CVPR 2022 Longguang Wang, Xiaoyu Dong, Yingqian Wang, Li Liu, Wei An, Yulan Guo

Since a linear quantizer (i. e., round(*) function) cannot well fit the bell-shaped distributions of weights and activations, many existing methods use pre-defined functions (e. g., exponential function) with learnable parameters to build the quantizer for joint optimization.

Computational Efficiency Image Classification +3

Efficient Video Transformers with Spatial-Temporal Token Selection

1 code implementation23 Nov 2021 Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang

Video transformers have achieved impressive results on major video recognition benchmarks, which however suffer from high computational cost.

Video Recognition

Image-Guided Navigation of a Robotic Ultrasound Probe for Autonomous Spinal Sonography Using a Shadow-aware Dual-Agent Framework

no code implementations3 Nov 2021 Keyu Li, Yangxin Xu, Jian Wang, Dong Ni, Li Liu, Max Q. -H. Meng

Ultrasound (US) imaging is commonly used to assist in the diagnosis and interventions of spine diseases, while the standardized US acquisitions performed by manually operating the probe require substantial experience and training of sonographers.

Anatomy Decision Making +2

Transient Synchronization Stability Analysis of Wind Farms with MMC-HVDC Integration Under Offshore AC Grid Fault

no code implementations25 Oct 2021 Yu Zhang, Chen Zhang, Renxin Yang, Jing Lyu, Li Liu, Xu Cai

The MMC-HVDC connected offshore wind farms (OWFs) could suffer short circuit fault (SCF), whereas their transient stability is not well analysed.

Unsupervised cross-user adaptation in taste sensation recognition based on surface electromyography with conformal prediction and domain regularized component analysis

no code implementations20 Oct 2021 Hengyang Wang, Xianghao Zhan, Li Liu, Asif Ullah, Huiyan Li, Han Gao, You Wang, Guang Li

The results show that DRCA improved the classification accuracy on six subjects (p < 0. 05), compared with the baseline models trained only with the source domain data;, while CPSC did not guarantee the accuracy improvement.

Conformal Prediction Data Augmentation

Dynamic Binary Neural Network by learning channel-wise thresholds

no code implementations8 Oct 2021 Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu

The experimental results prove that our method is an effective and straightforward way to reduce information loss and enhance performance of BNNs.

Decoupling Long- and Short-Term Patterns in Spatiotemporal Inference

no code implementations16 Sep 2021 Junfeng Hu, Yuxuan Liang, Zhencheng Fan, Li Liu, Yifang Yin, Roger Zimmermann

Specifically, we introduce a joint spatiotemporal graph attention network to learn the relations across space and time for short-term patterns.

Graph Attention