no code implementations • ECCV 2020 • Jiaxin Chen, Jie Qin, Yuming Shen, Li Liu, Fan Zhu, Ling Shao
This paper proposes a novel method for 3D shape representation learning, namely Hyperbolic Embedded Attentive Representation (HEAR).
no code implementations • ECCV 2020 • Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao
To exploit the progressive interactions among these regions, we represent them as a region graph, on which the parts relation reasoning is performed with graph convolutions, thus leading to our PRR branch.
no code implementations • 24 Nov 2024 • Chunhui Zhang, Li Liu, Hao Wen, Xi Zhou, Yanfeng Wang
Night unmanned aerial vehicle (UAV) tracking is impeded by the challenges of poor illumination, with previous daylight-optimized methods demonstrating suboptimal performance in low-light conditions, limiting the utility of UAV applications.
1 code implementation • 19 Nov 2024 • Shaoqing Xu, Fang Li, Shengyin Jiang, Ziying Song, Li Liu, Zhi-Xin Yang
In this context, we are excited to introduce GaussianPretrain, a novel pre-training paradigm that achieves a holistic understanding of the scene by uniformly integrating geometric and texture representations.
1 code implementation • 15 Nov 2024 • Huali Xu, Yongxiang Liu, Li Liu, Shuaifeng Zhi, Shuzhou Sun, Tianpeng Liu, MingMing Cheng
This paper addresses the source-free CDFSL (SF-CDFSL) problem, tackling few-shot learning (FSL) in the target domain using only pre-trained models and a few target samples without source data or strategies.
no code implementations • 14 Nov 2024 • Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang
In this paper, we introduce the Spatio-Temporal Unitized Model (STUM), a unified framework designed to capture both spatial and temporal dependencies while addressing spatio-temporal heterogeneity through techniques such as distribution alignment and feature fusion.
1 code implementation • 12 Nov 2024 • Jie zhou, Chao Xiao, Bowen Peng, Tianpeng Liu, Zhen Liu, Yongxiang Liu, Li Liu
The fundamental challenge in SAR target detection lies in developing discriminative, efficient, and robust representations of target characteristics within intricate non-cooperative environments.
no code implementations • 7 Nov 2024 • Xinhua Jiang, Tianpeng Liu, Li Liu, Zhen Liu, Yongxiang Liu
Active Object Detection (AOD) offers an effective way to achieve this purpose.
no code implementations • 5 Nov 2024 • Lei Wang, Weiming Zeng, Kai Long, Rongfeng Lan, Li Liu, Wai Ting Siok, Nizhuan Wang
Photoacoustic imaging (PAI) represents an innovative biomedical imaging modality that harnesses the advantages of optical resolution and acoustic penetration depth while ensuring enhanced safety.
1 code implementation • 1 Nov 2024 • Li Liu, Diji Yang, Sijia Zhong, Kalyana Suma Sree Tholeti, Lei Ding, Yi Zhang, Leilani H. Gilpin
To investigate this gap, we identify a critical and challenging task in the Visual Question Answering (VQA) scenario: can VLMs indicate how to adjust an image when the visual information is insufficient to answer a question?
1 code implementation • 25 Oct 2024 • Shakhrul Iman Siam, Hyunho Ahn, Li Liu, Samiul Alam, Hui Shen, Zhichao Cao, Ness Shroff, Bhaskar Krishnamachari, Mani Srivastava, Mi Zhang
We hope this survey will serve as a valuable resource for those engaged in AIoT research and act as a catalyst for future explorations to bridge gaps and drive advancements in this exciting field.
1 code implementation • 16 Oct 2024 • Yanyun Wang, Li Liu, Zi Liang, Qingqing Ye, Haibo Hu
Accordingly, to relax the tension between clean and robust learning derived from this overstrict assumption, we propose a new AT paradigm by introducing an additional dummy class for each original class, aiming to accommodate the hard adversarial samples with shifted distribution after perturbation.
no code implementations • 13 Oct 2024 • Yongxiang Liu, Bowen Peng, Li Liu, Xiang Li
Transferable targeted adversarial attacks (TTAs) against deep neural networks have been proven significantly more challenging than untargeted ones, yet they remain relatively underexplored.
no code implementations • 25 Sep 2024 • Li Liu, Tengchao Yu, Heng Yong
This leads us to question: \textbf{Does the approximation capacity of a neural network remain universal, or does it have a limit when the parameters are practically bounded?
2 code implementations • 25 Sep 2024 • Chunhui Zhang, Li Liu, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang
Over the past decade, significant progress has been made in visual object tracking, largely due to the availability of large-scale training datasets.
1 code implementation • 19 Sep 2024 • Xinyi Ying, Li Liu, Zaipin Lin, Yangsi Shi, Yingqian Wang, Ruojing Li, Xu Cao, Boyang Li, Shilin Zhou
To address the aforementioned challenges, in this paper, we first build a large-scale dataset for MIRST detection in satellite videos (namely IRSatVideo-LEO), and then develop a recurrent feature refinement (RFR) framework as the baseline method.
no code implementations • 4 Sep 2024 • Li Liu, Ruijie Zhu, Jiacheng Deng, Ziyang Song, Wenfei Yang, Tianzhu Zhang
Specifically, in the proposed plane guided depth generator (PGDG), we design a set of plane queries as prototypes to softly model planes in the scene and predict per-pixel plane coefficients.
no code implementations • 1 Sep 2024 • Yan Rong, Li Liu
Face-based Voice Conversion (FVC) is a novel task that leverages facial images to generate the target speaker's voice style.
no code implementations • 28 Aug 2024 • Weilin Lin, Li Liu, Jianze Li, Hui Xiong
This method, based on our findings on neuron weight changes (NWCs) of random unlearning, uses optimal transport (OT)-based model fusion to combine the advantages of both pruned and backdoored models.
no code implementations • 27 Aug 2024 • Lei Liu, Li Liu, Yawen Cui
Even in the era of large models, one of the well-known issues in continual learning (CL) is catastrophic forgetting, which is significantly challenging when the continual data stream exhibits a long-tailed distribution, termed as Long-Tailed Continual Learning (LTCL).
2 code implementations • 31 Jul 2024 • Chunhui Zhang, Yawen Cui, Weilin Lin, Guanjie Huang, Yan Rong, Li Liu, Shiguang Shan
To address this gap, this work conducts a systematic review on SAM for videos in the era of foundation models.
1 code implementation • 29 Jul 2024 • Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen
1) We provide an integrated implementation of state-of-the-art (SOTA) backdoor learning algorithms (currently including 20 attack and 32 defense algorithms), based on an extensible modular-based codebase.
1 code implementation • 22 Jul 2024 • Bowen Peng, Li Liu, Tianpeng Liu, Zhen Liu, Yongxiang Liu
We also contribute a surprising empirical insight that one of the most fundamental transformations, simple image scaling, is highly effective, scalable, sufficient, and necessary in enhancing targeted transferability.
1 code implementation • 11 Jul 2024 • Wentao Lei, Jinting Wang, Fengji Ma, Guanjie Huang, Li Liu
The goal of this survey is to offer the research community a clear and holistic view of the advancements in human video generation, highlighting the milestones achieved and the challenges that lie ahead.
1 code implementation • 11 Jul 2024 • Ruijie Zhu, Chuxin Wang, Ziyang Song, Li Liu, Tianzhu Zhang, Yongdong Zhang
Our method decomposes metric depth into scene scale and relative depth, and predicts them through a semantic-aware scale prediction (SASP) module and an adaptive relative depth estimation (ARDE) module, respectively.
Ranked #1 on Monocular Depth Estimation on DIODE Outdoor
no code implementations • 10 Jul 2024 • Jiangming Chen, Li Liu, Wanxia Deng, Zhen Liu, Yu Liu, YingMei Wei, Yongxiang Liu
Cross domain object detection learns an object detector for an unlabeled target domain by transferring knowledge from an annotated source domain.
1 code implementation • 20 Jun 2024 • Xinyi Ying, Chao Xiao, Ruojing Li, Xu He, Boyang Li, Zhaoxu Li, Yingqian Wang, Mingyuan Hu, Qingyu Xu, Zaiping Lin, Miao Li, Shilin Zhou, Wei An, Weidong Sheng, Li Liu
Based on the proposed RGBT-Tiny dataset and SAFit measure, extensive evaluations have been conducted, including 23 recent state-of-the-art algorithms that cover four different types (i. e., visible generic detection, visible SOD, thermal SOD and RGBT object detection).
no code implementations • 6 Jun 2024 • Jiaxi Hu, Qingsong Wen, Sijie Ruan, Li Liu, Yuxuan Liang
In this paper, we begin by validating this theory through wavelet analysis and propose the Transformer-based TwinS model, which consists of three modules to address the non-stationary periodic distributions: Wavelet Convolution, Period-Aware Attention, and Channel-Temporal Mixed MLP.
1 code implementation • 30 May 2024 • Chunhui Zhang, Li Liu, Guanjie Huang, Hao Wen, Xi Zhou, Yanfeng Wang
Most existing trackers are tailored for open-air environments, leading to performance degradation when applied to UOT due to domain gaps.
no code implementations • 30 May 2024 • Weilin Lin, Li Liu, Shaokui Wei, Jianze Li, Hui Xiong
Recently, without poisoned data, unlearning models with clean data and then learning a pruning mask have contributed to backdoor defense.
no code implementations • 27 May 2024 • Fengji Ma, Li Liu, Hei Victor Cheng
Simultaneously, fixed pre-trained image embeddings are used as cross-modal auxiliary supervision to maintain the similarity between the MHE-tuned and original text embeddings by the knowledge distillation, preserving semantic information between different classes.
4 code implementations • 23 May 2024 • Chunhui Zhang, Li Liu, Hao Wen, Xi Zhou, Yanfeng Wang
To leverage more modalities, some recent efforts have been made to learn a unified visual object tracking model for any modality.
2 code implementations • 15 May 2024 • Weijie Li, Wei Yang, Yuenan Hou, Li Liu, Yongxiang Liu, Xiang Li
Despite the remarkable progress in synthetic aperture radar automatic target recognition (SAR ATR), recent efforts have concentrated on the detection or classification of a specific and coarse category, e. g., vehicles, ships, airplanes, or buildings.
no code implementations • 30 Apr 2024 • Wentao Lei, Li Liu, Jun Wang
Therefore, we propose a novel Gloss-prompted Diffusion-based CS Gesture generation framework (called GlossDiff).
no code implementations • journal 2024 • Zhihao Tang, Li Liu, Yifan Shen, Zongyi Chen, Guixiang Ma, Jiyan Dong, Xujie Sun, Xi Zhang, Chaozhuo Li, Qingfeng Zheng, Lin Yang
Highlights•Without patching WSIs, a novel ViT-based model is proposed for survival predictions.•We first introduce aleatoric uncertainty into the survival loss function.•We explain survival prediction using a post-hoc explainable method.•Our method outperforms baselines in accuracy, explainability, and reliability.
no code implementations • 3 Apr 2024 • Shuxian Fan, Adam Visokay, Kentaro Hoffman, Stephen Salerno, Li Liu, Jeffrey T. Leek, Tyler H. McCormick
In this paper, we develop a method for valid inference using outcomes (in our case COD) predicted from free-form text using state-of-the-art NLP techniques.
1 code implementation • 29 Mar 2024 • Bo wang, Jian Li, Yang Yu, Li Liu, Zhenping Sun, Dewen Hu
Considering the complementarity of scene flow estimation in the spatial domain's focusing capability and 3D object tracking in the temporal domain's coherence, this study aims to address a comprehensive new task that can simultaneously capture fine-grained and long-term 3D motion in an online manner: long-term scene flow estimation (LSFE).
no code implementations • 22 Mar 2024 • Jinge Wang, Zien Cheng, Qiuming Yao, Li Liu, Dong Xu, Gangqing Hu
The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines.
no code implementations • CVPR 2024 • Lizhe Liu, Bohua Wang, Hongwei Xie, Daqi Liu, Li Liu, Zhiqiang Tian, Kuiyuan Yang, Bing Wang
Vision-centric 3D environment understanding is both vital and challenging for autonomous driving systems.
2 code implementations • 18 Mar 2024 • YuXuan Li, Xiang Li, Yimian Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang
While a considerable amount of research has been dedicated to remote sensing classification, object detection and semantic segmentation, most of these studies have overlooked the valuable prior knowledge embedded within remote sensing scenarios.
Ranked #2 on Change Detection on S2Looking
1 code implementation • 11 Mar 2024 • YuXuan Li, Xiang Li, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
Ranked #2 on 2D Object Detection on SARDet-100K (using extra training data)
1 code implementation • CVPR 2024 • Tianrui Lou, Xiaojun Jia, Jindong Gu, Li Liu, Siyuan Liang, Bangyan He, Xiaochun Cao
We find that concealing deformation perturbations in areas insensitive to human eyes can achieve a better trade-off between imperceptibility and adversarial strength, specifically in parts of the object surface that are complex and exhibit drastic curvature changes.
no code implementations • 7 Mar 2024 • Vanshika Vats, Marzia Binta Nizam, Minghao Liu, Ziyuan Wang, Richard Ho, Mohnish Sai Prasad, Vincent Titterton, Sai Venkat Malreddy, Riya Aggarwal, Yanwen Xu, Lei Ding, Jay Mehta, Nathan Grinnell, Li Liu, Sijia Zhong, Devanathan Nallur Gandamani, Xinyi Tang, Rohan Ghosalkar, Celeste Shen, Rachel Shen, Nafisa Hussain, Kesav Ravichandran, James Davis
In the rapidly evolving landscape of artificial intelligence (AI), the collaboration between human intelligence and AI systems, known as Human-AI (HAI) Teaming, has emerged as a cornerstone for advancing problem-solving and decision-making processes.
1 code implementation • 4 Mar 2024 • Huali Xu, Li Liu, Shuaifeng Zhi, Shaojing Fu, Zhuo Su, Ming-Ming Cheng, Yongxiang Liu
For this reason, this paper explores a Source-Free CDFSL (SF-CDFSL) problem, in which CDFSL is addressed through the use of existing pretrained models instead of training a model with source data, avoiding accessing source data.
1 code implementation • 1 Feb 2024 • Zhuo Su, Jiehua Zhang, Longguang Wang, Hua Zhang, Zhen Liu, Matti Pietikäinen, Li Liu
With PDC and Bi-PDC, we further present two lightweight deep networks named \emph{Pixel Difference Networks (PiDiNet)} and \emph{Binary PiDiNet (Bi-PiDiNet)} respectively to learn highly efficient yet more accurate representations for visual tasks including edge detection and object recognition.
no code implementations • 31 Jan 2024 • Lei Liu, Li Liu, Haizhou Li
Cued Speech (CS) is a pure visual coding method used by hearing-impaired people that combines lip reading with several specific hand shapes to make the spoken language visible.
1 code implementation • 30 Jan 2024 • Bowen Peng, Bo Peng, Jingyuan Xia, Tianpeng Liu, Yongxiang Liu, Li Liu
Recently, there has been increasing concern about the vulnerability of deep neural network (DNN)-based synthetic aperture radar (SAR) automatic target recognition (ATR) to adversarial attacks, where a DNN could be easily deceived by clean input with imperceptible but aggressive perturbations.
no code implementations • 26 Jan 2024 • Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen
We hope that our efforts could build a solid foundation of backdoor learning to facilitate researchers to investigate existing algorithms, develop more innovative algorithms, and explore the intrinsic mechanism of backdoor learning.
1 code implementation • 7 Jan 2024 • Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, Nicu Sebe
It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef).
Ranked #1 on Camouflaged Object Segmentation on COD
Camouflaged Object Segmentation Dichotomous Image Segmentation +3
no code implementations • 21 Dec 2023 • Peng Zhao, Jiehua Zhang, Bowen Peng, Longguang Wang, YingMei Wei, Yu Liu, Li Liu
2) BNNs consistently exhibit better adversarial robustness under black-box attacks.
1 code implementation • 14 Dec 2023 • Chubin Zhang, Juncheng Yan, Yi Wei, Jiaxin Li, Li Liu, Yansong Tang, Yueqi Duan, Jiwen Lu
Occupancy prediction reconstructs 3D structures of surrounding environments.
no code implementations • 13 Dec 2023 • Baoyuan Wu, Shaokui Wei, Mingli Zhu, Meixi Zheng, Zihao Zhu, Mingda Zhang, Hongrui Chen, Danni Yuan, Li Liu, Qingshan Liu
Adversarial phenomenon has been widely observed in machine learning (ML) systems, especially in those using deep neural networks, describing that ML systems may produce inconsistent and incomprehensible predictions with humans at some particular cases.
2 code implementations • 26 Nov 2023 • Weijie Li, Yang Wei, Tianpeng Liu, Yuenan Hou, YuXuan Li, Zhen Liu, Yongxiang Liu, Li Liu
The growing Synthetic Aperture Radar (SAR) data has the potential to build a foundation model through Self-Supervised Learning (SSL) methods, which can achieve various SAR Automatic Target Recognition (ATR) tasks with pre-training in large-scale unlabeled data and fine-tuning in small labeled samples.
no code implementations • 25 Oct 2023 • Yuan Li, Li Liu, Penggang Chen, Youmin Zhang, Guoyin Wang
Graph data widely exists in real life, with large amounts of data and complex structures.
no code implementations • 8 Oct 2023 • Zhong-Yu Li, Bo-Wen Yin, Yongxiang Liu, Li Liu, Ming-Ming Cheng
Thus, we propose Heterogeneous Self-Supervised Learning (HSSL), which enforces a base model to learn from an auxiliary head whose architecture is heterogeneous from the base model.
no code implementations • 5 Oct 2023 • Jinting Wang, Li Liu, Jun Wang, Hei Victor Cheng
To overcome this challenge, we introduce the concept of residuals by integrating a statistical face prior to the diffusion process.
1 code implementation • 18 Sep 2023 • Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Bing Wang, Hongwei Xie, Li Liu, Shanghang Zhang
3D occupancy prediction holds significant promise in the fields of robot perception and autonomous driving, which quantifies 3D scenes into grid cells with semantic labels.
1 code implementation • 18 Sep 2023 • Bowen Yin, Xuying Zhang, Zhongyu Li, Li Liu, Ming-Ming Cheng, Qibin Hou
We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks.
Ranked #1 on RGB-D Salient Object Detection on DES
1 code implementation • 12 Sep 2023 • Junjing Zheng, Xinyu Zhang, Yongxiang Liu, Weidong Jiang, Kai Huo, Li Liu
A standard convex SPCA-based model with PSD constraint for unsupervised feature selection is proposed.
no code implementations • 8 Sep 2023 • Li Liu, Da Chen, Minglei Shu, Laurent D. Cohen
These boundary proposals are then incorporated into the proposed image segmentation model, such that the target segmentation contours are made up of a set of selected boundary proposals and the corresponding geodesic paths linking them.
1 code implementation • 17 Aug 2023 • Li Liu, Lufei Gao, Wentao Lei, Fengji Ma, Xiaotian Lin, Jinting Wang
In summary, this survey paper provides a comprehensive understanding of deep multi-modal learning for various BL generations and recognitions for the first time.
no code implementations • 13 Aug 2023 • Jinghua Zhang, Li Liu, Olli Silvén, Matti Pietikäinen, Dewen Hu
In our in-depth examination, we delve into various facets of FSCIL, encompassing the problem definition, the discussion of the primary challenges of unreliable empirical risk minimization and the stability-plasticity dilemma, general schemes, and relevant problems of IL and Few-shot Learning (FSL).
class-incremental learning Class-Incremental Object Detection +6
no code implementations • 9 Aug 2023 • Shanshan Huang, Haoxuan Li, Qingsong Li, Chunyuan Zheng, Li Liu
Multimedia recommendation involves personalized ranking tasks, where multimedia content is usually represented using a generic encoder.
1 code implementation • 27 Jul 2023 • Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Liangjun Zhang, Hesheng Wang, Wei Tsang Ooi, Ruijie Zhu, Ziyang Song, Li Liu, Tianzhu Zhang, Jun Yu, Mohan Jing, Pengwei Li, Xiaohua Qi, Cheng Jin, Yingfeng Chen, Jie Hou, Jie Zhang, Zhen Kan, Qiang Ling, Liang Peng, Minglei Li, Di Xu, Changpeng Yang, Yuanqi Yao, Gang Wu, Jian Kuai, Xianming Liu, Junjun Jiang, Jiamian Huang, Baojun Li, Jiale Chen, Shuang Zhang, Sun Ao, Zhenyu Li, Runze Chen, Haiyong Luo, Fang Zhao, Jingze Yu
In this paper, we summarize the winning solutions from the RoboDepth Challenge -- an academic competition designed to facilitate and advance robust OoD depth estimation.
1 code implementation • 24 Jul 2023 • YiQing Wang, Zihan Li, Jieru Mei, Zihao Wei, Li Liu, Chen Wang, Shengtian Sang, Alan Yuille, Cihang Xie, Yuyin Zhou
To address this limitation, we present Masked Multi-view with Swin Transformers (SwinMM), a novel multi-view pipeline for enabling accurate and data-efficient self-supervised medical image analysis.
1 code implementation • 17 Jul 2023 • Liu Liu, Shuaifeng Zhi, Zhenhua Du, Li Liu, Xinyu Zhang, Kai Huo, Weidong Jiang
In this paper, we propose a hybrid point-wise Radar-Optical fusion approach for object detection in autonomous driving scenarios.
no code implementations • 11 Jul 2023 • Shuzhou Sun, Shuaifeng Zhi, Qing Liao, Janne Heikkilä, Li Liu
To remedy this, we propose Two-stage Causal Modeling (TsCM) for the SGG task, which takes the long-tailed distribution and semantic confusion as confounders to the Structural Causal Model (SCM) and then decouples the causal intervention into two stages.
no code implementations • 7 Jul 2023 • Chunhui Zhang, Xin Sun, Li Liu, Yiqian Yang, Qiong Liu, Xi Zhou, Yanfeng Wang
This approach achieves feature integration in a unified backbone, removing the need for carefully-designed fusion modules and resulting in a more effective and efficient VL tracking framework.
1 code implementation • 6 Jul 2023 • Yun Liu, Yu-Huan Wu, Shi-Chen Zhang, Li Liu, Min Wu, Ming-Ming Cheng
This dataset enables the training of sophisticated detectors for high-quality CTD.
no code implementations • 15 Jun 2023 • Mingjie Pan, Li Liu, Jiaming Liu, Peixiang Huang, Longlong Wang, Shanghang Zhang, Shaoqing Xu, Zhiyi Lai, Kuiyuan Yang
In this technical report, we present our solution, named UniOCC, for the Vision-Centric 3D occupancy prediction track in the nuScenes Open Dataset Challenge at CVPR 2023.
Ranked #4 on Prediction Of Occupancy Grid Maps on Occ3D-nuScenes
1 code implementation • 15 Jun 2023 • Bo wang, Yifan Zhang, Jian Li, Yang Yu, Zhenping Sun, Li Liu, Dewen Hu
The occlusion problem remains a crucial challenge in optical flow estimation (OFE).
no code implementations • 6 Jun 2023 • Jianrong Wang, Yaxin Zhao, Li Liu, Tianyi Xu, Qi Li, Sen Li
Given an audio clip and a reference face image, the goal of the talking head generation is to generate a high-fidelity talking head video.
1 code implementation • 5 Jun 2023 • Lufei Gao, Shan Huang, Li Liu
Cued Speech (CS) is a multi-modal visual coding system combining lip reading with several hand cues at the phonetic level to make the spoken language visible to the hearing impaired.
1 code implementation • 4 Jun 2023 • Jianrong Wang, Yuchen Huo, Li Liu, Tianyi Xu, Qi Li, Sen Li
Audio-visual speech recognition (AVSR) gains increasing attention from researchers as an important part of human-computer interaction.
no code implementations • 1 Jun 2023 • Ruotong Wang, Hongrui Chen, Zihao Zhu, Li Liu, Baoyuan Wu
Deep neural networks (DNNs) can be manipulated to exhibit specific behaviors when exposed to specific trigger patterns, without affecting their performance on benign samples, dubbed \textit{backdoor attack}.
1 code implementation • 18 May 2023 • Yixiong Chen, Li Liu, Chris Ding
This paper introduces a novel explainable image quality evaluation approach called X-IQE, which leverages visual large language models (LLMs) to evaluate text-to-image generation methods by generating textual explanations.
no code implementations • 15 May 2023 • Penghui Wei, Hongjian Dou, Shaoguo Liu, Rongjun Tang, Li Liu, Liang Wang, Bo Zheng
We introduce FedAds, the first benchmark for CVR estimation with vFL, to facilitate standardized and systematical evaluations for vFL algorithms.
1 code implementation • 14 May 2023 • Chunhui Zhang, Li Liu, Yawen Cui, Guanjie Huang, Weilin Lin, Yiqian Yang, Yuehong Hu
As the first to comprehensively review the progress of segmenting anything task for vision and beyond based on the foundation model of SAM, this work focuses on its applications to various tasks and data types by discussing its historical development, recent progress, and profound impact on broad applications.
no code implementations • 28 Apr 2023 • Yuchen Sun, Tianpeng Liu, Panhe Hu, Qing Liao, Shaojing Fu, Nenghai Yu, Deke Guo, Yongxiang Liu, Li Liu
Deep Neural Networks (DNNs), from AlexNet to ResNet to ChatGPT, have made revolutionary progress in recent years, and are widely used in various fields.
1 code implementation • 24 Apr 2023 • Jinghua Zhang, Li Liu, Kai Gao, Dewen Hu
In forward-compatible learning, we propose an innovative virtual class synthesis strategy and a Center-Triplet (CT) loss to enhance discriminative feature learning.
class-incremental learning Few-Shot Class-Incremental Learning +6
1 code implementation • 13 Apr 2023 • Zhuo Su, Jiehua Zhang, Tianpeng Liu, Zhen Liu, Shuanghui Zhang, Matti Pietikäinen, Li Liu
This paper proposes a novel module called middle spectrum grouped convolution (MSGC) for efficient deep convolutional neural networks (DCNNs) with the mechanism of grouped convolution.
1 code implementation • 7 Apr 2023 • Weijie Li, Wei Yang, Wenpeng Zhang, Tianpeng Liu, Yongxiang Liu, Li Liu
However, robustly recognizing vehicle targets is a challenging task in SAR due to the large intraclass variations and small interclass variations.
1 code implementation • CVPR 2023 • Xinyi Ying, Li Liu, Yingqian Wang, Ruojing Li, Nuo Chen, Zaiping Lin, Weidong Sheng, Shilin Zhou
Interestingly, during the training phase supervised by point labels, we discover that CNNs first learn to segment a cluster of pixels near the targets, and then gradually converge to predict groundtruth point labels.
no code implementations • 4 Apr 2023 • Bowen Peng, Jianyue Xie, Bo Peng, Li Liu
The proposed method contributes a mixed clutter variants generation strategy and a new inference branch equipped with channel-weighted mean square error (CWMSE) loss for invariant representation learning.
2 code implementations • 3 Apr 2023 • Weijie Li, Wei Yang, Li Liu, Wenpeng Zhang, Yongxiang Liu
Therefore, the degree of overfitting for clutter reflects the non-causality of deep learning in SAR ATR.
no code implementations • 23 Mar 2023 • Yande Li, Mingjie Wang, Minglun Gong, Yonggang Lu, Li Liu
The ever-increasing demands for intuitive interactions in Virtual Reality has triggered a boom in the realm of Facial Expression Recognition (FER).
Facial Expression Recognition Facial Expression Recognition (FER)
1 code implementation • ICCV 2023 • Yupeng Zhou, Zhen Li, Chun-Le Guo, Li Liu, Ming-Ming Cheng, Qibin Hou
Without any bells and whistles, we show that our SRFormer achieves a 33. 86dB PSNR score on the Urban100 dataset, which is 0. 46dB higher than that of SwinIR but uses fewer parameters and computations.
no code implementations • 15 Mar 2023 • Zhuo Su, Matti Pietikäinen, Li Liu
LBP is a successful hand-crafted feature descriptor in computer vision.
no code implementations • 15 Mar 2023 • Huali Xu, Shuaifeng Zhi, Shuzhou Sun, Vishal M. Patel, Li Liu
To address this, Few-shot learning (FSL) enables models to perform the target tasks with very few labeled examples by leveraging prior knowledge from related tasks.
no code implementations • 3 Mar 2023 • Wentao Lei, Lei Liu, Li Liu
Experiments on two medical image datasets (i. e., ISIC 2018 challenge and ChestX-ray14) show that our method outperforms state-of-the-art SSL methods.
1 code implementation • 19 Feb 2023 • Baoyuan Wu, Zihao Zhu, Li Liu, Qingshan Liu, Zhaofeng He, Siwei Lyu
Adversarial machine learning (AML) studies the adversarial phenomenon of machine learning, which may make inconsistent or unexpected predictions with humans.
1 code implementation • 12 Feb 2023 • Yawen Cui, Zitong Yu, Rizhao Cai, Xun Wang, Alex C. Kot, Li Liu
The goal of Few-Shot Continual Learning (FSCL) is to incrementally learn novel tasks with limited labeled samples and preserve previous capabilities simultaneously, while current FSCL methods are all for the class-incremental purpose.
no code implementations • ICLR 2023 2023 • Hongzhi Shi, Jingtao Ding, Yufan Cao, Quanming Yao, Li Liu, Yong Li
The essence of our method is to model the formula skeleton with a message-passing flow, which helps transform the discovery of the skeleton into the search for the message-passing flow.
1 code implementation • 24 Jan 2023 • Yawen Cui, Wanxia Deng, Haoyu Chen, Li Liu
Given a model well-trained with a large-scale base dataset, Few-Shot Class-Incremental Learning (FSCIL) aims at incrementally learning novel classes from a few labeled samples by avoiding overfitting, without catastrophically forgetting all encountered classes previously.
class-incremental learning Few-Shot Class-Incremental Learning +2
no code implementations • 5 Jan 2023 • Ang Li, Jiayi Han, Yongjian Zhao, Keyu Li, Li Liu
While the US is not a standard paradigm for spinal surgery, the scarcity of intra-operative clinical US data is an insurmountable bottleneck in training a neural network.
no code implementations • 3 Jan 2023 • Janne Mustaniemi, Juho Kannala, Esa Rahtu, Li Liu, Janne Heikkilä
Various datasets have been proposed for simultaneous localization and mapping (SLAM) and related problems.
1 code implementation • ICCV 2023 • Yaopei Zeng, Lei Liu, Li Liu, Li Shen, Shaoguo Liu, Baoyuan Wu
In particular, a proxy is derived from the accumulated gradients uploaded by the clients after local training, and is shared by all clients as the class prior for re-balance training.
1 code implementation • 29 Dec 2022 • Li Liu, Penggang Chen, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang
Aligning users across networks using graph representation learning has been found effective where the alignment is accomplished in a low-dimensional embedding space.
1 code implementation • 28 Dec 2022 • Peixiang Huang, Li Liu, Renrui Zhang, Song Zhang, Xinli Xu, Baichao Wang, Guoyi Liu
In this paper, we propose the learning scheme of Target Inner-Geometry from the LiDAR modality into camera-based BEV detectors for both dense depth and BEV features, termed as TiG-BEV.
no code implementations • 23 Dec 2022 • Yongling Xu, Yang Du, Jing Zou, Tianying Zhou, Lushan Xiao, Li Liu, Pengcheng
In this paper, we propose a deep model called Attention-based Multiple Dimensions EEG Transformer (AMDET), which can exploit the complementarity among the spectral-spatial-temporal features of EEG data by employing the multi-dimensional global attention mechanism.
1 code implementation • 8 Dec 2022 • Yixiong Chen, Chunhui Zhang, Chris H. Q. Ding, Li Liu
In this work, we pre-train DNNs on ultrasound (US) domains instead of ImageNet to reduce the domain gap in medical US applications.
no code implementations • 2 Dec 2022 • Lei Liu, Li Liu
To our knowledge, this is the first work on ACSR for Mandarin Chinese.
no code implementations • 1 Dec 2022 • Yixiong Chen, Jingxian Li, Chris Ding, Li Liu
Deep transfer learning (DTL) has formed a long-term quest toward enabling deep neural networks (DNNs) to reuse historical experiences as efficiently as humans.
no code implementations • 25 Nov 2022 • Taoyong Cui, Jianze Li, Yuhan Dong, Li Liu
In the first stage, we propose a novel algorithm called polar decomposition-based orthogonal initialization (PDOI) to find a good initialization for the orthogonal optimization.
no code implementations • 7 Nov 2022 • Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang, Xiao Sun, HaoDong Wu, Xuncheng Liu, Weizhan Zhang, Caixia Yan, Haipeng Du, Qinghua Zheng, Qi Wang, Wangdu Chen, Ran Duan, Mengdi Sun, Dan Zhu, Guannan Chen, Hojin Cho, Steve Kim, Shijie Yue, Chenghua Li, Zhengyang Zhuge, Wei Chen, Wenxu Wang, Yufeng Zhou, Xiaochen Cai, Hengxing Cai, Kele Xu, Li Liu, Zehua Cheng, Wenyi Lian, Wenjing Lian
While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices.
no code implementations • 4 Nov 2022 • Jiehua Zhang, Xueyang Zhang, Zhuo Su, Zitong Yu, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu
For ViTs, DyBinaryCCT presents the superiority of the convolutional embedding layer in fully binarized ViTs and achieves 56. 1% on the ImageNet dataset, which is nearly 9% higher than the baseline.
no code implementations • 28 Oct 2022 • Xuefeng Yang, Li Liu, Wenju Zhou, Jing Shi, Yinggang Zhang, Xin Hu, Huiyu Zhou
Moreover, the privacy of the system is analyzed to ensure the security of the real data.
1 code implementation • 10 Oct 2022 • Chunhui Zhang, Yixiong Chen, Li Liu, Qiong Liu, Xi Zhou
This work proposes a hierarchical contrastive learning (HiCo) method to improve the transferability for the US video model pretraining.
no code implementations • 3 Oct 2022 • Yu Zhang, Li Liu, Chen Diao, Ning Cai
Computer model has been extensively adopted to overcome the time limitation of language evolution by transforming language theory into physical modeling mechanism, which helps to explore the general laws of the evolution.
1 code implementation • 13 Sep 2022 • Zhuo Su, Max Welling, Matti Pietikäinen, Li Liu
Precisely, the presence of scalar features makes the major part of the network binarizable, while vector features serve to retain rich structural information and ensure SO(3) equivariance.
1 code implementation • 11 Sep 2022 • Bowen Peng, Bo Peng, Jie zhou, Jianyue Xie, Li Liu
Toward building more robust DNN-based SAR ATR models, this article explores the domain knowledge of SAR imaging process and proposes a novel Scattering Model Guided Adversarial Attack (SMGAA) algorithm which can generate adversarial perturbations in the form of electromagnetic scattering response (called adversarial scatterers).
1 code implementation • ACM Transactions on Multimedia Computing, Communications and Applications 2022 • Ruoyu Chen, Jingzhi Li, Hua Zhang, Changchong Sheng, Li Liu, Xiaochun Cao
Different from existing models, in this paper, we propose a new interpretation method that explains the image similarity models by salience maps and attribute words.
no code implementations • 6 Sep 2022 • Zichao Li, Li Liu, Zeyu Wang, Yuyin Zhou, Cihang Xie
Adversarial training (AT) with samples generated by Fast Gradient Sign Method (FGSM), also known as FGSM-AT, is a computationally simple method to train robust networks.
no code implementations • 17 Aug 2022 • Huali Xu, Shuaifeng Zhi, Li Liu
The goal of Cross-Domain Few-Shot Classification (CDFSC) is to accurately classify a target dataset with limited labelled data by exploiting the knowledge of a richly labelled auxiliary dataset, despite the differences between the domains of the two datasets.
no code implementations • 10 Aug 2022 • Li Liu, Xiangeng Fang, Di Wang, Weijing Tang, Kevin He
Neural Network (Deep Learning) is a modern model in Artificial Intelligence and it has been exploited in Survival Analysis.
1 code implementation • 5 Aug 2022 • Runkai Zheng, Rongjun Tang, Jianze Li, Li Liu
Pruning these channels was then shown to be effective in mitigating the backdoor behaviors.
no code implementations • 3 Aug 2022 • Yuli Sun, Lin Lei, Dongdong Guan, Gangyao Kuang, Li Liu
Then, we propose a regression model for the HCD, which decomposes the source signal into the regressed signal and changed signal, and requires the regressed signal have the same spectral property as the target signal on the same graph.
no code implementations • 3 Aug 2022 • Yuli Sun, Lin Lei, Dongdong Guan, Gangyao Kuang, Li Liu
In this first part, we analyze the HCD with GSP from the vertex domain.
no code implementations • 26 Jul 2022 • Ye Wang, Jingbo Liao, Hong Yu, Guoyin Wang, Xiaoxia Zhang, Li Liu
Particularly, the model integrates the macro-level guided-category knowledge and micro-level open-domain dialogue data for the training, leveraging the priori knowledge into the latent space, which enables the model to disentangle the latent variables within the mesoscopic scale.
no code implementations • 20 Jul 2022 • Yawen Cui, Zitong Yu, Wei Peng, Li Liu
Few-Shot Class-Incremental Learning (FSCIL) aims at incrementally learning novel classes from a few labeled samples by avoiding the overfitting and catastrophic forgetting simultaneously.
class-incremental learning Few-Shot Class-Incremental Learning +3
no code implementations • 16 Jul 2022 • Wei Wu, Junlin He, Yu Qiao, Guoheng Fu, Li Liu, Jin Yu
The in-memory approximate nearest neighbor search (ANNS) algorithms have achieved great success for fast high-recall query processing, but are extremely inefficient when handling hybrid queries with unstructured (i. e., feature vectors) and structured (i. e., related attributes) constraints.
1 code implementation • 3 Jun 2022 • Yixiong Chen, Li Liu, Jingxian Li, Hua Jiang, Chris Ding, Zongwei Zhou
In this work, we propose a meta-learning-based LR tuner, named MetaLR, to make different layers automatically co-adapt to downstream tasks based on their transferabilities across domains.
no code implementations • 30 May 2022 • Jiehua Zhang, Zhuo Su, Li Liu
Face recognition is one of the most active tasks in computer vision and has been widely used in the real world.
no code implementations • 22 May 2022 • Changchong Sheng, Gangyao Kuang, Liang Bai, Chenping Hou, Yulan Guo, Xin Xu, Matti Pietikäinen, Li Liu
Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment.
no code implementations • 2 Apr 2022 • Jianrong Wang, Jinyu Liu, Longxuan Zhao, Shanyu Wang, Ruiguo Yu, Li Liu
Acoustic-to-articulatory inversion (AAI) is to obtain the movement of articulators from speech signals.
no code implementations • 1 Apr 2022 • Jianrong Wang, Zixuan Wang, Xiaosheng Hu, XueWei Li, Qiang Fang, Li Liu
Experimental results show that the speech synthesized by our model is comparable to the personalized speech synthesized by training a large amount of audio data in previous works.
1 code implementation • 8 Feb 2022 • Li Liu, Qingle Huang, Sihao Lin, Hongwei Xie, Bing Wang, Xiaojun Chang, Xiaodan Liang
Extensive experiments on two vision tasks, includ-ing ImageNet classification and Pascal VOC segmentation, demonstrate the superiority of our ICKD, which consis-tently outperforms many existing methods, advancing thestate-of-the-art in the fields of Knowledge Distillation.
1 code implementation • 19 Jan 2022 • Chunhui Zhang, Guanjie Huang, Li Liu, Shan Huang, Yinan Yang, Xiang Wan, Shiming Ge, DaCheng Tao
In this work, we propose WebUAV-3M, the largest public UAV tracking benchmark to date, to facilitate both the development and evaluation of deep UAV trackers.
no code implementations • 18 Jan 2022 • Yan Zhao, Lingjun Zhao, Zhong Liu, Dewen Hu, Gangyao Kuang, Li Liu
Aircraft detection in Synthetic Aperture Radar (SAR) imagery is a challenging task in SAR Automatic Target Recognition (SAR ATR) areas due to aircraft's extremely discrete appearance, obvious intraclass variation, small size and serious background's interference.
1 code implementation • 11 Jan 2022 • Jinyu Lu, Guoqiang Liu, Bing Sun, Chao Li, Li Liu
In CRYPTO 2019, Gohr made a pioneering attempt and successfully applied deep learning to the differential cryptanalysis against NSA block cipher SPECK32/64, achieving higher accuracy than the pure differential distinguishers.
1 code implementation • CVPR 2022 • Kunhong Li, Longguang Wang, Li Liu, Qing Ran, Kai Xu, Yulan Guo
Weakly supervised learning can help local feature methods to overcome the obstacle of acquiring a large-scale dataset with densely labeled correspondences.
Ranked #1 on Camera Localization on Aachen Day-Night benchmark
1 code implementation • 4 Jan 2022 • Xinyi Ying, Yingqian Wang, Longguang Wang, Weidong Sheng, Li Liu, Zaiping Lin, Shilin Zhou
Specifically, motivated by the local motion prior in the spatio-temporal dimension, we propose a local spatio-temporal attention module to perform implicit frame alignment and incorporate the local spatio-temporal information to enhance the local features (especially for small targets).
1 code implementation • CVPR 2022 • Longguang Wang, Xiaoyu Dong, Yingqian Wang, Li Liu, Wei An, Yulan Guo
Since a linear quantizer (i. e., round(*) function) cannot well fit the bell-shaped distributions of weights and activations, many existing methods use pre-defined functions (e. g., exponential function) with learnable parameters to build the quantizer for joint optimization.
1 code implementation • CVPR 2022 • Siwei Wang, Xinwang Liu, Li Liu, Wenxuan Tu, Xinzhong Zhu, Jiyuan Liu, Sihang Zhou, En Zhu
Multi-view clustering has received increasing attention due to its effectiveness in fusing complementary information without manual annotations.
1 code implementation • 23 Nov 2021 • Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang
Video transformers have achieved impressive results on major video recognition benchmarks, which however suffer from high computational cost.
1 code implementation • 22 Nov 2021 • Zihan Yan, Li Liu, Xin Li, William K. Cheung, Youmin Zhang, Qun Liu, Guoyin Wang
Social network alignment aims at aligning person identities across social networks.
no code implementations • 3 Nov 2021 • Keyu Li, Yangxin Xu, Jian Wang, Dong Ni, Li Liu, Max Q. -H. Meng
Ultrasound (US) imaging is commonly used to assist in the diagnosis and interventions of spine diseases, while the standardized US acquisitions performed by manually operating the probe require substantial experience and training of sonographers.
no code implementations • 25 Oct 2021 • Yu Zhang, Chen Zhang, Renxin Yang, Jing Lyu, Li Liu, Xu Cai
The MMC-HVDC connected offshore wind farms (OWFs) could suffer short circuit fault (SCF), whereas their transient stability is not well analysed.
no code implementations • 20 Oct 2021 • Hengyang Wang, Xianghao Zhan, Li Liu, Asif Ullah, Huiyan Li, Han Gao, You Wang, Guang Li
The results show that DRCA improved the classification accuracy on six subjects (p < 0. 05), compared with the baseline models trained only with the source domain data;, while CPSC did not guarantee the accuracy improvement.
no code implementations • 8 Oct 2021 • Jiehua Zhang, Zhuo Su, Yanghe Feng, Xin Lu, Matti Pietikäinen, Li Liu
The experimental results prove that our method is an effective and straightforward way to reduce information loss and enhance performance of BNNs.
no code implementations • 16 Sep 2021 • Junfeng Hu, Yuxuan Liang, Zhencheng Fan, Li Liu, Yifang Yin, Roger Zimmermann
Specifically, we introduce a joint spatiotemporal graph attention network to learn the relations across space and time for short-term patterns.