Search Results for author: Zhe Li

Found 91 papers, 35 papers with code

Masked Generative Distillation

3 code implementations • 3 May 2022 • Zhendong Yang, Zhe Li, Mingqi Shao, Dachuan Shi, Zehuan Yuan, Chun Yuan

The current distillation algorithm usually improves students' performance by imitating the output of the teacher.

Image Classification Instance Segmentation +5

12,066

Paper
Code

Track Anything: Segment Anything Meets Videos

1 code implementation • 24 Apr 2023 • Jinyu Yang, Mingqi Gao, Zhe Li, Shang Gao, Fangjing Wang, Feng Zheng

Therefore, in this report, we propose Track Anything Model (TAM), which achieves high-performance interactive tracking and segmentation in videos.

Image Segmentation Segmentation +2

6,097

Paper
Code

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

1 code implementation • 4 May 2023 • Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao

Controllable image captioning is an emerging multimodal topic that aims to describe the image with natural language following human purpose, $\textit{e. g.}$, looking at the specified regions or telling in a particular text style.

controllable image captioning Instruction Following

1,600

Paper
Code

Evaluate the Malignancy of Pulmonary Nodules Using the 3D Deep Leaky Noisy-or Network

11 code implementations • 22 Nov 2017 • Fangzhou Liao, Ming Liang, Zhe Li, Xiaolin Hu, Sen Song

The model consists of two modules.

Computed Tomography (CT) Region Proposal

1,227

Paper
Code

CARLS: Cross-platform Asynchronous Representation Learning System

1 code implementation • 26 May 2021 • Chun-Ta Lu, Yun Zeng, Da-Cheng Juan, Yicheng Fan, Zhe Li, Jan Dlabal, Yi-Ting Chen, Arjun Gopalan, Allan Heydon, Chun-Sung Ferng, Reah Miyara, Ariel Fuxman, Futang Peng, Zhen Li, Tom Duerig, Andrew Tomkins

In this work, we propose CARLS, a novel framework for augmenting the capacity of existing deep learning frameworks by enabling multiple components -- model trainers, knowledge makers and knowledge banks -- to concertedly work together in an asynchronous fashion across hardware platforms.

Representation Learning

975

Paper
Code

Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling

1 code implementation • 27 Nov 2023 • Zhe Li, Zerong Zheng, Lizhen Wang, Yebin Liu

Overall, our method can create lifelike avatars with dynamic, realistic and generalized appearances.

729

Paper
Code

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

1 code implementation • 5 Dec 2023 • Yuelang Xu, Benwang Chen, Zhe Li, Hongwen Zhang, Lizhen Wang, Zerong Zheng, Yebin Liu

Creating high-fidelity 3D head avatars has always been a research hotspot, but there remains a great challenge under lightweight sparse view setups.

623

Paper
Code

Changer: Feature Interaction is What You Need for Change Detection

1 code implementation • 17 Sep 2022 • Sheng Fang, Kaiyu Li, Zhe Li

To verify the effectiveness of MetaChanger, we propose two derived models, ChangerAD and ChangerEx with simple interaction strategies: Aggregation-Distribution (AD) and "exchange".

Ranked #3 on Change Detection on LEVIR-CD

Building change detection for remote sensing images Change Detection +1

379

Paper
Code

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

1 code implementation • 23 Feb 2024 • Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia, Jianxi Ye, Xin Jin, Xin Liu

Training LLMs at this scale brings unprecedented challenges to training efficiency and stability.

Language Modelling Large Language Model

343

Paper
Code

Focal and Global Knowledge Distillation for Detectors

1 code implementation • CVPR 2022 • Zhendong Yang, Zhe Li, Xiaohu Jiang, Yuan Gong, Zehuan Yuan, Danpei Zhao, Chun Yuan

Global distillation rebuilds the relation between different pixels and transfers it from teachers to students, compensating for missing global information in focal distillation.

Ranked #1 on Knowledge Distillation on MS COCO

Image Classification Knowledge Distillation +2

334

Paper
Code

Siamese NestedUNet Networks for Change Detection of High Resolution Satellite Image

1 code implementation • 27 Oct 2020 • Kaiyu Li, Zhe Li, Sheng Fang

In this paper, we improve the semantic segmentation network UNet++ and propose a fully convolutional siamese network (Siam-NestedUNet) for change detection.

Ranked #23 on Change detection for remote sensing images on CDD Dataset (season-varying)

Change Detection Change detection for remote sensing images +2

242

Paper
Code

Rethinking Knowledge Distillation via Cross-Entropy

1 code implementation • 22 Aug 2022 • Zhendong Yang, Zhe Li, Yuan Gong, Tianke Zhang, Shanshan Lao, Chun Yuan, Yu Li

Furthermore, we smooth students' target output to treat it as the soft target for training without teachers and propose a teacher-free new KD loss (tf-NKD).

Knowledge Distillation

188

Paper
Code

ViTKD: Practical Guidelines for ViT feature knowledge distillation

1 code implementation • 6 Sep 2022 • Zhendong Yang, Zhe Li, Ailing Zeng, Zexian Li, Chun Yuan, Yu Li

In this paper, we explore the way of feature-based distillation for ViT.

Image Classification Knowledge Distillation

188

Paper
Code

From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels

1 code implementation • ICCV 2023 • Zhendong Yang, Ailing Zeng, Zhe Li, Tianke Zhang, Chun Yuan, Yu Li

We decompose the KD loss and find the non-target loss from it forces the student's non-target logits to match the teacher's, but the sum of the two non-target logits is different, preventing them from being identical.

Self-Knowledge Distillation

188

Paper
Code

AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture

1 code implementation • 5 Jul 2022 • Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu

Then given a monocular RGB video of this subject, our method integrates information from both the image observation and the avatar prior, and accordingly recon-structs high-fidelity 3D textured models with dynamic details regardless of the visibility.

178

Paper
Code

MTS-Mixers: Multivariate Time Series Forecasting via Factorized Temporal and Channel Mixing

1 code implementation • 9 Feb 2023 • Zhe Li, Zhongwen Rao, Lujia Pan, Zenglin Xu

Specifically, we find that (1) attention is not necessary for capturing temporal dependencies, (2) the entanglement and redundancy in the capture of temporal and channel interaction affect the forecasting performance, and (3) it is important to model the mapping between the input and the prediction sequence.

Multivariate Time Series Forecasting Time Series

160

Paper
Code

PoseVocab: Learning Joint-structured Pose Embeddings for Human Avatar Modeling

1 code implementation • 25 Apr 2023 • Zhe Li, Zerong Zheng, Yuxiao Liu, Boyao Zhou, Yebin Liu

To this end, we present PoseVocab, a novel pose encoding method that encourages the network to discover the optimal pose embeddings for learning the dynamic human appearance.

153

Paper
Code

Generative Adversarial Active Learning for Unsupervised Outlier Detection

2 code implementations • 28 Sep 2018 • Yezheng Liu, Zhe Li, Chong Zhou, Yuanchun Jiang, Jianshan Sun, Meng Wang, Xiangnan He

In this paper, we approach outlier detection as a binary-classification issue by sampling potential outliers from a uniform reference distribution.

Active Learning Binary Classification +1

130

Paper
Code

ConTNet: Why not use convolution and transformer at the same time?

2 code implementations • 27 Apr 2021 • Haotian Yan, Zhe Li, Weijian Li, Changhu Wang, Ming Wu, Chuang Zhang

It is also worth pointing that, given identical strong data augmentations, the performance improvement of ConTNet is more remarkable than that of ResNet.

Image Classification object-detection +1

Paper
Code

Class-Incremental Learning with Generative Classifiers

1 code implementation • 20 Apr 2021 • Gido M. van de Ven, Zhe Li, Andreas S. Tolias

As a proof-of-principle, here we implement this strategy by training a variational autoencoder for each class to be learned and by using importance sampling to estimate the likelihoods p(x|y).

Class Incremental Learning Incremental Learning

Paper
Code

Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter

1 code implementation • CVPR 2021 • Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo

Specifically, we integrate IFA into the two most prevailing text recognition streams (attention-based and CTC-based) and propose attention-guided dense prediction (ADP) and Extended CTC (ExCTC).

Optical Character Recognition Optical Character Recognition (OCR) +1

Paper
Code

Forward Laplacian: A New Computational Framework for Neural Network-based Variational Monte Carlo

2 code implementations • 17 Jul 2023 • Ruichen Li, Haotian Ye, Du Jiang, Xuelan Wen, Chuwei Wang, Zhe Li, Xiang Li, Di He, Ji Chen, Weiluo Ren, LiWei Wang

Neural network-based variational Monte Carlo (NN-VMC) has emerged as a promising cutting-edge technique of ab initio quantum chemistry.

Efficient Neural Network Variational Monte Carlo

Paper
Code

RGBD Object Tracking: An In-depth Review

1 code implementation • 26 Mar 2022 • Jinyu Yang, Zhe Li, Song Yan, Feng Zheng, Aleš Leonardis, Joni-Kristian Kämäräinen, Ling Shao

Particularly, we are the first to provide depth quality evaluation and analysis of tracking results in depth-friendly scenarios in RGBD tracking.

Object Object Tracking

Paper
Code

Revisiting Long-term Time Series Forecasting: An Investigation on Linear Mapping

1 code implementation • 18 May 2023 • Zhe Li, shiyi qi, Yiduo Li, Zenglin Xu

In this paper, we thoroughly investigate the intrinsic effectiveness of recent approaches and make three key observations: 1) linear mapping is critical to prior long-term time series forecasting efforts; 2) RevIN (reversible normalization) and CI (Channel Independent) play a vital role in improving overall forecasting performance; and 3) linear mapping can effectively capture periodic features in time series and has robustness for different periods across channels when increasing the input horizon.

Time Series Time Series Forecasting

Paper
Code

Temporal Action Segmentation from Timestamp Supervision

1 code implementation • CVPR 2021 • Zhe Li, Yazan Abu Farha, Juergen Gall

To demonstrate the effectiveness of timestamp supervision, we propose an approach to train a segmentation model using only timestamps annotations.

Ranked #4 on Weakly Supervised Action Localization on GTEA

Action Segmentation Segmentation +1

Paper
Code

Salient Positions based Attention Network for Image Classification

1 code implementation • 9 Jun 2021 • Sheng Fang, Kaiyu Li, Zhe Li

Aimed at both questions this paper proposes the salient positions-based attention scheme SPANet, which is inspired by some interesting observations on the attention maps and affinity matrices generated in self-attention scheme.

Classification Image Classification

Paper
Code

Thoracic Disease Identification and Localization with Limited Supervision

1 code implementation • CVPR 2018 • Zhe Li, Chong Wang, Mei Han, Yuan Xue, Wei Wei, Li-Jia Li, Li Fei-Fei

Accurate identification and localization of abnormalities from radiology images play an integral part in clinical diagnosis and treatment planning.

General Classification

Paper
Code

RFBFN: A Relation-First Blank Filling Network for Joint Relational Triple Extraction

1 code implementation • ACL 2022 • Zhe Li, Luoyi Fu, Xinbing Wang, Haisong Zhang, Chenghu Zhou

However, most existing works either ignore the semantic information of relations or predict subjects and objects sequentially.

Relation

Paper
Code

Speaker Representation Learning via Contrastive Loss with Maximal Speaker Separability

1 code implementation • 29 Oct 2022 • Zhe Li, Man-Wai Mak

A great challenge in speaker representation learning using deep models is to design learning objectives that can enhance the discrimination of unseen speakers under unseen domains.

Contrastive Learning Data Augmentation +1

Paper
Code

Resource-Efficient RGBD Aerial Tracking

1 code implementation • CVPR 2023 • Jinyu Yang, Shang Gao, Zhe Li, Feng Zheng, Aleš Leonardis

However, current research on aerial perception has mainly focused on limited categories, such as pedestrian or vehicle, and most scenes are captured in urban environments from a birds-eye view.

Object Tracking

Paper
Code

Causality Analysis for Evaluating the Security of Large Language Models

1 code implementation • 13 Dec 2023 • Wei Zhao, Zhe Li, Jun Sun

Based on a layer-level causality analysis, we show that RLHF has the effect of overfitting a model to harmful prompts.

Paper
Code

Ti-MAE: Self-Supervised Masked Time Series Autoencoders

1 code implementation • 21 Jan 2023 • Zhe Li, Zhongwen Rao, Lujia Pan, Pengyun Wang, Zenglin Xu

Multivariate Time Series forecasting has been an increasingly popular topic in various applications and scenarios.

Contrastive Learning Multivariate Time Series Forecasting +2

Paper
Code

MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion

1 code implementation • 12 Apr 2024 • Zhe Li, Haiwei Pan, Kejia Zhang, Yuhua Wang, Fengming Yu

Multi-modality image fusion (MMIF) aims to integrate complementary information from different modalities into a single fused image to represent the imaging scene and facilitate downstream visual tasks comprehensively.

Image Reconstruction object-detection +1

Paper
Code

Inferring Inference

1 code implementation • 4 Oct 2023 • Rajkumar Vasudeva Raju, Zhe Li, Scott Linderman, Xaq Pitkow

Given a time series of neural activity during a perceptual inference task, our framework finds (i) the neural representation of relevant latent variables, (ii) interactions between these variables that define the brain's internal model of the world, and (iii) message-functions specifying the inference algorithm.

Experimental Design

Paper
Code

EIGEN: Ecologically-Inspired GENetic Approach for Neural Network Structure Searching from Scratch

no code implementations • CVPR 2019 • Jian Ren, Zhe Li, Jianchao Yang, Ning Xu, Tianbao Yang, David J. Foran

In this paper, we propose an Ecologically-Inspired GENetic (EIGEN) approach that uses the concept of succession, extinction, mimicry, and gene duplication to search neural network structure from scratch with poorly initialized simple network and few constraints forced during the evolution, as we assume no prior knowledge about the task domain.

Paper
Add Code

An Aggressive Genetic Programming Approach for Searching Neural Network Structure Under Computational Constraints

no code implementations • 3 Jun 2018 • Zhe Li, Xuehan Xiong, Zhou Ren, Ning Zhang, Xiaoyu Wang, Tianbao Yang

In this paper, we study how to design a genetic programming approach for optimizing the structure of a CNN for a given task under limited computational resources yet without imposing strong restrictions on the search space.

Evolutionary Algorithms

Paper
Add Code

Towards Budget-Driven Hardware Optimization for Deep Convolutional Neural Networks using Stochastic Computing

no code implementations • 10 May 2018 • Zhe Li, Ji Li, Ao Ren, Caiwen Ding, Jeffrey Draper, Qinru Qiu, Bo Yuan, Yanzhi Wang

Recently, Deep Convolutional Neural Network (DCNN) has achieved tremendous success in many machine learning applications.

Paper
Add Code

Learning Topics using Semantic Locality

no code implementations • 11 Apr 2018 • Ziyi Zhao, Krittaphat Pugdeethosapol, Sheng Lin, Zhe Li, Caiwen Ding, Yanzhi Wang, Qinru Qiu

The topic modeling discovers the latent topic probability of the given text documents.

Topic Models

Paper
Add Code

Efficient Recurrent Neural Networks using Structured Matrices in FPGAs

no code implementations • 20 Mar 2018 • Zhe Li, Shuo Wang, Caiwen Ding, Qinru Qiu, Yanzhi Wang, Yun Liang

Recurrent Neural Networks (RNNs) are becoming increasingly important for time series-related applications which require efficient and real-time implementations.

Model Compression Time Series +1

Paper
Add Code

Image Dataset for Visual Objects Classification in 3D Printing

no code implementations • 15 Feb 2018 • Hongjia Li, Xiaolong Ma, Aditya Singh Rathore, Zhe Li, Qiyuan An, Chen Song, Wenyao Xu, Yanzhi Wang

The rapid development in additive manufacturing (AM), also known as 3D printing, has brought about potential risk and security issues along with significant benefits.

Classification General Classification

Paper
Add Code

C3PO: Database and Benchmark for Early-stage Malicious Activity Detection in 3D Printing

no code implementations • 20 Mar 2018 • Zhe Li, Xiaolong Ma, Hongjia Li, Qiyuan An, Aditya Singh Rathore, Qinru Qiu, Wenyao Xu, Yanzhi Wang

It is of vital importance to enable 3D printers to identify the objects to be printed, so that the manufacturing procedure of an illegal weapon can be terminated at the early stage.

Action Detection Activity Detection +1

Paper
Add Code

C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs

no code implementations • 14 Mar 2018 • Shuo Wang, Zhe Li, Caiwen Ding, Bo Yuan, Yanzhi Wang, Qinru Qiu, Yun Liang

The previous work proposes to use a pruning based compression technique to reduce the model size and thus speedups the inference on FPGAs.

Paper
Add Code

A Framework in CRM Customer Lifecycle: Identify Downward Trend and Potential Issues Detection

no code implementations • 25 Feb 2018 • Kun Hu, Zhe Li, Ying Liu, Luyin Cheng, Qi Yang, Yan Li

In the first prediction part, we focus on predicting the downward trend, which is an earlier stage of the customer lifecycle compared to churn.

Causal Inference Management +1

Paper
Add Code

Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework

no code implementations • 18 Feb 2018 • Yanzhi Wang, Caiwen Ding, Zhe Li, Geng Yuan, Siyu Liao, Xiaolong Ma, Bo Yuan, Xuehai Qian, Jian Tang, Qinru Qiu, Xue Lin

Hardware accelerations of deep learning systems have been extensively investigated in industry and academia.

Paper
Add Code

An Area and Energy Efficient Design of Domain-Wall Memory-Based Deep Convolutional Neural Networks using Stochastic Computing

no code implementations • 3 Feb 2018 • Xiaolong Ma, Yi-Peng Zhang, Geng Yuan, Ao Ren, Zhe Li, Jie Han, Jingtong Hu, Yanzhi Wang

However, in these works, the memory design optimization is neglected for weight storage, which will inevitably result in large hardware cost.

Paper
Add Code

Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank

no code implementations • ICML 2017 • Liang Zhao, Siyu Liao, Yanzhi Wang, Zhe Li, Jian Tang, Victor Pan, Bo Yuan

Recently low displacement rank (LDR) matrices, or so-called structured matrices, have been proposed to compress large-scale neural networks.

Paper
Add Code

A Simple Analysis for Exp-concave Empirical Minimization with Arbitrary Convex Regularizer

no code implementations • 9 Sep 2017 • Tianbao Yang, Zhe Li, Lijun Zhang

In this paper, we present a simple analysis of {\bf fast rates} with {\it high probability} of {\bf empirical minimization} for {\it stochastic composite optimization} over a finite-dimensional bounded convex set with exponential concave loss functions and an arbitrary convex regularization.

Paper
Add Code

CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices

no code implementations • 29 Aug 2017 • Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yi-Peng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan

As the size of DNNs continues to grow, it is critical to improve the energy efficiency and performance while maintaining accuracy.

Paper
Add Code

A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning

no code implementations • 13 Mar 2017 • Ning Liu, Zhe Li, Zhiyuan Xu, Jielong Xu, Sheng Lin, Qinru Qiu, Jian Tang, Yanzhi Wang

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in the cloud computing system.

Cloud Computing Decision Making +3

Paper
Add Code

SEP-Nets: Small and Effective Pattern Networks

no code implementations • 13 Jun 2017 • Zhe Li, Xiaoyu Wang, Xutao Lv, Tianbao Yang

By doing this, we show that previous deep CNNs such as GoogLeNet and Inception-type Nets can be compressed dramatically with marginal drop in performance.

Binarization Quantization

Paper
Add Code

Hardware-Driven Nonlinear Activation for Stochastic Computing Based Deep Convolutional Neural Networks

no code implementations • 12 Mar 2017 • Ji Li, Zihao Yuan, Zhe Li, Caiwen Ding, Ao Ren, Qinru Qiu, Jeffrey Draper, Yanzhi Wang

Recently, Deep Convolutional Neural Networks (DCNNs) have made unprecedented progress, achieving the accuracy close to, or even better than human-level perception in various tasks.

Paper
Add Code

SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

no code implementations • 18 Nov 2016 • Ao Ren, Ji Li, Zhe Li, Caiwen Ding, Xuehai Qian, Qinru Qiu, Bo Yuan, Yanzhi Wang

Stochastic Computing (SC), which uses bit-stream to represent a number within [-1, 1] by counting the number of ones in the bit-stream, has a high potential for implementing DCNNs with high scalability and ultra-low hardware footprint.

Paper
Add Code

Improved Dropout for Shallow and Deep Learning

no code implementations • NeurIPS 2016 • Zhe Li, Boqing Gong, Tianbao Yang

To exhibit the optimal dropout probabilities, we analyze the shallow learning with multinomial dropout and establish the risk bound for stochastic optimization.

Stochastic Optimization

Paper
Add Code

Unified Convergence Analysis of Stochastic Momentum Methods for Convex and Non-convex Optimization

no code implementations • 12 Apr 2016 • Tianbao Yang, Qihang Lin, Zhe Li

This paper fills the gap between practice and theory by developing a basic convergence analysis of two stochastic momentum methods, namely stochastic heavy-ball method and the stochastic variant of Nesterov's accelerated gradient method.

Paper
Add Code

A Unified Analysis of Stochastic Momentum Methods for Deep Learning

no code implementations • 30 Aug 2018 • Yan Yan, Tianbao Yang, Zhe Li, Qihang Lin, Yi Yang

However, their theoretical analysis of convergence of the training objective and the generalization error for prediction is still under-explored.

Paper
Add Code

Adaptive Negative Curvature Descent with Applications in Non-convex Optimization

no code implementations • NeurIPS 2018 • Mingrui Liu, Zhe Li, Xiaoyu Wang, Jin-Feng Yi, Tianbao Yang

Negative curvature descent (NCD) method has been utilized to design deterministic or stochastic algorithms for non-convex optimization aiming at finding second-order stationary points or local minima.

Paper
Add Code

CircConv: A Structured Convolution with Low Complexity

no code implementations • 28 Feb 2019 • Siyu Liao, Zhe Li, Liang Zhao, Qinru Qiu, Yanzhi Wang, Bo Yuan

Deep neural networks (DNNs), especially deep convolutional neural networks (CNNs), have emerged as the powerful technique in various machine learning applications.

Paper
Add Code

Prior-aware Neural Network for Partially-Supervised Multi-Organ Segmentation

no code implementations • ICCV 2019 • Yuyin Zhou, Zhe Li, Song Bai, Chong Wang, Xinlei Chen, Mei Han, Elliot Fishman, Alan Yuille

Accurate multi-organ abdominal CT segmentation is essential to many clinical applications such as computer-aided intervention.

Medical Image Segmentation Organ Segmentation +2

Paper
Add Code

Learning From Brains How to Regularize Machines

no code implementations • NeurIPS 2019 • Zhe Li, Wieland Brendel, Edgar Y. Walker, Erick Cobos, Taliah Muhammad, Jacob Reimer, Matthias Bethge, Fabian H. Sinz, Xaq Pitkow, Andreas S. Tolias

We propose to regularize CNNs using large-scale neuroscience data to learn more robust neural features in terms of representational similarity.

Image Classification Inductive Bias

Paper
Add Code

Long Short-Term Sample Distillation

no code implementations • 2 Mar 2020 • Liang Jiang, Zujie Wen, Zhongping Liang, Yafang Wang, Gerard de Melo, Zhe Li, Liangzhuang Ma, Jiaxing Zhang, Xiaolong Li, Yuan Qi

The long-term teacher draws on snapshots from several epochs ago in order to provide steadfast guidance and to guarantee teacher--student differences, while the short-term one yields more up-to-date cues with the goal of enabling higher-quality updates.

Paper
Add Code

RCC-Dual-GAN: An Efficient Approach for Outlier Detection with Few Identified Anomalies

no code implementations • 7 Mar 2020 • Zhe Li, Chunhua Sun, Chunli Liu, Xiayu Chen, Meng Wang, Yezheng Liu

To address these issues, we focus on semi-supervised outlier detection with few identified anomalies, in the hope of using limited labels to achieve high detection accuracy.

Outlier Detection

Paper
Add Code

Robust 3D Self-portraits in Seconds

no code implementations • CVPR 2020 • Zhe Li, Tao Yu, Chuanyu Pan, Zerong Zheng, Yebin Liu

In this paper, we propose an efficient method for robust 3D self-portraits using a single RGBD camera.

Paper
Add Code

E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs

no code implementations • 12 Dec 2018 • Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang

It is a challenging task to have real-time, efficient, and accurate hardware RNN implementations because of the high sensitivity to imprecision accumulation and the requirement of special activation function implementations.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Joint Multi-Dimension Pruning via Numerical Gradient Update

no code implementations • 18 May 2020 • Zechun Liu, Xiangyu Zhang, Zhiqiang Shen, Zhe Li, Yichen Wei, Kwang-Ting Cheng, Jian Sun

To tackle these three naturally different dimensions, we proposed a general framework by defining pruning as seeking the best pruning vector (i. e., the numerical value of layer-wise channel number, spacial size, depth) and construct a unique mapping from the pruning vector to the pruned network structures.

Paper
Add Code

Improving Attention-Based Handwritten Mathematical Expression Recognition with Scale Augmentation and Drop Attention

no code implementations • 20 Jul 2020 • Zhe Li, Lianwen Jin, Songxuan Lai, Yecheng Zhu

Handwritten mathematical expression recognition (HMER) is an important research direction in handwriting recognition.

Handwriting Recognition

Paper
Add Code

POSEFusion: Pose-guided Selective Fusion for Single-view Human Volumetric Capture

no code implementations • CVPR 2021 • Zhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu

By contributing a novel reconstruction framework which contains pose-guided keyframe selection and robust implicit surface fusion, our method fully utilizes the advantages of both tracking-based methods and tracking-free inference methods, and finally enables the high-fidelity reconstruction of dynamic surface details even in the invisible regions.

3D Reconstruction

Paper
Add Code

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras

no code implementations • ICCV 2021 • Yuxiang Zhang, Zhe Li, Liang An, Mengcheng Li, Tao Yu, Yebin Liu

Overall, we propose the first light-weight total capture system and achieves fast, robust and accurate multi-person total motion capture performance.

Ranked #2 on 3D Multi-Person Pose Estimation on Shelf

3D Multi-Person Pose Estimation

Paper
Add Code

The emergence of cooperation from shared goals in the Systemic Sustainability Game of common pool resources

no code implementations • 1 Oct 2021 • Chengyi Tu, Paolo DOdorico, Zhe Li, Samir Suweis

The sustainable use of common-pool resources (CPRs) is a major environmental governance challenge because of their possible over-exploitation.

Paper
Add Code

Low-Resource Text Classification via Cross-lingual Language Model Fine-tuning

no code implementations • CCL 2020 • Xiuhong Li, Zhe Li, Jiabao Sheng, Wushour Slamu

There are major challenges of low-resource agglutinative text classification the lack of labeled data in a target domain and morphologic diversity of derivations in language structures.

Language Modelling Morphological Analysis +2

Paper
Add Code

VRConvMF: Visual Recurrent Convolutional Matrix Factorization for Movie Recommendation

no code implementations • 16 Feb 2022 • Zhu Wang, Honglong Chen, Zhe Li, Kai Lin, Nan Jiang, Feng Xia

Fortunately, context-aware recommender systems can alleviate the sparsity problem by making use of some auxiliary information, such as the information of both the users and items.

Descriptive Movie Recommendation +1

Paper
Add Code

Edge Data Based Trailer Inception Probabilistic Matrix Factorization for Context-Aware Movie Recommendation

no code implementations • 16 Feb 2022 • Honglong Chen, Zhe Li, Zhu Wang, Zhichen Ni, Junjian Li, Ge Xu, Abdul Aziz, Feng Xia

As an effective way to alleviate information overload, recommender system can improve the quality of various services by adding application data generated by users on edge devices, such as visual and textual information, on the basis of sparse rating data.

Movie Recommendation Recommendation Systems

Paper
Add Code

Learning Dynamics and Structure of Complex Systems Using Graph Neural Networks

no code implementations • 22 Feb 2022 • Zhe Li, Andreas S. Tolias, Xaq Pitkow

In this work we trained graph neural networks to fit time series from an example nonlinear dynamical system, the belief propagation algorithm.

Inductive Bias Time Series +1

Paper
Add Code

SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text

no code implementations • 23 Feb 2022 • Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Zhe Li, Dezhi Peng

Specifically, we propose a style bank to parameterize the specific handwriting styles as latent vectors, which are input to a generator as style priors to achieve the corresponding handwritten styles.

Attribute Generative Adversarial Network

Paper
Add Code

MSCET: A Multi-Scenario Offloading Schedule for Biomedical Data Processing and Analysis in Cloud-Edge-Terminal Collaborative Vehicular Networks

no code implementations • 16 Feb 2022 • Zhichen Ni, Honglong Chen, Zhe Li, Xiaomeng Wang, Na Yan, Weifeng Liu, Feng Xia

The vehicles can offload the computation intensive tasks to the cloud to save the resource of edge.

Edge-computing

Paper
Add Code

Prompting for Multi-Modal Tracking

no code implementations • 29 Jul 2022 • Jinyu Yang, Zhe Li, Feng Zheng, Aleš Leonardis, Jingkuan Song

Multi-modal tracking gains attention due to its ability to be more accurate and robust in complex scenarios compared to traditional RGB-based tracking.

Ranked #12 on Rgb-T Tracking on LasHeR

Rgb-T Tracking

Paper
Add Code

Multi-view Contrastive Learning with Additive Margin for Adaptive Nasopharyngeal Carcinoma Radiotherapy Prediction

no code implementations • 27 Oct 2022 • Jiabao Sheng, Yuanpeng Zhang, Jing Cai, Sai-Kit Lam, Zhe Li, Jiang Zhang, Xinzhi Teng

To improve the discriminative ability of the loss function, we incorporate a margin into the contrastive learning.

Contrastive Learning

Paper
Add Code

Discriminative Speaker Representation via Contrastive Learning with Class-Aware Attention in Angular Space

no code implementations • 29 Oct 2022 • Zhe Li, Man-Wai Mak, Helen Mei-Ling Meng

The challenges in applying contrastive learning to speaker verification (SV) are that the softmax-based contrastive loss lacks discriminative power and that the hard negative pairs can easily influence learning.

Contrastive Learning Speaker Verification

Paper
Add Code

Learning Dual-Fused Modality-Aware Representations for RGBD Tracking

no code implementations • 6 Nov 2022 • Shang Gao, Jinyu Yang, Zhe Li, Feng Zheng, Aleš Leonardis, Jingkuan Song

However, some existing RGBD trackers use the two modalities separately and thus some particularly useful shared information between them is ignored.

Object Tracking

Paper
Add Code

Balancing Privacy Protection and Interpretability in Federated Learning

no code implementations • 16 Feb 2023 • Zhe Li, Honglong Chen, Zhichen Ni, Huajie Shao

Federated learning (FL) aims to collaboratively train the global model in a distributed manner by sharing the model parameters from local clients to a central server, thereby potentially protecting users' private information.

Federated Learning

Paper
Add Code

A Graph Reconstruction by Dynamic Signal Coefficient for Fault Classification

no code implementations • 30 May 2023 • Wenbin He, Jianxu Mao, Yaonan Wang, Zhe Li, Qiu Fang, Haotian Wu

To improve the performance in identifying the faults under strong noise for rotating machinery, this paper presents a dynamic feature reconstruction signal graph method, which plays the key role of the proposed end-to-end fault diagnosis model.

feature selection Graph Reconstruction

Paper
Add Code

Whole Slide Multiple Instance Learning for Predicting Axillary Lymph Node Metastasis

1 code implementation • 6 Oct 2023 • Glejdis Shkëmbi, Johanna P. Müller, Zhe Li, Katharina Breininger, Peter Schüffler, Bernhard Kainz

Breast cancer is a major concern for women's health globally, with axillary lymph node (ALN) metastasis identification being critical for prognosis evaluation and treatment guidance.

Data Augmentation Multiple Instance Learning +1

Paper
Code

General Point Model with Autoencoding and Autoregressive

no code implementations • 25 Oct 2023 • Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang

This model is versatile, allowing fine-tuning for downstream point cloud representation tasks, as well as unconditional and conditional generation tasks.

Language Modelling Large Language Model +2

Paper
Add Code

A Wi-Fi Signal-Based Human Activity Recognition Using High-Dimensional Factor Models

no code implementations • 10 Nov 2023 • Junshuo Liu, Fuhai Wang, Zhe Li, Rujing Xiong, Tiebin Mi, Robert Caiming Qiu

As a consequence, the accuracy of human activity recognition based on Wi-Fi signals is compromised.

Human Activity Recognition

Paper
Add Code

Robust Learning Based Condition Diagnosis Method for Distribution Network Switchgear

no code implementations • 14 Nov 2023 • Wenxi Zhang, Zhe Li, Weixi Li, Weisi Ma, Xinyi Chen, Sizhe Li

This paper introduces a robust, learning-based method for diagnosing the state of distribution network switchgear, which is crucial for maintaining the power quality for end users.

Position

Paper
Add Code

Widely Linear Matched Filter: A Lynchpin towards the Interpretability of Complex-valued CNNs

no code implementations • 30 Jan 2024 • Qingchen Wang, Zhe Li, Zdenka Babic, Wei Deng, Ljubiša Stanković, Danilo P. Mandic

However, applying this paradigm to illuminate the interpretability of complex-valued CNNs meets a formidable obstacle: the extension of matched filtering to a general class of noncircular complex-valued data, referred to here as the widely linear matched filter (WLMF), has been only implicit in the literature.

Paper
Add Code

DCS-Net: Pioneering Leakage-Free Point Cloud Pretraining Framework with Global Insights

no code implementations • 3 Feb 2024 • Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang

Experimental results demonstrate that our method enhances the expressive capacity of existing point cloud models and effectively addresses the issue of information leakage.

Paper
Add Code

MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning

no code implementations • 3 Feb 2024 • Zhe Li, Laurence T. Yang, Bocheng Ren, Xin Nie, Zhangyang Gao, Cheng Tan, Stan Z. Li

The scarcity of annotated data has sparked significant interest in unsupervised pre-training methods that leverage medical reports as auxiliary signals for medical visual representation learning.

Contrastive Learning Image Classification +5

Paper
Add Code

RISAR: RIS-assisted Human Activity Recognition with Commercial Wi-Fi Devices

no code implementations • 27 Feb 2024 • Junshuo Liu, Yunlong Huang, Wei Yang, Zhe Li, Rujing Xiong, Tiebin Mi, Xin Shi, Robert C. Qiu

Human activity recognition (HAR) holds significant importance in smart homes, security, and healthcare.

Denoising Human Activity Recognition

Paper
Add Code

Enhancing Multivariate Time Series Forecasting with Mutual Information-driven Cross-Variable and Temporal Modeling

no code implementations • 1 Mar 2024 • shiyi qi, Liangjian Wen, Yiduo Li, Yuanhang Yang, Zhe Li, Zhongwen Rao, Lujia Pan, Zenglin Xu

To substantiate this claim, we introduce the Cross-variable Decorrelation Aware feature Modeling (CDAM) for Channel-mixing approaches, aiming to refine Channel-mixing by minimizing redundant information between channels while enhancing relevant mutual information.

Multivariate Time Series Forecasting Time Series

Paper
Add Code

TexVocab: Texture Vocabulary-conditioned Human Avatars

no code implementations • 31 Mar 2024 • Yuxiao Liu, Zhe Li, Yebin Liu, Haoqian Wang

To adequately utilize the available image evidence in multi-view video-based avatar modeling, we propose TexVocab, a novel avatar representation that constructs a texture vocabulary and associates body poses with texture maps for animation.

Human Dynamics

Paper
Add Code

Automatic Knowledge Graph Construction for Judicial Cases

no code implementations • 15 Apr 2024 • Jie zhou, Xin Chen, Hang Zhang, Zhe Li

Building on these results, we detail the automatic construction process of case knowledge graphs for judicial cases, enabling the assembly of knowledge graphs for hundreds of thousands of judgments.

graph construction Knowledge Graphs

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.