Search Results for author: Jie Li

Found 202 papers, 60 papers with code

MindSemantix: Deciphering Brain Visual Experiences with a Brain-Language Model

no code implementations29 May 2024 Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao

MindSemantix generates high-quality captions that are deeply rooted in the visual and semantic information derived from brain activity.

Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

no code implementations23 May 2024 Zhusi Zhong, Jie Li, John Sollee, Scott Collins, Harrison Bai, Paul Zhang, Terrence Healey, Michael Atalay, Xinbo Gao, Zhicheng Jiao

In response to the worldwide COVID-19 pandemic, advanced automated technologies have emerged as valuable tools to aid healthcare professionals in managing an increased workload by improving radiology report generation and prognostic analysis.

Sentence Survival Prediction

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

no code implementations10 May 2024 Trevor J. Chan, Aarush Sahni, Jie Li, Alisha Luthra, Amy Fang, Alison Pouch, Chamith S. Rajapakse

We introduce SAM3D, a new approach to semi-automatic zero-shot segmentation of 3D images building on the existing Segment Anything Model.

Zero Shot Segmentation

Region-specific Risk Quantification for Interpretable Prognosis of COVID-19

no code implementations5 May 2024 Zhusi Zhong, Jie Li, Zhuoqi Ma, Scott Collins, Harrison Bai, Paul Zhang, Terrance Healey, Xinbo Gao, Michael K. Atalay, Zhicheng Jiao

The COVID-19 pandemic has strained global public health, necessitating accurate diagnosis and intervention to control disease spread and reduce mortality rates.

COVID-19 Diagnosis Decision Making +2

Towards Balanced RGB-TSDF Fusion for Consistent Semantic Scene Completion by 3D RGB Feature Completion and a Classwise Entropy Loss Function

no code implementations25 Mar 2024 Laiyan Ding, Panwen Hu, Jie Li, Rui Huang

To address this RGB-TSDF distribution difference, we propose a two-stage network with a 3D RGB feature completion module that completes RGB features with meaningful values for occluded areas.

TBI Image/Text (TBI-IT): Comprehensive Text and Image Datasets for Traumatic Brain Injury Research

no code implementations14 Mar 2024 Jie Li, Jiaying Wen, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang

In this paper, we introduce a new dataset in the medical field of Traumatic Brain Injury (TBI), called TBI-IT, which includes both electronic medical records (EMRs) and head CT images.

Image Segmentation named-entity-recognition +2

LocalGCL: Local-aware Contrastive Learning for Graphs

no code implementations27 Feb 2024 Haojun Jiang, Jiawei Sun, Jie Li, Chentao Wu

Graph representation learning (GRL) makes considerable progress recently, which encodes graphs with topological structures into low-dimensional embeddings.

Contrastive Learning Graph Representation Learning +1

Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment

no code implementations22 Feb 2024 Zhaoyang Wang, Bo Hu, Mingyang Zhang, Jie Li, Leida Li, Maoguo Gong, Xinbo Gao

Firstly, we devise a new diffusion restoration network that leverages the produced enhanced image and noise-containing images, incorporating nonlinear features obtained during the denoising process of the diffusion model, as high-level visual information.

Denoising No-Reference Image Quality Assessment +1

Advancing GenAI Assisted Programming--A Comparative Study on Prompt Efficiency and Code Quality Between GPT-4 and GLM-4

no code implementations20 Feb 2024 Angus Yang, Zehan Li, Jie Li

Our GenAI Coding Workshop highlights the effectiveness and accessibility of the prompting methodology developed in this study.

Code Generation

Seeing is not always believing: The Space of Harmless Perturbations

no code implementations3 Feb 2024 Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo

However, in this work, we reveal the existence of a harmless perturbation space, in which perturbations drawn from this space, regardless of their magnitudes, leave the network output unchanged when applied to inputs.

Privacy Preserving

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models

no code implementations30 Jan 2024 Jie Li, Yi Liu, Chongyang Liu, Ling Shi, Xiaoning Ren, Yaowen Zheng, Yang Liu, Yinxing Xue

To address this research gap, we conducted an extensive empirical study on Multilingual Jailbreak attacks.

Text Generation

Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human

no code implementations5 Jan 2024 Song Bai, Jie Li

Since the year 2023 an abundant amount of research papers has emerged in the domain of 3D generation.

3D Generation

DeLR: Active Learning for Detection with Decoupled Localization and Recognition Query

no code implementations28 Dec 2023 Yuhang Zhang, Yuang Deng, Xiaopeng Zhang, Jie Li, Robert C. Qiu, Qi Tian

In DeLR, the query is based on region-level, and we only annotate the object region that is queried; 2) Instead of directly providing both localization and recognition annotations, we separately query the two components, and thus reduce the recognition budget with the pseudo class labels provided by the model.

Active Learning Object +2

A Dual Domain Multi-exposure Image Fusion Network based on the Spatial-Frequency Integration

1 code implementation17 Dec 2023 Guang Yang, Jie Li, Xinbo Gao

Specifically, we introduce a Spatial-Frequency Fusion Block to facilitate efficient interaction between dual domains and capture complementary information from input images with different exposures.

Multi-Exposure Image Fusion

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

no code implementations14 Dec 2023 Yan Gao, Haojun Xu, Nannan Wang, Jie Li, Xinbo Gao

In addition to the previous method of treating objects as nodes, the network innovatively treats object trajectories as nodes for information interaction, improving the graph neural network's feature representation capability.

Multi-Object Tracking Multiple Object Tracking +1

A Multi-scale Information Integration Framework for Infrared and Visible Image Fusion

1 code implementation7 Dec 2023 Guang Yang, Jie Li, Hanxiao Lei, Xinbo Gao

In this study, we propose a multi-scale dual attention (MDA) framework for infrared and visible image fusion, which is designed to measure and integrate complementary information in both structure and loss function at the image and patch level.

Infrared And Visible Image Fusion

EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model

no code implementations5 Dec 2023 Guozhang Li, Xinpeng Ding, De Cheng, Jie Li, Nannan Wang, Xinbo Gao

To further clarify the noise of expanded boundaries, we combine mutual learning with a tailored proposal-level contrastive objective to use a learnable approach to harmonize a balance between incomplete yet clean (initial) and comprehensive yet noisy (expanded) boundaries for more precise ones.

Boundary Detection Language Modelling +2

Joint Design of ISAC Waveform under PAPR Constraints

no code implementations20 Nov 2023 Yating Chen, Cai Wen, Yan Huang, Le Liang, Jie Li, HUI ZHANG, Wei Hong

In this paper, we formulate the precoding problem of integrated sensing and communication (ISAC) waveform as a non-convex quadratically constrainted quadratic program (QCQP), in which the weighted sum of communication multi-user interference (MUI) and the gap between dual-use waveform and ideal radar waveform is minimized with peak-to-average power ratio (PAPR) constraints.

Audio-visual Saliency for Omnidirectional Videos

no code implementations9 Nov 2023 Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai

Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc..

Saliency Prediction

Robust and Communication-Efficient Federated Domain Adaptation via Random Features

1 code implementation8 Nov 2023 Zhanbo Feng, Yuanjie Wang, Jie Li, Fan Yang, Jiong Lou, Tiebin Mi, Robert. C. Qiu, Zhenyu Liao

As a result, there is a growing trend to leverage federated learning (FL) techniques to train large ML models in a distributed and collaborative manner.

Domain Adaptation Federated Learning

Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

1 code implementation7 Nov 2023 Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

We propose a novel framework to integrate SD maps into online map prediction and propose a Transformer-based encoder, SD Map Encoder Representations from transFormers, to leverage priors in SD maps for the lane-topology prediction task.

Autonomous Driving Lane Detection

Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback

no code implementations29 Oct 2023 Jingliang Duan, Jie Li, Xuyang Chen, Kai Zhao, Shengbo Eben Li, Lin Zhao

Despite the absence of convexity, we leverage these properties to derive novel findings regarding convergence (and nearly dimension-free rate) to stationary points for three policy gradient methods, including the vanilla policy gradient method, the natural policy gradient method, and the Gauss-Newton method.

Policy Gradient Methods

Bridging the Gap between Newton-Raphson Method and Regularized Policy Iteration

no code implementations11 Oct 2023 Zeyang Li, Chuxiong Hu, Yunan Wang, Guojian Zhan, Jie Li, Shengbo Eben Li

We also show that a modified version of regularized policy iteration, i. e., with finite-step policy evaluation, is equivalent to inexact Newton method where the Newton iteration formula is solved with truncated iterations.

User Experience Design Professionals' Perceptions of Generative Artificial Intelligence

no code implementations26 Sep 2023 Jie Li, Hancheng Cao, Laura Lin, Youyang Hou, Ruihao Zhu, Abdallah El Ali

They emphasized the unique human factors of "enjoyment" and "agency", where humans remain the arbiters of "AI alignment".

Learning Optimal Robust Control of Connected Vehicles in Mixed Traffic Flow

no code implementations18 Sep 2023 Jie Li, Jiawei Wang, Shengbo Eben Li, Keqiang Li

Connected and automated vehicles (CAVs) technologies promise to attenuate undesired traffic disturbances.

Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

no code implementations30 Aug 2023 Zhanbo Feng, Zenan Ling, Ci Gong, Feng Zhou, Jie Li, Robert C. Qiu

Existing works tend to use either image-guided methods, which provide a visual reference but lack control over semantic coherence, or text-guided methods, which ensure faithfulness to text guidance but lack visual quality.

Attribute Denoising

HODN: Disentangling Human-Object Feature for HOI Detection

no code implementations20 Aug 2023 Shuman Fang, Zhiwen Lin, Ke Yan, Jie Li, Xianming Lin, Rongrong Ji

However, these methods ignore the relationship among humans, objects, and interactions: 1) human features are more contributive than object ones to interaction prediction; 2) interactive information disturbs the detection of objects but helps human detection.

Decoder Human Detection +4

ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023

no code implementations16 Aug 2023 Mengjie Du, Xiang Fang, Jie Li

This technical report describes ChinaTelecom system for Track 1 (closed) of the VoxCeleb2023 Speaker Recognition Challenge (VoxSRC 2023).

Speaker Recognition

Improving Human-Object Interaction Detection via Virtual Image Learning

no code implementations4 Aug 2023 Shuman Fang, Shuai Liu, Jie Li, Guannan Jiang, Xianming Lin, Rongrong Ji

Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects, which plays a curtail role in high-level semantic understanding tasks.

Human-Object Interaction Detection Object

UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming

no code implementations31 Jul 2023 Hao Lin, Ke wu, Jie Li, Jun Li, Wu-Jun Li

To the best of our knowledge, UniAP is the first parallel method that can jointly optimize the two categories of parallel strategies to find an optimal solution.

A novel integrated method of detection-grasping for specific object based on the box coordinate matching

no code implementations20 Jul 2023 Zongmin Liu, Jirui Wang, Jie Li, Zufeng Li, Kai Ren, Peng Shi

Furthermore, a detection-grasping integrated algorithm based on box coordinate matching (DG-BCM) is proposed to obtain the fusion model of object detection and grasp estimation.

Instance Segmentation Object +3

MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection

no code implementations6 Jul 2023 Ruiyang Xia, Decheng Liu, Jie Li, Lin Yuan, Nannan Wang, Xinbo Gao

Advanced manipulation techniques have provided criminals with opportunities to make social panic or gain illicit profits through the generation of deceptive media, such as forged face images.

DeepFake Detection Face Swapping

Quantum-Enhanced Diamond Molecular Tension Microscopy for Quantifying Cellular Forces

no code implementations28 Jun 2023 Feng Xu, Shuxiang Zhang, Linjie Ma, Yong Hou, Jie Li, Andrej Denisenko, Zifu Li, Joachim Spatz, Jörg Wrachtrup, Qiang Wei, Zhiqin Chu

The constant interplay and information exchange between cells and their micro-environment are essential to their survival and ability to execute biological functions.

Temporal Gradient Inversion Attacks with Robust Optimization

no code implementations13 Jun 2023 Bowen Li, Hanlin Gu, Ruoxin Chen, Jie Li, Chentao Wu, Na Ruan, Xueming Si, Lixin Fan

We investigate a Temporal Gradient Inversion Attack with a Robust Optimization framework, called TGIAs-RO, which recovers private data without any prior knowledge by leveraging multiple temporal gradients.

Federated Learning Privacy Preserving

Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition

1 code implementation21 May 2023 Haojun Xu, Yan Gao, Zheng Hui, Jie Li, Xinbo Gao

Also, humans have brain regions dedicated to understanding the minds of others and analyzing their intentions, such as the medial prefrontal cortex of the temporal lobe.

Ranked #2 on Skeleton Based Action Recognition on NTU RGB+D 120 (using extra training data)

Action Recognition GPR +2

Tracking through Containers and Occluders in the Wild

1 code implementation CVPR 2023 Basile Van Hoorick, Pavel Tokmakov, Simon Stent, Jie Li, Carl Vondrick

Tracking objects with persistence in cluttered and dynamic environments remains a difficult challenge for computer vision systems.

Visual Tracking

Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint

1 code implementation25 Apr 2023 Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Jie Li, Xinbo Gao

The proposed Bi-SCC firstly adopts a temporal context augmentation to generate an augmented video that breaks the correlation between positive actions and their co-scene actions in the inter-video; Then, a semantic consistency constraint (SCC) is used to enforce the predictions of the original video and augmented video to be consistent, hence suppressing the co-scene actions.

Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization

BS-GAT Behavior Similarity Based Graph Attention Network for Network Intrusion Detection

no code implementations7 Apr 2023 Yalu Wang, Zhijie Han, Jie Li, Xin He

To address the above issue, this paper proposes a graph neural network algorithm based on behavior similarity (BS-GAT) using graph attention network.

Graph Attention graph construction +1

What's in a Name? Beyond Class Indices for Image Recognition

no code implementations5 Apr 2023 Kai Han, Yandong Li, Sagar Vaze, Jie Li, Xuhui Jia

In this paper, we reconsider the recognition problem and task a vision-language model to assign class names to images given only a large and essentially unconstrained vocabulary of categories as prior information.

Language Modelling Object Recognition

MRCN: A Novel Modality Restitution and Compensation Network for Visible-Infrared Person Re-identification

no code implementations26 Mar 2023 Yukang Zhang, Yan Yan, Jie Li, Hanzi Wang

Furthermore, to better disentangle the modality-relevant features and the modality-irrelevant features, we propose a novel Center-Quadruplet Causal (CQC) loss to encourage the network to effectively learn the modality-relevant features and the modality-irrelevant features.

Person Re-Identification

Viewpoint Equivariance for Multi-View 3D Object Detection

1 code implementation CVPR 2023 Dian Chen, Jie Li, Vitor Guizilini, Rares Ambrus, Adrien Gaidon

We design view-conditioned queries at the output level, which enables the generation of multiple virtual frames during training to learn viewpoint equivariance by enforcing multi-view consistency.

3D Object Detection Object +2

Adaptive incentive for cross-silo federated learning: A multi-agent reinforcement learning approach

no code implementations15 Feb 2023 Shijing Yuan, Hongze Liu, Hongtao Lv, Zhanbo Feng, Jie Li, Hongyang Chen, Chentao Wu

To overcome these limitations, we propose a novel adaptive mechanism for cross-silo FL, towards incentivizing organizations to contribute data to maximize their long-term payoffs in a real dynamic training environment.

Federated Learning Multi-agent Reinforcement Learning

Predicting Molecule-Target Interaction by Learning Biomedical Network and Molecule Representations

no code implementations2 Feb 2023 Jinjiang Guo, Jie Li

Most existing methodologies utilize either biomedical network information or molecule structural features to predict potential interaction link.

Drug Discovery

Multi-scale multi-modal micro-expression recognition algorithm based on transformer

no code implementations8 Jan 2023 Fengping Wang, Jie Li, Chun Qi, Lin Wang, Pan Wang

A micro-expression is a spontaneous unconscious facial muscle movement that can reveal the true emotions people attempt to hide.

Contrastive Learning Micro Expression Recognition +2

A Novel Improved Mask RCNN for Multiple Targets Detection in the Indoor Complex Scenes

no code implementations7 Jan 2023 Zongmin Liu, Jirui Wang, Jie Li, Pengda Liu, Kai Ren

However, indoor scenes are usually complex and there are many types of interference factors, leading to great challenges in the multiple targets detection.

Breaking the "Object" in Video Object Segmentation

no code implementations CVPR 2023 Pavel Tokmakov, Jie Li, Adrien Gaidon

Yet, this important phenomenon is largely absent from existing video object segmentation (VOS) benchmarks.

Object Semantic Segmentation +2

ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking

no code implementations8 Nov 2022 Tara Sadjadpour, Jie Li, Rares Ambrus, Jeannette Bohg

To address these issues in a unified framework, we propose to learn shape and spatio-temporal affinities between tracks and detections in consecutive frames.

3D Multi-Object Tracking Autonomous Vehicles +1

Automatic Change-Point Detection in Time Series via Deep Learning

1 code implementation7 Nov 2022 Jie Li, Paul Fearnhead, Piotr Fryzlewicz, Tengyao Wang

We show how to automatically generate new offline detection methods based on training a neural network.

Change Point Detection Time Series +1

Reconstruction of compressed spectral imaging based on global structure and spectral correlation

no code implementations27 Oct 2022 Pan Wang, Jie Li, Jieru Chen, Lin Wang, Chun Qi

To take full exploration of the constraints between spectra, the coefficients corresponding to the convolution kernel are constrained by the L_(2, 1)norm to improve spectral accuracy.

SSIM

Depth Is All You Need for Monocular 3D Detection

no code implementations5 Oct 2022 Dennis Park, Jie Li, Dian Chen, Vitor Guizilini, Adrien Gaidon

Our methods leverage commonly available LiDAR or RGB videos during training time to fine-tune the depth representation, which leads to improved 3D detectors.

Depth Prediction Monocular Depth Estimation +1

Seen to Unseen: When Fuzzy Inference System Predicts IoT Device Positioning Labels That Had Not Appeared in Training Phase

no code implementations21 Sep 2022 Han Xu, Zheming Zuo, Jie Li, Victor Chang

Situating at the core of Artificial Intelligence (AI), Machine Learning (ML), and more specifically, Deep Learning (DL) have embraced great success in the past two decades.

feature selection

Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

no code implementations17 Sep 2022 Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang

Experimental results show that the proposed model achieves competitive performance with 1/3 of the parameters of the encoder, compared with the full-parameter model.

Knowledge Distillation speech-recognition +1

Seeking Subjectivity in Visual Emotion Distribution Learning

no code implementations25 Jul 2022 Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao

In psychology, the \textit{Object-Appraisal-Emotion} model has demonstrated that each individual's emotion is affected by his/her subjective appraisal, which is further formed by the affective memory.

Emotion Recognition

SpOT: Spatiotemporal Modeling for 3D Object Tracking

no code implementations12 Jul 2022 Colton Stearns, Davis Rempe, Jie Li, Rares Ambrus, Sergey Zakharov, Vitor Guizilini, Yanchao Yang, Leonidas J Guibas

In this work, we develop a holistic representation of traffic scenes that leverages both spatial and temporal information of the actors in the scene.

3D Multi-Object Tracking 3D Object Tracking +1

TransFA: Transformer-based Representation for Face Attribute Evaluation

1 code implementation12 Jul 2022 Decheng Liu, Weijie He, Chunlei Peng, Nannan Wang, Jie Li, Xinbo Gao

The multiple branches transformer is employed to explore the inter-correlation between different attributes in similar semantic regions for attribute feature learning.

Attribute Multi-Label Classification +1

Digital-twin-enhanced metal tube bending forming real-time prediction method based on Multi-source-input MTL

1 code implementation3 Jul 2022 Chang Sun, Zili Wang, Shuyou Zhang, Taotao Zhou, Jie Li, Jianrong Tan

To address this issue, a digital-twin-enhanced (DT-enhanced) metal tube bending forming real-time prediction method based on multi-source-input multi-task learning (MTL) is proposed.

Multi-Task Learning

Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?

1 code implementation16 Jun 2022 Adam W. Harley, Zhaoyuan Fang, Jie Li, Rares Ambrus, Katerina Fragkiadaki

Building 3D perception systems for autonomous vehicles that do not rely on high-density LiDAR is a critical research problem because of the expense of LiDAR systems compared to cameras and other sensors.

Autonomous Vehicles Bird's-Eye View Semantic Segmentation +1

An Indoor Environment Sensing and Localization System via mmWave Phased Array

no code implementations7 Jun 2022 Yifei Sun, Jie Li, Tong Zhang, Rui Wang, Xiaohui Peng, Tony Xiao Han, Haisheng Tan

At the end, we show that the reconstructed room layout can be utilized to locate a mobile device according to its AoA spectrum, even with single access point.

FairGAN: GANs-based Fairness-aware Learning for Recommendations with Implicit Feedback

1 code implementation Proceedings of the ACM Web Conference 2022 Jie Li, Yongli Ren, Ke Deng

To fill this gap, we propose a Generative Adversarial Networks (GANs) based learning algorithm FairGAN mapping the exposure fairness issue to the problem of negative preferences in implicit feedback data.

Exposure Fairness Recommendation Systems

Object Permanence Emerges in a Random Walk along Memory

1 code implementation4 Apr 2022 Pavel Tokmakov, Allan Jabri, Jie Li, Adrien Gaidon

This paper proposes a self-supervised objective for learning representations that localize objects under occlusion - a property known as object permanence.

Object

Passive Motion Detection via mmWave Communication System

no code implementations28 Mar 2022 Jie Li, Chao Yu, Yan Luo, Yifei Sun, Rui Wang

Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively.

Hand Gesture Recognition Hand-Gesture Recognition +1

On Understanding and Mitigating the Dimensional Collapse of Graph Contrastive Learning: a Non-Maximum Removal Approach

no code implementations24 Mar 2022 Jiawei Sun, Ruoxin Chen, Jie Li, Chentao Wu, Yue Ding, Junchi Yan

Graph Contrastive Learning (GCL) has shown promising performance in graph representation learning (GRL) without the supervision of manual annotations.

Contrastive Learning Graph Classification +1

Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

1 code implementation29 Jan 2022 Jie Li, Ling Han, Cong Zhang, Qiyue Li, Zhi Liu

Most of the current prediction methods combining saliency detection and FoV information neither take into account that the distortion of projected 360-degree videos can invalidate the weight sharing of traditional convolutional networks, nor do they adequately consider the difficulty of obtaining complete multi-user FoV information, which degrades the prediction performance.

Saliency Detection Time Series +1

One-Bit Active Query With Contrastive Pairs

no code implementations CVPR 2022 Yuhang Zhang, Xiaopeng Zhang, Lingxi Xie, Jie Li, Robert C. Qiu, Hengtong Hu, Qi Tian

The Yes query is treated as positive pairs of the queried category for contrastive pulling, while the No query is treated as hard negative pairs for contrastive repelling.

Active Learning Contrastive Learning

Learning to Learn Transferable Attack

1 code implementation10 Dec 2021 Shuman Fang, Jie Li, Xianming Lin, Rongrong Ji

By treating the attack of both specific data and a modified model as a task, we expect the adversarial perturbations to adopt enough tasks for generalization.

Adversarial Attack Data Augmentation +1

Fully Attentional Network for Semantic Segmentation

1 code implementation8 Dec 2021 Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang

Recent non-local self-attention methods have proven to be effective in capturing long-range dependencies for semantic segmentation.

Computational Efficiency Segmentation +1

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

1 code implementation4 Dec 2021 Feng Xu, Chuang Zhu, Wenqi Tang, Ying Wang, Yu Zhang, Jie Li, Hongchuan Jiang, Zhongyue Shi, Jun Liu, Mulan Jin

Conclusion: Our study provides a novel DL-based biomarker on primary tumor CNB slides to predict the metastatic status of ALN preoperatively for patients with EBC.

Multiple Instance Learning Specificity +1

Image-specific Convolutional Kernel Modulation for Single Image Super-resolution

1 code implementation16 Nov 2021 Yuanfei Huang, Jie Li, Yanting Hu, Xinbo Gao, Hua Huang

Recently, deep-learning-based super-resolution methods have achieved excellent performances, but mainly focus on training a single generalized deep network by feeding numerous samples.

Image Super-Resolution

Denoised Non-Local Neural Network for Semantic Segmentation

no code implementations27 Oct 2021 Qi Song, Jie Li, Hao Guo, Rui Huang

Without any external training data, our proposed Denoised NL can achieve the state-of-the-art performance of 83. 5\% and 46. 69\% mIoU on Cityscapes and ADE20K, respectively.

Semantic Segmentation

Bone Marrow Cell Recognition: Training Deep Object Detection with A New Loss Function

no code implementations25 Oct 2021 Dehao Huang, Jintao Cheng, Rui Fan, Zhihao Su, Qiongxiong Ma, Jie Li

Therefore, it is crucial to study a robust bone marrow cell detection algorithm for a quantitative automatic analysis system.

Cell Detection object-detection +1

FedIPR: Ownership Verification for Federated Deep Neural Network Models

1 code implementation27 Sep 2021 Bowen Li, Lixin Fan, Hanlin Gu, Jie Li, Qiang Yang

To address these risks, the ownership verification of federated learning models is a prerequisite that protects federated learning model intellectual property rights (IPR) i. e., FedIPR.

Federated Learning

Geometry-Based Stochastic Line-of-Sight Probability Model for A2G Channels under Urban Scenarios

no code implementations6 Sep 2021 Qiuming Zhu, Fei Bai, Minghui Pang, Jie Li, Weizhi Zhong, Xiaomin Chen, Kai Mao

Line-of-sight (LoS) path is essential for the reliability of air-to-ground (A2G) communications, but the existence of LoS path is difficult to predict due to random obstacles on the ground.

Stimuli-Aware Visual Emotion Analysis

no code implementations4 Sep 2021 Jingyuan Yang, Jie Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao

Then, we design three specific networks, i. e., Global-Net, Semantic-Net and Expression-Net, to extract distinct emotional features from different stimuli simultaneously.

Emotion Recognition

An Integrated Framework for the Heterogeneous Spatio-Spectral-Temporal Fusion of Remote Sensing Images

no code implementations1 Sep 2021 Menghui Jiang, Huanfeng Shen, Jie Li, Liangpei Zhang

Images from many remote sensing satellites, including MODIS, Landsat-8, Sentinel-1, and Sentinel-2, are utilized in the experiments.

LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision

no code implementations ICCV 2021 Zhijian Liu, Simon Stent, Jie Li, John Gideon, Song Han

Computer vision tasks such as object detection and semantic/instance segmentation rely on the painstaking annotation of large training datasets.

Image Classification Instance Segmentation +3

Coupling Model-Driven and Data-Driven Methods for Remote Sensing Image Restoration and Fusion

no code implementations13 Aug 2021 Huanfeng Shen, Menghui Jiang, Jie Li, Chenxia Zhou, Qiangqiang Yuan, Liangpei Zhang

In this paper, we systematically investigate the coupling of model-driven and data-driven methods, which has rarely been considered in the remote sensing image restoration and fusion communities.

Image Restoration

Is Pseudo-Lidar needed for Monocular 3D Object detection?

2 code implementations ICCV 2021 Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon

Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors.

 Ranked #1 on Monocular 3D Object Detection on KITTI Pedestrian Moderate (using extra training data)

Monocular 3D Object Detection Monocular Depth Estimation +2

A Dynamic 3D Spontaneous Micro-expression Database: Establishment and Evaluation

no code implementations31 Jul 2021 Fengping Wang, Jie Li, Siqi Zhang, Chun Qi, Yun Zhang, Danmin Miao

Micro-expressions are spontaneous, unconscious facial movements that show people's true inner emotions and have great potential in related fields of psychological testing.

Real-time Keypoints Detection for Autonomous Recovery of the Unmanned Ground Vehicle

no code implementations27 Jul 2021 Jie Li, Sheng Zhang, Kai Han, Xia Yuan, Chunxia Zhao, Yu Liu

UGV-KPNet is computationally efficient with a small number of parameters and provides pixel-level accurate keypoints detection results in real-time.

Keypoint Detection

Wideband photonic blind source separation with optical pulse sampling

no code implementations21 Jul 2021 Taichu Shi, Yang Qi, Weipeng Zhang, Paul R. Prucnal, Jie Li, Ben Wu

The ultra-fast optical pulse functions as a tweezer that collects samples of the signals at very low sampling rates, and each sample is short enough to maintain the statistical properties of the signals.

blind source separation

Integrated Sensing and Communication from Learning Perspective: An SDP3 Approach

no code implementations20 Jul 2021 Guoliang Li, Shuai Wang, Jie Li, Rui Wang, Fan Liu, Xiaohui Peng, Tony Xiao Han, Chengzhong Xu

Characterizing the sensing and communication performance tradeoff in integrated sensing and communication (ISAC) systems is challenging in the applications of learning-based human motion recognition.

Fully Polarimetric SAR and Single-Polarization SAR Image Fusion Network

no code implementations18 Jul 2021 Liupeng Lin, Jie Li, Huanfeng Shen, Lingli Zhao, Qiangqiang Yuan, Xinghua Li

The data fusion technology aims to aggregate the characteristics of different data and obtain products with multiple data advantages.

IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement

no code implementations29 Jun 2021 Jie Li, Laiyan Ding, Rui Huang

3D semantic scene completion and 2D semantic segmentation are two tightly correlated tasks that are both essential for indoor scene understanding, because they predict the same semantic classes, using positively correlated high-level features.

2D Semantic Segmentation 3D Semantic Scene Completion +3

A Circular-Structured Representation for Visual Emotion Distribution Learning

no code implementations CVPR 2021 Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Xinbo Gao

Visual Emotion Analysis (VEA) has attracted increasing attention recently with the prevalence of sharing images on social networks.

Emotion Recognition

Hierarchical Lovasz Embeddings for Proposal-Free Panoptic Segmentation

no code implementations CVPR 2021 Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon

We use a hierarchical Lovasz hinge loss to learn a low-dimensional embedding space structured into a unified semantic and instance hierarchy without requiring separate network branches or object proposals.

Instance Segmentation Panoptic Segmentation +1

Learning the Non-Differentiable Optimization for Blind Super-Resolution

no code implementations CVPR 2021 Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao

Instead of considering iterative strategy, we make the blur kernel predictor trainable in the whole blind SR model, in which AMNet is well-trained.

Blind Super-Resolution Super-Resolution

Inverse Simulation: Reconstructing Dynamic Geometry of Clothed Humans via Optimal Control

no code implementations CVPR 2021 Jingfan Guo, Jie Li, Rahul Narain, Hyun Soo Park

Inspired by the theory of optimal control, we optimize the body states such that the simulated cloth motion is matched to the point cloud measurements, and the analytic gradient of the simulator is back-propagated to update the body states.

Friction

GPLA-12: An Acoustic Signal Dataset of Gas Pipeline Leakage

2 code implementations19 Jun 2021 Jie Li, Lizhong Yao

In this paper, we introduce a new acoustic leakage dataset of gas pipelines, called as GPLA-12, which has 12 categories over 684 training/testing acoustic signals.

Fault Detection Time Series +1

Hierarchical Lovász Embeddings for Proposal-free Panoptic Segmentation

no code implementations8 Jun 2021 Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon

We use a hierarchical Lov\'asz hinge loss to learn a low-dimensional embedding space structured into a unified semantic and instance hierarchy without requiring separate network branches or object proposals.

Instance Segmentation Panoptic Segmentation +1

Exploring Multi-dimensional Data via Subset Embedding

no code implementations24 Apr 2021 Peng Xie, Wenyuan Tao, Jie Li, Wentao Huang, Siming Chen

The core of the approach is a subset embedding network (SEN) that represents a group of subsets as uniformly-formatted embeddings.

Wireless Sensing With Deep Spectrogram Network and Primitive Based Autoregressive Hybrid Channel Model

no code implementations21 Apr 2021 Guoliang Li, Shuai Wang, Jie Li, Rui Wang, Xiaohui Peng, Tony Xiao Han

Although wireless channel models can be adopted for dataset generation, current channel models are mostly designed for communication rather than sensing.

Scene Understanding

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer

2 code implementations CVPR 2021 Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao

Inspired by the common painting process of drawing a draft and revising the details, we introduce a novel feed-forward method named Laplacian Pyramid Network (LapStyle).

Style Transfer

Geometric Unsupervised Domain Adaptation for Semantic Segmentation

no code implementations ICCV 2021 Vitor Guizilini, Jie Li, Rares Ambrus, Adrien Gaidon

Simulators can efficiently generate large amounts of labeled synthetic data with perfect supervision for hard-to-label tasks like semantic segmentation.

Depth Prediction Monocular Depth Estimation +3

Transitional Learning: Exploring the Transition States of Degradation for Blind Super-resolution

1 code implementation29 Mar 2021 Yuanfei Huang, Jie Li, Yanting Hu, Xinbo Gao, Hua Huang

Being extremely dependent on iterative estimation of the degradation prior or optimization of the model from scratch, the existing blind super-resolution (SR) methods are generally time-consuming and less effective, as the estimation of degradation proceeds from a blind initialization and lacks interpretable degradation priors.

Blind Super-Resolution Super-Resolution

Learning to Track with Object Permanence

1 code implementation ICCV 2021 Pavel Tokmakov, Jie Li, Wolfram Burgard, Adrien Gaidon

In this work, we introduce an end-to-end trainable approach for joint object detection and tracking that is capable of such reasoning.

Multi-Object Tracking Object +3

Approximate Optimal Filter for Linear Gaussian Time-invariant Systems

no code implementations9 Mar 2021 Kaiming Tang, Shengbo Eben Li, Yuming Yin, Yang Guan, Jingliang Duan, Wenhan Cao, Jie Li

The equivalence holds given certain conditions about initial state distributions and policy formats, in which the system state is the estimation error, control input is the filter gain, and control objective function is the accumulated estimation error.

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

2 code implementations23 Feb 2021 Yang Guan, Jingliang Duan, Shengbo Eben Li, Jie Li, Jianyu Chen, Bo Cheng

Formally, MPG is constructed as a weighted average of the data-driven and model-driven PGs, where the former is the derivative of the learned Q-value function, and the latter is that of the model-predictive return.

Decision Making Reinforcement Learning (RL)

DPointNet: A Density-Oriented PointNet for 3D Object Detection in Point Clouds

no code implementations7 Feb 2021 Jie Li, Yu Hu

In this paper, we put forward a novel density-oriented PointNet (DPointNet) for 3D object detection in point clouds, in which the density of points increases layer by layer.

3D Object Detection Object +1

Long time-series NDVI reconstruction in cloud-prone regions via spatio-temporal tensor completion

no code implementations4 Feb 2021 Dong Chu, Huanfeng Shen, Xiaobin Guan, Jing M. Chen, Xinghua Li, Jie Li, Liangpei Zhang

The applications of Normalized Difference Vegetation Index (NDVI) time-series data are inevitably hampered by cloud-induced gaps and noise.

Time Series Time Series Analysis

Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction

no code implementations28 Jan 2021 Jinjiang Guo, Jie Li, Dawei Leng, Lurong Pan

Multi-scale biomedical knowledge networks are expanding with emerging experimental technologies that generates multi-scale biomedical big data.

Link Prediction

Curvature-based Feature Selection with Application in Classifying Electronic Health Records

1 code implementation10 Jan 2021 Zheming Zuo, Jie Li, Han Xu, Noura Al Moubayed

Disruptive technologies provides unparalleled opportunities to contribute to the identifications of many aspects in pervasive healthcare, from the adoption of the Internet of Things through to Machine Learning (ML) techniques.

Breast Cancer Detection Breast Tissue Identification +4

Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models

1 code implementation ICCV 2021 Jie Li, Rongrong Ji, Peixian Chen, Baochang Zhang, Xiaopeng Hong, Ruixin Zhang, Shaoxin Li, Jilin Li, Feiyue Huang, Yongjian Wu

A common practice is to start from a large perturbation and then iteratively reduce it with a deterministic direction and a random one while keeping it adversarial.

Dimensionality Reduction

Probabilistic 3D Multi-Modal, Multi-Object Tracking for Autonomous Driving

1 code implementation26 Dec 2020 Hsu-kuang Chiu, Jie Li, Rares Ambrus, Jeannette Bohg

Second, we propose to learn a metric that combines the Mahalanobis and feature distances when comparing a track and a new detection in data association.

Autonomous Driving Management +5

Two-Dimensional Multifunctional Materials from Endohedral Fullerenes

no code implementations23 Dec 2020 Jie Li, Ruqian Wu

A new multifunctional 2D material is theoretically predicted based on systematic ab-initio calculations and model simulations for the honeycomb lattice of endohedral fullerene W@C28 molecules.

Materials Science Computational Physics

Skeleton-based Approaches based on Machine Vision: A Survey

no code implementations23 Dec 2020 Jie Li, Binglin Li, Min Gao

Recently, skeleton-based approaches have achieved rapid progress on the basis of great success in skeleton representation.

object-detection Object Detection

Coherent mechanical noise cancellation and cooperativity competition in optomechanical arrays

no code implementations21 Dec 2020 Matthijs H. J. de Jong, Jie Li, Claus Gärtner, Richard A. Norte, Simon Gröblacher

Studying the interplay between multiple coupled mechanical resonators is a promising new direction in the field of optomechanics.

Optics Mesoscale and Nanoscale Physics

6 GHz hyperfast rotation of an optically levitated nanoparticle in vacuum

no code implementations17 Dec 2020 Yuanbin Jin, Jiangwei Yan, Shah Jee Rahman, Jie Li, Xudong Yu, Jing Zhang

We measure a highest rotation frequency about 4. 3 GHz of the trapped nanoparticle without feedback cooling and a 6 GHz rotation with feedback cooling, which is the fastest mechanical rotation ever reported to date.

Optics Mesoscale and Nanoscale Physics Quantum Physics

Anatomy of Multipath BGP Deployment in a Large ISP Network

1 code implementation14 Dec 2020 Jie Li, Vasileios Giotsas, Shi Zhou

Our work provides insights into the latest deployment of M-BGP in a major ISP network and it highlights the characteristics and effectiveness of M-BGP as a means to realize load sharing.

Networking and Internet Architecture

Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion

2 code implementations7 Dec 2020 Xu Yan, Jiantao Gao, Jie Li, Ruimao Zhang, Zhen Li, Rui Huang, Shuguang Cui

In practice, an initial semantic segmentation (SS) of a single sweep point cloud can be achieved by any appealing network and then flows into the semantic scene completion (SSC) module as the input.

3D Semantic Scene Completion from a single RGB image 3D Semantic Segmentation +3

Generative and Discriminative Learning for Distorted Image Restoration

no code implementations11 Nov 2020 Yi Gu, Yuting Gao, Jie Li, Chentao Wu, Weijia Jia

Due to the uncertainty in the distortion variation, restoring distorted images caused by liquify filter is a challenging task.

Image Restoration

The Occurrence of Rocky Habitable Zone Planets Around Solar-Like Stars from Kepler Data

1 code implementation28 Oct 2020 Steve Bryson, Michelle Kunimoto, Ravi K. Kopparapu, Jeffrey L. Coughlin, William J. Borucki, David Koch, Victor Silva Aguirre, Christopher Allen, Geert Barentsen, Natalie. M. Batalha, Travis Berger, Alan Boss, Lars A. Buchhave, Christopher J. Burke, Douglas A. Caldwell, Jennifer R. Campbell, Joseph Catanzarite, Hema Chandrasekharan, William J. Chaplin, Jessie L. Christiansen, Jorgen Christensen-Dalsgaard, David R. Ciardi, Bruce D. Clarke, William D. Cochran, Jessie L. Dotson, Laurance R. Doyle, Eduardo Seperuelo Duarte, Edward W. Dunham, Andrea K. Dupree, Michael Endl, James L. Fanson, Eric B. Ford, Maura Fujieh, Thomas N. Gautier III, John C. Geary, Ronald L Gilliland, Forrest R. Girouard, Alan Gould, Michael R. Haas, Christopher E. Henze, Matthew J. Holman, Andrew Howard, Steve B. Howell, Daniel Huber, Roger C. Hunter, Jon M. Jenkins, Hans Kjeldsen, Jeffery Kolodziejczak, Kipp Larson, David W. Latham, Jie Li, Savita Mathur, Soren Meibom, Chris Middour, Robert L. Morris, Timothy D. Morton, Fergal Mullally, Susan E. Mullally, David Pletcher, Andrej Prsa, Samuel N. Quinn, Elisa V. Quintana, Darin Ragozzine, Solange V. Ramirez, Dwight T. Sanderfer, Dimitar Sasselov, Shawn E. Seader, Megan Shabram, Avi Shporer, Jeffrey C. Smith, Jason H. Steffen, Martin Still, Guillermo Torres, John Troeltzsch, Joseph D. Twicken, Akm Kamal Uddin, Jeffrey E. Van Cleve, Janice Voss, Lauren Weiss, William F. Welsh, Bill Wohler, Khadeejah A Zamudio

We present occurrence rates for rocky planets in the habitable zones (HZ) of main-sequence dwarf stars based on the Kepler DR25 planet candidate catalog and Gaia-based stellar properties.

Earth and Planetary Astrophysics Solar and Stellar Astrophysics

Interpretable Detail-Fidelity Attention Network for Single Image Super-Resolution

1 code implementation28 Sep 2020 Yuanfei Huang, Jie Li, Xinbo Gao, Yanting Hu, Wen Lu

To solve them, we propose a purposeful and interpretable detail-fidelity attention network to progressively process these smoothes and details in divide-and-conquer manner, which is a novel and specific prospect of image super-resolution for the purpose on improving the detail fidelity, instead of blindly designing or employing the deep CNNs architectures for merely feature representation in local receptive fields.

Image Super-Resolution

A Framework of Randomized Selection Based Certified Defenses Against Data Poisoning Attacks

no code implementations18 Sep 2020 Ruoxin Chen, Jie Li, Chentao Wu, Bin Sheng, Ping Li

Random selection based defenses can achieve certified robustness by averaging the classifiers' predictions on the sub-datasets sampled from the training set.

Data Poisoning

Extending Label Smoothing Regularization with Self-Knowledge Distillation

no code implementations11 Sep 2020 Ji-Yue Wang, Pei Zhang, Wen-feng Pang, Jie Li

The experiment results confirm that the TC can help LsrKD and MrKD to boost training, especially on the networks they are failed.

Self-Knowledge Distillation

A Density-Aware PointRCNN for 3D Object Detection in Point Clouds

no code implementations11 Sep 2020 Jie Li, Yu Hu

We present an improved version of PointRCNN for 3D object detection, in which a multi-branch backbone network is adopted to handle the non-uniform density of point clouds.

3D Object Detection object-detection

A Light-Weight Object Detection Framework with FPA Module for Optical Remote Sensing Imagery

no code implementations7 Sep 2020 Xi Gu, Lingbin Kong, Zhicheng Wang, Jie Li, Zhaohui Yu, Gang Wei

On the DOTA dataset, CenterFPANet mAP is 64. 00%, and FPS is 22. 2, which is close to the accuracy of the anchor-based methods currently used and much faster than them.

Object object-detection +1

Align Deep Features for Oriented Object Detection

3 code implementations21 Aug 2020 Jiaming Han, Jian Ding, Jie Li, Gui-Song Xia

However most of existing methods rely on heuristically defined anchors with different scales, angles and aspect ratios and usually suffer from severe misalignment between anchor boxes and axis-aligned convolutional features, which leads to the common inconsistency between the classification score and localization accuracy.

Ranked #22 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +2

PillarFlow: End-to-end Birds-eye-view Flow Estimation for Autonomous Driving

no code implementations3 Aug 2020 Kuan-Hui Lee, Matthew Kliemann, Adrien Gaidon, Jie Li, Chao Fang, Sudeep Pillai, Wolfram Burgard

In autonomous driving, accurately estimating the state of surrounding obstacles is critical for safe and robust path planning.

Autonomous Driving

Giant magnetic anisotropy energy and long coherence time of uranium substitution on defected Al2O3(0001)

no code implementations14 Jul 2020 Jie Li, Lei Gu, Ruqian Wu

Nanomagnets with giant magnetic anisotropy energy and long coherence time are desired for various technological innovations such as quantum information procession and storage.

Materials Science Computational Physics

Ternary Policy Iteration Algorithm for Nonlinear Robust Control

no code implementations14 Jul 2020 Jie Li, Shengbo Eben Li, Yang Guan, Jingliang Duan, Wenyu Li, Yuming Yin

The simulation results show that the TPI algorithm can converge to the optimal solution for the linear plant, and has high resistance to disturbances for the nonlinear plant.

Balanced Symmetric Cross Entropy for Large Scale Imbalanced and Noisy Data

no code implementations3 Jul 2020 Feifei Huang, Jie Li, Xuelin Zhu

Deep convolution neural network has attracted many attentions in large-scale visual classification task, and achieves significant performance improvement compared to traditional visual analysis methods.

Collaborative Boundary-aware Context Encoding Networks for Error Map Prediction

no code implementations25 Jun 2020 Zhenxi Zhang, Chunna Tian, Jie Li, Zhusi Zhong, Zhicheng Jiao, Xinbo Gao

Further, we propose a context encoding module to utilize the global predictor from the error map to enhance the feature representation and regularize the networks.

Image Segmentation Medical Image Segmentation +2

Multi-Margin based Decorrelation Learning for Heterogeneous Face Recognition

no code implementations25 May 2020 Bing Cao, Nannan Wang, Xinbo Gao, Jie Li, Zhifeng Li

Heterogeneous face recognition (HFR) refers to matching face images acquired from different domains with wide applications in security scenarios.

Face Recognition Heterogeneous Face Recognition +1

Projection & Probability-Driven Black-Box Attack

1 code implementation CVPR 2020 Jie Li, Rongrong Ji, Hong Liu, Jianzhuang Liu, Bineng Zhong, Cheng Deng, Qi Tian

For reducing the solution space, we first model the adversarial perturbation optimization problem as a process of recovering frequency-sparse perturbations with compressed sensing, under the setting that random noise in the low-frequency space is more likely to be adversarial.

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

1 code implementation CVPR 2020 Jie Li, Kai Han, Peng Wang, Yu Liu, Xia Yuan

In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely.

3D Semantic Scene Completion from a single RGB image

A Big Data Enabled Channel Model for 5G Wireless Communication Systems

no code implementations28 Feb 2020 Jie Huang, Cheng-Xiang Wang, Lu Bai, Jian Sun, Yang Yang, Jie Li, Olav Tirkkonen, Ming-Tuo Zhou

This paper investigates various applications of big data analytics, especially machine learning algorithms in wireless communications and channel modeling.

BIG-bench Machine Learning

Semantically-Guided Representation Learning for Self-Supervised Monocular Depth

1 code implementation ICLR 2020 Vitor Guizilini, Rui Hou, Jie Li, Rares Ambrus, Adrien Gaidon

Instead of using semantic labels and proxy losses in a multi-task approach, we propose a new architecture leveraging fixed pretrained semantic segmentation networks to guide self-supervised representation learning via pixel-adaptive convolutions.

Depth Prediction Monocular Depth Estimation +3

3D Gated Recurrent Fusion for Semantic Scene Completion

no code implementations17 Feb 2020 Yu Liu, Jie Li, Qingsen Yan, Xia Yuan, Chunxia Zhao, Ian Reid, Cesar Cadena

This paper tackles the problem of data fusion in the semantic scene completion (SSC) task, which can simultaneously deal with semantic labeling and scene completion.

3D Semantic Scene Completion Scene Understanding

Facial Attribute Capsules for Noise Face Super Resolution

no code implementations16 Feb 2020 Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Xinbo Gao, Zhifeng Li

In the SR processing, we first generated a group of FACs from the input LR face, and then reconstructed the HR face from this group of FACs.

Attribute Hallucination +1

Video Face Super-Resolution with Motion-Adaptive Feedback Cell

no code implementations15 Feb 2020 Jingwei Xin, Nannan Wang, Jie Li, Xinbo Gao, Zhifeng Li

Current state-of-the-art CNN methods usually treat the VSR problem as a large number of separate multi-frame super-resolution tasks, at which a batch of low resolution (LR) frames is utilized to generate a single high resolution (HR) frame, and running a slide window to select LR frames over the entire video would obtain a series of HR frames.

Motion Compensation Motion Estimation +2

Image Fine-grained Inpainting

3 code implementations7 Feb 2020 Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao

Besides, we devise a geometrical alignment constraint item to compensate for the pixel-based distance between prediction features and ground-truth ones.

Facial Inpainting Fine-Grained Image Inpainting

Cloud Removal with Fusion of High Resolution Optical and SAR Images Using Generative Adversarial Networks

no code implementations MDPI Remote Sensing 2020 Jianhao Gao, Qiangqiang Yuan, Jie Li, Hai Zhang, Xin Su

The approach can be roughly divided into two steps: in the first step, a specially designed convolutional neural network (CNN) translates the synthetic aperture radar (SAR) images into simulated optical images in an object-to-object manner; in the second step, the simulated optical image, together with the SAR image and the optical image corrupted by clouds, is fused to reconstruct the corrupted area by a generative adversarial network (GAN) with a particular loss function.

Cloud Removal Earth Observation +2

Direct and indirect reinforcement learning

no code implementations23 Dec 2019 Yang Guan, Shengbo Eben Li, Jingliang Duan, Jie Li, Yangang Ren, Qi Sun, Bo Cheng

Reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks.

Decision Making reinforcement-learning +1

Real-Time Panoptic Segmentation from Dense Detections

no code implementations CVPR 2020 Rui Hou, Jie Li, Arjun Bhargava, Allan Raventos, Vitor Guizilini, Chao Fang, Jerome Lynch, Adrien Gaidon

Panoptic segmentation is a complex full scene parsing task requiring simultaneous instance and semantic segmentation at high resolution.

Clustering object-detection +4

HighEr-Resolution Network for Image Demosaicing and Enhancing

1 code implementation19 Nov 2019 Kangfu Mei, Juncheng Li, Jiajie Zhang, Hao-Yu Wu, Jie Li, Rui Huang

However, plenty of studies have shown that global information is crucial for image restoration tasks like image demosaicing and enhancing.

Demosaicking

Trident Segmentation CNN: A Spatiotemporal Transformation CNN for Punctate White Matter Lesions Segmentation in Preterm Neonates

1 code implementation22 Oct 2019 Yalong Liu, Jie Li, Miaomiao Wang, Zhicheng Jiao, Jian Yang, Xianjun Li

In this paper, a novel spatiotemporal transformation deep learning method called Trident Segmentation CNN (TS-CNN) is proposed to segment PWML in MR images.

Segmentation Specificity

Robust Semi-Supervised Monocular Depth Estimation with Reprojected Distances

no code implementations4 Oct 2019 Vitor Guizilini, Jie Li, Rares Ambrus, Sudeep Pillai, Adrien Gaidon

Dense depth estimation from a single image is a key problem in computer vision, with exciting applications in a multitude of robotic tasks.

Monocular Depth Estimation valid

Two Stream Networks for Self-Supervised Ego-Motion Estimation

no code implementations4 Oct 2019 Rares Ambrus, Vitor Guizilini, Jie Li, Sudeep Pillai, Adrien Gaidon

Learning depth and camera ego-motion from raw unlabeled RGB video streams is seeing exciting progress through self-supervision from strong geometric cues.

Data Augmentation Motion Estimation +2

ViLiVO: Virtual LiDAR-Visual Odometry for an Autonomous Vehicle with a Multi-Camera System

no code implementations30 Sep 2019 Zhenzhen Xiang, Jingrui Yu, Jie Li, Jianbo Su

As for the pose tracker, we propose a visual odometry system fusing both the feature matching and the virtual LiDAR scan matching results.

Monocular Visual Odometry Pose Estimation +1

Boosting Real-Time Driving Scene Parsing with Shared Semantics

1 code implementation16 Sep 2019 Zhenzhen Xiang, Anbo Bao, Jie Li, Jianbo Su

On the other hand, feature fusion modules are designed to combine different modal of semantic features, which leverage the information from both inputs for better accuracy.

Autonomous Driving Scene Parsing +1

Relaxed Actor-Critic with Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems

no code implementations11 Sep 2019 Jingliang Duan, Jie Li, Qiang Ge, Shengbo Eben Li, Monimoy Bujarbaruah, Fei Ma, Dezhao Zhang

The warm-up phase minimizes the square of the Hamiltonian to achieve admissibility, while the generalized policy iteration phase relaxes the update termination conditions for faster convergence.