Search Results for author: Jie Li

Found 243 papers, 73 papers with code

Multi-modal Anchor Gated Transformer with Knowledge Distillation for Emotion Recognition in Conversation

1 code implementation23 Jun 2025 Jie Li, Shifei Ding, Lili Guo, Xuan Li

Furthermore, we introduce a multi-modal anchor gated transformer to effectively integrate utterance-level representations across modalities.

A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation

no code implementations20 Jun 2025 Penglong Zhai, Yifang Yuan, Fanyi Di, Jie Li, Yue Liu, Chen Li, Jie Huang, Sicong Wang, Yao Xu, Xin Li

Specifically, different from existing reconstruction-based strategies, SimCIT propose to use a learnable residual quantization module to align with the signals from different modalities of the items, which combines multi-modal knowledge alignment and semantic tokenization in a mutually beneficial contrastive learning framework.

Probing the Robustness of Large Language Models Safety to Latent Perturbations

no code implementations19 Jun 2025 Tianle Gu, Kexin Huang, Zongqi Wang, Yixu Wang, Jie Li, Yuanqi Yao, Yang Yao, Yujiu Yang, Yan Teng, Yingchun Wang

Consequently, small shifts in hidden activations can re-trigger harmful behaviors embedded in the latent space.

FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-Resolution

no code implementations13 Jun 2025 Zhaoyang Wang, Jie Li, Wen Lu, Lihuo He, Maoguo Gong, Xinbo Gao

State-of-the-art (SOTA) compressed video super-resolution (CVSR) models face persistent challenges, including prolonged inference time, complex training pipelines, and reliance on auxiliary information.

Dimensionality Reduction Video Super-Resolution

EyeSim-VQA: A Free-Energy-Guided Eye Simulation Framework for Video Quality Assessment

no code implementations13 Jun 2025 Zhaoyang Wang, Wen Lu, Jie Li, Lihuo He, Maoguo Gong, Xinbo Gao

Free-energy-guided self-repair mechanisms have shown promising results in image quality assessment (IQA), but remain under-explored in video quality assessment (VQA), where temporal dynamics and model constraints pose unique challenges.

Image Quality Assessment Video Quality Assessment +1

FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training

no code implementations10 Jun 2025 Fuhan Cai, Yong Guo, Jie Li, Wenbo Li, Xiangzhong Fang, Jian Chen

Recent advancements in text-to-image (T2I) generation have led to the emergence of highly expressive models such as diffusion transformers (DiTs), exemplified by FLUX.

Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?

no code implementations3 Jun 2025 Yang Yao, Lingyu Li, Jiaxin Song, Chiyu Chen, Zhenqi He, Yixu Wang, Xin Wang, Tianle Gu, Jie Li, Yan Teng, Yingchun Wang

As Multimodal Large Language Models (MLLMs) continue to evolve, their cognitive and reasoning capabilities have seen remarkable progress.

Causal Inference

JailBound: Jailbreaking Internal Safety Boundaries of Vision-Language Models

no code implementations26 May 2025 Jiaxin Song, Yixu Wang, Jie Li, Rui Yu, Yan Teng, Xingjun Ma, Yingchun Wang

Vision-Language Models (VLMs) exhibit impressive performance, yet the integration of powerful vision encoders has significantly broadened their attack surface, rendering them increasingly susceptible to jailbreak attacks.

Towards Generalized Proactive Defense against Face Swapping with Contour-Hybrid Watermark

no code implementations25 May 2025 Ruiyang Xia, Dawei Zhou, Decheng Liu, Lin Yuan, Jie Li, Nannan Wang, Xinbo Gao

Face swapping, recognized as a privacy and security concern, has prompted considerable defensive research.

Face Swapping

Dim and Small Target Detection for Drone Broadcast Frames Based on Time-Frequency Analysis

no code implementations14 May 2025 Jie Li, Jing Li, Zhanyu Ju, Fengkui Gong, Lu Lv

As the sampling duration increases, the detection speed improves while the detection accuracy of broadcast frames termed as small targets decreases.

Pure Component Property Estimation Framework Using Explainable Machine Learning Methods

no code implementations14 May 2025 Jianfeng Jiao, Xi Gao, Jie Li

In this work, an enhanced framework for pure component property prediction by using explainable machine learning methods is proposed.

molecular representation Property Prediction

Reinforcement Learning (RL) Meets Urban Climate Modeling: Investigating the Efficacy and Impacts of RL-Based HVAC Control

no code implementations11 May 2025 Junjie Yu, John S. Schreck, David John Gagne, Keith W. Oleson, Jie Li, Yongtu Liang, Qi Liao, Mingfei Sun, David O. Topping, Zhonghua Zheng

This study proposes an integrated framework combining RL with an urban climate model that incorporates a building energy model, aiming to evaluate the efficacy of RL-based HVAC control across different background climates, impacts of RL strategies on indoor climate and local urban climate, and the transferability of RL strategies across cities.

Reinforcement Learning (RL)

ALFEE: Adaptive Large Foundation Model for EEG Representation

no code implementations7 May 2025 Wei Xiong, Junming Lin, Jiangtong Li, Jie Li, Changjun Jiang

A channel encoder adaptively compresses variable channel information, a temporal encoder captures task-guided evolution, and a hybrid decoder reconstructs signals in both temporal and frequency domains.

EEG

Global Stress Generation and Spatiotemporal Super-Resolution Physics-Informed Operator under Dynamic Loading for Two-Phase Random Materials

no code implementations26 Apr 2025 Tengfei Xing, Xiaodan Ren, Jie Li

In this study, we propose a framework for global stress generation and spatiotemporal super-resolution in TRMs under dynamic loading.

STS Super-Resolution

AlphaZero-Edu: Making AlphaZero Accessible to Everyone

1 code implementation20 Apr 2025 Binjie Guo, Hanyu Zheng, Guowei Su, Ru Zhang, Haohan Jiang, Xurong Lin, Hongyan Wei, Aisheng Mo, Jie Li, Zhiyuan Qian, Zhuhao Zhang, Xiaoyuan Cheng

Recent years have witnessed significant progress in reinforcement learning, especially with Zero-like paradigms, which have greatly boosted the generalization and reasoning abilities of large-scale language models.

GOAT-TTS: Expressive and Realistic Speech Generation via A Dual-Branch LLM

no code implementations15 Apr 2025 Yaodong Song, Hongjie Chen, Jie Lian, Yuxin Zhang, Guangmin Xia, Zehan Li, Genliang Zhao, Jian Kang, Jie Li, Yongxiang Li, Xuelong Li

While large language models (LLMs) have revolutionized text-to-speech (TTS) synthesis through discrete tokenization paradigms, current architectures exhibit fundamental tensions between three critical dimensions: 1) irreversible loss of acoustic characteristics caused by quantization of speech prompts; 2) stringent dependence on precisely aligned prompt speech-text pairs that limit real-world deployment; and 3) catastrophic forgetting of the LLM's native text comprehension during optimization for speech token generation.

Quantization Reading Comprehension +2

Bridging the Gap between Continuous and Informative Discrete Representations by Random Product Quantization

no code implementations7 Apr 2025 Xueqing Li, Zehan Li, Boyu Zhu, Ruihao Jing, Jian Kang, Jie Li, Xiao-Lei Zhang, Xuelong Li

Its quantization error is lower-bounded by the product of rho and epsilon-kms, where epsilon-kms denotes the quantization error of a single K-means quantizer.

Quantization Self-Supervised Learning

WonderTurbo: Generating Interactive 3D World in 0.72 Seconds

no code implementations3 Apr 2025 Chaojun Ni, XiaoFeng Wang, Zheng Zhu, Weijie Wang, Haoyun Li, Guosheng Zhao, Jie Li, Wenkang Qin, Guan Huang, Wenjun Mei

Interactive 3D generation is gaining momentum and capturing extensive attention for its potential to create immersive virtual experiences.

3D Generation Depth Completion +1

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation

no code implementations CVPR 2025 Kun Liu, Qi Liu, Xinchen Liu, Jie Li, Yongdong Zhang, Jiebo Luo, Xiaodong He, Wu Liu

However, human-object interaction (HOI) often cannot be precisely generated by current T2V models due to the lack of large-scale videos with accurate captions for HOI.

Hallucination Human-Object Interaction Detection +2

Drone Remote Identification Based on Zadoff-Chu Sequences and Time-Frequency Images

no code implementations19 Mar 2025 Jie Li, Jing Li, Lu Lv, Peixin Zhang, Fengkui Gong

Cross-correlation is performed between locally generated ZC sequences and drone signals to derive ZC sequence-based features.

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

no code implementations4 Mar 2025 Yuhao Yang, Zhi Ji, Zhaopeng Li, Yi Li, Zhonglin Mo, Yue Ding, Kai Chen, Zijian Zhang, Jie Li, Shuanglong Li, Lin Liu

To address this, we introduce the Cascaded Organized Bi-Represented generAtive retrieval (COBRA) framework, which innovatively integrates sparse semantic IDs and dense vectors through a cascading process.

Quantization Recommendation Systems +1

Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning

no code implementations26 Feb 2025 Hongyi Cal, Jie Li, Wenzhen Dong

The effectiveness of instruction fine-tuning for Large Language Models is fundamentally constrained by the quality and efficiency of training datasets.

Diversity

A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method

no code implementations13 Dec 2024 Jing Sun, Qiangqiang Yuan, Huanfeng Shen, Jie Li, Liangpei Zhang

In this paper, we propose a novel two-step image super resolution method concatenating multi-frame super-resolution (MFSR) with single-frame super-resolution (SFSR), to progressively upsample images to the desired resolution.

Multi-Frame Super-Resolution

Performance-Driven QUBO for Recommender Systems on Quantum Annealers

no code implementations20 Oct 2024 Jiayang Niu, Jie Li, Ke Deng, Mark Sanderson, Yongli Ren

We propose Counterfactual Analysis Quadratic Unconstrained Binary Optimization (CAQUBO) to solve QUBO problems for feature selection in recommender systems.

counterfactual feature selection +1

MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction

no code implementations26 Sep 2024 Jingyu Song, Xudong Chen, Liupei Lu, Jie Li, Katherine A. Skinner

We propose MemFusionMap, a novel temporal fusion model with enhanced temporal reasoning capabilities for online HD map construction.

Autonomous Driving Online Vectorized HD Map Construction

Infrared and Visible Image Fusion with Hierarchical Human Perception

no code implementations14 Sep 2024 Guang Yang, Jie Li, Xin Liu, Zhusi Zhong, Xinbo Gao

Existing methods take pixel intensity, texture and high-level vision task information as the standards to determine preservation of information, lacking enhancement for human perception.

Infrared And Visible Image Fusion Language Modeling +1

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

1 code implementation13 Sep 2024 Xinxu Ge, Xin Liu, Zitong Yu, Jingang Shi, Chun Qi, Jie Li, Heikki Kälviäinen

Based on our analysis, we propose DiffFAS framework, which quantifies quality as prior information input into the network to counter image quality shift, and performs diffusion-based high-fidelity cross-domain and cross-attack types generation to counter image style shift.

Face Anti-Spoofing Face Recognition

SDformer: Efficient End-to-End Transformer for Depth Completion

1 code implementation12 Sep 2024 Jian Qian, Miao Sun, Ashley Lee, Jie Li, Shenglong Zhuo, Patrick Yin Chiang

The network consists of an input module for the depth map and RGB image features extraction and concatenation, a U-shaped encoder-decoder Transformer for extracting deep features, and a refinement module.

Decoder Depth Completion

AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration

no code implementations10 Sep 2024 Hongyi Cai, Mohammad Mahdinur Rahman, Mohammad Shahid Akhtar, Jie Li, Jingyu Wu, Zhili Fang

Thus, we introduce AgileIR, group shifted attention mechanism along with window attention, which sparsely simplifies the model in architecture.

Image Restoration Quantization

Spectrum Prediction With Deep 3D Pyramid Vision Transformer Learning

1 code implementation13 Aug 2024 Guangliang Pan, Qihui Wu, Bo Zhou, Jie Li, Wei Wang, Guoru Ding, David K. Y. Yau

Based on the Deep- SPred, we first propose a novel 3D spectrum prediction method combining a flow processing strategy with 3D vision Transformer (ViT, i. e., Swin) and a pyramid to serve possible applications such as spectrum monitoring task, named 3D-SwinSTB.

Transfer Learning

Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation

1 code implementation4 Jul 2024 Laiyan Ding, Hualie Jiang, Jie Li, Yongquan Chen, Rui Huang

This paper proposes an efficient and consistent pose estimation design and two loss functions to enhance cross-view consistency for SSSDE.

Autonomous Driving Depth Estimation +1

CRUISE on Quantum Computing for Feature Selection in Recommender Systems

no code implementations3 Jul 2024 Jiayang Niu, Jie Li, Ke Deng, Yongli Ren

In this paper, we use Quantum Annealers to address the feature selection problem in recommendation algorithms.

counterfactual feature selection +1

Self and Cross-Model Distillation for LLMs: Effective Methods for Refusal Pattern Alignment

no code implementations17 Jun 2024 Jie Li, Yi Liu, Chongyang Liu, Xiaoning Ren, Ling Shi, Weisong Sun, Yinxing Xue

Our results show that these methods significantly improve refusal rates and reduce unsafe content, with cross-model distilling achieving refusal rates close to Claude3's 94. 51%.

Text Generation

An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition

no code implementations2 Jun 2024 Haojun Xu, Yan Gao, Jie Li, Xinbo Gao

Significant action recognition performance is achieved when evaluated on the challenging NTU RGB+D, NTU RGB+D 120, and PKU-MMD benchmarks and validate that multi-granularity semantic features facilitate the differentiation of action clusters with similar visual features.

Action Recognition Ensemble Learning +3

Multi-Channel Multi-Step Spectrum Prediction Using Transformer and Stacked Bi-LSTM

no code implementations29 May 2024 Guangliang Pan, Jie Li, Minglei Li

The advantage of this fusion mode is that it can deeply capture the long-term dependence of multichannel spectrum data.

Decoder Prediction

MindSemantix: Deciphering Brain Visual Experiences with a Brain-Language Model

no code implementations29 May 2024 Ziqi Ren, Jie Li, Xuetong Xue, Xin Li, Fan Yang, Zhicheng Jiao, Xinbo Gao

MindSemantix generates high-quality captions that are deeply rooted in the visual and semantic information derived from brain activity.

Brain Decoding Language Modeling +2

Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

1 code implementation23 May 2024 Zhusi Zhong, Jie Li, John Sollee, Scott Collins, Harrison Bai, Paul Zhang, Terrence Healey, Michael Atalay, Xinbo Gao, Zhicheng Jiao

In response to the worldwide COVID-19 pandemic, advanced automated technologies have emerged as valuable tools to aid healthcare professionals in managing an increased workload by improving radiology report generation and prognostic analysis.

Image to text Sentence +1

Region-specific Risk Quantification for Interpretable Prognosis of COVID-19

1 code implementation5 May 2024 Zhusi Zhong, Jie Li, Zhuoqi Ma, Scott Collins, Harrison Bai, Paul Zhang, Terrance Healey, Xinbo Gao, Michael K. Atalay, Zhicheng Jiao

The COVID-19 pandemic has strained global public health, necessitating accurate diagnosis and intervention to control disease spread and reduce mortality rates.

COVID-19 Diagnosis Decision Making +4

Towards Balanced RGB-TSDF Fusion for Consistent Semantic Scene Completion by 3D RGB Feature Completion and a Classwise Entropy Loss Function

no code implementations25 Mar 2024 Laiyan Ding, Panwen Hu, Jie Li, Rui Huang

To address this RGB-TSDF distribution difference, we propose a two-stage network with a 3D RGB feature completion module that completes RGB features with meaningful values for occluded areas.

TBI Image/Text (TBI-IT): Comprehensive Text and Image Datasets for Traumatic Brain Injury Research

no code implementations14 Mar 2024 Jie Li, Jiaying Wen, Tongxin Yang, Fenglin Cai, Miao Wei, Zhiwei Zhang, Li Jiang

In this paper, we introduce a new dataset in the medical field of Traumatic Brain Injury (TBI), called TBI-IT, which includes both electronic medical records (EMRs) and head CT images.

Image Segmentation named-entity-recognition +2

LocalGCL: Local-aware Contrastive Learning for Graphs

no code implementations27 Feb 2024 Haojun Jiang, Jiawei Sun, Jie Li, Chentao Wu

Graph representation learning (GRL) makes considerable progress recently, which encodes graphs with topological structures into low-dimensional embeddings.

Contrastive Learning Graph Representation Learning +1

Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment

1 code implementation22 Feb 2024 Zhaoyang Wang, Bo Hu, Mingyang Zhang, Jie Li, Leida Li, Maoguo Gong, Xinbo Gao

Firstly, we devise a new diffusion restoration network that leverages the produced enhanced image and noise-containing images, incorporating nonlinear features obtained during the denoising process of the diffusion model, as high-level visual information.

Denoising NR-IQA

Advancing GenAI Assisted Programming--A Comparative Study on Prompt Efficiency and Code Quality Between GPT-4 and GLM-4

no code implementations20 Feb 2024 Angus Yang, Zehan Li, Jie Li

Our GenAI Coding Workshop highlights the effectiveness and accessibility of the prompting methodology developed in this study.

Code Generation

Contrasting Adversarial Perturbations: The Space of Harmless Perturbations

no code implementations3 Feb 2024 Lu Chen, Shaofeng Li, Benhao Huang, Fan Yang, Zheng Li, Jie Li, Yuan Luo

However, in this work, we reveal the existence of a harmless perturbation space, in which perturbations drawn from this space, regardless of their magnitudes, leave the network output unchanged when applied to inputs.

Privacy Preserving

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models

no code implementations30 Jan 2024 Jie Li, Yi Liu, Chongyang Liu, Ling Shi, Xiaoning Ren, Yaowen Zheng, Yang Liu, Yinxing Xue

To address this research gap, we conducted an extensive empirical study on Multilingual Jailbreak attacks.

Text Generation

Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human

no code implementations5 Jan 2024 Song Bai, Jie Li

Since the year 2023 an abundant amount of research papers has emerged in the domain of 3D generation.

3D Generation 3DGS +2

DeLR: Active Learning for Detection with Decoupled Localization and Recognition Query

no code implementations28 Dec 2023 Yuhang Zhang, Yuang Deng, Xiaopeng Zhang, Jie Li, Robert C. Qiu, Qi Tian

In DeLR, the query is based on region-level, and we only annotate the object region that is queried; 2) Instead of directly providing both localization and recognition annotations, we separately query the two components, and thus reduce the recognition budget with the pseudo class labels provided by the model.

Active Learning Object +2

A Dual Domain Multi-exposure Image Fusion Network based on the Spatial-Frequency Integration

1 code implementation17 Dec 2023 Guang Yang, Jie Li, Xinbo Gao

Specifically, we introduce a Spatial-Frequency Fusion Block to facilitate efficient interaction between dual domains and capture complementary information from input images with different exposures.

Multi-Exposure Image Fusion

Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking

no code implementations14 Dec 2023 Yan Gao, Haojun Xu, Nannan Wang, Jie Li, Xinbo Gao

In addition to the previous method of treating objects as nodes, the network innovatively treats object trajectories as nodes for information interaction, improving the graph neural network's feature representation capability.

Multi-Object Tracking Multiple Object Tracking +1

A Multi-scale Information Integration Framework for Infrared and Visible Image Fusion

1 code implementation7 Dec 2023 Guang Yang, Jie Li, Hanxiao Lei, Xinbo Gao

In this study, we propose a multi-scale dual attention (MDA) framework for infrared and visible image fusion, which is designed to measure and integrate complementary information in both structure and loss function at the image and patch level.

Infrared And Visible Image Fusion

EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model

no code implementations5 Dec 2023 Guozhang Li, Xinpeng Ding, De Cheng, Jie Li, Nannan Wang, Xinbo Gao

To further clarify the noise of expanded boundaries, we combine mutual learning with a tailored proposal-level contrastive objective to use a learnable approach to harmonize a balance between incomplete yet clean (initial) and comprehensive yet noisy (expanded) boundaries for more precise ones.

Boundary Detection Language Modeling +4

Joint Design of ISAC Waveform under PAPR Constraints

no code implementations20 Nov 2023 Yating Chen, Cai Wen, Yan Huang, Le Liang, Jie Li, HUI ZHANG, Wei Hong

In this paper, we formulate the precoding problem of integrated sensing and communication (ISAC) waveform as a non-convex quadratically constrainted quadratic program (QCQP), in which the weighted sum of communication multi-user interference (MUI) and the gap between dual-use waveform and ideal radar waveform is minimized with peak-to-average power ratio (PAPR) constraints.

Integrated sensing and communication ISAC

Audio-visual Saliency for Omnidirectional Videos

no code implementations9 Nov 2023 Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai

Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc..

Prediction Saliency Prediction

Robust and Communication-Efficient Federated Domain Adaptation via Random Features

1 code implementation8 Nov 2023 Zhanbo Feng, Yuanjie Wang, Jie Li, Fan Yang, Jiong Lou, Tiebin Mi, Robert. C. Qiu, Zhenyu Liao

As a result, there is a growing trend to leverage federated learning (FL) techniques to train large ML models in a distributed and collaborative manner.

Domain Adaptation Federated Learning

Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

1 code implementation7 Nov 2023 Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

We propose a novel framework to integrate SD maps into online map prediction and propose a Transformer-based encoder, SD Map Encoder Representations from transFormers, to leverage priors in SD maps for the lane-topology prediction task.

Autonomous Driving Lane Detection +1

Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback

no code implementations29 Oct 2023 Jingliang Duan, Jie Li, Xuyang Chen, Kai Zhao, Shengbo Eben Li, Lin Zhao

Despite the absence of convexity, we leverage these properties to derive novel findings regarding convergence (and nearly dimension-free rate) to stationary points for three policy gradient methods, including the vanilla policy gradient method, the natural policy gradient method, and the Gauss-Newton method.

Policy Gradient Methods

Bridging the Gap between Newton-Raphson Method and Regularized Policy Iteration

no code implementations11 Oct 2023 Zeyang Li, Chuxiong Hu, Yunan Wang, Guojian Zhan, Jie Li, Shengbo Eben Li

We also show that a modified version of regularized policy iteration, i. e., with finite-step policy evaluation, is equivalent to inexact Newton method where the Newton iteration formula is solved with truncated iterations.

User Experience Design Professionals' Perceptions of Generative Artificial Intelligence

no code implementations26 Sep 2023 Jie Li, Hancheng Cao, Laura Lin, Youyang Hou, Ruihao Zhu, Abdallah El Ali

They emphasized the unique human factors of "enjoyment" and "agency", where humans remain the arbiters of "AI alignment".

Learning Optimal Robust Control of Connected Vehicles in Mixed Traffic Flow

no code implementations18 Sep 2023 Jie Li, Jiawei Wang, Shengbo Eben Li, Keqiang Li

Connected and automated vehicles (CAVs) technologies promise to attenuate undesired traffic disturbances.

A boundary-aware point clustering approach in Euclidean and embedding spaces for roof plane segmentation

1 code implementation7 Sep 2023 Li Li, Qingqing Li, Guozheng Xu, Pengwei Zhou, Jingmin Tu, Jie Li, Mingming Li, Jian Yao

To solve this problem, we propose a boundary-aware point clustering approach in Euclidean and embedding spaces constructed by a multi-task deep network for roof plane segmentation.

Segmentation

HODN: Disentangling Human-Object Feature for HOI Detection

no code implementations20 Aug 2023 Shuman Fang, Zhiwen Lin, Ke Yan, Jie Li, Xianming Lin, Rongrong Ji

However, these methods ignore the relationship among humans, objects, and interactions: 1) human features are more contributive than object ones to interaction prediction; 2) interactive information disturbs the detection of objects but helps human detection.

Decoder Human Detection +4

ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023

no code implementations16 Aug 2023 Mengjie Du, Xiang Fang, Jie Li

This technical report describes ChinaTelecom system for Track 1 (closed) of the VoxCeleb2023 Speaker Recognition Challenge (VoxSRC 2023).

Speaker Recognition

Improving Human-Object Interaction Detection via Virtual Image Learning

no code implementations4 Aug 2023 Shuman Fang, Shuai Liu, Jie Li, Guannan Jiang, Xianming Lin, Rongrong Ji

Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects, which plays a curtail role in high-level semantic understanding tasks.

Human-Object Interaction Detection Object

UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming

no code implementations CVPR 2025 Hao Lin, Ke wu, Jie Li, Jun Li, Wu-Jun Li

To the best of our knowledge, UniAP is the first parallel method that can jointly optimize the two categories of parallel strategies to find an optimal solution.

A novel integrated method of detection-grasping for specific object based on the box coordinate matching

no code implementations20 Jul 2023 Zongmin Liu, Jirui Wang, Jie Li, Zufeng Li, Kai Ren, Peng Shi

Furthermore, a detection-grasping integrated algorithm based on box coordinate matching (DG-BCM) is proposed to obtain the fusion model of object detection and grasp estimation.

Instance Segmentation Object +3

MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection

no code implementations6 Jul 2023 Ruiyang Xia, Decheng Liu, Jie Li, Lin Yuan, Nannan Wang, Xinbo Gao

Advanced manipulation techniques have provided criminals with opportunities to make social panic or gain illicit profits through the generation of deceptive media, such as forged face images.

DeepFake Detection Face Swapping

Quantum-Enhanced Diamond Molecular Tension Microscopy for Quantifying Cellular Forces

no code implementations28 Jun 2023 Feng Xu, Shuxiang Zhang, Linjie Ma, Yong Hou, Jie Li, Andrej Denisenko, Zifu Li, Joachim Spatz, Jörg Wrachtrup, Qiang Wei, Zhiqin Chu

The constant interplay and information exchange between cells and their micro-environment are essential to their survival and ability to execute biological functions.

Temporal Gradient Inversion Attacks with Robust Optimization

no code implementations13 Jun 2023 Bowen Li, Hanlin Gu, Ruoxin Chen, Jie Li, Chentao Wu, Na Ruan, Xueming Si, Lixin Fan

We investigate a Temporal Gradient Inversion Attack with a Robust Optimization framework, called TGIAs-RO, which recovers private data without any prior knowledge by leveraging multiple temporal gradients.

Federated Learning Privacy Preserving

Language Knowledge-Assisted Representation Learning for Skeleton-Based Action Recognition

1 code implementation21 May 2023 Haojun Xu, Yan Gao, Zheng Hui, Jie Li, Xinbo Gao

Also, humans have brain regions dedicated to understanding the minds of others and analyzing their intentions, such as the medial prefrontal cortex of the temporal lobe.

Ranked #3 on Skeleton Based Action Recognition on NTU RGB+D 120 (using extra training data)

Action Recognition GPR +2

Tracking through Containers and Occluders in the Wild

1 code implementation CVPR 2023 Basile Van Hoorick, Pavel Tokmakov, Simon Stent, Jie Li, Carl Vondrick

Tracking objects with persistence in cluttered and dynamic environments remains a difficult challenge for computer vision systems.

Visual Tracking

Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint

1 code implementation25 Apr 2023 Guozhang Li, De Cheng, Xinpeng Ding, Nannan Wang, Jie Li, Xinbo Gao

The proposed Bi-SCC firstly adopts a temporal context augmentation to generate an augmented video that breaks the correlation between positive actions and their co-scene actions in the inter-video; Then, a semantic consistency constraint (SCC) is used to enforce the predictions of the original video and augmented video to be consistent, hence suppressing the co-scene actions.

BS-GAT Behavior Similarity Based Graph Attention Network for Network Intrusion Detection

no code implementations7 Apr 2023 Yalu Wang, Zhijie Han, Jie Li, Xin He

To address the above issue, this paper proposes a graph neural network algorithm based on behavior similarity (BS-GAT) using graph attention network.

Graph Attention graph construction +2

What's in a Name? Beyond Class Indices for Image Recognition

no code implementations5 Apr 2023 Kai Han, Xiaohu Huang, Yandong Li, Sagar Vaze, Jie Li, Xuhui Jia

In this paper, we reconsider the recognition problem and task a vision-language model with assigning class names to images given only a large (essentially unconstrained) vocabulary of categories as prior information.

Clustering Language Modelling +1

MRCN: A Novel Modality Restitution and Compensation Network for Visible-Infrared Person Re-identification

no code implementations26 Mar 2023 Yukang Zhang, Yan Yan, Jie Li, Hanzi Wang

Furthermore, to better disentangle the modality-relevant features and the modality-irrelevant features, we propose a novel Center-Quadruplet Causal (CQC) loss to encourage the network to effectively learn the modality-relevant features and the modality-irrelevant features.

Person Re-Identification

Viewpoint Equivariance for Multi-View 3D Object Detection

1 code implementation CVPR 2023 Dian Chen, Jie Li, Vitor Guizilini, Rares Ambrus, Adrien Gaidon

We design view-conditioned queries at the output level, which enables the generation of multiple virtual frames during training to learn viewpoint equivariance by enforcing multi-view consistency.

3D Object Detection Object +2

Adaptive incentive for cross-silo federated learning: A multi-agent reinforcement learning approach

no code implementations15 Feb 2023 Shijing Yuan, Hongze Liu, Hongtao Lv, Zhanbo Feng, Jie Li, Hongyang Chen, Chentao Wu

To overcome these limitations, we propose a novel adaptive mechanism for cross-silo FL, towards incentivizing organizations to contribute data to maximize their long-term payoffs in a real dynamic training environment.

Federated Learning Multi-agent Reinforcement Learning

Predicting Molecule-Target Interaction by Learning Biomedical Network and Molecule Representations

no code implementations2 Feb 2023 Jinjiang Guo, Jie Li

Most existing methodologies utilize either biomedical network information or molecule structural features to predict potential interaction link.

Drug Discovery Graph Neural Network

Multi-scale multi-modal micro-expression recognition algorithm based on transformer

no code implementations8 Jan 2023 Fengping Wang, Jie Li, Chun Qi, Lin Wang, Pan Wang

A micro-expression is a spontaneous unconscious facial muscle movement that can reveal the true emotions people attempt to hide.

Contrastive Learning Micro Expression Recognition +2

A Novel Improved Mask RCNN for Multiple Targets Detection in the Indoor Complex Scenes

no code implementations7 Jan 2023 Zongmin Liu, Jirui Wang, Jie Li, Pengda Liu, Kai Ren

However, indoor scenes are usually complex and there are many types of interference factors, leading to great challenges in the multiple targets detection.

Breaking the "Object" in Video Object Segmentation

no code implementations CVPR 2023 Pavel Tokmakov, Jie Li, Adrien Gaidon

Yet, this important phenomenon is largely absent from existing video object segmentation (VOS) benchmarks.

Object Semantic Segmentation +2

ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking

no code implementations8 Nov 2022 Tara Sadjadpour, Jie Li, Rares Ambrus, Jeannette Bohg

To address these issues in a unified framework, we propose to learn shape and spatio-temporal affinities between tracks and detections in consecutive frames.

3D Multi-Object Tracking Autonomous Vehicles +1

Automatic Change-Point Detection in Time Series via Deep Learning

1 code implementation7 Nov 2022 Jie Li, Paul Fearnhead, Piotr Fryzlewicz, Tengyao Wang

We show how to automatically generate new offline detection methods based on training a neural network.

Change Point Detection Deep Learning +2

Reconstruction of compressed spectral imaging based on global structure and spectral correlation

no code implementations27 Oct 2022 Pan Wang, Jie Li, Jieru Chen, Lin Wang, Chun Qi

To take full exploration of the constraints between spectra, the coefficients corresponding to the convolution kernel are constrained by the L_(2, 1)norm to improve spectral accuracy.

SSIM

Depth Is All You Need for Monocular 3D Detection

no code implementations5 Oct 2022 Dennis Park, Jie Li, Dian Chen, Vitor Guizilini, Adrien Gaidon

Our methods leverage commonly available LiDAR or RGB videos during training time to fine-tune the depth representation, which leads to improved 3D detectors.

All Depth Prediction +2

Seen to Unseen: When Fuzzy Inference System Predicts IoT Device Positioning Labels That Had Not Appeared in Training Phase

no code implementations21 Sep 2022 Han Xu, Zheming Zuo, Jie Li, Victor Chang

Situating at the core of Artificial Intelligence (AI), Machine Learning (ML), and more specifically, Deep Learning (DL) have embraced great success in the past two decades.

feature selection

Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

no code implementations17 Sep 2022 Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang

Experimental results show that the proposed model achieves competitive performance with 1/3 of the parameters of the encoder, compared with the full-parameter model.

Knowledge Distillation Mixture-of-Experts +2

Seeking Subjectivity in Visual Emotion Distribution Learning

no code implementations25 Jul 2022 Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao

In psychology, the \textit{Object-Appraisal-Emotion} model has demonstrated that each individual's emotion is affected by his/her subjective appraisal, which is further formed by the affective memory.

Emotion Recognition

TransFA: Transformer-based Representation for Face Attribute Evaluation

1 code implementation12 Jul 2022 Decheng Liu, Weijie He, Chunlei Peng, Nannan Wang, Jie Li, Xinbo Gao

The multiple branches transformer is employed to explore the inter-correlation between different attributes in similar semantic regions for attribute feature learning.

Attribute Multi-Label Classification +2

SpOT: Spatiotemporal Modeling for 3D Object Tracking

no code implementations12 Jul 2022 Colton Stearns, Davis Rempe, Jie Li, Rares Ambrus, Sergey Zakharov, Vitor Guizilini, Yanchao Yang, Leonidas J Guibas

In this work, we develop a holistic representation of traffic scenes that leverages both spatial and temporal information of the actors in the scene.

3D Multi-Object Tracking 3D Object Tracking +1

Digital-twin-enhanced metal tube bending forming real-time prediction method based on Multi-source-input MTL

1 code implementation3 Jul 2022 Chang Sun, Zili Wang, Shuyou Zhang, Taotao Zhou, Jie Li, Jianrong Tan

To address this issue, a digital-twin-enhanced (DT-enhanced) metal tube bending forming real-time prediction method based on multi-source-input multi-task learning (MTL) is proposed.

Multi-Task Learning Prediction

Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?

1 code implementation16 Jun 2022 Adam W. Harley, Zhaoyuan Fang, Jie Li, Rares Ambrus, Katerina Fragkiadaki

Building 3D perception systems for autonomous vehicles that do not rely on high-density LiDAR is a critical research problem because of the expense of LiDAR systems compared to cameras and other sensors.

Autonomous Vehicles Bird's-Eye View Semantic Segmentation +1

An Indoor Environment Sensing and Localization System via mmWave Phased Array

no code implementations7 Jun 2022 Yifei Sun, Jie Li, Tong Zhang, Rui Wang, Xiaohui Peng, Tony Xiao Han, Haisheng Tan

At the end, we show that the reconstructed room layout can be utilized to locate a mobile device according to its AoA spectrum, even with single access point.

FairGAN: GANs-based Fairness-aware Learning for Recommendations with Implicit Feedback

1 code implementation Proceedings of the ACM Web Conference 2022 Jie Li, Yongli Ren, Ke Deng

To fill this gap, we propose a Generative Adversarial Networks (GANs) based learning algorithm FairGAN mapping the exposure fairness issue to the problem of negative preferences in implicit feedback data.

Exposure Fairness Recommendation Systems

Object Permanence Emerges in a Random Walk along Memory

1 code implementation4 Apr 2022 Pavel Tokmakov, Allan Jabri, Jie Li, Adrien Gaidon

This paper proposes a self-supervised objective for learning representations that localize objects under occlusion - a property known as object permanence.

Object

Passive Motion Detection via mmWave Communication System

no code implementations28 Mar 2022 Jie Li, Chao Yu, Yan Luo, Yifei Sun, Rui Wang

Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively.

Hand Gesture Recognition Hand-Gesture Recognition +1

On Understanding and Mitigating the Dimensional Collapse of Graph Contrastive Learning: a Non-Maximum Removal Approach

no code implementations24 Mar 2022 Jiawei Sun, Ruoxin Chen, Jie Li, Chentao Wu, Yue Ding, Junchi Yan

Graph Contrastive Learning (GCL) has shown promising performance in graph representation learning (GRL) without the supervision of manual annotations.

Contrastive Learning Graph Classification +1

Spherical Convolution empowered FoV Prediction in 360-degree Video Multicast with Limited FoV Feedback

1 code implementation29 Jan 2022 Jie Li, Ling Han, Cong Zhang, Qiyue Li, Zhi Liu

Most of the current prediction methods combining saliency detection and FoV information neither take into account that the distortion of projected 360-degree videos can invalidate the weight sharing of traditional convolutional networks, nor do they adequately consider the difficulty of obtaining complete multi-user FoV information, which degrades the prediction performance.

Prediction Saliency Detection +2

One-Bit Active Query With Contrastive Pairs

no code implementations CVPR 2022 Yuhang Zhang, Xiaopeng Zhang, Lingxi Xie, Jie Li, Robert C. Qiu, Hengtong Hu, Qi Tian

The Yes query is treated as positive pairs of the queried category for contrastive pulling, while the No query is treated as hard negative pairs for contrastive repelling.

Active Learning Contrastive Learning

Learning to Learn Transferable Attack

1 code implementation10 Dec 2021 Shuman Fang, Jie Li, Xianming Lin, Rongrong Ji

By treating the attack of both specific data and a modified model as a task, we expect the adversarial perturbations to adopt enough tasks for generalization.

Adversarial Attack Data Augmentation +1

Fully Attentional Network for Semantic Segmentation

1 code implementation8 Dec 2021 Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang

Recent non-local self-attention methods have proven to be effective in capturing long-range dependencies for semantic segmentation.

Computational Efficiency Segmentation +1

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

1 code implementation4 Dec 2021 Feng Xu, Chuang Zhu, Wenqi Tang, Ying Wang, Yu Zhang, Jie Li, Hongchuan Jiang, Zhongyue Shi, Jun Liu, Mulan Jin

Conclusion: Our study provides a novel DL-based biomarker on primary tumor CNB slides to predict the metastatic status of ALN preoperatively for patients with EBC.

Multiple Instance Learning Specificity +1

Image-specific Convolutional Kernel Modulation for Single Image Super-resolution

1 code implementation16 Nov 2021 Yuanfei Huang, Jie Li, Yanting Hu, Xinbo Gao, Hua Huang

Recently, deep-learning-based super-resolution methods have achieved excellent performances, but mainly focus on training a single generalized deep network by feeding numerous samples.

Image Super-Resolution

Denoised Non-Local Neural Network for Semantic Segmentation

no code implementations27 Oct 2021 Qi Song, Jie Li, Hao Guo, Rui Huang

Without any external training data, our proposed Denoised NL can achieve the state-of-the-art performance of 83. 5\% and 46. 69\% mIoU on Cityscapes and ADE20K, respectively.

Semantic Segmentation

Bone Marrow Cell Recognition: Training Deep Object Detection with A New Loss Function

no code implementations25 Oct 2021 Dehao Huang, Jintao Cheng, Rui Fan, Zhihao Su, Qiongxiong Ma, Jie Li

Therefore, it is crucial to study a robust bone marrow cell detection algorithm for a quantitative automatic analysis system.

Cell Detection object-detection +1

FedIPR: Ownership Verification for Federated Deep Neural Network Models

1 code implementation27 Sep 2021 Bowen Li, Lixin Fan, Hanlin Gu, Jie Li, Qiang Yang

To address these risks, the ownership verification of federated learning models is a prerequisite that protects federated learning model intellectual property rights (IPR) i. e., FedIPR.

Federated Learning

Geometry-Based Stochastic Line-of-Sight Probability Model for A2G Channels under Urban Scenarios

no code implementations6 Sep 2021 Qiuming Zhu, Fei Bai, Minghui Pang, Jie Li, Weizhi Zhong, Xiaomin Chen, Kai Mao

Line-of-sight (LoS) path is essential for the reliability of air-to-ground (A2G) communications, but the existence of LoS path is difficult to predict due to random obstacles on the ground.

Stimuli-Aware Visual Emotion Analysis

no code implementations4 Sep 2021 Jingyuan Yang, Jie Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao

Then, we design three specific networks, i. e., Global-Net, Semantic-Net and Expression-Net, to extract distinct emotional features from different stimuli simultaneously.

Emotion Recognition

An Integrated Framework for the Heterogeneous Spatio-Spectral-Temporal Fusion of Remote Sensing Images

no code implementations1 Sep 2021 Menghui Jiang, Huanfeng Shen, Jie Li, Liangpei Zhang

Images from many remote sensing satellites, including MODIS, Landsat-8, Sentinel-1, and Sentinel-2, are utilized in the experiments.

LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision

no code implementations ICCV 2021 Zhijian Liu, Simon Stent, Jie Li, John Gideon, Song Han

Computer vision tasks such as object detection and semantic/instance segmentation rely on the painstaking annotation of large training datasets.

image-classification Image Classification +4

Coupling Model-Driven and Data-Driven Methods for Remote Sensing Image Restoration and Fusion

no code implementations13 Aug 2021 Huanfeng Shen, Menghui Jiang, Jie Li, Chenxia Zhou, Qiangqiang Yuan, Liangpei Zhang

In this paper, we systematically investigate the coupling of model-driven and data-driven methods, which has rarely been considered in the remote sensing image restoration and fusion communities.

Image Restoration

Is Pseudo-Lidar needed for Monocular 3D Object detection?

2 code implementations ICCV 2021 Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon

Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors.

 Ranked #1 on Monocular 3D Object Detection on KITTI Pedestrian Moderate (using extra training data)

Monocular 3D Object Detection Monocular Depth Estimation +2

A Dynamic 3D Spontaneous Micro-expression Database: Establishment and Evaluation

no code implementations31 Jul 2021 Fengping Wang, Jie Li, Siqi Zhang, Chun Qi, Yun Zhang, Danmin Miao

Micro-expressions are spontaneous, unconscious facial movements that show people's true inner emotions and have great potential in related fields of psychological testing.

Real-time Keypoints Detection for Autonomous Recovery of the Unmanned Ground Vehicle

no code implementations27 Jul 2021 Jie Li, Sheng Zhang, Kai Han, Xia Yuan, Chunxia Zhao, Yu Liu

UGV-KPNet is computationally efficient with a small number of parameters and provides pixel-level accurate keypoints detection results in real-time.

Keypoint Detection

Wideband photonic blind source separation with optical pulse sampling

no code implementations21 Jul 2021 Taichu Shi, Yang Qi, Weipeng Zhang, Paul R. Prucnal, Jie Li, Ben Wu

The ultra-fast optical pulse functions as a tweezer that collects samples of the signals at very low sampling rates, and each sample is short enough to maintain the statistical properties of the signals.

blind source separation

Integrated Sensing and Communication from Learning Perspective: An SDP3 Approach

no code implementations20 Jul 2021 Guoliang Li, Shuai Wang, Jie Li, Rui Wang, Fan Liu, Xiaohui Peng, Tony Xiao Han, Chengzhong Xu

Characterizing the sensing and communication performance tradeoff in integrated sensing and communication (ISAC) systems is challenging in the applications of learning-based human motion recognition.

Integrated sensing and communication ISAC

Fully Polarimetric SAR and Single-Polarization SAR Image Fusion Network

no code implementations18 Jul 2021 Liupeng Lin, Jie Li, Huanfeng Shen, Lingli Zhao, Qiangqiang Yuan, Xinghua Li

The data fusion technology aims to aggregate the characteristics of different data and obtain products with multiple data advantages.

IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement

no code implementations29 Jun 2021 Jie Li, Laiyan Ding, Rui Huang

3D semantic scene completion and 2D semantic segmentation are two tightly correlated tasks that are both essential for indoor scene understanding, because they predict the same semantic classes, using positively correlated high-level features.

2D Semantic Segmentation 3D Semantic Scene Completion +3

A Circular-Structured Representation for Visual Emotion Distribution Learning

no code implementations CVPR 2021 Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Xinbo Gao

Visual Emotion Analysis (VEA) has attracted increasing attention recently with the prevalence of sharing images on social networks.

Emotion Recognition

Learning the Non-Differentiable Optimization for Blind Super-Resolution

no code implementations CVPR 2021 Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao

Instead of considering iterative strategy, we make the blur kernel predictor trainable in the whole blind SR model, in which AMNet is well-trained.

Blind Super-Resolution Deep Reinforcement Learning +1

Inverse Simulation: Reconstructing Dynamic Geometry of Clothed Humans via Optimal Control

no code implementations CVPR 2021 Jingfan Guo, Jie Li, Rahul Narain, Hyun Soo Park

Inspired by the theory of optimal control, we optimize the body states such that the simulated cloth motion is matched to the point cloud measurements, and the analytic gradient of the simulator is back-propagated to update the body states.

Friction

Hierarchical Lovasz Embeddings for Proposal-Free Panoptic Segmentation

no code implementations CVPR 2021 Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon

We use a hierarchical Lovasz hinge loss to learn a low-dimensional embedding space structured into a unified semantic and instance hierarchy without requiring separate network branches or object proposals.

Instance Segmentation Panoptic Segmentation +1

GPLA-12: An Acoustic Signal Dataset of Gas Pipeline Leakage

2 code implementations19 Jun 2021 Jie Li, Lizhong Yao

In this paper, we introduce a new acoustic leakage dataset of gas pipelines, called as GPLA-12, which has 12 categories over 684 training/testing acoustic signals.

Fault Detection Fault Diagnosis +2

Hierarchical Lovász Embeddings for Proposal-free Panoptic Segmentation

no code implementations8 Jun 2021 Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon

We use a hierarchical Lov\'asz hinge loss to learn a low-dimensional embedding space structured into a unified semantic and instance hierarchy without requiring separate network branches or object proposals.

Instance Segmentation Panoptic Segmentation +1

Exploring Multi-dimensional Data via Subset Embedding

no code implementations24 Apr 2021 Peng Xie, Wenyuan Tao, Jie Li, Wentao Huang, Siming Chen

The core of the approach is a subset embedding network (SEN) that represents a group of subsets as uniformly-formatted embeddings.

Wireless Sensing With Deep Spectrogram Network and Primitive Based Autoregressive Hybrid Channel Model

no code implementations21 Apr 2021 Guoliang Li, Shuai Wang, Jie Li, Rui Wang, Xiaohui Peng, Tony Xiao Han

Although wireless channel models can be adopted for dataset generation, current channel models are mostly designed for communication rather than sensing.

Dataset Generation Scene Understanding

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer

2 code implementations CVPR 2021 Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao

Inspired by the common painting process of drawing a draft and revising the details, we introduce a novel feed-forward method named Laplacian Pyramid Network (LapStyle).

Style Transfer

Geometric Unsupervised Domain Adaptation for Semantic Segmentation

no code implementations ICCV 2021 Vitor Guizilini, Jie Li, Rares Ambrus, Adrien Gaidon

Simulators can efficiently generate large amounts of labeled synthetic data with perfect supervision for hard-to-label tasks like semantic segmentation.

Depth Prediction Monocular Depth Estimation +3

Transitional Learning: Exploring the Transition States of Degradation for Blind Super-resolution

1 code implementation29 Mar 2021 Yuanfei Huang, Jie Li, Yanting Hu, Xinbo Gao, Hua Huang

Being extremely dependent on iterative estimation of the degradation prior or optimization of the model from scratch, the existing blind super-resolution (SR) methods are generally time-consuming and less effective, as the estimation of degradation proceeds from a blind initialization and lacks interpretable degradation priors.

Blind Super-Resolution Super-Resolution

Learning to Track with Object Permanence

1 code implementation ICCV 2021 Pavel Tokmakov, Jie Li, Wolfram Burgard, Adrien Gaidon

In this work, we introduce an end-to-end trainable approach for joint object detection and tracking that is capable of such reasoning.

Multi-Object Tracking Object +3

Approximate Optimal Filter for Linear Gaussian Time-invariant Systems

no code implementations9 Mar 2021 Kaiming Tang, Shengbo Eben Li, Yuming Yin, Yang Guan, Jingliang Duan, Wenhan Cao, Jie Li

The equivalence holds given certain conditions about initial state distributions and policy formats, in which the system state is the estimation error, control input is the filter gain, and control objective function is the accumulated estimation error.

State Estimation

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

2 code implementations23 Feb 2021 Yang Guan, Jingliang Duan, Shengbo Eben Li, Jie Li, Jianyu Chen, Bo Cheng

Formally, MPG is constructed as a weighted average of the data-driven and model-driven PGs, where the former is the derivative of the learned Q-value function, and the latter is that of the model-predictive return.

Decision Making Reinforcement Learning (RL) +1

DPointNet: A Density-Oriented PointNet for 3D Object Detection in Point Clouds

no code implementations7 Feb 2021 Jie Li, Yu Hu

In this paper, we put forward a novel density-oriented PointNet (DPointNet) for 3D object detection in point clouds, in which the density of points increases layer by layer.

3D Object Detection Object +1

Long time-series NDVI reconstruction in cloud-prone regions via spatio-temporal tensor completion

no code implementations4 Feb 2021 Dong Chu, Huanfeng Shen, Xiaobin Guan, Jing M. Chen, Xinghua Li, Jie Li, Liangpei Zhang

The applications of Normalized Difference Vegetation Index (NDVI) time-series data are inevitably hampered by cloud-induced gaps and noise.

Time Series Time Series Analysis

Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction

no code implementations28 Jan 2021 Jinjiang Guo, Jie Li, Dawei Leng, Lurong Pan

Multi-scale biomedical knowledge networks are expanding with emerging experimental technologies that generates multi-scale biomedical big data.

Deep Learning Link Prediction

Curvature-based Feature Selection with Application in Classifying Electronic Health Records

1 code implementation10 Jan 2021 Zheming Zuo, Jie Li, Han Xu, Noura Al Moubayed

Disruptive technologies provides unparalleled opportunities to contribute to the identifications of many aspects in pervasive healthcare, from the adoption of the Internet of Things through to Machine Learning (ML) techniques.

Breast Cancer Detection Breast Tissue Identification +4

Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models

1 code implementation ICCV 2021 Jie Li, Rongrong Ji, Peixian Chen, Baochang Zhang, Xiaopeng Hong, Ruixin Zhang, Shaoxin Li, Jilin Li, Feiyue Huang, Yongjian Wu

A common practice is to start from a large perturbation and then iteratively reduce it with a deterministic direction and a random one while keeping it adversarial.

Dimensionality Reduction

Probabilistic 3D Multi-Modal, Multi-Object Tracking for Autonomous Driving

1 code implementation26 Dec 2020 Hsu-kuang Chiu, Jie Li, Rares Ambrus, Jeannette Bohg

Second, we propose to learn a metric that combines the Mahalanobis and feature distances when comparing a track and a new detection in data association.

3D Pedestrian Tracking Management +5

Skeleton-based Approaches based on Machine Vision: A Survey

no code implementations23 Dec 2020 Jie Li, Binglin Li, Min Gao

Recently, skeleton-based approaches have achieved rapid progress on the basis of great success in skeleton representation.

object-detection Object Detection +1

Two-Dimensional Multifunctional Materials from Endohedral Fullerenes

no code implementations23 Dec 2020 Jie Li, Ruqian Wu

A new multifunctional 2D material is theoretically predicted based on systematic ab-initio calculations and model simulations for the honeycomb lattice of endohedral fullerene W@C28 molecules.

Materials Science Computational Physics

Coherent mechanical noise cancellation and cooperativity competition in optomechanical arrays

no code implementations21 Dec 2020 Matthijs H. J. de Jong, Jie Li, Claus Gärtner, Richard A. Norte, Simon Gröblacher

Studying the interplay between multiple coupled mechanical resonators is a promising new direction in the field of optomechanics.

Optics Mesoscale and Nanoscale Physics

6 GHz hyperfast rotation of an optically levitated nanoparticle in vacuum

no code implementations17 Dec 2020 Yuanbin Jin, Jiangwei Yan, Shah Jee Rahman, Jie Li, Xudong Yu, Jing Zhang

We measure a highest rotation frequency about 4. 3 GHz of the trapped nanoparticle without feedback cooling and a 6 GHz rotation with feedback cooling, which is the fastest mechanical rotation ever reported to date.

Optics Mesoscale and Nanoscale Physics Quantum Physics

Anatomy of Multipath BGP Deployment in a Large ISP Network

1 code implementation14 Dec 2020 Jie Li, Vasileios Giotsas, Shi Zhou

Our work provides insights into the latest deployment of M-BGP in a major ISP network and it highlights the characteristics and effectiveness of M-BGP as a means to realize load sharing.

Networking and Internet Architecture

Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion

2 code implementations7 Dec 2020 Xu Yan, Jiantao Gao, Jie Li, Ruimao Zhang, Zhen Li, Rui Huang, Shuguang Cui

In practice, an initial semantic segmentation (SS) of a single sweep point cloud can be achieved by any appealing network and then flows into the semantic scene completion (SSC) module as the input.

3D Semantic Scene Completion from a single RGB image 3D Semantic Segmentation +3

Generative and Discriminative Learning for Distorted Image Restoration

no code implementations11 Nov 2020 Yi Gu, Yuting Gao, Jie Li, Chentao Wu, Weijia Jia

Due to the uncertainty in the distortion variation, restoring distorted images caused by liquify filter is a challenging task.

Image Restoration

The Occurrence of Rocky Habitable Zone Planets Around Solar-Like Stars from Kepler Data

1 code implementation28 Oct 2020 Steve Bryson, Michelle Kunimoto, Ravi K. Kopparapu, Jeffrey L. Coughlin, William J. Borucki, David Koch, Victor Silva Aguirre, Christopher Allen, Geert Barentsen, Natalie. M. Batalha, Travis Berger, Alan Boss, Lars A. Buchhave, Christopher J. Burke, Douglas A. Caldwell, Jennifer R. Campbell, Joseph Catanzarite, Hema Chandrasekharan, William J. Chaplin, Jessie L. Christiansen, Jorgen Christensen-Dalsgaard, David R. Ciardi, Bruce D. Clarke, William D. Cochran, Jessie L. Dotson, Laurance R. Doyle, Eduardo Seperuelo Duarte, Edward W. Dunham, Andrea K. Dupree, Michael Endl, James L. Fanson, Eric B. Ford, Maura Fujieh, Thomas N. Gautier III, John C. Geary, Ronald L Gilliland, Forrest R. Girouard, Alan Gould, Michael R. Haas, Christopher E. Henze, Matthew J. Holman, Andrew Howard, Steve B. Howell, Daniel Huber, Roger C. Hunter, Jon M. Jenkins, Hans Kjeldsen, Jeffery Kolodziejczak, Kipp Larson, David W. Latham, Jie Li, Savita Mathur, Soren Meibom, Chris Middour, Robert L. Morris, Timothy D. Morton, Fergal Mullally, Susan E. Mullally, David Pletcher, Andrej Prsa, Samuel N. Quinn, Elisa V. Quintana, Darin Ragozzine, Solange V. Ramirez, Dwight T. Sanderfer, Dimitar Sasselov, Shawn E. Seader, Megan Shabram, Avi Shporer, Jeffrey C. Smith, Jason H. Steffen, Martin Still, Guillermo Torres, John Troeltzsch, Joseph D. Twicken, Akm Kamal Uddin, Jeffrey E. Van Cleve, Janice Voss, Lauren Weiss, William F. Welsh, Bill Wohler, Khadeejah A Zamudio

We present occurrence rates for rocky planets in the habitable zones (HZ) of main-sequence dwarf stars based on the Kepler DR25 planet candidate catalog and Gaia-based stellar properties.

Earth and Planetary Astrophysics Solar and Stellar Astrophysics

Interpretable Detail-Fidelity Attention Network for Single Image Super-Resolution

1 code implementation28 Sep 2020 Yuanfei Huang, Jie Li, Xinbo Gao, Yanting Hu, Wen Lu

To solve them, we propose a purposeful and interpretable detail-fidelity attention network to progressively process these smoothes and details in divide-and-conquer manner, which is a novel and specific prospect of image super-resolution for the purpose on improving the detail fidelity, instead of blindly designing or employing the deep CNNs architectures for merely feature representation in local receptive fields.

Image Super-Resolution

A Framework of Randomized Selection Based Certified Defenses Against Data Poisoning Attacks

no code implementations18 Sep 2020 Ruoxin Chen, Jie Li, Chentao Wu, Bin Sheng, Ping Li

Random selection based defenses can achieve certified robustness by averaging the classifiers' predictions on the sub-datasets sampled from the training set.

Data Poisoning

A Density-Aware PointRCNN for 3D Object Detection in Point Clouds

no code implementations11 Sep 2020 Jie Li, Yu Hu

We present an improved version of PointRCNN for 3D object detection, in which a multi-branch backbone network is adopted to handle the non-uniform density of point clouds.

3D Object Detection object-detection

Extending Label Smoothing Regularization with Self-Knowledge Distillation

no code implementations11 Sep 2020 Ji-Yue Wang, Pei Zhang, Wen-feng Pang, Jie Li

The experiment results confirm that the TC can help LsrKD and MrKD to boost training, especially on the networks they are failed.

Self-Knowledge Distillation

A Light-Weight Object Detection Framework with FPA Module for Optical Remote Sensing Imagery

no code implementations7 Sep 2020 Xi Gu, Lingbin Kong, Zhicheng Wang, Jie Li, Zhaohui Yu, Gang Wei

On the DOTA dataset, CenterFPANet mAP is 64. 00%, and FPS is 22. 2, which is close to the accuracy of the anchor-based methods currently used and much faster than them.

Object object-detection +1

Align Deep Features for Oriented Object Detection

3 code implementations21 Aug 2020 Jiaming Han, Jian Ding, Jie Li, Gui-Song Xia

However most of existing methods rely on heuristically defined anchors with different scales, angles and aspect ratios and usually suffer from severe misalignment between anchor boxes and axis-aligned convolutional features, which leads to the common inconsistency between the classification score and localization accuracy.

Ranked #26 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +2

PillarFlow: End-to-end Birds-eye-view Flow Estimation for Autonomous Driving

no code implementations3 Aug 2020 Kuan-Hui Lee, Matthew Kliemann, Adrien Gaidon, Jie Li, Chao Fang, Sudeep Pillai, Wolfram Burgard

In autonomous driving, accurately estimating the state of surrounding obstacles is critical for safe and robust path planning.

Autonomous Driving

Giant magnetic anisotropy energy and long coherence time of uranium substitution on defected Al2O3(0001)

no code implementations14 Jul 2020 Jie Li, Lei Gu, Ruqian Wu

Nanomagnets with giant magnetic anisotropy energy and long coherence time are desired for various technological innovations such as quantum information procession and storage.

Materials Science Computational Physics

Ternary Policy Iteration Algorithm for Nonlinear Robust Control

no code implementations14 Jul 2020 Jie Li, Shengbo Eben Li, Yang Guan, Jingliang Duan, Wenyu Li, Yuming Yin

The simulation results show that the TPI algorithm can converge to the optimal solution for the linear plant, and has high resistance to disturbances for the nonlinear plant.

Balanced Symmetric Cross Entropy for Large Scale Imbalanced and Noisy Data

no code implementations3 Jul 2020 Feifei Huang, Jie Li, Xuelin Zhu

Deep convolution neural network has attracted many attentions in large-scale visual classification task, and achieves significant performance improvement compared to traditional visual analysis methods.

Weakly Supervised Temporal Action Localization with Segment-Level Labels

no code implementations3 Jul 2020 Xinpeng Ding, Nannan Wang, Xinbo Gao, Jie Li, Xiaoyu Wang, Tongliang Liu

Specifically, we devise a partial segment loss regarded as a loss sampling to learn integral action parts from labeled segments.

Collaborative Boundary-aware Context Encoding Networks for Error Map Prediction

no code implementations25 Jun 2020 Zhenxi Zhang, Chunna Tian, Jie Li, Zhusi Zhong, Zhicheng Jiao, Xinbo Gao

Further, we propose a context encoding module to utilize the global predictor from the error map to enhance the feature representation and regularize the networks.

Image Segmentation Medical Image Segmentation +2

Multi-Margin based Decorrelation Learning for Heterogeneous Face Recognition

no code implementations25 May 2020 Bing Cao, Nannan Wang, Xinbo Gao, Jie Li, Zhifeng Li

Heterogeneous face recognition (HFR) refers to matching face images acquired from different domains with wide applications in security scenarios.

Face Recognition Heterogeneous Face Recognition +1

Projection & Probability-Driven Black-Box Attack

1 code implementation CVPR 2020 Jie Li, Rongrong Ji, Hong Liu, Jianzhuang Liu, Bineng Zhong, Cheng Deng, Qi Tian

For reducing the solution space, we first model the adversarial perturbation optimization problem as a process of recovering frequency-sparse perturbations with compressed sensing, under the setting that random noise in the low-frequency space is more likely to be adversarial.

compressed sensing

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

1 code implementation CVPR 2020 Jie Li, Kai Han, Peng Wang, Yu Liu, Xia Yuan

In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely.

3D Semantic Scene Completion from a single RGB image

A Big Data Enabled Channel Model for 5G Wireless Communication Systems

no code implementations28 Feb 2020 Jie Huang, Cheng-Xiang Wang, Lu Bai, Jian Sun, Yang Yang, Jie Li, Olav Tirkkonen, Ming-Tuo Zhou

This paper investigates various applications of big data analytics, especially machine learning algorithms in wireless communications and channel modeling.

BIG-bench Machine Learning

Semantically-Guided Representation Learning for Self-Supervised Monocular Depth

1 code implementation ICLR 2020 Vitor Guizilini, Rui Hou, Jie Li, Rares Ambrus, Adrien Gaidon

Instead of using semantic labels and proxy losses in a multi-task approach, we propose a new architecture leveraging fixed pretrained semantic segmentation networks to guide self-supervised representation learning via pixel-adaptive convolutions.

Depth Prediction Monocular Depth Estimation +3

3D Gated Recurrent Fusion for Semantic Scene Completion

no code implementations17 Feb 2020 Yu Liu, Jie Li, Qingsen Yan, Xia Yuan, Chunxia Zhao, Ian Reid, Cesar Cadena

This paper tackles the problem of data fusion in the semantic scene completion (SSC) task, which can simultaneously deal with semantic labeling and scene completion.

3D Semantic Scene Completion Scene Understanding

Facial Attribute Capsules for Noise Face Super Resolution

no code implementations16 Feb 2020 Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Xinbo Gao, Zhifeng Li

In the SR processing, we first generated a group of FACs from the input LR face, and then reconstructed the HR face from this group of FACs.

Attribute Hallucination +1

Video Face Super-Resolution with Motion-Adaptive Feedback Cell

no code implementations15 Feb 2020 Jingwei Xin, Nannan Wang, Jie Li, Xinbo Gao, Zhifeng Li

Current state-of-the-art CNN methods usually treat the VSR problem as a large number of separate multi-frame super-resolution tasks, at which a batch of low resolution (LR) frames is utilized to generate a single high resolution (HR) frame, and running a slide window to select LR frames over the entire video would obtain a series of HR frames.

Motion Compensation Motion Estimation +2

Image Fine-grained Inpainting

3 code implementations7 Feb 2020 Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao

Besides, we devise a geometrical alignment constraint item to compensate for the pixel-based distance between prediction features and ground-truth ones.

Facial Inpainting Fine-Grained Image Inpainting

Cloud Removal with Fusion of High Resolution Optical and SAR Images Using Generative Adversarial Networks

no code implementations MDPI Remote Sensing 2020 Jianhao Gao, Qiangqiang Yuan, Jie Li, Hai Zhang, Xin Su

The approach can be roughly divided into two steps: in the first step, a specially designed convolutional neural network (CNN) translates the synthetic aperture radar (SAR) images into simulated optical images in an object-to-object manner; in the second step, the simulated optical image, together with the SAR image and the optical image corrupted by clouds, is fused to reconstruct the corrupted area by a generative adversarial network (GAN) with a particular loss function.

Cloud Removal Earth Observation +2

Direct and indirect reinforcement learning

no code implementations23 Dec 2019 Yang Guan, Shengbo Eben Li, Jingliang Duan, Jie Li, Yangang Ren, Qi Sun, Bo Cheng

Reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision making and control tasks.

Decision Making reinforcement-learning +3

Real-Time Panoptic Segmentation from Dense Detections

no code implementations CVPR 2020 Rui Hou, Jie Li, Arjun Bhargava, Allan Raventos, Vitor Guizilini, Chao Fang, Jerome Lynch, Adrien Gaidon

Panoptic segmentation is a complex full scene parsing task requiring simultaneous instance and semantic segmentation at high resolution.

Clustering object-detection +4

HighEr-Resolution Network for Image Demosaicing and Enhancing

1 code implementation19 Nov 2019 Kangfu Mei, Juncheng Li, Jiajie Zhang, Hao-Yu Wu, Jie Li, Rui Huang

However, plenty of studies have shown that global information is crucial for image restoration tasks like image demosaicing and enhancing.

Demosaicking

Trident Segmentation CNN: A Spatiotemporal Transformation CNN for Punctate White Matter Lesions Segmentation in Preterm Neonates

1 code implementation22 Oct 2019 Yalong Liu, Jie Li, Miaomiao Wang, Zhicheng Jiao, Jian Yang, Xianjun Li

In this paper, a novel spatiotemporal transformation deep learning method called Trident Segmentation CNN (TS-CNN) is proposed to segment PWML in MR images.

Segmentation Specificity

Robust Semi-Supervised Monocular Depth Estimation with Reprojected Distances

no code implementations4 Oct 2019 Vitor Guizilini, Jie Li, Rares Ambrus, Sudeep Pillai, Adrien Gaidon

Dense depth estimation from a single image is a key problem in computer vision, with exciting applications in a multitude of robotic tasks.

Monocular Depth Estimation valid

Cannot find the paper you are looking for? You can Submit a new open access paper.