Search Results for author: Kai Li

Found 176 papers, 77 papers with code

Instance-hiding Schemes for Private Distributed Learning

no code implementations ICML 2020 Yangsibo Huang, Zhao Song, Sanjeev Arora, Kai Li

The new ideas in the current paper are: (a) new variants of mixup with negative as well as positive coefficients, and extend the sample-wise mixup to be pixel-wise.

Federated Learning

EAGLE: An Efficient Global Attention Lesion Segmentation Model for Hepatic Echinococcosis

no code implementations25 Jun 2025 Jiayan Chen, Kai Li, Yulu Zhao, Jianqiang Huang, Zhan Wang

Hepatic echinococcosis (HE) is a widespread parasitic disease in underdeveloped pastoral areas with limited medical resources.

Image Segmentation Lesion Segmentation +3

Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization

no code implementations11 Jun 2025 Shengda Gu, Kai Li, Junliang Xing, Yifan Zhang, Jian Cheng

Combinatorial optimization problems are notoriously challenging due to their discrete structure and exponentially large solution space.

Combinatorial Optimization Deep Reinforcement Learning +2

Segment Concealed Objects with Incomplete Supervision

no code implementations10 Jun 2025 Chunming He, Kai Li, Yachao Zhang, Ziyun Yang, Youwei Pang, Longxiang Tang, Chengyu Fang, Yulun Zhang, Linghe Kong, Xiu Li, Sina Farsiu

To mitigate the effect of low-quality segmentation masks, we introduce a series of strategies for pseudo-label generation, storage, and supervision.

Pseudo Label Segmentation +1

DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience

no code implementations4 Jun 2025 Runxiang Wang, Boxiao Wang, Kai Li, Yifan Zhang, Jian Cheng

Symbolic regression is a fundamental tool for discovering interpretable mathematical expressions from data, with broad applications across scientific and engineering domains.

Efficient Exploration Equation Discovery +1

Zero-Trust Foundation Models: A New Paradigm for Secure and Collaborative Artificial Intelligence for Internet of Things

no code implementations26 May 2025 Kai Li, Conggai Li, Xin Yuan, Shenghong Li, Sai Zou, Syed Sohail Ahmed, Wei Ni, Dusit Niyato, Abbas Jamalipour, Falko Dressler, Ozgur B. Akan

This paper focuses on Zero-Trust Foundation Models (ZTFMs), a novel paradigm that embeds zero-trust security principles into the lifecycle of foundation models (FMs) for Internet of Things (IoT) systems.

Anomaly Detection Federated Learning +1

Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation

no code implementations19 May 2025 Guo Chen, Kai Li, Runxuan Yang, Xiaolin Hu

Existing causal speech separation models often underperform compared to non-causal models due to difficulties in retaining historical information.

Speech Separation

DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking

no code implementations18 May 2025 Jirong Zha, Yuxuan Fan, Kai Li, Han Li, Chen Gao, Xinlei Chen, Yong Li

First, DIMM extends the model combination solution space of conventional IMM from a hyperplane to a hypercube by designing a 3D-decoupled multi-hierarchy filter bank, which describes the target's motion with various-order linear models.

3D Object Tracking Object Tracking +1

SepPrune: Structured Pruning for Efficient Deep Speech Separation

1 code implementation17 May 2025 Yuqi Li, Kai Li, Xin Yin, Zhifei Yang, Junhao Dong, Zeyu Dong, Chuanguang Yang, YingLi Tian, Yao Lu

Although deep learning has substantially advanced speech separation in recent years, most existing studies continue to prioritize separation quality while overlooking computational efficiency, an essential factor for low-latency speech processing in real-time applications.

channel selection Computational Efficiency +1

Undermining Federated Learning Accuracy in EdgeIoT via Variational Graph Auto-Encoders

no code implementations14 Apr 2025 Kai Li, Shuyan Hu, Bochun Wu, Sai Zou, Wei Ni, Falko Dressler

EdgeIoT represents an approach that brings together mobile edge computing with Internet of Things (IoT) devices, allowing for data processing close to the data source.

Edge-computing Federated Learning

Using machine learning method for variable star classification using the TESS Sectors 1-57 data

no code implementations1 Apr 2025 Li-Heng Wang, Kai Li, Xiang Gao, Ya-Ni Guo, Guo-You Sun

The Transiting Exoplanet Survey Satellite (TESS) is a wide-field all-sky survey mission designed to detect Earth-sized exoplanets.

Classification Of Variable Stars Survey

Automatic Operator-level Parallelism Planning for Distributed Deep Learning -- A Mixed-Integer Programming Approach

no code implementations12 Mar 2025 Ruifeng She, Bowen Pang, Kai Li, Zehua Liu, Tao Zhong

We propose a bi-level solution framework balancing optimality with computational efficiency, automatically generating effective distributed plans that capture both the heterogeneous structure of modern neural networks and the underlying hardware constraints.

Computational Efficiency Mixture-of-Experts +1

Dynamic Dictionary Learning for Remote Sensing Image Segmentation

1 code implementation9 Mar 2025 Xuechao Zou, Yue Li, Shun Zhang, Kai Li, Shiying Wang, Pin Tao, Junliang Xing, Congyan Lang

Remote sensing image segmentation faces persistent challenges in distinguishing morphologically similar categories and adapting to diverse scene variations.

Dictionary Learning Image Segmentation +2

Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction

no code implementations8 Mar 2025 Kai Li, Junhao Wang, William Han, Ding Zhao

On the EndoNeRF dataset, FEG achieves superior performance (SSIM of 0. 97, PSNR of 39. 08, and LPIPS of 0. 03) compared to leading methods.

3DGS image-classification +6

ARS: Automatic Routing Solver with Large Language Models

1 code implementation21 Feb 2025 Kai Li, Fei Liu, Zhenkun Wang, Xialiang Tong, Xiongwei Han, Mingxuan Yuan, Qingfu Zhang

Real-world Vehicle Routing Problems (VRPs) are characterized by a variety of practical constraints, making manual solver design both knowledge-intensive and time-consuming.

Language Modeling Language Modelling +1

Hybrid Offline-online Scheduling Method for Large Language Model Inference Optimization

no code implementations14 Feb 2025 Bowen Pang, Kai Li, Ruifeng She, Feifan Wang

A 100-cases study shows that our method consistently outperforms the baseline method and improves the utilization rate by 8. 0% on average.

GSM8K Inference Optimization +4

RUN: Reversible Unfolding Network for Concealed Object Segmentation

no code implementations30 Jan 2025 Chunming He, Rihan Zhang, Fengyang Xiao, Chenyu Fang, Longxiang Tang, Yulun Zhang, Linghe Kong, Deng-Ping Fan, Kai Li, Sina Farsiu

To address this, we propose the Reversible Unfolding Network (RUN), which applies reversible strategies across both mask and RGB domains through a theoretically grounded framework, enabling accurate segmentation.

Object Segmentation +1

Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification

no code implementations25 Jan 2025 Zhongqi Wang, Jia Dai, Kai Li, Xu Li, Yanmeng Guo, MaoSheng Xiang

We conduct comparison experiments of the proposed Comp-LoRA method and other PEFT methods on fine-tuning VLM for few shot classification.

Few-Shot Learning parameter-efficient fine-tuning

Diffusion Models for Smarter UAVs: Decision-Making and Modeling

no code implementations10 Jan 2025 Yousef Emami, Hao Zhou, Luis Almeida, Kai Li

By combining the data generation capabilities of DMs with the decision-making framework of RL and the modeling accuracy of DT, the integration improves the adaptability and real-time performance of UAV communication.

Decision Making Reinforcement Learning (RL)

GDSR: Global-Detail Integration through Dual-Branch Network with Wavelet Losses for Remote Sensing Image Super-Resolution

no code implementations31 Dec 2024 Qiwei Zhu, Kai Li, Guojing Zhang, Xiaoying Wang, Jianqiang Huang, Xilai Li

In addition, we propose Wavelet Loss, a loss function that effectively captures high-frequency detail information in images, thereby enhancing the visual quality of SR, particularly in terms of detail reconstruction.

Image Super-Resolution State Space Models

HES-UNet: A U-Net for Hepatic Echinococcosis Lesion Segmentation

no code implementations9 Dec 2024 Jiayan Chen, Kai Li, Zhanjin Wang, Zhan Wang, Jianqiang Huang

Due to the distinct regional characteristics of HE, there is currently no publicly available high-quality dataset for training our model.

Lesion Segmentation

InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences

1 code implementation2 Dec 2024 Chenyang Zhu, Kai Li, Yue Ma, Longxiang Tang, Chengyu Fang, Chubin Chen, Qifeng Chen, Xiu Li

They struggle to maintain consistency in both the foreground and background during concept swapping, especially when the shape difference is large between objects.

Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images

1 code implementation20 Nov 2024 Xuechao Zou, Shun Zhang, Kai Li, Shiying Wang, Junliang Xing, Lei Jin, Congyan Lang, Pin Tao

Cloud segmentation is a critical challenge in remote sensing image interpretation, as its accuracy directly impacts the effectiveness of subsequent data processing and analysis.

Segmentation

Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection

1 code implementation17 Nov 2024 Wentao Bao, Kai Li, Yuxiao Chen, Deep Patel, Martin Renqiang Min, Yu Kong

Existing approaches focus on the closed-set setting where an action detector is trained and tested on videos from a fixed set of action categories.

Action Detection Open Vocabulary Action Detection

Adapter-dependent Adapter Methylation Assay

no code implementations5 Oct 2024 Jia Zhang, Peng Qi, Li Xiao, Mengxi Yuan, Jun Chuan, Yaling Zeng, Li-mei Lin, Yue Gu, Yan Zhang, Duan-fang Liao, Kai Li

Sensitive and reliable methylation assay is important for oncogentic studies and clinical applications.

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

2 code implementations2 Oct 2024 Kai Li, Wendi Sang, Chang Zeng, Runxuan Yang, Guo Chen, Xiaolin Hu

Additionally, to investigate the differences between synthetic and real-world data, we selected 5 hours of raw, non-reverberant data from the SonicSet validation set and recorded a real-world speech separation dataset, providing a reference for comparing SonicSet with other synthetic datasets.

Speech Enhancement Speech Separation

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

no code implementations2 Oct 2024 Mohan Xu, Kai Li, Guo Chen, Xiaolin Hu

On EchoSet and real-world data, TIGER significantly reduces the number of parameters by 94. 3% and the MACs by 95. 3% while achieving performance surpassing the state-of-the-art (SOTA) model TF-GridNet.

Speech Separation

A Novel Framework of Horizontal-Vertical Hybrid Federated Learning for EdgeIoT

no code implementations2 Oct 2024 Kai Li, Yilei Liang, Xin Yuan, Wei Ni, Jon Crowcroft, Chau Yuen, Ozgur B. Akan

This letter puts forth a new hybrid horizontal-vertical federated learning (HoVeFL) for mobile edge computing-enabled Internet of Things (EdgeIoT).

Edge-computing Vertical Federated Learning

Subjective and Objective Quality-of-Experience Evaluation Study for Live Video Streaming

no code implementations26 Sep 2024 Zehao Zhu, Wei Sun, Jun Jia, Wei Wu, Sibin Deng, Kai Li, Ying Chen, Xiongkuo Min, Jia Wang, Guangtao Zhai

For the subjective QoE study, we introduce the first live video streaming QoE dataset, TaoLive QoE, which consists of $42$ source videos collected from real live broadcasts and $1, 155$ corresponding distorted ones degraded due to a variety of streaming distortions, including conventional streaming distortions such as compression, stalling, as well as live streaming-specific distortions like frame skipping, variable frame rate, etc.

Optical Flow Estimation

Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

no code implementations22 Sep 2024 Yuxiao Chen, Kai Li, Wentao Bao, Deep Patel, Yu Kong, Martin Renqiang Min, Dimitris N. Metaxas

Learning to localize temporal boundaries of procedure steps in instructional videos is challenging due to the limited availability of annotated large-scale training videos.

Contrastive Learning cross-modal alignment +4

SafeEar: Content Privacy-Preserving Audio Deepfake Detection

1 code implementation14 Sep 2024 Xinfeng Li, Kai Li, Yifan Zheng, Chen Yan, Xiaoyu Ji, Wenyuan Xu

To overcome the challenge of identifying diverse deepfake audio without semantic clues, we enhance our deepfake detector with real-world codec augmentation.

Audio Deepfake Detection Face Swapping +4

Apollo: Band-sequence Modeling for High-Quality Audio Restoration

1 code implementation13 Sep 2024 Kai Li, Yi Luo

Audio restoration has become increasingly significant in modern society, not only due to the demand for high-quality auditory experiences enabled by advanced playback devices, but also because the growing capabilities of generative audio models necessitate high-fidelity audio.

Computational Efficiency Speech Enhancement

Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks

no code implementations9 Sep 2024 Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki

The quantification results from the training set of the Malfunctioning Industrial Machine Investigation and Inspection dataset with a signal-to-noise (SNR) of 6 dB reveal that the distinguishing information between normal and anomalous sounds of different machines is encoded non-uniformly in the frequency domain.

Enhancing Multi-Stream Beamforming Through CQIs For 5G NR FDD Massive MIMO Communications: A Tuning-Free Scheme

no code implementations1 Sep 2024 Kai Li, Ying Li, Lei Cheng, Zhi-Quan Luo

In such schemes, the performance of downlink beamforming is determined by the codebook design and the codebook indicator feedback.

Quantization

Extracting polygonal footprints in off-nadir images with Segment Anything Model

2 code implementations16 Aug 2024 Kai Li, Yupeng Deng, Jingbo Chen, Yu Meng, Zhihao Xi, Junxian Ma, Chenhao Wang, Xiangyu Zhao

Building Footprint Extraction (BFE) from off-nadir aerial images often involves roof segmentation and offset prediction to adjust roof boundaries to the building footprint.

Prediction RAG

A Course Shared Task on Evaluating LLM Output for Clinical Questions

1 code implementation31 Jul 2024 Yufang Hou, Thy Thy Tran, Doan Nam Long Vu, Yiwen Cao, Kai Li, Lukas Rohde, Iryna Gurevych

This paper presents a shared task that we organized at the Foundations of Language Technology (FoLT) course in 2023/2024 at the Technical University of Darmstadt, which focuses on evaluating the output of Large Language Models (LLMs) in generating harmful answers to health-related clinical questions.

Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data

no code implementations18 Jul 2024 Wufei Ma, Kai Li, Zhongshi Jiang, Moustafa Meshry, Qihao Liu, Huiyu Wang, Christian Häne, Alan Yuille

In order to narrow the gap between video-text models and human performance on RCAD, we identify a key limitation of current contrastive approaches on video-text data and introduce LLM-teacher, a more effective approach to learn action semantics by leveraging knowledge obtained from a pretrained large language model.

Language Modelling Large Language Model +2

Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions

no code implementations17 Jul 2024 Alam Noor, Kai Li, Eduardo Tovar, Pei Zhang, Bo Wei

Recognizing unauthorized Unmanned Aerial Vehicles (UAVs) within designated no-fly zones throughout the day and night is of paramount importance, where the unauthorized UAVs pose a substantial threat to both civil and military aviation safety.

Graph Neural Network Optical Flow Estimation

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

1 code implementation7 Jul 2024 Longxiang Tang, Zhuotao Tian, Kai Li, Chunming He, Hantao Zhou, Hengshuang Zhao, Xiu Li, Jiaya Jia

To address this problem efficiently, we propose the Distribution-aware Interference-free Knowledge Integration (DIKI) framework, retaining pre-trained knowledge of VLMs from a perspective of avoiding information interference.

class-incremental learning Class Incremental Learning +2

Timely Requesting for Time-Critical Content Users in Decentralized F-RANs

no code implementations3 Jul 2024 Xingran Chen, Kai Li, Kun Yang

We study two general classes of policies: (i) oblivious policies, where decision-making is independent of historical information, and (ii) non-oblivious policies, where decisions are influenced by historical information.

Evaluating Copyright Takedown Methods for Language Models

no code implementations26 Jun 2024 Boyi Wei, Weijia Shi, Yangsibo Huang, Noah A. Smith, Chiyuan Zhang, Luke Zettlemoyer, Kai Li, Peter Henderson

Language models (LMs) derive their capabilities from extensive training on diverse data, including potentially copyrighted material.

A Novel Defense Against Poisoning Attacks on Federated Learning: LayerCAM Augmented with Autoencoder

1 code implementation2 Jun 2024 Jingjing Zheng, Xin Yuan, Kai Li, Wei Ni, Eduardo Tovar, Jon Crowcroft

The autoencoder is designed to process the LayerCAM heat maps from the local model updates, improving their distinctiveness and thereby increasing the accuracy in spotting anomalous maps and malicious local models.

Federated Learning Model Poisoning

BERP: A Blind Estimator of Room Parameters for Single-Channel Noisy Speech Signals

1 code implementation7 May 2024 Lijun Wang, Yixian Lu, Ziyan Gao, Kai Li, Jianqiang Huang, Yuntao Kong, Shogo Okada

In this paper, we propose a new universal blind estimation framework called the blind estimator of room parameters (BERP) to estimate RAPs, RGPs and occupancy level via a unified methodology.

Room Impulse Response (RIR)

Age of Information Minimization using Multi-agent UAVs based on AI-Enhanced Mean Field Resource Allocation

no code implementations24 Apr 2024 Yousef Emami, Hao Gao, Kai Li, Luis Almeida, Eduardo Tovar, Zhu Han

Unmanned Aerial Vehicle (UAV) swarms play an effective role in timely data collection from ground sensors in remote and hostile areas.

Q-Learning Scheduling

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent

1 code implementation22 Apr 2024 Hang Xu, Kai Li, Bingyun Liu, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng

Counterfactual regret minimization (CFR) is a family of algorithms for effectively solving imperfect-information games.

counterfactual

MultiBooth: Towards Generating All Your Concepts in an Image from Text

1 code implementation22 Apr 2024 Chenyang Zhu, Kai Li, Yue Ma, Chunming He, Xiu Li

MultiBooth addresses these issues by dividing the multi-concept generation process into two phases: a single-concept learning phase and a multi-concept integration phase.

All Computational Efficiency +1

SPMamba: State-space model is all you need in speech separation

1 code implementation2 Apr 2024 Kai Li, Guo Chen, Runxuan Yang, Xiaolin Hu

Existing CNN-based speech separation models face local receptive field limitations and cannot effectively capture long time dependencies.

All Mamba +1

DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

2 code implementations CVPR 2024 Muyang Li, Tianle Cai, Jiaxin Cao, Qinsheng Zhang, Han Cai, Junjie Bai, Yangqing Jia, Ming-Yu Liu, Kai Li, Song Han

To overcome this dilemma, we observe the high similarity between the input from adjacent diffusion steps and propose displaced patch parallelism, which takes advantage of the sequential nature of the diffusion process by reusing the pre-computed feature maps from the previous timestep to provide context for the current step.

BitDelta: Your Fine-Tune May Only Be Worth One Bit

1 code implementation15 Feb 2024 James Liu, Guangxuan Xiao, Kai Li, Jason D. Lee, Song Han, Tri Dao, Tianle Cai

Large Language Models (LLMs) are typically trained in two phases: pre-training on large internet-scale datasets, and fine-tuning for downstream tasks.

Failure Analysis in Next-Generation Critical Cellular Communication Infrastructures

no code implementations6 Feb 2024 Siguo Bi, Xin Yuan, Shuyan Hu, Kai Li, Wei Ni, Ekram Hossain, Xin Wang

The advent of communication technologies marks a transformative phase in critical infrastructure construction, where the meticulous analysis of failures becomes paramount in achieving the fundamental objectives of continuity, security, and availability.

Survey

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion

1 code implementation25 Jan 2024 Samuel Pegg, Kai Li, Xiaolin Hu

TDANet serves as the architectural foundation for the auditory and visual networks within TDFNet, offering an efficient model with fewer parameters.

speech-recognition Speech Recognition +1

Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

1 code implementation8 Dec 2023 Yinwei Dai, Rui Pan, Anand Iyer, Kai Li, Ravi Netravali

Machine learning (ML) inference platforms are tasked with balancing two competing goals: ensuring high throughput given many requests, and delivering low-latency responses to support interactive applications.

Subnetwork-to-go: Elastic Neural Network with Dynamic Training and Customizable Inference

no code implementations6 Dec 2023 Kai Li, Yi Luo

Deploying neural networks to different devices or platforms is in general challenging, especially when the model size is large or model complexity is high.

Dynamic neural networks Music Source Separation

Detection and Mitigation of Position Spoofing Attacks on Cooperative UAV Swarm Formations

no code implementations6 Dec 2023 Siguo Bi, Kai Li, Shuyan Hu, Wei Ni, Cong Wang, Xin Wang

Detecting spoofing attacks on the positions of unmanned aerial vehicles (UAVs) within a swarm is challenging.

Position

Data-Agnostic Model Poisoning against Federated Learning: A Graph Autoencoder Approach

no code implementations30 Nov 2023 Kai Li, Jingjing Zheng, Xin Yuan, Wei Ni, Ozgur B. Akan, H. Vincent Poor

The attacker then adversarially regenerates the graph structural correlations while maximizing the FL training loss, and subsequently generates malicious local models using the adversarial graph structure and the training data features of the benign ones.

Federated Learning Model Poisoning

PanBench: Towards High-Resolution and High-Performance Pansharpening

no code implementations20 Nov 2023 Shiying Wang, Xuechao Zou, Kai Li, Junliang Xing, Pin Tao

Pansharpening, a pivotal task in remote sensing, involves integrating low-resolution multispectral images with high-resolution panchromatic images to synthesize an image that is both high-resolution and retains multispectral information.

Change Detection Land Cover Classification +1

Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model

1 code implementation20 Nov 2023 Chunming He, Chengyu Fang, Yulun Zhang, Tian Ye, Kai Li, Longxiang Tang, Zhenhua Guo, Xiu Li, Sina Farsiu

These priors are subsequently utilized by RGformer to guide the decomposition of image features into their respective reflectance and illumination components.

Image Restoration

Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model

no code implementations25 Oct 2023 Kai Li, Yupeng Deng, Yunlong Kong, Diyou Liu, Jingbo Chen, Yu Meng, Junxian Ma

More accurate extraction of invisible building footprints from very-high-resolution (VHR) aerial images relies on roof segmentation and roof-to-footprint offset extraction.

Instance Segmentation Region Proposal +1

Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation

2 code implementations10 Oct 2023 Yangsibo Huang, Samyak Gupta, Mengzhou Xia, Kai Li, Danqi Chen

Finally, we propose an effective alignment method that explores diverse generation strategies, which can reasonably reduce the misalignment rate under our attack.

Red Teaming

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation

1 code implementation29 Sep 2023 Samuel Pegg, Kai Li, Xiaolin Hu

This is the first time-frequency domain audio-visual speech separation method to outperform all contemporary time-domain counterparts.

Audio-Visual Speech Recognition speech-recognition +2

Gastro-Intestinal Tract Segmentation Using an Explainable 3D Unet

no code implementations25 Sep 2023 Kai Li, Jonathan Chan

This paper proposes a deep learning pipeline that incorporates XAI to address the challenges of organ segmentation.

Organ Segmentation Position

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation

1 code implementation16 Aug 2023 Kai Li, Runxuan Yang, Fuchun Sun, Xiaolin Hu

Recent research has made significant progress in designing fusion modules for audio-visual speech separation.

Speech Separation

High-Fidelity Lake Extraction via Two-Stage Prompt Enhancement: Establishing a Novel Baseline and Benchmark

1 code implementation16 Aug 2023 Ben Chen, Xuechao Zou, Kai Li, Yu Zhang, Junliang Xing, Pin Tao

Lake extraction from remote sensing imagery is a complex challenge due to the varied lake shapes and data noise.

Decoder

DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images

1 code implementation8 Aug 2023 Xuechao Zou, Kai Li, Junliang Xing, Yu Zhang, Shiying Wang, Lei Jin, Pin Tao

Optical satellite images are a critical data source; however, cloud cover often compromises their quality, hindering image applications and analysis.

Cloud Removal Image Generation

Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects

1 code implementation6 Aug 2023 Chunming He, Kai Li, Yachao Zhang, Yulun Zhang, Zhenhua Guo, Xiu Li, Martin Danelljan, Fisher Yu

On the prey side, we propose an adversarial training framework, Camouflageator, which introduces an auxiliary generator to generate more camouflaged objects that are harder for a COD method to detect.

object-detection Object Detection

Consistency Regularization for Generalizable Source-free Domain Adaptation

no code implementations3 Aug 2023 Longxiang Tang, Kai Li, Chunming He, Yulun Zhang, Xiu Li

In this paper, we propose a consistency regularization framework to develop a more generalizable SFDA method, which simultaneously boosts model performance on both target training and testing datasets.

Pseudo Label Source-Free Domain Adaptation

How do software citation formats evolve over time? A longitudinal analysis of R programming language packages

no code implementations17 Jul 2023 Yuzhuo Wang, Kai Li

However, software is hardly consistently cited: one software entity can be cited as different objects, and the citations can change over time.

Articles

HQG-Net: Unpaired Medical Image Enhancement with High-Quality Guidance

no code implementations15 Jul 2023 Chunming He, Kai Li, Guoxia Xu, Jiangpeng Yan, Longxiang Tang, Yulun Zhang, Xiu Li, YaoWei Wang

Specifically, we extract features from an HQ image and explicitly insert the features, which are expected to encode HQ cues, into the enhancement network to guide the LQ enhancement with the variational normalization module.

Image Enhancement Medical Image Enhancement

Towards Ubiquitous Semantic Metaverse: Challenges, Approaches, and Opportunities

no code implementations13 Jul 2023 Kai Li, Billy Pik Lik Lau, Xin Yuan, Wei Ni, Mohsen Guizani, Chau Yuen

In recent years, ubiquitous semantic Metaverse has been studied to revolutionize immersive cyber-virtual experiences for augmented reality (AR) and virtual reality (VR) users, which leverages advanced semantic understanding and representation to enable seamless, context-aware interactions within mixed-reality environments.

Marketing Mixed Reality +1

PCG-based Static Underground Garage Scenario Generation

no code implementations8 Jul 2023 Wenjin Li, Kai Li

The key to crossing these levels lies in training the autonomous driving model.

Autonomous Driving

Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model

1 code implementation31 May 2023 Héctor Martel, Julius Richter, Kai Li, Xiaolin Hu, Timo Gerkmann

We propose Audio-Visual Lightweight ITerative model (AVLIT), an effective and lightweight neural network that uses Progressive Learning (PL) to perform audio-visual speech separation in noisy environments.

Speech Separation

A Neural State-Space Model Approach to Efficient Speech Separation

1 code implementation26 May 2023 Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng

In this work, we introduce S4M, a new efficient speech separation framework based on neural state-space models (SSM).

Representation Learning Speech Separation +1

Privacy Implications of Retrieval-Based Language Models

1 code implementation24 May 2023 Yangsibo Huang, Samyak Gupta, Zexuan Zhong, Kai Li, Danqi Chen

Crucially, we find that $k$NN-LMs are more susceptible to leaking private information from their private datastore than parametric models.

Retrieval

PruMUX: Augmenting Data Multiplexing with Model Compression

1 code implementation24 May 2023 Yushan Su, Vishvak Murahari, Karthik Narasimhan, Kai Li

As language models increase in size by the day, methods for efficient inference are critical to leveraging their capabilities for various applications.

Knowledge Distillation model +1

Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping

no code implementations NeurIPS 2023 Chunming He, Kai Li, Yachao Zhang, Guoxia Xu, Longxiang Tang, Yulun Zhang, Zhenhua Guo, Xiu Li

It remains a challenging task since (1) it is hard to distinguish concealed objects from the background due to the intrinsic similarity and (2) the sparsely-annotated training data only provide weak supervision for model learning.

Segmentation Semantic Segmentation

Exploring Compositional Visual Generation with Latent Classifier Guidance

no code implementations25 Apr 2023 Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min

We show that this paradigm based on latent classifier guidance is agnostic to pre-trained generative models, and present competitive results for both image generation and sequential manipulation of real and synthetic images.

Image Generation

Towards Realizing the Value of Labeled Target Samples: a Two-Stage Approach for Semi-Supervised Domain Adaptation

no code implementations21 Apr 2023 mengqun Jin, Kai Li, Shuyan Li, Chunming He, Xiu Li

We further propose a consistency learning based mean teacher model to effectively adapt the learned UDA model using labeled and unlabeled target samples.

Semi-supervised Domain Adaptation Unsupervised Domain Adaptation

PMAA: A Progressive Multi-scale Attention Autoencoder Model for High-performance Cloud Removal from Multi-temporal Satellite Imagery

2 code implementations29 Mar 2023 Xuechao Zou, Kai Li, Junliang Xing, Pin Tao, Yachao Cui

Satellite imagery analysis plays a pivotal role in remote sensing; however, information loss due to cloud cover significantly impedes its application.

Cloud Detection Cloud Removal

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

1 code implementation CVPR 2023 Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

In this paper, we propose an approach for cI2V using novel latent flow diffusion models (LFDM) that synthesize an optical flow sequence in the latent space based on the given condition to warp the given image.

Image to Video Generation Motion Generation +1

Learning Trustworthy Model from Noisy Labels based on Rough Set for Surface Defect Detection

no code implementations25 Jan 2023 Tongzhi Niu, Bin Li, Kai Li, Yufeng Lin, Yuwei Li, Weifeng Li, Zhenrong Wang

In the surface defect detection, there are some suspicious regions that cannot be uniquely classified as abnormal or normal.

Defect Detection

Adversarial Alignment for Source Free Object Detection

no code implementations11 Jan 2023 Qiaosong Chu, Shuyan Li, Guangyi Chen, Kai Li, Xiu Li

Source-free object detection (SFOD) aims to transfer a detector pre-trained on a label-rich source domain to an unlabeled target domain without seeing source data.

Object object-detection +2

Degradation-Resistant Unfolding Network for Heterogeneous Image Fusion

no code implementations ICCV 2023 Chunming He, Kai Li, Guoxia Xu, Yulun Zhang, Runze Hu, Zhenhua Guo, Xiu Li

Heterogeneous image fusion (HIF) techniques aim to enhance image quality by merging complementary information from images captured by different sensors.

Few-Shot Video Classification via Representation Fusion and Promotion Learning

no code implementations ICCV 2023 Haifeng Xia, Kai Li, Martin Renqiang Min, Zhengming Ding

This operation maximizes the contribution of discriminative frames to further capture the similarity of support and query samples from the same category.

Video Classification

Personalized Semantics Excitation for Federated Image Classification

no code implementations ICCV 2023 Haifeng Xia, Kai Li, Zhengming Ding

Federated learning casts a light on the collaboration of distributed local clients with privacy protected to attain a more generic global model.

Classification image-classification +3

Camouflaged Object Detection With Feature Decomposition and Edge Reconstruction

no code implementations CVPR 2023 Chunming He, Kai Li, Yachao Zhang, Longxiang Tang, Yulun Zhang, Zhenhua Guo, Xiu Li

COD is a challenging task due to the intrinsic similarity of camouflaged objects with the background, as well as their ambiguous boundaries.

object-detection Object Detection

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits

2 code implementations21 Dec 2022 Kai Li, Fenghua Xie, Hang Chen, Kexin Yuan, Xiaolin Hu

Then, inspired by the large number of connections between cortical regions and the thalamus, the model fuses the auditory and visual information in a thalamic subnetwork through top-down connections.

Speech Separation

SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields

no code implementations7 Dec 2022 Siddhant Ranade, Christoph Lassner, Kai Li, Christian Haene, Shen-Chi Chen, Jean-Charles Bazin, Sofien Bouaziz

Neural Radiance Fields (NeRFs) encode the radiance in a scene parameterized by the scene's plenoptic function.

Video Editing

Machine Learning-Aided Operations and Communications of Unmanned Aerial Vehicles: A Contemporary Survey

no code implementations7 Nov 2022 Harrison Kurunathan, Hailong Huang, Kai Li, Wei Ni, Ekram Hossain

It is also unveiled that the reliability and trust of ML in UAV operations and applications require significant attention before full automation of UAVs and potential cooperation between UAVs and humans come to fruition.

Survey

An efficient encoder-decoder architecture with top-down attention for speech separation

1 code implementation30 Sep 2022 Kai Li, Runxuan Yang, Xiaolin Hu

In addition, a large-size version of TDANet obtained SOTA results on three datasets, with MACs still only 10\% of Sepformer and the CPU inference time only 24\% of Sepformer.

Decoder +1

A Review on Method Entities in the Academic Literature: Extraction, Evaluation, and Application

no code implementations8 Sep 2022 Yuzhuo Wang, Chengzhi Zhang, Kai Li

With the advancement of sciences, many scientific methods are being proposed, modified, and used in academic literature.

A Multi-scale Video Denoising Algorithm for Raw Image

no code implementations5 Sep 2022 Bin Ma, Yueli Hu, Xianxian Lv, Kai Li

Video denoising for raw image has always been the difficulty of camera image processing.

Image Denoising Motion Estimation +1

Designs, Motion Mechanism, Motion Coordination, and Communication of Bionic Robot Fishes: A Survey

no code implementations30 Jun 2022 Zhiwei Yu, Kai Li, Yu Ji, Simon X. Yang

Various methods are needed to narrow the gap in swimming performance between robot fishes and fish.

Multi-Agent Feedback Enabled Neural Networks for Intelligent Communications

1 code implementation22 May 2022 Fanglei Sun, Yang Li, Ying Wen, Jingchen Hu, Jun Wang, Yang Yang, Kai Li

The design of MAFENN framework and algorithm are dedicated to enhance the learning capability of the feedfoward DL networks or their variations with the simple data feedback.

Denoising Intelligent Communication

Recovering Private Text in Federated Learning of Language Models

1 code implementation17 May 2022 Samyak Gupta, Yangsibo Huang, Zexuan Zhong, Tianyu Gao, Kai Li, Danqi Chen

For the first time, we show the feasibility of recovering text from large batch sizes of up to 128 sentences.

Federated Learning Word Embeddings

Fast Few-shot Debugging for NLU Test Suites

1 code implementation DeeLIO (ACL) 2022 Christopher Malon, Kai Li, Erik Kruus

We study few-shot debugging of transformer based natural language understanding models, using recently popularized test suites to not just diagnose but correct a problem.

Natural Language Understanding

Downlink Channel Covariance Matrix Reconstruction for FDD Massive MIMO Systems with Limited Feedback

no code implementations2 Apr 2022 Kai Li, Ying Li, Lei Cheng, Qingjiang Shi, Zhi-Quan Luo

The downlink channel covariance matrix (CCM) acquisition is the key step for the practical performance of massive multiple-input and multiple-output (MIMO) systems, including beamforming, channel tracking, and user scheduling.

Scheduling

StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis

1 code implementation CVPR 2022 Zhiheng Li, Martin Renqiang Min, Kai Li, Chenliang Xu

Based on the identified latent directions of attributes, we propose Compositional Attribute Adjustment to adjust the latent code, resulting in better compositionality of image synthesis.

Attribute Fairness +2

Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Persevering EdgeIoT

1 code implementation15 Feb 2022 Jingjing Zheng, Kai Li, Naram Mhaisen, Wei Ni, Eduardo Tovar, Mohsen Guizani

Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT).

Deep Reinforcement Learning Edge-computing +1

Evaluating Gradient Inversion Attacks and Defenses in Federated Learning

1 code implementation NeurIPS 2021 Yangsibo Huang, Samyak Gupta, Zhao Song, Kai Li, Sanjeev Arora

Gradient inversion attack (or input recovery from gradient) is an emerging threat to the security and privacy preservation of Federated learning, whereby malicious eavesdroppers or participants in the protocol can recover (partially) the clients' private data.

Federated Learning

Rethinking Lightweight Convolutional Neural Networks for Efficient and High-quality Pavement Crack Detection

2 code implementations13 Sep 2021 Kai Li, Jie Yang, Siwei Ma, Bo wang, Shanshe Wang, Yingjie Tian, Zhiquan Qi

For the second issue, we reconsider how to improve detection efficiency with excellent performance, and then propose our lightweight encoder-decoder architecture termed CarNet.

Decoder

EMA: Auditing Data Removal from Trained Models

1 code implementation8 Sep 2021 Yangsibo Huang, Xiaoxiao Li, Kai Li

In this paper, we propose a new method called Ensembled Membership Auditing (EMA) for auditing data removal to overcome these limitations.

Plot2Spectra: an Automatic Spectra Extraction Tool

1 code implementation6 Jul 2021 Weixin Jiang, Eric Schwenker, Trevor Spreadbury, Kai Li, Maria K. Y. Chan, Oliver Cossairt

In scientific literature, XANES/Raman data are usually plotted in line graphs which is a visually appropriate way to represent the information when the end-user is a human reader.

Optical Flow Estimation Semantic Segmentation

Fast and Accurate Road Crack Detection Based on Adaptive Cost-Sensitive Loss Function

no code implementations29 Jun 2021 Kai Li, Bo wang, Yingjie Tian, Zhiquan Qi

Numerous detection problems in computer vision, including road crack detection, suffer from exceedingly foreground-background imbalance.

MR Image Super-Resolution With Squeeze and Excitation Reasoning Attention Network

no code implementations CVPR 2021 Yulun Zhang, Kai Li, Kunpeng Li, Yun Fu

They also fail to sense the entire space of the input, which is critical for high-quality MR image SR. To address those problems, we propose squeeze and excitation reasoning attention networks (SERAN) for accurate MR image SR. We propose to squeeze attention from global spatial information of the input and obtain global descriptors.

Image Super-Resolution

Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment

no code implementations1 Jun 2021 Tianze Zhou, Fubiao Zhang, Kun Shao, Kai Li, Wenhan Huang, Jun Luo, Weixun Wang, Yaodong Yang, Hangyu Mao, Bin Wang, Dong Li, Wulong Liu, Jianye Hao

In addition, we use a novel agent network named Population Invariant agent with Transformer (PIT) to realize the coordination transfer in more varieties of scenarios.

Management Multi-agent Reinforcement Learning +3

Cooperative Multi-Agent Reinforcement Learning with Sequential Credit Assignment

1 code implementation NeurIPS 2021 Yifan Zang, Jinmin He, Kai Li, Lily Cao, Haobo Fu, Qiang Fu, Junliang Xing

In this paper, we propose a cooperative MARL method with sequential credit assignment (SeCA) that deduces each agent's contribution to the team's success one by one to learn better cooperation.

counterfactual Multi-agent Reinforcement Learning +5

ECACL: A Holistic Framework for Semi-Supervised Domain Adaptation

1 code implementation ICCV 2021 Kai Li, Chang Liu, Handong Zhao, Yulun Zhang, Yun Fu

This paper studies Semi-Supervised Domain Adaptation (SSDA), a practical yet under-investigated research topic that aims to learn a model of good performance using unlabeled samples and a few labeled samples in the target domain, with the help of labeled samples from a source domain.

Data Augmentation Domain Adaptation +1

RPCL: A Framework for Improving Cross-Domain Detection with Auxiliary Tasks

no code implementations18 Apr 2021 Kai Li, Curtis Wigington, Chris Tensmeyer, Vlad I. Morariu, Handong Zhao, Varun Manjunatha, Nikolaos Barmpalios, Yun Fu

Contrasted with prior work, this paper provides a complementary solution to align domains by learning the same auxiliary tasks in both domains simultaneously.

L2E: Learning to Exploit Your Opponent

no code implementations18 Feb 2021 Zhe Wu, Kai Li, Enmin Zhao, Hang Xu, Meng Zhang, Haobo Fu, Bo An, Junliang Xing

In this work, we propose a novel Learning to Exploit (L2E) framework for implicit opponent modeling.

VINS: Visual Search for Mobile User Interface Design

1 code implementation10 Feb 2021 Sara Bunian, Kai Li, Chaima Jemmali, Casper Harteveld, Yun Fu, Magy Seif El-Nasr

By utilizing this dataset, we propose an object-detection based image retrieval framework that models the UI context and hierarchical structure.

Image Retrieval object-detection +2

Bridge the Vision Gap from Field to Command: A Deep Learning Network Enhancing Illumination and Details

no code implementations20 Jan 2021 Zhuqing Jiang, Chang Liu, Ya'nan Wang, Kai Li, Aidong Men, Haiying Wang, Haiyong Luo

With the goal of tuning up the brightness, low-light image enhancement enjoys numerous applications, such as surveillance, remote sensing and computational photography.

Low-Light Image Enhancement

Shed Various Lights on a Low-Light Image: Multi-Level Enhancement Guided by Arbitrary References

no code implementations4 Jan 2021 Ya'nan Wang, Zhuqing Jiang, Chang Liu, Kai Li, Aidong Men, Haiying Wang

This paper proposes a neural network for multi-level low-light image enhancement, which is user-friendly to meet various requirements by selecting different images as brightness reference.

Low-Light Image Enhancement Style Transfer

Video Matting via Consistency-Regularized Graph Neural Networks

no code implementations ICCV 2021 Tiantian Wang, Sifei Liu, Yapeng Tian, Kai Li, Ming-Hsuan Yang

In this paper, we propose to enhance the temporal coherence by Consistency-Regularized Graph Neural Networks (CRGNN) with the aid of a synthesized video matting dataset.

Image Matting Optical Flow Estimation +1

Temperature Regret Matching for Imperfect-Information Games

no code implementations1 Jan 2021 Enmin Zhao, Kai Li, Junliang Xing

Regret matching (RM) plays a crucial role in CFR and its variants to approach Nash equilibrium.

counterfactual

Neighbor Class Consistency on Unsupervised Domain Adaptation

no code implementations1 Jan 2021 Chang Liu, Kai Li, Yun Fu

Unsupervised domain adaptation (UDA) is to make predictions for unlabeled data in a target domain with labeled data from source domain available.

Clustering image-classification +2

OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research

no code implementations11 Dec 2020 Kai Li, Hang Xu, Enmin Zhao, Zhe Wu, Junliang Xing

Owning to the unremitting efforts by a few institutes, significant progress has recently been made in designing superhuman AIs in No-limit Texas Hold'em (NLTH), the primary testbed for large-scale imperfect-information game research.

DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio

no code implementations12 Nov 2020 Jiangtao Kong, Yu Cheng, Benjia Zhou, Kai Li, Junliang Xing

To obtain a high-performance vehicle ReID model, we present a novel Distance Shrinking with Angular Marginalizing (DSAM) loss function to perform hybrid learning in both the Original Feature Space (OFS) and the Feature Angular Space (FAS) using the local verification and the global identification information.

Person Re-Identification Vehicle Re-Identification

Beyond the Deep Metric Learning: Enhance the Cross-Modal Matching with Adversarial Discriminative Domain Regularization

no code implementations23 Oct 2020 Li Ren, Kai Li, Liqiang Wang, Kien Hua

In this paper, we address this limitation with an efficient learning objective that considers the discriminative feature distributions between the visual objects and sentence words.

Metric Learning Sentence

Pushing The Limit of Type I Codebook For FDD Massive MIMO Beamforming: A Channel Covariance Reconstruction Approach

no code implementations22 Oct 2020 Kai Li, Ying Li, Lei Cheng, Qingjiang Shi, Zhi-Quan Luo

There is a fundamental trade-off between the channel representation resolution of codebooks and the overheads of feedback communications in the fifth generation new radio (5G NR) frequency division duplex (FDD) massive multiple-input and multiple-output (MIMO) systems.

Vocal Bursts Type Prediction

MixCon: Adjusting the Separability of Data Representations for Harder Data Recovery

no code implementations22 Oct 2020 Xiaoxiao Li, Yangsibo Huang, Binghui Peng, Zhao Song, Kai Li

To address the issue that deep neural networks (DNNs) are vulnerable to model inversion attacks, we design an objective function, which adjusts the separability of the hidden data representations, as a way to control the trade-off between data utility and vulnerability to inversion attacks.

InstaHide: Instance-hiding Schemes for Private Distributed Learning

3 code implementations6 Oct 2020 Yangsibo Huang, Zhao Song, Kai Li, Sanjeev Arora

This paper introduces InstaHide, a simple encryption of training images, which can be plugged into existing distributed deep learning pipelines.

Light Field View Synthesis via Aperture Disparity and Warping Confidence Map

no code implementations7 Sep 2020 Nan Meng, Kai Li, Jianzhuang Liu, Edmund Y. Lam

This paper presents a learning-based approach to synthesize the view from an arbitrary camera position given a sparse set of images.

Novel View Synthesis Position

Cross-Domain Document Object Detection: Benchmark Suite and Method

1 code implementation CVPR 2020 Kai Li, Curtis Wigington, Chris Tensmeyer, Handong Zhao, Nikolaos Barmpalios, Vlad I. Morariu, Varun Manjunatha, Tong Sun, Yun Fu

We establish a benchmark suite consisting of different types of PDF document datasets that can be utilized for cross-domain DOD model training and evaluation.

object-detection Object Detection

Adversarial Feature Hallucination Networks for Few-Shot Learning

1 code implementation CVPR 2020 Kai Li, Yulun Zhang, Kunpeng Li, Yun Fu

The recent flourish of deep learning in various tasks is largely accredited to the rich and accessible labeled data.

Data Augmentation Diversity +2

Privacy-preserving Learning via Deep Net Pruning

no code implementations4 Mar 2020 Yangsibo Huang, Yushan Su, Sachin Ravi, Zhao Song, Sanjeev Arora, Kai Li

This paper attempts to answer the question whether neural network pruning can be used as a tool to achieve differential privacy without losing much data utility.

Network Pruning Privacy Preserving

Learning Relaxed Belady for Content Distribution Network Caching

1 code implementation25 Feb 2020 Zhenyu Song, Daniel S. Berger, Kai Li, Wyatt Lloyd

This paper presents a new approach for caching in CDNs that uses machine learning to approximate the Belady MIN algorithm.

BIG-bench Machine Learning

Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective

1 code implementation ICCV 2019 Kai Li, Martin Renqiang Min, Yun Fu

We instead reformulate ZSL as a conditioned visual classification problem, i. e., classifying visual features based on the classifiers learned from the semantic descriptions.

Classification General Classification +1

Visual Semantic Reasoning for Image-Text Matching

2 code implementations ICCV 2019 Kunpeng Li, Yulun Zhang, Kai Li, Yuanyuan Li, Yun Fu

It outperforms the current best method by 6. 8% relatively for image retrieval and 4. 8% relatively for caption retrieval on MS-COCO (Recall@1 using 1K test set).

Image Retrieval Image-text matching +3

On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection

no code implementations4 Jun 2019 Kai Li, Wei Ni, Eduardo Tovar

A key challenge is online MPT and data collection in the presence of on-board control of a UAV (e. g., patrolling velocity) for preventing battery drainage and data queue overflow of the sensing devices, while up-to-date knowledge on battery level and data queue of the devices is not available at the UAV.

Deep Reinforcement Learning Q-Learning +3

Authenticated Key-Value Stores with Hardware Enclaves

no code implementations26 Apr 2019 Yuzhe Tang, Ju Chen, Kai Li, Jianliang Xu, Qi Zhang

To circumvent the limited enclave memory (128 MB with the latest Intel CPUs), we propose to place the memory buffer of the eLSM store outside the enclave and protect the buffer using a new authenticated data structure by digesting individual LSM-tree levels.

Cryptography and Security Databases Distributed, Parallel, and Cluster Computing Data Structures and Algorithms

Residual Non-local Attention Networks for Image Restoration

2 code implementations ICLR 2019 Yulun Zhang, Kunpeng Li, Kai Li, Bineng Zhong, Yun Fu

To address this issue, we design local and non-local attention blocks to extract features that capture the long-range dependencies between pixels and pay more attention to the challenging parts.

Demosaicking Image Denoising +1

PZnet: Efficient 3D ConvNet Inference on Manycore CPUs

no code implementations18 Mar 2019 Sergiy Popovych, Davit Buniatyan, Aleksandar Zlateski, Kai Li, H. Sebastian Seung

Convolutional nets have been shown to achieve state-of-the-art accuracy in many biomedical image analysis tasks.

Fast 3D Line Segment Detection From Unorganized Point Cloud

3 code implementations8 Jan 2019 Xiaohu Lu, Yahui Liu, Kai Li

This paper presents a very simple but efficient algorithm for 3D line segment detection from large scale unorganized point cloud.

Line Detection Line Segment Detection +1

Dynamic Spatio-temporal Graph-based CNNs for Traffic Prediction

no code implementations5 Dec 2018 Ken Chen, Fei Chen, Baisheng Lai, Zhongming Jin, Yong liu, Kai Li, Long Wei, Pengfei Wang, Yandong Tang, Jianqiang Huang, Xian-Sheng Hua

To capture the graph dynamics, we use the graph prediction stream to predict the dynamic graph structures, and the predicted structures are fed into the flow prediction stream.

Prediction Traffic Prediction

Support Neighbor Loss for Person Re-Identification

1 code implementation18 Aug 2018 Kai Li, Zhengming Ding, Kunpeng Li, Yulun Zhang, Yun Fu

To ensure scalability and separability, a softmax-like function is formulated to push apart the positive and negative support sets.

Person Re-Identification

Payoff Control in the Iterated Prisoner's Dilemma

no code implementations17 Jul 2018 Dong Hao, Kai Li, Tao Zhou

Repeated game has long been the touchstone model for agents' long-run relationships.

Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation

no code implementations CVPR 2018 Kai Li, Junliang Xing, Chi Su, Weiming Hu, Yundong Zhang, Stephen Maybank

First, a novel cost-sensitive multi-task loss function is designed to learn transferable aging features by training on the source population.

Age Estimation

Deep Ordinal Hashing with Spatial Attention

no code implementations7 May 2018 Lu Jin, Xiangbo Shu, Kai Li, Zechao Li, Guo-Jun Qi, Jinhui Tang

However, most existing deep hashing methods directly learn the hash functions by encoding the global semantic information, while ignoring the local spatial information of images.

Deep Hashing Image Retrieval

Large Linear Multi-output Gaussian Process Learning

1 code implementation30 May 2017 Vladimir Feinberg, Li-Fang Cheng, Kai Li, Barbara E. Engelhardt

Gaussian processes (GPs), or distributions over arbitrary functions in a continuous domain, can be generalized to the multi-output case: a linear model of coregionalization (LMC) is one approach.

Gaussian Processes Time Series +1

Sparse Multi-Output Gaussian Processes for Medical Time Series Prediction

1 code implementation27 Mar 2017 Li-Fang Cheng, Gregory Darnell, Bianca Dumitrascu, Corey Chivers, Michael E Draugelis, Kai Li, Barbara E. Engelhardt

In the scenario of real-time monitoring of hospital patients, high-quality inference of patients' health status using all information available from clinical covariates and lab tests is essential to enable successful medical interventions and improve patient outcomes.

Gaussian Processes Time Series +1

Quantum cluster approach to the topological invariants in correlated Chern insulators

no code implementations16 Dec 2015 Zhao-Long Gu, Kai Li, Jian-Xin Li

We detect the topological properties of Chern insulators with strong Coulomb interactions by use of cluster perturbation theory and variational cluster approach.

Strongly Correlated Electrons

From rules to runs: A dynamic epistemic take on imperfect information games

no code implementations7 Dec 2015 Kai Li, Yanjing Wang

We argue that the problem lies in the mix-up of two interpretations of the extensive form game structures: game rules or game runs which do not always coincide.

First-Take-All: Temporal Order-Preserving Hashing for 3D Action Videos

no code implementations6 Jun 2015 Jun Ye, Hao Hu, Kai Li, Guo-Jun Qi, Kien A. Hua

With the prevalence of the commodity depth cameras, the new paradigm of user interfaces based on 3D motion capturing and recognition have dramatically changed the way of interactions between human and computers.

3D Action Recognition All

Line-Based Multi-Label Energy Optimization for Fisheye Image Rectification and Calibration

no code implementations CVPR 2015 Mi Zhang, Jian Yao, Menghan Xia, Kai Li, Yi Zhang, Yaping Liu

Fisheye image rectification and estimation of intrinsic parameters for real scenes have been addressed in the literature by using line information on the distorted images.

Rank Subspace Learning for Compact Hash Codes

no code implementations19 Mar 2015 Kai Li, Guo-Jun Qi, Jun Ye, Kien A. Hua

In this work, we propose a novel hash learning framework that encodes feature's rank orders instead of numeric values in a number of optimal low-dimensional ranking subspaces.

Cannot find the paper you are looking for? You can Submit a new open access paper.