Search Results for author: Chen Chen

Found 400 papers, 176 papers with code

Semi-Dynamic Load Balancing: Efficient Distributed Learning in Non-Dedicated Environments

no code implementations7 Jun 2018 Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, Bo Li

Efficient model training requires eliminating such stragglers, yet for modern ML workloads, existing load balancing strategies are inefficient and even infeasible.

Key Person Aided Re-identification in Partially Ordered Pedestrian Set

no code implementations25 May 2018 Chen Chen, Min Cao, Xiyuan Hu, Silong Peng

Ideally person re-identification seeks for perfect feature representation and metric model that re-identify all various pedestrians well in non-overlapping views at different locations with different camera configurations, which is very challenging.

Person Re-Identification

Gabor Convolutional Networks

no code implementations3 May 2017 Shangzhen Luan, Baochang Zhang, Chen Chen, Xian-Bin Cao, Jungong Han, Jianzhuang Liu

Steerable properties dominate the design of traditional filters, e. g., Gabor filters, and endow features the capability of dealing with spatial transformations.

Robust 3D Action Recognition through Sampling Local Appearances and Global Distributions

no code implementations4 Dec 2017 Mengyuan Liu, Hong Liu, Chen Chen

Then, motion and shape cues are jointly used to generate robust and distinctive spatial-temporal interest points (STIPs): motion-based STIPs and shape-based STIPs.

3D Action Recognition

An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos

no code implementations30 Nov 2017 Rui Hou, Chen Chen, Mubarak Shah

A video is first divided into equal length clips and next for each clip a set of tube proposals are generated based on 3D CNN features.

Action Detection Action Segmentation +4

Latent Constrained Correlation Filter

no code implementations11 Nov 2017 Baochang Zhang, Shangzhen Luan, Chen Chen, Jungong Han, Wei Wang, Alessandro Perina, Ling Shao

In this paper, we introduce an intermediate step -- solution sampling -- after the data sampling step to form a subspace, in which an optimal solution can be estimated.

Object Recognition Object Tracking

Block building programming for symbolic regression

no code implementations22 May 2017 Chen Chen, Changtong Luo, Zonglin Jiang

Symbolic regression that aims to detect underlying data-driven models has become increasingly important for industrial data analysis.

Computational Efficiency regression +1

Multi-modal Aggregation for Video Classification

no code implementations27 Oct 2017 Chen Chen, Xiaowei Zhao, Yang Liu

In this paper, we present a solution to Large-Scale Video Classification Challenge (LSVC2017) [1] that ranked the 1st place.

Classification General Classification +1

Manifold Constrained Low-Rank Decomposition

no code implementations6 Aug 2017 Chen Chen, Baochang Zhang, Alessio Del Bue, Vittorio Murino

Low-rank decomposition (LRD) is a state-of-the-art method for visual data reconstruction and modelling.

Fast Modeling Methods for Complex System with Separable Features

no code implementations5 Aug 2017 Chen Chen, Changtong Luo, Zonglin Jiang

In this paper, we analyze different types of separability of some real-world engineering equations and establish a mathematical model of generalized separable system (GS system).

A divide and conquer method for symbolic regression

no code implementations23 May 2017 Changtong Luo, Chen Chen, Zonglin Jiang

This feature motivated us to develop a new method, divide and conquer (D&C), for symbolic regression, in which the target function is divided into a number of sub-functions and the sub-functions are then determined by any of a GP algorithm.

regression Symbolic Regression

Latent Constrained Correlation Filters for Object Localization

no code implementations7 Jun 2016 Shangzhen Luan, Baochang Zhang, Jungong Han, Chen Chen, Ling Shao, Alessandro Perina, Linlin Shen

There is a neglected fact in the traditional machine learning methods that the data sampling can actually lead to the solution sampling.

Object Object Localization

Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

no code implementations24 Apr 2017 Chen Chen, Changtong Luo, Zonglin Jiang

In this paper, a new non-evolutionary real-time algorithm for symbolic regression, Elite Bases Regression (EBR), is proposed.

regression Symbolic Regression

Measuring and Predicting Tag Importance for Image Retrieval

no code implementations28 Feb 2016 Shang-Wen Li, Sanjay Purushotham, Chen Chen, Yuzhuo Ren, C. -C. Jay Kuo

Textual data such as tags, sentence descriptions are combined with visual cues to reduce the semantic gap for image retrieval applications in today's Multimodal Image Retrieval (MIR) systems.

Image Retrieval Retrieval +2

GAL: A Global-Attributes Assisted Labeling System for Outdoor Scenes

no code implementations3 Apr 2016 Yuzhuo Ren, Chen Chen, Shang-Wen Li, C. -C. Jay Kuo

The proposed Global-attributes Assisted Labeling (GAL) system exploits both local features and global attributes.

Fast Iteratively Reweighted Least Squares Algorithms for Analysis-Based Sparsity Reconstruction

no code implementations18 Nov 2014 Chen Chen, Junzhou Huang, Lei He, Hongsheng Li

The convergence rate of the proposed algorithm is almost the same as that of the traditional IRLS algorithms, that is, exponentially fast.

Compressive Sensing

SIRF: Simultaneous Image Registration and Fusion in A Unified Framework

no code implementations18 Nov 2014 Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang

In this paper, we propose a novel method for image fusion with a high-resolution panchromatic image and a low-resolution multispectral image at the same geographical location.

Image Registration

Forest Sparsity for Multi-channel Compressive Sensing

no code implementations20 Nov 2012 Chen Chen, Yeqing Li, Junzhou Huang

In this paper, we investigate a new compressive sensing model for multi-channel sparse data where each channel can be represented as a hierarchical tree and different channels are highly correlated.

Compressive Sensing

Rain Removal By Image Quasi-Sparsity Priors

no code implementations20 Dec 2018 Yinglong Wang, Shuaicheng Liu, Chen Chen, Dehua Xie, Bing Zeng

We present a novel rain removal method in this paper, which consists of two steps, i. e., detection of rain streaks and reconstruction of the rain-removed image.

Rain Removal

Compressive Sensing MRI with Wavelet Tree Sparsity

no code implementations NeurIPS 2012 Chen Chen, Junzhou Huang

On the other side, some algorithms have been proposed for tree sparsity regularization, but few of them has validated the benefit of tree structure in CS-MRI.

Compressive Sensing

Image Fusion with Local Spectral Consistency and Dynamic Gradient Sparsity

no code implementations CVPR 2014 Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang

In this paper, we propose a novel method for image fusion from a high resolution panchromatic image and a low resolution multispectral image at the same geographical location.

Binary Coding for Partial Action Analysis With Limited Observation Ratios

no code implementations CVPR 2017 Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang

Extensive experiments on four realistic action datasets in terms of three tasks (i. e., partial action retrieval, recognition and prediction) clearly show the superiority of PRBC over the state-of-the-art methods, along with significantly reduced memory load and computational costs during the online test.

Action Analysis Action Recognition +3

GeoCapsNet: Aerial to Ground view Image Geo-localization using Capsule Network

no code implementations12 Apr 2019 Bin Sun, Chen Chen, Yingying Zhu, Jianmin Jiang

The task of cross-view image geo-localization aims to determine the geo-location (GPS coordinates) of a query ground-view image by matching it with the GPS-tagged aerial (satellite) images in a reference dataset.

Image Retrieval Retrieval

Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction

no code implementations5 Jul 2019 Wenjia Bai, Chen Chen, Giacomo Tarroni, Jinming Duan, Florian Guitton, Steffen E. Petersen, Yike Guo, Paul M. Matthews, Daniel Rueckert

In the recent years, convolutional neural networks have transformed the field of medical image analysis due to their capacity to learn discriminative image features for a variety of classification and regression tasks.

Image Segmentation Position +4

Unsupervised Multi-modal Style Transfer for Cardiac MR Segmentation

no code implementations20 Aug 2019 Chen Chen, Cheng Ouyang, Giacomo Tarroni, Jo Schlemper, Huaqi Qiu, Wenjia Bai, Daniel Rueckert

In this work, we present a fully automatic method to segment cardiac structures from late-gadolinium enhanced (LGE) images without using labelled LGE data for training, but instead by transferring the anatomical knowledge and features learned on annotated balanced steady-state free precession (bSSFP) images, which are easier to acquire.

Image Segmentation Segmentation +3

Renyi Differentially Private ADMM for Non-Smooth Regularized Optimization

no code implementations18 Sep 2019 Chen Chen, Jaewoo Lee

In this paper we consider the problem of minimizing composite objective functions consisting of a convex differentiable loss function plus a non-smooth regularization term, such as $L_1$ norm or nuclear norm, under R\'enyi differential privacy (RDP).

feature selection

MGHRL: Meta Goal-generation for Hierarchical Reinforcement Learning

no code implementations30 Sep 2019 Haotian Fu, Hongyao Tang, Jianye Hao, Wulong Liu, Chen Chen

Most meta reinforcement learning (meta-RL) methods learn to adapt to new tasks by directly optimizing the parameters of policies over primitive action space.

Hierarchical Reinforcement Learning Meta-Learning +3

Multilingual Dialogue Generation with Shared-Private Memory

no code implementations6 Oct 2019 Chen Chen, Lisong Qiu, Zhenxin Fu, Dongyan Zhao, Junfei Liu, Rui Yan

Existing dialog systems are all monolingual, where features shared among different languages are rarely explored.

Cross-Lingual Transfer Dialogue Generation

HMTNet:3D Hand Pose Estimation from Single Depth Image Based on Hand Morphological Topology

no code implementations12 Nov 2019 Weiguo Zhou, Xin Jiang, Chen Chen, Sijia Mei, Yun-hui Liu

In this paper, we propose a method that takes advantage of human hand morphological topology (HMT) structure to improve the pose estimation performance.

Robotics Human-Computer Interaction

Approximating Trajectory Constraints with Machine Learning -- Microgrid Islanding with Frequency Constraints

no code implementations16 Jan 2020 Yichen Zhang, Chen Chen, Guodong Liu, Tianqi Hong, Feng Qiu

In this paper, we introduce a deep learning aided constraint encoding method to tackle the frequency-constraint microgrid scheduling problem.

BIG-bench Machine Learning Scheduling

Video Anomaly Detection for Smart Surveillance

no code implementations1 Apr 2020 Sijie Zhu, Chen Chen, Waqas Sultani

Temporal localization (i. e. indicating the start and end frames of the anomaly event in a video) is referred to as frame-level detection.

Anomaly Detection Temporal Localization +1

Enhancing Review Comprehension with Domain-Specific Commonsense

no code implementations6 Apr 2020 Aaron Traylor, Chen Chen, Behzad Golshan, Xiaolan Wang, Yuliang Li, Yoshihiko Suhara, Jinfeng Li, Cagatay Demiralp, Wang-Chiew Tan

In this paper, we introduce xSense, an effective system for review comprehension using domain-specific commonsense knowledge bases (xSense KBs).

Aspect Extraction Knowledge Distillation +3

Action recognition in real-world videos

no code implementations22 Apr 2020 Waqas Sultani, Qazi Ammar Arshad, Chen Chen

Temporal localization (i. e. indicating the start and end frames of the action in a video) is referred to as frame-level detection.

Action Recognition Temporal Action Localization +1

PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution

no code implementations1 May 2020 Hao Dou, Chen Chen, Xiyuan Hu, Zuxing Xuan, Zhisen Hu, Silong Peng

Generative Adversarial Networks (GAN) have been employed for face super resolution but they bring distorted facial details easily and still have weakness on recovering realistic texture.

Super-Resolution

CP-NAS: Child-Parent Neural Architecture Search for Binary Neural Networks

no code implementations30 Apr 2020 Li'an Zhuo, Baochang Zhang, Hanlin Chen, Linlin Yang, Chen Chen, Yanjun Zhu, David Doermann

To this end, a Child-Parent (CP) model is introduced to a differentiable NAS to search the binarized architecture (Child) under the supervision of a full-precision model (Parent).

Neural Architecture Search

Recognizing Exercises and Counting Repetitions in Real Time

no code implementations7 May 2020 Talal Alatiah, Chen Chen

Artificial intelligence technology has made its way absolutely necessary in a variety of industries including the fitness industry.

BIG-bench Machine Learning Pose Estimation

Revisiting Street-to-Aerial View Image Geo-localization and Orientation Estimation

no code implementations23 May 2020 Sijie Zhu, Taojiannan Yang, Chen Chen

Street-to-aerial image geo-localization, which matches a query street-view image to the GPS-tagged aerial images in a reference set, has attracted increasing attention recently.

Metric Learning

Twitter discussions and emotions about COVID-19 pandemic: a machine learning approach

no code implementations26 May 2020 Jia Xue, Junxiang Chen, Ran Hu, Chen Chen, Chengda Zheng, Xiaoqian Liu, Tingshao Zhu

Across all identified topics, the dominant sentiments for the spread of coronavirus are anticipation that measures that can be taken, followed by a mixed feeling of trust, anger, and fear for different topics.

BIG-bench Machine Learning

Robust Federated Recommendation System

no code implementations15 Jun 2020 Chen Chen, Jingfeng Zhang, Anthony K. H. Tung, Mohan Kankanhalli, Gang Chen

We argue that the key to Byzantine detection is monitoring of gradients of the model parameters of clients.

Recommendation Systems

Deep Generative Model-based Quality Control for Cardiac MRI Segmentation

no code implementations23 Jun 2020 Shuo Wang, Giacomo Tarroni, Chen Qin, Yuanhan Mo, Chengliang Dai, Chen Chen, Ben Glocker, Yike Guo, Daniel Rueckert, Wenjia Bai

Our approach provides a real-time and model-agnostic quality control for cardiac MRI segmentation, which has the potential to be integrated into clinical image analysis workflows.

Image Segmentation MRI segmentation +2

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

no code implementations ICML 2020 Denny Zhou, Mao Ye, Chen Chen, Tianjian Meng, Mingxing Tan, Xiaodan Song, Quoc Le, Qiang Liu, Dale Schuurmans

This is achieved by layerwise imitation, that is, forcing the thin network to mimic the intermediate outputs of the wide network from layer to layer.

Computational Efficiency Model Compression

Stochastic Adaptive Line Search for Differentially Private Optimization

no code implementations18 Aug 2020 Chen Chen, Jaewoo Lee

In this paper, we introduce a stochastic variant of classic backtracking line search algorithm that satisfies R\'enyi differential privacy.

LEMMA

On the Transformer Growth for Progressive BERT Training

no code implementations NAACL 2021 Xiaotao Gu, Liyuan Liu, Hongkun Yu, Jing Li, Chen Chen, Jiawei Han

Due to the excessive cost of large-scale language model pre-training, considerable efforts have been made to train BERT progressively -- start from an inferior but low-cost model and gradually grow the model to increase the computational complexity.

Language Modelling

Nonvolatile electric control of exciton complex in monolayer MoSe$_2$ with two dimensional ferroelectric CuInP$_2$S$_6$

no code implementations10 Nov 2020 Xiaoyu Mao, Jun Fu, Chen Chen, Yue Li, Heng Liu, Ming Gong, Hualing Zeng

With the saturated ferroelectric polarization of CIPS, electron-doped or hole-doped MoSe$_2$ is realized in a single device with a large carrier density tunability up to $5\times 10^{12}$cm$^{-2}$.

Materials Science

Robust Synthesis of Wind Turbine Generators to Support Microgrid Frequency Considering Linearization-Induced Uncertainty

no code implementations5 Mar 2020 Yichen Zhang, Chen Chen, Tianqi Hong, Bai Cui, Bo Chen, Feng Qiu

The capability to switch between grid-connected and islanded modes has promoted adoption of microgrid technology for powering remote locations.

Scheduling

A3D: Adaptive 3D Networks for Video Action Recognition

no code implementations24 Nov 2020 Sijie Zhu, Taojiannan Yang, Matias Mendieta, Chen Chen

Even under the same computational constraints, the performance of our adaptive networks can be significantly boosted over the baseline counterparts by the mutual training along three dimensions.

Action Recognition Temporal Action Localization

NOMA for Energy-Efficient LiFi-Enabled Bidirectional IoT Communication

no code implementations20 May 2020 Chen Chen, Shu Fu, Xin Jian, Min Liu, Xiong Deng, Zhiguo Ding

In order to improve the energy efficiency (EE) of the bidirectional LiFi-IoT system, non-orthogonal multiple access (NOMA) with a quality-of-service (QoS)-guaranteed optimal power allocation (OPA) strategy is applied to maximize the EE of the system.

Privacy and Robustness in Federated Learning: Attacks and Defenses

no code implementations7 Dec 2020 Lingjuan Lyu, Han Yu, Xingjun Ma, Chen Chen, Lichao Sun, Jun Zhao, Qiang Yang, Philip S. Yu

Besides training powerful global models, it is of paramount importance to design FL systems that have privacy guarantees and are resistant to different types of adversaries.

Federated Learning Privacy Preserving

Towards a category-extended object detector with limited data

no code implementations28 Dec 2020 Bowen Zhao, Chen Chen, Xi Xiao, Shutao Xia

Object detectors are typically learned on fully-annotated training data with fixed predefined categories.

Object

A Transferable Anti-Forensic Attack on Forensic CNNs Using A Generative Adversarial Network

no code implementations23 Jan 2021 Xinwei Zhao, Chen Chen, Matthew C. Stamm

In this paper, we propose a new anti-forensic attack framework designed to remove forensic traces left by a variety of manipulation operations.

Generative Adversarial Network

Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation

no code implementations30 Jan 2021 Chengli Peng, Jiayi Ma, Chen Chen, Xiaojie Guo

To verify the efficiency of the proposed bilateral attention decoder, we adopt a lightweight network as the backbone and compare our proposed method with other state-of-the-art real-time semantic segmentation methods on the Cityscapes and Camvid datasets.

Real-Time Semantic Segmentation Segmentation

Guided Interpolation for Adversarial Training

no code implementations15 Feb 2021 Chen Chen, Jingfeng Zhang, Xilie Xu, Tianlei Hu, Gang Niu, Gang Chen, Masashi Sugiyama

To enhance adversarial robustness, adversarial training learns deep neural networks on the adversarial variants generated by their natural data.

Adversarial Robustness

A Dataset and Benchmark for Malaria Life-Cycle Classification in Thin Blood Smear Images

no code implementations17 Feb 2021 Qazi Ammar Arshad, Mohsen Ali, Saeed-Ul Hassan, Chen Chen, Ayisha Imran, Ghulam Rasul, Waqas Sultani

Malaria microscopy, microscopic examination of stained blood slides to detect parasite Plasmodium, is considered to be a gold-standard for detecting life-threatening disease malaria.

General Classification

Addressing Action Oscillations through Learning Policy Inertia

no code implementations3 Mar 2021 Chen Chen, Hongyao Tang, Jianye Hao, Wulong Liu, Zhaopeng Meng

We propose Nested Policy Iteration as a general training algorithm for PIC-augmented policy which ensures monotonically non-decreasing updates under some mild conditions.

Atari Games Autonomous Driving +1

Insta-RS: Instance-wise Randomized Smoothing for Improved Robustness and Accuracy

no code implementations7 Mar 2021 Chen Chen, Kezhi Kong, Peihong Yu, Juan Luque, Tom Goldstein, Furong Huang

Randomized smoothing (RS) is an effective and scalable technique for constructing neural network classifiers that are certifiably robust to adversarial perturbations.

Optimal Scheduling of Integrated Demand Response-Enabled Integrated Energy Systems with Uncertain Renewable Generations: A Stackelberg Game Approach

no code implementations8 Mar 2021 Yang Li, Chunling Wang, Guoqing Li, Chen Chen

In order to balance the interests of integrated energy operator (IEO) and users, a novel Stackelberg game-based optimization framework is proposed for the optimal scheduling of integrated demand response (IDR)-enabled integrated energy systems with uncertain renewable generations, where the IEO acts as the leader who pursues the maximization of his profits by setting energy prices, while the users are the follower who adjusts energy consumption plans to minimize their energy costs.

Scheduling

Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs

no code implementations23 May 2021 Chen Chen, Xuanli He, Lingjuan Lyu, Fangzhao Wu

In this work, we bridge this gap by first presenting an effective model extraction attack, where the adversary can practically steal a BERT-based API (the target/victim model) by only querying a limited number of queries.

Attribute Inference Attack +4

Fairness-Aware Unsupervised Feature Selection

no code implementations4 Jun 2021 Xiaoying Xing, Hongfu Liu, Chen Chen, Jundong Li

Feature selection is a prevalent data preprocessing paradigm for various learning tasks.

Fairness feature selection

Energy Aligning for Biased Models

no code implementations7 Jun 2021 Bowen Zhao, Chen Chen, Qi Ju, Shutao Xia

Training on class-imbalanced data usually results in biased models that tend to predict samples into the majority classes, which is a common and notorious problem.

Class Incremental Learning Incremental Learning

Deep Learning-Aided OFDM-Based Generalized Optical Quadrature Spatial Modulation

no code implementations24 Jun 2021 Chen Chen, Lin Zeng, Xin Zhong, Shu Fu, Min Liu, Pengfei Du

In this paper, we propose an orthogonal frequency division multiplexing (OFDM)-based generalized optical quadrature spatial modulation (GOQSM) technique for multiple-input multiple-output optical wireless communication (MIMO-OWC) systems.

Joint Motion Correction and Super Resolution for Cardiac Segmentation via Latent Optimisation

no code implementations8 Jul 2021 Shuo Wang, Chen Qin, Nicolo Savioli, Chen Chen, Declan O'Regan, Stuart Cook, Yike Guo, Daniel Rueckert, Wenjia Bai

In cardiac magnetic resonance (CMR) imaging, a 3D high-resolution segmentation of the heart is essential for detailed description of its anatomical structures.

Anatomy Cardiac Segmentation +2

A Novel Attribute Reconstruction Attack in Federated Learning

no code implementations16 Aug 2021 Lingjuan Lyu, Chen Chen

We perform the first systematic evaluation of attribute reconstruction attack (ARA) launched by the malicious server in the FL system, and empirically demonstrate that the shared epoch-averaged local model gradients can reveal sensitive attributes of local training data of any victim participant.

Attribute Federated Learning +1

CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation

no code implementations24 Aug 2021 Xidong Feng, Chen Chen, Dong Li, Mengchen Zhao, Jianye Hao, Jun Wang

Meta learning, especially gradient based one, can be adopted to tackle this problem by learning initial parameters of the model and thus allowing fast adaptation to a specific task from limited data examples.

Computational Efficiency Meta-Learning +1

Uncertainty-Aware Training for Cardiac Resynchronisation Therapy Response Prediction

no code implementations22 Sep 2021 Tareen Dawood, Chen Chen, Robin Andlauer, Baldeep S. Sidhu, Bram Ruijsink, Justin Gould, Bradley Porter, Mark Elliott, Vishal Mehta, C. Aldo Rinaldi, Esther Puyol-Antón, Reza Razavi, Andrew P. King

Evaluation of predictive deep learning (DL) models beyond conventional performance metrics has become increasingly important for applications in sensitive environments like healthcare.

DeepGOMIMO: Deep Learning-Aided Generalized Optical MIMO with CSI-Free Blind Detection

no code implementations8 Oct 2021 Xin Zhong, Chen Chen, Shu Fu, Zhihong Zeng, Min Liu

Generalized optical multiple-input multiple-output (GOMIMO) techniques have been recently shown to be promising for high-speed optical wireless communication (OWC) systems.

Learning Pseudometric-based Action Representations for Offline Reinforcement Learning

no code implementations29 Sep 2021 Pengjie Gu, Mengchen Zhao, Chen Chen, Dong Li, Jianye Hao, Bo An

Offline reinforcement learning is a promising approach for practical applications since it does not require interactions with real-world environments.

Offline RL Recommendation Systems +4

Label-Occurrence-Balanced Mixup for Long-tailed Recognition

no code implementations11 Oct 2021 Shaoyu Zhang, Chen Chen, Xiujuan Zhang, Silong Peng

When applying mixup to long-tailed data, a label suppression issue arises, where the frequency of label occurrence for each class is imbalanced and most of the new examples will be completely or partially assigned with head labels.

Data Augmentation

Sim-to-Real Transfer in Multi-agent Reinforcement Networking for Federated Edge Computing

no code implementations18 Oct 2021 Pinyarash Pinyoanuntapong, Tagore Pothuneedi, Ravikumar Balakrishnan, Minwoo Lee, Chen Chen, Pu Wang

Federated Learning (FL) over wireless multi-hop edge computing networks, i. e., multi-hop FL, is a cost-effective distributed on-device deep learning paradigm.

Edge-computing Federated Learning +3

CycleFlow: Purify Information Factors by Cycle Loss

no code implementations18 Oct 2021 Haoran Sun, Chen Chen, Lantian Li, Dong Wang

SpeechFlow is a powerful factorization model based on information bottleneck (IB), and its effectiveness has been reported by several studies.

Voice Conversion

Self-learned Intelligence for Integrated Decision and Control of Automated Vehicles at Signalized Intersections

no code implementations24 Oct 2021 Yangang Ren, Jianhua Jiang, Dongjie Yu, Shengbo Eben Li, Jingliang Duan, Chen Chen, Keqiang Li

This paper develops the dynamic permutation state representation in the framework of integrated decision and control (IDC) to handle signalized intersections with mixed traffic flows.

Autonomous Driving Decision Making

Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines

no code implementations18 Nov 2021 Xuejing Zheng, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo

In this paper, we propose Lifelong reinforcement learning with Sequential linear temporal logic formulas and Reward Machines (LSRM), which enables an agent to leverage previously learned knowledge to fasten learning of logically specified tasks.

reinforcement-learning Reinforcement Learning (RL) +1

EdgeML: Towards Network-Accelerated Federated Learning over Wireless Edge

no code implementations14 Oct 2021 Pinyarash Pinyoanuntapong, Prabhu Janakaraj, Ravikumar Balakrishnan, Minwoo Lee, Chen Chen, Pu Wang

To solve such MDP, multi-agent reinforcement learning (MA-RL) algorithms along with domain-specific action space refining schemes are developed, which online learn the delay-minimum forwarding paths to minimize the model exchange latency between the edge devices (i. e., workers) and the remote server.

Edge-computing Federated Learning +1

Boost Distribution System Restoration with Emergency Communication Vehicles Considering Cyber-Physical Interdependence

no code implementations19 Nov 2021 Zhigang Ye, Chen Chen, Ruihuan Liu, Kai Wu, Zhaohong Bie, Guannan Lou, Wei Gu, Yubo Yuan

Enhancing restoration capabilities of distribution systems is one of the main strategies for resilient power systems to cope with extreme events.

A Deep-Learning Intelligent System Incorporating Data Augmentation for Short-Term Voltage Stability Assessment of Power Systems

no code implementations5 Dec 2021 Yang Li, Meng Zhang, Chen Chen

Facing the difficulty of expensive and trivial data collection and annotation, how to make a deep learning-based short-term voltage stability assessment (STVSA) model work well on a small training dataset is a challenging and urgent problem.

Data Augmentation valid

Hierarchical Stochastic Scheduling of Multi-Community Integrated Energy Systems in Uncertain Environments via Stackelberg Game

no code implementations14 Dec 2021 Yang Li, Bin Wang, Zhen Yang, Jiazheng Li, Chen Chen

An operating entity utilizing community-integrated energy systems with a large number of small-scale distributed energy sources can easily trade with existing distribution markets.

energy management Generative Adversarial Network +2

Creativity of AI: Hierarchical Planning Model Learning for Facilitating Deep Reinforcement Learning

no code implementations18 Dec 2021 Hankz Hankui Zhuo, Shuting Deng, Mu Jin, Zhihao Ma, Kebing Jin, Chen Chen, Chao Yu

Despite of achieving great success in real-world applications, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, i. e., data efficiency, lack of the interpretability and transferability.

Montezuma's Revenge reinforcement-learning +1

Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads

no code implementations16 Feb 2022 Dharma Shukla, Muthian Sivathanu, Srinidhi Viswanatha, Bhargav Gulavani, Rimma Nehme, Amey Agrawal, Chen Chen, Nipun Kwatra, Ramachandran Ramjee, Pankaj Sharma, Atul Katiyar, Vipul Modi, Vaibhav Sharma, Abhishek Singh, Shreshth Singhal, Kaustubh Welankar, Lu Xun, Ravi Anupindi, Karthik Elangovan, Hasibur Rahman, Zhou Lin, Rahul Seetharaman, Cheng Xu, Eddie Ailijiang, Suresh Krishnappa, Mark Russinovich

At the heart of Singularity is a novel, workload-aware scheduler that can transparently preempt and elastically scale deep learning workloads to drive high utilization without impacting their correctness or performance, across a global fleet of AI accelerators (e. g., GPUs, FPGAs).

Scheduling

Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning

no code implementations29 Mar 2022 Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng

Automated Audio captioning (AAC) is a cross-modal task that generates natural language to describe the content of input audio.

Audio captioning Contrastive Learning

Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data

no code implementations29 Mar 2022 Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng

Noise-robust speech recognition systems require large amounts of training data including noisy speech data and corresponding transcripts to achieve state-of-the-art performances in face of various practical environments.

Generative Adversarial Network Robust Speech Recognition +1

Resilient Distribution System Restoration with Communication Recovery by Drone Small Cells

no code implementations31 Mar 2022 Haochen Zhang, Chen Chen, Shunbo Lei, Zhaohong Bie

Distribution system (DS) restoration after natural disasters often faces the challenge of communication failures to feeder automation (FA) facilities, resulting in prolonged load pick-up process.

GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing

no code implementations31 Mar 2022 Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen

To move a step further, this paper proposes GALA (Geometry-and-Lighting-Aware), a generic foreground object search method with discriminative modeling on geometry and lighting compatibility for open-world image compositing.

Object

Primal-dual Estimator Learning: an Offline Constrained Moving Horizon Estimation Method with Feasibility and Near-optimality Guarantees

no code implementations6 Apr 2022 Wenhan Cao, Jingliang Duan, Shengbo Eben Li, Chen Chen, Chang Liu, Yu Wang

Both the primal and dual estimators are learned from data using supervised learning techniques, and the explicit sample size is provided, which enables us to guarantee the quality of each learned estimator in terms of feasibility and optimality.

Self-critical Sequence Training for Automatic Speech Recognition

no code implementations13 Apr 2022 Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng

Although automatic speech recognition (ASR) task has gained remarkable success by sequence-to-sequence models, there are two main mismatches between its training and testing that might lead to performance degradation: 1) The typically used cross-entropy criterion aims to maximize log-likelihood of the training data, while the performance is evaluated by word error rate (WER), not log-likelihood; 2) The teacher-forcing method leads to the dependence on ground truth during training, which means that model has never been exposed to its own prediction before testing.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Rethinking Reinforcement Learning based Logic Synthesis

no code implementations16 May 2022 Chao Wang, Chen Chen, Dong Li, Bin Wang

Recently, reinforcement learning has been used to address logic synthesis by formulating the operator sequence optimization problem as a Markov decision process.

reinforcement-learning Reinforcement Learning (RL)

Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation

no code implementations14 Jun 2022 Wenxuan Wang, Chen Chen, Jing Wang, Sen Zha, Yan Zhang, Jiangyun Li

For 3D medical image (e. g. CT and MRI) segmentation, the difficulty of segmenting each slice in a clinical case varies greatly.

Brain Tumor Segmentation Image Segmentation +5

BANet: Motion Forecasting with Boundary Aware Network

no code implementations16 Jun 2022 Chen Zhang, Honglin Sun, Chen Chen, Yandong Guo

We propose a motion forecasting model called BANet, which means Boundary-Aware Network, and it is a variant of LaneGCN.

Motion Forecasting

Federated Graph Machine Learning: A Survey of Concepts, Techniques, and Applications

no code implementations24 Jul 2022 Xingbo Fu, Binchi Zhang, Yushun Dong, Chen Chen, Jundong Li

Federated Graph Machine Learning (FGML) is a promising solution to tackle this challenge by training graph machine learning models in a federated manner.

BIG-bench Machine Learning

RAIN: RegulArization on Input and Network for Black-Box Domain Adaptation

no code implementations22 Aug 2022 Qucheng Peng, Zhengming Ding, Lingjuan Lyu, Lichao Sun, Chen Chen

For the input-level, we design a new data augmentation technique as Phase MixUp, which highlights task-relevant objects in the interpolations, thus enhancing input-level regularization and class consistency for target models.

Data Augmentation Self-Knowledge Distillation +1

Binary Representation via Jointly Personalized Sparse Hashing

1 code implementation31 Aug 2022 Xiaoqin Wang, Chen Chen, Rushi Lan, Licheng Liu, Zhenbing Liu, Huiyu Zhou, Xiaonan Luo

Different personalized subspaces are constructed to reflect category-specific attributes for different clusters, adaptively mapping instances within the same cluster to the same Hamming space.

Representation Learning

An Analysis of Deep Reinforcement Learning Agents for Text-based Games

no code implementations9 Sep 2022 Chen Chen, Yue Dai, Josiah Poon, Caren Han

Text-based games(TBG) are complex environments which allow users or computer agents to make textual interactions and achieve game goals. In TBG agent design and training process, balancing the efficiency and performance of the agent models is a major challenge.

reinforcement-learning Reinforcement Learning (RL) +1

Conquering the Communication Constraints to Enable Large Pre-Trained Models in Federated Learning

no code implementations4 Oct 2022 Guangyu Sun, Umar Khalid, Matias Mendieta, Taojiannan Yang, Chen Chen

Recently, the use of small pre-trained models has been shown effective in federated learning optimization and improving convergence.

Federated Learning

Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model

no code implementations8 Oct 2022 Zeyu Gao, Yao Mu, Ruoyan Shen, Chen Chen, Yangang Ren, Jianyu Chen, Shengbo Eben Li, Ping Luo, YanFeng Lu

End-to-end autonomous driving provides a feasible way to automatically maximize overall driving system performance by directly mapping the raw pixels from a front-facing camera to control signals.

Autonomous Driving

Rule Based Event Extraction for Artificial Social Intelligence

no code implementations PANDL (COLING) 2022 Remo Nitschke, Yuwei Wang, Chen Chen, Adarsh Pyarelal, Rebecca Sharp

Natural language (as opposed to structured communication modes such as Morse code) is by far the most common mode of communication between humans, and can thus provide significant insight into both individual mental states and interpersonal dynamics.

Event Extraction

End-to-End Context-Aided Unicity Matching for Person Re-identification

no code implementations20 Oct 2022 Min Cao, Cong Ding, Chen Chen, Junchi Yan, Qi Tian

Based on a natural assumption that images belonging to the same person identity should not match with images belonging to multiple different person identities across views, called the unicity of person matching on the identity level, we propose an end-to-end person unicity matching architecture for learning and refining the person matching relations.

Graph Matching Person Re-Identification

Language-Assisted Deep Learning for Autistic Behaviors Recognition

no code implementations17 Nov 2022 Andong Deng, Taojiannan Yang, Chen Chen, Qian Chen, Leslie Neely, Sakiko Oyama

In such cases, automatic recognition systems based on computer vision and machine learning (in particular deep learning) technology can alleviate this issue to a large extent.

Action Recognition Multimodal Deep Learning +1

State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning

no code implementations28 Nov 2022 Chen Chen, Hongyao Tang, Yi Ma, Chao Wang, Qianli Shen, Dong Li, Jianye Hao

The key idea of SA-PP is leveraging discounted stationary state distribution ratios between the learning policy and the offline dataset to modulate the degree of behavior regularization in a state-wise manner, so that pessimism can be implemented in a more appropriate way.

Offline RL Q-Learning +2

When Do Curricula Work in Federated Learning?

no code implementations ICCV 2023 Saeed Vahidian, Sreevatsank Kadaveru, Woonjoon Baek, Weijia Wang, Vyacheslav Kungurtsev, Chen Chen, Mubarak Shah, Bill Lin

Specifically, we aim to investigate how ordered learning principles can contribute to alleviating the heterogeneity effects in FL.

Federated Learning

Machine Learning-Based Secret Key Generation for IRS-assisted Multi-antenna Systems

no code implementations19 Jan 2023 Chen Chen, Junqing Zhang, Tianyu Lu, Magnus Sandell, Liquan Chen

Different from most previous works that adopt the iterative optimization to solve the problem, the proposed DNN based algorithm directly obtains the BS precoding and IRS phase shifts as the output of the DNN.

The Exploration of Knowledge-Preserving Prompts for Document Summarisation

no code implementations27 Jan 2023 Chen Chen, Wei Emma Zhang, Alireza Seyed Shakeri, Makhmoor Fiza

Despite the great development of document summarisation techniques nowadays, factual inconsistencies between the generated summaries and the original texts still occur from time to time.

Document Summarization

DELTA: degradation-free fully test-time adaptation

no code implementations30 Jan 2023 Bowen Zhao, Chen Chen, Shu-Tao Xia

However, we find that two unfavorable defects are concealed in the prevalent adaptation methodologies like test-time batch normalization (BN) and self-learning.

Self-Learning Test-time Adaptation

Filtering Context Mitigates Scarcity and Selection Bias in Political Ideology Prediction

no code implementations1 Feb 2023 Chen Chen, Dylan Walker, Venkatesh Saligrama

We propose a novel supervised learning approach for political ideology prediction (PIP) that is capable of predicting out-of-distribution inputs.

Selection bias

Delving into the Adversarial Robustness of Federated Learning

no code implementations19 Feb 2023 Jie Zhang, Bo Li, Chen Chen, Lingjuan Lyu, Shuang Wu, Shouhong Ding, Chao Wu

In this work, we propose a novel algorithm called Decision Boundary based Federated Adversarial Training (DBFAT), which consists of two components (local re-weighting and global regularization) to improve both accuracy and robustness of FL systems.

Adversarial Robustness Federated Learning

Delving into Identify-Emphasize Paradigm for Combating Unknown Bias

no code implementations22 Feb 2023 Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia

For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples.

Unsupervised Noise adaptation using Data Simulation

no code implementations23 Feb 2023 Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng

Deep neural network based speech enhancement approaches aim to learn a noisy-to-clean transformation using a supervised learning paradigm.

Domain Adaptation Generative Adversarial Network +1

Metric-oriented Speech Enhancement using Diffusion Probabilistic Model

no code implementations23 Feb 2023 Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng

Deep neural network based speech enhancement technique focuses on learning a noisy-to-clean transformation supervised by paired training data.

Speech Enhancement

A Pathway Towards Responsible AI Generated Content

no code implementations2 Mar 2023 Chen Chen, Jie Fu, Lingjuan Lyu

AI Generated Content (AIGC) has received tremendous attention within the past few years, with content generated in the format of image, text, audio, video, etc.

Misinformation

Highly Efficient Architecture of NewHope-NIST on FPGA using Low-Complexity NTT/INTT

no code implementations IACR Transactions on Cryptographic Hardware and Embedded Systems 2020 Neng Zhang, Bohan Yang, Chen Chen, Shouyi Yin, Shaojun Wei and Leibo Liu*

First, both the pre-processing of NTT and the post-processing of INTT are merged into the fast Fourier transform (FFT) algorithm, which reduces N and 2N modular multiplications for N-point NTT and INTT, respectively.

DiffMesh: A Motion-aware Diffusion-like Framework for Human Mesh Recovery from Videos

no code implementations23 Mar 2023 Ce Zheng, Xianpeng Liu, Mengyuan Liu, Tianfu Wu, Guo-Jun Qi, Chen Chen

While image-based HMR methods have achieved impressive results, they often struggle to recover humans in dynamic scenarios, leading to temporal inconsistencies and non-smooth 3D motion predictions due to the absence of human motion.

3D Human Pose Estimation Human Mesh Recovery

Towards Adversarially Robust Continual Learning

no code implementations31 Mar 2023 Tao Bai, Chen Chen, Lingjuan Lyu, Jun Zhao, Bihan Wen

Recent studies show that models trained by continual learning can achieve the comparable performances as the standard supervised learning and the learning flexibility of continual learning models enables their wide applications in the real world.

Adversarial Robustness Continual Learning

Graph-Guided MLP-Mixer for Skeleton-Based Human Motion Prediction

no code implementations7 Apr 2023 Xinshun Wang, Qiongjie Cui, Chen Chen, Shen Zhao, Mengyuan Liu

In recent years, Graph Convolutional Networks (GCNs) have been widely used in human motion prediction, but their performance remains unsatisfactory.

Human motion prediction Human Pose Forecasting +1

$R^{2}$Former: Unified $R$etrieval and $R$eranking Transformer for Place Recognition

no code implementations6 Apr 2023 Sijie Zhu, Linjie Yang, Chen Chen, Mubarak Shah, Xiaohui Shen, Heng Wang

Visual Place Recognition (VPR) estimates the location of query images by matching them with images in a reference database.

Feature Correlation Retrieval +1

TopNet: Transformer-based Object Placement Network for Image Compositing

no code implementations CVPR 2023 Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen

Given a background image and a segmented object, the goal is to train a model to predict plausible placements (location and scale) of the object for compositing.

Object

Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR

no code implementations11 Apr 2023 Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng

Second, during finetuning we propose a Transformer-based code predictor to accurately predict clean codes by modeling the global dependency of input noisy representations, which enables discovery and restoration of high-quality clean representations with reduced distortions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Med-Tuning: Parameter-Efficient Transfer Learning with Fine-Grained Feature Enhancement for Medical Volumetric Segmentation

no code implementations21 Apr 2023 Wenxuan Wang, Jiachen Shen, Chen Chen, Jianbo Jiao, Jing Liu, Yan Zhang, Shanshan Song, Jiangyun Li

In this paper, we present the study on parameter-efficient transfer learning for medical volumetric segmentation and propose a new framework named Med-Tuning based on intra-stage feature enhancement and inter-stage feature interaction.

Segmentation Transfer Learning

Secret Key Generation for IRS-Assisted Multi-Antenna Systems: A Machine Learning-Based Approach

no code implementations28 Apr 2023 Chen Chen, Junqing Zhang, Tianyu Lu, Magnus Sandell, Liquan Chen

Different from most previous works that adopt iterative optimisation to solve the problem, the proposed DNN-based algorithm directly obtains the BS precoding and IRS phase shifts as the output of the DNN.

Spatial-Temporal Networks for Antibiogram Pattern Prediction

no code implementations2 May 2023 Xingbo Fu, Chen Chen, Yushun Dong, Anil Vullikanti, Eili Klein, Gregory Madden, Jundong Li

In this paper, we propose a novel problem of antibiogram pattern prediction that aims to predict which patterns will appear in the future.

Dynamic Graph Learning With Content-Guided Spatial-Frequency Relation Reasoning for Deepfake Detection

no code implementations CVPR 2023 YuAn Wang, Kun Yu, Chen Chen, Xiyuan Hu, Silong Peng

To address this issue, we propose a Spatial-Frequency Dynamic Graph method to exploit the relation-aware features in spatial and frequency domains via dynamic graph learning.

DeepFake Detection Face Generation +3

Private Image Generation With Dual-Purpose Auxiliary Classifier

no code implementations CVPR 2023 Chen Chen, Daochang Liu, Siqi Ma, Surya Nepal, Chang Xu

However, apart from this standard utility, we identify the "reversed utility" as another crucial aspect, which computes the accuracy on generated data of a classifier trained using real data, dubbed as real2gen accuracy (r2g%).

Image Generation Privacy Preserving

CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation

no code implementations19 May 2023 Wenxuan Wang, Jing Liu, Xingjian He, Yisi Zhang, Chen Chen, Jiachen Shen, Yan Zhang, Jiangyun Li

Referring image segmentation (RIS) is a fundamental vision-language task that intends to segment a desired object from an image based on a given natural language expression.

Image Segmentation Segmentation +1

Text-based Person Search without Parallel Image-Text Data

no code implementations22 May 2023 Yang Bai, Jingyao Wang, Min Cao, Chen Chen, Ziqiang Cao, Liqiang Nie, Min Zhang

Text-based person search (TBPS) aims to retrieve the images of the target person from a large image gallery based on a given natural language description.

Image Captioning Language Modelling +4

DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models

no code implementations23 May 2023 Lijun Li, Li'an Zhuo, Bang Zhang, Liefeng Bo, Chen Chen

Hand mesh reconstruction from the monocular image is a challenging task due to its depth ambiguity and severe occlusion, there remains a non-unique mapping between the monocular image and hand mesh.

Denoising

CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition

no code implementations25 May 2023 Lantian Li, Xiaolou Li, Haoyu Jiang, Chen Chen, Ruihai Hou, Dong Wang

A comprehensive study was conducted to compare CN-Celeb-AV with two popular public AVPR benchmark datasets, and the results demonstrated that CN-Celeb-AV is more in line with real-world scenarios and can be regarded as a new benchmark dataset for AVPR research.

Person Recognition

Alteration-free and Model-agnostic Origin Attribution of Generated Images

no code implementations29 May 2023 Zhenting Wang, Chen Chen, Yi Zeng, Lingjuan Lyu, Shiqing Ma

To overcome this problem, we first develop an alteration-free and model-agnostic origin attribution method via input reverse-engineering on image generation models, i. e., inverting the input of a particular model for a specific image.

Image Generation

When Foundation Model Meets Federated Learning: Motivations, Challenges, and Future Directions

no code implementations27 Jun 2023 Weiming Zhuang, Chen Chen, Lingjuan Lyu

The intersection of the Foundation Model (FM) and Federated Learning (FL) provides mutual benefits, presents a unique opportunity to unlock new possibilities in AI research, and address critical challenges in AI and real-world applications.

Federated Learning Privacy Preserving

Pay Attention to the Atlas: Atlas-Guided Test-Time Adaptation Method for Robust 3D Medical Image Segmentation

no code implementations2 Jul 2023 Jingjie Guo, Weitong Zhang, Matthew Sinclair, Daniel Rueckert, Chen Chen

In addition, different from most existing TTA methods which restrict the adaptation to batch normalization blocks in the segmentation network only, we further exploit the use of channel and spatial attention blocks for improved adaptability at test time.

Image Segmentation Medical Image Segmentation +4

Separate-and-Aggregate: A Transformer-based Patch Refinement Model for Knowledge Graph Completion

no code implementations11 Jul 2023 Chen Chen, YuFei Wang, Yang Zhang, Quan Z. Sheng, Kwok-Yan Lam

Previous KGC methods typically represent knowledge graph entities and relations as trainable continuous embeddings and fuse the embeddings of the entity $h$ (or $t$) and relation $r$ into hidden representations of query $(h, r, ?

Inductive Bias Relation

Mystique: Deconstructing SVG Charts for Layout Reuse

no code implementations25 Jul 2023 Chen Chen, Bongshin Lee, Yunhai Wang, Yunjeong Chang, Zhicheng Liu

To facilitate the reuse of existing charts, previous research has examined how to obtain a semantic understanding of a chart by deconstructing its visual representation into reusable components, such as encodings.

Learning Snippet-to-Motion Progression for Skeleton-based Human Motion Prediction

no code implementations26 Jul 2023 Xinshun Wang, Qiongjie Cui, Chen Chen, Shen Zhao, Mengyuan Liu

Existing Graph Convolutional Networks to achieve human motion prediction largely adopt a one-step scheme, which output the prediction straight from history input, failing to exploit human motion patterns.

Human motion prediction motion prediction +1

A Safe DRL Method for Fast Solution of Real-Time Optimal Power Flow

no code implementations7 Aug 2023 Pengfei Wu, Chen Chen, Dexiang Lai, Jian Zhong

Instead of integrating the constraint violation penalty with the reward function, its actor gradients are estimated by a Lagrange advantage function which is derived from two critic systems based on economic reward and violation cost.

GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement

no code implementations18 Aug 2023 Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah

We attribute this deficiency to the lack of ability to extract the geometric layout of visual features and models' overfitting to low-level details.

Attribute Disentanglement

Uncertainty Aware Training to Improve Deep Learning Model Calibration for Classification of Cardiac MR Images

no code implementations29 Aug 2023 Tareen Dawood, Chen Chen, Baldeep S. Sidhua, Bram Ruijsink, Justin Goulda, Bradley Porter, Mark K. Elliott, Vishal Mehta, Christopher A. Rinaldi, Esther Puyol-Anton, Reza Razavi, Andrew P. King

The best-performing model in terms of both classification accuracy and the most common calibration measure, expected calibration error (ECE) was the Confidence Weight method, a novel approach that weights the loss of samples to explicitly penalise confident incorrect predictions.

MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers

no code implementations8 Sep 2023 Sijia Li, Chen Chen, Haonan Lu

In this work, we propose a method with a mixture-of-expert (MOE) controllers to align the text-guided capacity of diffusion models with different kinds of human instructions, enabling our model to handle various open-domain image manipulation tasks with natural language instructions.

Image Generation Image Manipulation

Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges

no code implementations25 Sep 2023 Tongtong Yuan, Xuange Zhang, Kun Liu, Bo Liu, Chen Chen, Jian Jin, Zhenzhen Jiao

Furthermore, we benchmark SOTA models for four multimodal tasks on this newly created dataset, which serve as new baselines for surveillance video-and-language understanding.

Anomaly Detection Dense Video Captioning +1

STAG: Enabling Low Latency and Low Staleness of GNN-based Services with Dynamic Graphs

no code implementations27 Sep 2023 Jiawen Wang, Quan Chen, Deze Zeng, Zhuo Song, Chen Chen, Minyi Guo

With the collaborative serving mechanism, only part of node representations are updated during the update phase, and the final representations are calculated in the inference phase.

Beyond Sharing Weights in Decoupling Feature Learning Network for UAV RGB-Infrared Vehicle Re-Identification

no code implementations12 Oct 2023 Xingyue Liu, Jiahao Qi, Chen Chen, Kangcheng Bin, Ping Zhong

Moreover, to meet cross-modality discrepancy and orientation discrepancy challenges, we present a hybrid weights decoupling network (HWDNet) to learn the shared discriminative orientation-invariant features.

Vehicle Re-Identification

Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

no code implementations15 Oct 2023 Chengwei Qin, Chen Chen, Shafiq Joty

Inspired by the learning paradigm of humans, we propose Dynamic Module Expansion and Adaptation (DMEA), which enables the model to dynamically determine the architecture for acquiring new knowledge based on task correlation and select the most similar previous tasks to facilitate adaptation to new tasks.

Continual Learning Transfer Learning

Adaptive Quantization for Key Generation in Low-Power Wide-Area Networks

no code implementations11 Oct 2023 Chen Chen, Junqing Zhang, Yingying Chen

Physical layer key generation based on reciprocal and random wireless channels has been an attractive solution for securing resource-constrained low-power wide-area networks (LPWANs).

Quantization

Knowledge Editing for Large Language Models: A Survey

no code implementations24 Oct 2023 Song Wang, Yaochen Zhu, Haochen Liu, Zaiyi Zheng, Chen Chen, Jundong Li

Afterward, we provide an innovative taxonomy of KME techniques based on how the new knowledge is introduced into pre-trained LLMs, and investigate existing KME strategies while analyzing key insights, advantages, and limitations of methods from each category.

knowledge editing

Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation

no code implementations28 Oct 2023 Haoran Shen, Yifu Zhang, Wenxuan Wang, Chen Chen, Jing Liu, Shanshan Song, Jiangyun Li

As a pioneering work, a dynamic architecture network for medical volumetric segmentation (i. e. Med-DANet) has achieved a favorable accuracy and efficiency trade-off by dynamically selecting a suitable 2D candidate model from the pre-defined model bank for different slices.

Computational Efficiency MRI segmentation +2

MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval

no code implementations30 Oct 2023 Youbo Lei, Feifei He, Chen Chen, Yingbin Mo, Si Jia Li, Defeng Xie, Haonan Lu

Due to the success of large-scale visual-language pretraining (VLP) models and the widespread use of image-text retrieval in industry areas, it is now critically necessary to reduce the model size and streamline their mobile-device deployment.

Retrieval Text Retrieval

Supported Trust Region Optimization for Offline Reinforcement Learning

no code implementations15 Nov 2023 Yixiu Mao, Hongchang Zhang, Chen Chen, Yi Xu, Xiangyang Ji

Offline reinforcement learning suffers from the out-of-distribution issue and extrapolation error.

reinforcement-learning

Decouple Content and Motion for Conditional Image-to-Video Generation

no code implementations24 Nov 2023 Cuifeng Shen, Yulu Gan, Chen Chen, Xiongwei Zhu, Lele Cheng, Tingting Gao, Jinzhi Wang

The goal of conditional image-to-video (cI2V) generation is to create a believable new video by beginning with the condition, i. e., one image and text. The previous cI2V generation methods conventionally perform in RGB pixel space, with limitations in modeling motion consistency and visual continuity.

Image to Video Generation

LucidDreaming: Controllable Object-Centric 3D Generation

no code implementations30 Nov 2023 Zhaoning Wang, Ming Li, Chen Chen

Nonetheless, achieving precise control over 3D generation continues to be an arduous task, as using text to control often leads to missing objects and imprecise locations.

3D Generation Benchmarking +4

FBChain: A Blockchain-based Federated Learning Model with Efficiency and Secure Communication

no code implementations21 Nov 2023 Yang Li, Chunhe Xia, Wei Liu, Weidong Zhou, Chen Chen, Tianbo Wang

This article proposes Blockchain-based Federated Learning (FBChain) model for federated learning parameter communication to overcome the above two problems.

Federated Learning

IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment

no code implementations10 Dec 2023 Letian Zhang, Ming Li, Chen Chen, Jie Xu

This poses a paradox as the necessary camera pose must be estimated from the entire dataset, even though the data arrives sequentially and future chunks are inaccessible.

Incremental Learning Knowledge Distillation

Free-Editor: Zero-shot Text-driven 3D Scene Editing

no code implementations21 Dec 2023 Nazmul Karim, Umar Khalid, Hasan Iqbal, Jing Hua, Chen Chen

To date, editing 3D scenes requires either re-training the model to adapt to various 3D edited scenes or design-specific methods for each special editing type.

3D scene Editing Style Transfer +1

NID-SLAM: Neural Implicit Representation-based RGB-D SLAM in dynamic environments

no code implementations2 Jan 2024 Ziheng Xu, Jianwei Niu, Qingfeng Li, Tao Ren, Chen Chen

In this paper we present NID-SLAM, which significantly improves the performance of neural SLAM in dynamic environments.

Enhanced Few-Shot Class-Incremental Learning via Ensemble Models

no code implementations14 Jan 2024 Mingli Zhu, Zihao Zhu, Sihong Chen, Chen Chen, Baoyuan Wu

To tackle overfitting challenge, we design a new ensemble model framework cooperated with data augmentation to boost generalization.

Data Augmentation Few-Shot Class-Incremental Learning +2

Dream360: Diverse and Immersive Outdoor Virtual Scene Creation via Transformer-Based 360 Image Outpainting

no code implementations19 Jan 2024 Hao Ai, Zidong Cao, Haonan Lu, Chen Chen, Jian Ma, Pengyuan Zhou, Tae-Kyun Kim, Pan Hui, Lin Wang

To this end, we propose a transformer-based 360 image outpainting framework called Dream360, which can generate diverse, high-fidelity, and high-resolution panoramas from user-selected viewports, considering the spherical properties of 360 images.

Image Outpainting

Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition

no code implementations4 Feb 2024 Mengyuan Liu, Chen Chen, Songtao Wu, Fanyang Meng, Hong Liu

Recognizing interactive actions, including hand-to-hand interaction and human-to-human interaction, has attracted increasing attention for various applications in the field of video analysis and human-robot interaction.

Action Recognition Human Interaction Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.