Search Results for author: Zhe Chen

Found 77 papers, 35 papers with code

Advancing Attack-Resilient Scheduling of Integrated Energy Systems with Demand Response via Deep Reinforcement Learning

no code implementations28 Nov 2023 Yang Li, Wenjie Ma, Yuanzheng Li, Sen Li, Zhe Chen

Simulation results demonstrate that our method is capable of adequately addressing the uncertainties resulting from RES and loads, mitigating the impact of cyber-attacks on the scheduling strategy, and ensuring a stable demand supply for various energy sources.

Scheduling

Breast Cancer classification by adaptive weighted average ensemble of previously trained models

no code implementations22 Nov 2023 Mosab S. M. Farea, Zhe Chen

Our approach is different because it used adaptive average ensemble after training which has increased the performance of evaluation metrics.

Interpretation of the Transformer and Improvement of the Extractor

1 code implementation21 Nov 2023 Zhe Chen

It has been over six years since the Transformer architecture was put forward.

FedSN: A General Federated Learning Framework over LEO Satellite Networks

no code implementations2 Nov 2023 Zheng Lin, Zhe Chen, Zihan Fang, Xianhao Chen, Xiong Wang, Yue Gao

To this end, we propose FedSN as a general FL framework to tackle the above challenges, and fully explore data diversity on LEO satellites.

Federated Learning Image Classification +1

HoloFed: Environment-Adaptive Positioning via Multi-band Reconfigurable Holographic Surfaces and Federated Learning

no code implementations10 Oct 2023 Jingzhi Hu, Zhe Chen, Tianyue Zheng, Robert Schober, Jun Luo

Our simulation results confirm that HoloFed achieves a 57% lower positioning error variance compared to a beam-scanning baseline and can effectively adapt to diverse environments.

Federated Learning Scheduling +1

Pushing Large Language Models to the 6G Edge: Vision, Challenges, and Opportunities

no code implementations28 Sep 2023 Zheng Lin, Guanqiao Qu, Qiyuan Chen, Xianhao Chen, Zhe Chen, Kaibin Huang

In both aspects, considering the inherent resource limitations at the edge, we discuss various cutting-edge techniques, including split learning/inference, parameter-efficient fine-tuning, quantization, and parameter-sharing inference, to facilitate the efficient deployment of LLMs.

Edge-computing Quantization

Traffic Flow Optimisation for Lifelong Multi-Agent Path Finding

no code implementations22 Aug 2023 Zhe Chen, Daniel Harabor, Jiaoyang Li, Peter J. Stuckey

To tackle this issue we propose a new approach for MAPF where agents are guided to their destination by following congestion-avoiding paths.

Multi-Agent Path Finding

OCHID-Fi: Occlusion-Robust Hand Pose Estimation in 3D via RF-Vision

1 code implementation ICCV 2023 Shujie Zhang, Tianyue Zheng, Zhe Chen, Jingzhi Hu, Abdelwahed Khamis, Jiajun Liu, Jun Luo

To overcome the challenge in labeling RF imaging given its human incomprehensible nature, OCHID-Fi employs a cross-modality and cross-domain training process.

3D Pose Estimation Hand Pose Estimation

Attention Is Not All You Need Anymore

2 code implementations15 Aug 2023 Zhe Chen

Experimental results show that replacing the self-attention mechanism with the SHE evidently improves the performance of the Transformer, whereas the simplified versions of the SHE, i. e., the HE, the WE, and the ME, perform close to or better than the self-attention mechanism with less computational and memory complexity.

Text Generation

AVSegFormer: Audio-Visual Segmentation with Transformer

1 code implementation3 Jul 2023 Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu

In this paper, we propose AVSegFormer, a novel framework for AVS tasks that leverages the transformer architecture.

Scene Understanding Segmentation

GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation

no code implementations7 Jun 2023 Kai Chen, Enze Xie, Zhe Chen, Yibo Wang, Lanqing Hong, Zhenguo Li, Dit-yan Yeung

However, the usage of diffusion models to generate the high-quality object detection data remains an underexplored area, where not only image-level perceptual quality but also geometric conditions such as bounding boxes and camera views are essential.

Image Classification Layout-to-Image Generation +2

Graph Propagation Transformer for Graph Representation Learning

1 code implementation19 May 2023 Zhe Chen, Hao Tan, Tao Wang, Tianrun Shen, Tong Lu, Qiuying Peng, Cheng Cheng, Yue Qi

The core insight of our method is to fully consider the information propagation among nodes and edges in a graph when building the attention module in the transformer blocks.

Ranked #2 on Graph Regression on PCQM4M-LSC (Validation MAE metric)

Graph Learning Graph Property Prediction +3

Tracking Progress in Multi-Agent Path Finding

no code implementations15 May 2023 Bojie Shen, Zhe Chen, Muhammad Aamir Cheema, Daniel D. Harabor, Peter J. Stuckey

Multi-Agent Path Finding (MAPF) is an important core problem for many new and emerging industrial applications.

Multi-Agent Path Finding

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

2 code implementations9 May 2023 Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, LiMin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Different from existing interactive systems that rely on pure language, by incorporating pointing instructions, the proposed iGPT significantly improves the efficiency of communication between users and chatbots, as well as the accuracy of chatbots in vision-centric tasks, especially in complicated visual scenarios where the number of objects is greater than 2.

Language Modelling

Noise-Resistant Multimodal Transformer for Emotion Recognition

no code implementations4 May 2023 Yuanyuan Liu, Haoyu Zhang, Yibing Zhan, Zijing Chen, Guanghao Yin, Lin Wei, Zhe Chen

To this end, we present a novel paradigm that attempts to extract noise-resistant features in its pipeline and introduces a noise-aware learning scheme to effectively improve the robustness of multimodal emotion understanding.

Multimodal Emotion Recognition

A Simulation-Augmented Benchmarking Framework for Automatic RSO Streak Detection in Single-Frame Space Images

no code implementations30 Apr 2023 Zhe Chen, Yang Yang, Anne Bettens, Youngho Eun, Xiaofeng Wu

In our framework, by making the best use of the hardware parameters of the sensor that captures real-world space images, we first develop a high-fidelity RSO simulator that can generate various realistic space images.

Benchmarking object-detection +1

Multi-band Reconfigurable Holographic Surface Based ISAC Systems: Design and Optimization

no code implementations28 Mar 2023 Jingzhi Hu, Zhe Chen, Jun Luo

Metamaterial-based reconfigurable holographic surfaces (RHSs) have been proposed as novel cost-efficient antenna arrays, which are promising for improving the positioning and communication performance of integrated sensing and communications (ISAC) systems.

AutoFed: Heterogeneity-Aware Federated Multimodal Learning for Robust Autonomous Driving

no code implementations17 Feb 2023 Tianyue Zheng, Ang Li, Zhe Chen, Hongbo Wang, Jun Luo

Object detection with on-board sensors (e. g., lidar, radar, and camera) play a crucial role in autonomous driving (AD), and these sensors complement each other in modalities.

Autonomous Driving Federated Learning +3

Champion Solution for the WSDM2023 Toloka VQA Challenge

1 code implementation22 Jan 2023 Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu

In this report, we present our champion solution to the WSDM2023 Toloka Visual Question Answering (VQA) Challenge.

Question Answering Visual Grounding +1

LegalRelectra: Mixed-domain Language Modeling for Long-range Legal Text Comprehension

no code implementations16 Dec 2022 Wenyue Hua, Yuchen Zhang, Zhe Chen, Josie Li, Melanie Weber

We show that our model improves over general-domain and single-domain medical and legal language models when processing mixed-domain (personal injury) text.

Language Modelling Reading Comprehension

FPGA-Based In-Vivo Calcium Image Decoding for Closed-Loop Feedback Applications

no code implementations9 Dec 2022 Zhe Chen, Garrett J. Blair, Chengdi Cao, Jim Zhou, Daniel Aharoni, Peyman Golshani, Hugh T. Blair, Jason Cong

Our FPGA implementation enables the real-time calcium image decoding with sub-ms processing latency for closed-loop feedback applications.

Pose-disentangled Contrastive Learning for Self-supervised Facial Representation

1 code implementation CVPR 2023 Yuanyuan Liu, Wenbin Wang, Yibing Zhan, Shaoze Feng, Kejun Liu, Zhe Chen

Self-supervised facial representation has recently attracted increasing attention due to its ability to perform face understanding without relying on large-scale annotated datasets heavily.

Contrastive Learning Data Augmentation +5

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

2 code implementations CVPR 2023 Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao

Compared to the great progress of large-scale vision transformers (ViTs) in recent years, large-scale models based on convolutional neural networks (CNNs) are still in an early state.

 Ranked #1 on Instance Segmentation on COCO test-dev (APS metric, using extra training data)

Classification Image Classification +3

Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection

no code implementations14 Jul 2022 Zhe Chen, Jing Zhang, Yufei Xu, DaCheng Tao

Current object detectors typically have a feature pyramid (FP) module for multi-level feature fusion (MFF) which aims to mitigate the gap between features from different levels and form a comprehensive object representation to achieve better detection performance.

object-detection Object Detection

CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose

1 code implementation CVPR 2023 Xu Zhang, Wen Wang, Zhe Chen, Yufei Xu, Jing Zhang, DaCheng Tao

Motivated by the progress of visual-language research, we propose that pre-trained language models (e. g., CLIP) can facilitate animal pose estimation by providing rich prior knowledge for describing animal keypoints in text.

Animal Pose Estimation Contrastive Learning

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search

1 code implementation19 May 2022 Xiao Wang, Zhe Chen, Bo Jiang, Jin Tang, Bin Luo, DaCheng Tao

To track the target in a video, current visual trackers usually adopt greedy search for target object localization in each frame, that is, the candidate region with the maximum response score will be selected as the tracking result of each frame.

Decision Making Image Captioning +5

Load-Flow Solvability under Security Constraints in DC Distribution Networks

no code implementations7 Mar 2022 Zhe Chen, Cong Wang

We present sufficient conditions for the load-flow solvability under security constraints in DC distribution networks.

Low-rank features based double transformation matrices learning for image classification

no code implementations28 Jan 2022 Yu-Hong Cai, Xiao-Jun Wu, Zhe Chen

However, methods based on this technique ignore the pressure on a single transformation matrix due to the complex information contained in the data.

Classification Image Classification +1

SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection

1 code implementation6 Jan 2022 Chen Chen, Zhe Chen, Jing Zhang, DaCheng Tao

We observe that the prevailing set abstraction design for down-sampling points may maintain too much unimportant background information that can affect feature learning for detecting objects.

3D Object Detection object-detection

Recurrent Glimpse-based Decoder for Detection with Transformer

1 code implementation CVPR 2022 Zhe Chen, Jing Zhang, DaCheng Tao

Then, a glimpse-based decoder is introduced to provide refined detection results based on both the glimpse features and the attention modeling outputs of the previous stage.

 Ranked #1 on Object Detection on COCO (GFlops metric)

Object Detection

Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation

no code implementations1 Dec 2021 Tianyue Zheng, Zhe Chen, Shuya Ding, Chao Cai, Jun Luo

Whereas adversarial training can be useful against specific adversarial perturbations, they have also proven ineffective in generalizing towards attacks deviating from those used for training.

Domain Adaptation

MoRe-Fi: Motion-robust and Fine-grained Respiration Monitoring via Deep-Learning UWB Radar

no code implementations16 Nov 2021 Tianyue Zheng, Zhe Chen, Shujie Zhang, Chao Cai, Jun Luo

Crucial for healthcare and biomedical applications, respiration monitoring often employs wearable sensors in practice, causing inconvenience due to their direct contact with human bodies.

Data Augmentation

RF-Net: a Unified Meta-learning Framework for RF-enabled One-shot Human Activity Recognition

1 code implementation29 Oct 2021 Shuya Ding, Zhe Chen, Tianyue Zheng, Jun Luo

Radio-Frequency (RF) based device-free Human Activity Recognition (HAR) rises as a promising solution for many applications.

Human Activity Recognition Meta-Learning

V2iFi: in-Vehicle Vital Sign Monitoring via Compact RF Sensing

no code implementations28 Oct 2021 Tianyue Zheng, Zhe Chen, Chao Cai, Jun Luo, Xu Zhang

Given the significant amount of time people spend in vehicles, health issues under driving condition have become a major concern.

Heart Rate Variability

Enhancing RF Sensing with Deep Learning: A Layered Approach

no code implementations28 Oct 2021 Tianyue Zheng, Zhe Chen, Shuya Ding, Jun Luo

To better understand this potential, this article takes a layered approach to summarize RF sensing enabled by deep learning.

SiWa: See into Walls via Deep UWB Radar

no code implementations27 Oct 2021 Tianyue Zheng, Zhe Chen, Jun Luo, Lin Ke, Chaoyang Zhao, Yaowen Yang

To this end, we equip SiWa with a deep learning pipeline to parse the rich sensory data.

Dropout Prediction Uncertainty Estimation Using Neuron Activation Strength

no code implementations13 Oct 2021 Haichao Yu, Zhe Chen, Dong Lin, Gil Shamir, Jie Han

Dropout has been commonly used to quantify prediction uncertainty, i. e, the variations of model predictions on a given input example.

Variational Component Decoder for Source Extraction from Nonlinear Mixture

no code implementations29 Sep 2021 Shujie Zhang, Tianyue Zheng, Zhe Chen, Jun Luo, Sinno Pan

In many practical scenarios of signal extraction from a nonlinear mixture, only one (signal) source is intended to be extracted.

EEG Electroencephalogram (EEG) +2

Expression Snippet Transformer for Robust Video-based Facial Expression Recognition

no code implementations17 Sep 2021 Yuanyuan Liu, Wenbin Wang, Chuanxu Feng, Haoyu Zhang, Zhe Chen, Yibing Zhan

To this end, we propose to decompose each video into a series of expression snippets, each of which contains a small number of facial movements, and attempt to augment the Transformer's ability for modeling intra-snippet and inter-snippet visual relations, respectively, obtaining the Expression snippet Transformer (EST).

Dynamic Facial Expression Recognition Facial Expression Recognition +1

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows

2 code implementations11 Aug 2021 Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, YaoWei Wang, Yonghong Tian, Feng Wu

Different from visible cameras which record intensity images frame by frame, the biologically inspired event camera produces a stream of asynchronous and sparse events with much lower latency.

Object Tracking

Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking

1 code implementation30 Mar 2021 Xiao Wang, Zhe Chen, Jin Tang, Bin Luo, YaoWei Wang, Yonghong Tian, Feng Wu

In this paper, we propose to introduce more dynamics by devising a dynamic attention-guided multi-trajectory tracking strategy.

Object Tracking

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

1 code implementation22 Mar 2021 Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo

(1) We divide input image into small patches and adopt TIN, successfully transferring image style with arbitrary high-resolution.

Style Transfer

Symmetry Breaking for k-Robust Multi-Agent Path Finding

no code implementations17 Feb 2021 Zhe Chen, Daniel Harabor, Jiaoyang Li, Peter J. Stuckey

During Multi-Agent Path Finding (MAPF) problems, agents can be delayed by unexpected events.

Multi-Agent Path Finding

Recent Progress in Appearance-based Action Recognition

no code implementations25 Nov 2020 Jack Humphreys, Zhe Chen, DaCheng Tao

Action recognition, which is formulated as a task to identify various human actions in a video, has attracted increasing interest from computer vision researchers due to its importance in various applications.

Action Recognition

CL-MAPF: Multi-Agent Path Finding for Car-Like Robots with Kinematic and Spatiotemporal Constraints

1 code implementation1 Nov 2020 Licheng Wen, Zhen Zhang, Zhe Chen, Xiangrui Zhao, Yong liu

In this paper, we give a mathematical formalization of Multi-Agent Path Finding for Car-Like robots (CL-MAPF) problem.

Robotics Multiagent Systems

Beyond Point Estimate: Inferring Ensemble Prediction Variation from Neuron Activation Strength in Recommender Systems

no code implementations17 Aug 2020 Zhe Chen, Yuyan Wang, Dong Lin, Derek Zhiyuan Cheng, Lichan Hong, Ed H. Chi, Claire Cui

Despite deep neural network (DNN)'s impressive prediction performance in various domains, it is well known now that a set of DNN models trained with the same model specification and the same data can produce very different prediction results.

Model-based Reinforcement Learning Recommendation Systems

Invertible Neural BRDF for Object Inverse Rendering

1 code implementation ECCV 2020 Zhe Chen, Shohei Nobuhara, Ko Nishino

We introduce a novel neural network-based BRDF model and a Bayesian framework for object inverse rendering, i. e., joint estimation of reflectance and natural illumination from a single image of an object of known geometry.

Inverse Rendering

Block Shuffle: A Method for High-resolution Fast Style Transfer with Limited Memory

1 code implementation9 Aug 2020 Weifeng Ma, Zhe Chen, Caoting Ji

This method can act as a plug-in for Fast Style Transfer without any modification to the network architecture.

Image Generation Style Transfer

Model-Free Voltage Regulation of Unbalanced Distribution Network Based on Surrogate Model and Deep Reinforcement Learning

no code implementations24 Jun 2020 Di Cao, Junbo Zhao, Weihao Hu, Fei Ding, Qi Huang, Zhe Chen, Frede Blaabjerg

Accurate knowledge of the distribution system topology and parameters is required to achieve good voltage controls, but this is difficult to obtain in practice.

Decision Making

Condensing Two-stage Detection with Automatic Object Key Part Discovery

1 code implementation10 Jun 2020 Zhe Chen, Jing Zhang, DaCheng Tao

Modern two-stage object detectors generally require excessively large models for their detection heads to achieve high accuracy.

Vocal Bursts Valence Prediction

Twisting operators and centralisers of Lie type groups over local rings

no code implementations3 Jun 2020 Zhe Chen

We extend the classical result asserting that the twisting operator preserves certain Deligne--Lusztig character values for truncated formal power series; along the way we discuss some properties of centralisers.

Representation Theory

Distributed Voltage Regulation of Active Distribution System Based on Enhanced Multi-agent Deep Reinforcement Learning

no code implementations31 May 2020 Di Cao, Junbo Zhao, Weihao Hu, Fei Ding, Qi Huang, Zhe Chen

This paper proposes a data-driven distributed voltage control approach based on the spectrum clustering and the enhanced multi-agent deep reinforcement learning (MADRL) algorithm.

Clustering

TextFuseNet: Scene Text Detection with Richer Fused Features

5 code implementations17 May 2020 Jian Ye, Zhe Chen, Juhua Liu, Bo Du

More specifically, we propose to perceive texts from three levels of feature representations, i. e., character-, word- and global-level, and then introduce a novel text representation fusion technique to help achieve robust arbitrary text detection.

Scene Text Detection Text Detection

Towards High Performance Human Keypoint Detection

1 code implementation3 Feb 2020 Jing Zhang, Zhe Chen, DaCheng Tao

Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance.

Human Detection Keypoint Detection +1

A Shape Transformation-based Dataset Augmentation Framework for Pedestrian Detection

no code implementations15 Dec 2019 Zhe Chen, Wanli Ouyang, Tongliang Liu, DaCheng Tao

Alternatively, to access much more natural-looking pedestrians, we propose to augment pedestrian detection datasets by transforming real pedestrians from the same dataset into different shapes.

Pedestrian Detection

Human Keypoint Detection by Progressive Context Refinement

1 code implementation27 Oct 2019 Jing Zhang, Zhe Chen, DaCheng Tao

Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance of person instances.

Human Detection Keypoint Detection +1

Transition Subspace Learning based Least Squares Regression for Image Classification

no code implementations14 May 2019 Zhe Chen, Xiao-Jun Wu, Josef Kittler

Only learning one projection matrix from original samples to the corresponding binary labels is too strict and will consequentlly lose some intrinsic geometric structures of data.

Classification General Classification +2

Progressive LiDAR Adaptation for Road Detection

1 code implementation2 Apr 2019 Zhe Chen, Jing Zhang, DaCheng Tao

To this end, LiDAR sensor data can be incorporated to improve the visual image-based road detection, because LiDAR data is less susceptible to visual noises.

Non-negative representation based discriminative dictionary learning for face recognition

no code implementations19 Mar 2019 Zhe Chen, Xiao-Jun Wu, Josef Kittler

In this paper, we propose a non-negative representation based discriminative dictionary learning algorithm (NRDL) for multicategory face classification.

Dictionary Learning Face Recognition +1

Fisher Discriminative Least Squares Regression for Image Classification

1 code implementation19 Mar 2019 Zhe Chen, Xiao-Jun Wu, Josef Kittler

On one hand, the Fisher criterion improves the intra-class compactness of the relaxed labels during relaxation learning.

Classification Face Recognition +3

Low-Rank Discriminative Least Squares Regression for Image Classification

no code implementations19 Mar 2019 Zhe Chen, Xiao-Jun Wu, Josef Kittler

To solve above problems, we propose a low-rank discriminative least squares regression model (LRDLSR) for multi-class image classification.

Classification General Classification +2

Pedestrian Attribute Recognition: A Survey

1 code implementation22 Jan 2019 Xiao Wang, Shaofei Zheng, Rui Yang, Aihua Zheng, Zhe Chen, Jin Tang, Bin Luo

We also review some popular network architectures which have been widely applied in the deep learning community.

Multi-Label Learning Multi-Task Learning +1

Context Refinement for Object Detection

no code implementations ECCV 2018 Zhe Chen, Shaoli Huang, DaCheng Tao

Current two-stage object detectors, which consists of a region proposal stage and a refinement stage, may produce unreliable results due to ill-localized proposed regions.

object-detection Object Detection +1

An Experimental Survey on Correlation Filter-based Tracking

no code implementations18 Sep 2015 Zhe Chen, Zhibin Hong, DaCheng Tao

We find that further improvements for correlation filter-based tracking can be made on estimating scales, applying part-based tracking strategy and cooperating with long-term tracking methods.

Visual Object Tracking

MUlti-Store Tracker (MUSTer): A Cognitive Psychology Inspired Approach to Object Tracking

no code implementations CVPR 2015 Zhibin Hong, Zhe Chen, Chaohui Wang, Xue Mei, Danil Prokhorov, DaCheng Tao

Variations in the appearance of a tracked object, such as changes in geometry/photometry, camera viewpoint, illumination, or partial occlusion, pose a major challenge to object tracking.

Object Tracking

Nonparametric Estimation of Band-limited Probability Density Functions

no code implementations20 Mar 2015 Rahul Agarwal, Zhe Chen, Sridevi V. Sarma

In this paper, a nonparametric maximum likelihood (ML) estimator for band-limited (BL) probability density functions (pdfs) is proposed.

Density Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.