1 code implementation • 30 Nov 2023 • Chen Long, Wenxiao Zhang, Zhe Chen, Haiping Wang, YuAn Liu, Zhen Cao, Zhen Dong, Bisheng Yang
The key contributions of SparseDC are two-fold.
no code implementations • 28 Nov 2023 • Yang Li, Wenjie Ma, Yuanzheng Li, Sen Li, Zhe Chen
Simulation results demonstrate that our method is capable of adequately addressing the uncertainties resulting from RES and loads, mitigating the impact of cyber-attacks on the scheduling strategy, and ensuring a stable demand supply for various energy sources.
no code implementations • 22 Nov 2023 • Mosab S. M. Farea, Zhe Chen
Our approach is different because it used adaptive average ensemble after training which has increased the performance of evaluation metrics.
1 code implementation • 21 Nov 2023 • Zhe Chen
It has been over six years since the Transformer architecture was put forward.
no code implementations • 2 Nov 2023 • Zheng Lin, Zhe Chen, Zihan Fang, Xianhao Chen, Xiong Wang, Yue Gao
To this end, we propose FedSN as a general FL framework to tackle the above challenges, and fully explore data diversity on LEO satellites.
no code implementations • 10 Oct 2023 • Jingzhi Hu, Zhe Chen, Tianyue Zheng, Robert Schober, Jun Luo
Our simulation results confirm that HoloFed achieves a 57% lower positioning error variance compared to a beam-scanning baseline and can effectively adapt to diverse environments.
no code implementations • 28 Sep 2023 • Zheng Lin, Guanqiao Qu, Qiyuan Chen, Xianhao Chen, Zhe Chen, Kaibin Huang
In both aspects, considering the inherent resource limitations at the edge, we discuss various cutting-edge techniques, including split learning/inference, parameter-efficient fine-tuning, quantization, and parameter-sharing inference, to facilitate the efficient deployment of LLMs.
no code implementations • 26 Sep 2023 • Hongcheng Liu, Zhe Chen, Hui Li, Pingjie Wang, Yanfeng Wang, Yu Wang
Generating dialogue grounded in videos requires a high level of understanding and reasoning about the visual scenes in the videos.
no code implementations • 22 Aug 2023 • Zhe Chen, Daniel Harabor, Jiaoyang Li, Peter J. Stuckey
To tackle this issue we propose a new approach for MAPF where agents are guided to their destination by following congestion-avoiding paths.
1 code implementation • ICCV 2023 • Shujie Zhang, Tianyue Zheng, Zhe Chen, Jingzhi Hu, Abdelwahed Khamis, Jiajun Liu, Jun Luo
To overcome the challenge in labeling RF imaging given its human incomprehensible nature, OCHID-Fi employs a cross-modality and cross-domain training process.
2 code implementations • 15 Aug 2023 • Zhe Chen
Experimental results show that replacing the self-attention mechanism with the SHE evidently improves the performance of the Transformer, whereas the simplified versions of the SHE, i. e., the HE, the WE, and the ME, perform close to or better than the self-attention mechanism with less computational and memory complexity.
1 code implementation • 3 Aug 2023 • Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao
We present the All-Seeing (AS) project: a large-scale data and model for recognizing and understanding everything in the open world.
1 code implementation • 3 Jul 2023 • Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu
In this paper, we propose AVSegFormer, a novel framework for AVS tasks that leverages the transformer architecture.
no code implementations • 7 Jun 2023 • Kai Chen, Enze Xie, Zhe Chen, Yibo Wang, Lanqing Hong, Zhenguo Li, Dit-yan Yeung
However, the usage of diffusion models to generate the high-quality object detection data remains an underexplored area, where not only image-level perceptual quality but also geometric conditions such as bounding boxes and camera views are essential.
1 code implementation • 19 May 2023 • Zhe Chen, Hao Tan, Tao Wang, Tianrun Shen, Tong Lu, Qiuying Peng, Cheng Cheng, Yue Qi
The core insight of our method is to fully consider the information propagation among nodes and edges in a graph when building the attention module in the transformer blocks.
Ranked #2 on
Graph Regression
on PCQM4M-LSC
(Validation MAE metric)
2 code implementations • NeurIPS 2023 • Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie zhou, Yu Qiao, Jifeng Dai
We hope this model can set a new baseline for generalist vision and language models.
no code implementations • 15 May 2023 • Bojie Shen, Zhe Chen, Muhammad Aamir Cheema, Daniel D. Harabor, Peter J. Stuckey
Multi-Agent Path Finding (MAPF) is an important core problem for many new and emerging industrial applications.
2 code implementations • 9 May 2023 • Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, LiMin Wang, Ping Luo, Jifeng Dai, Yu Qiao
Different from existing interactive systems that rely on pure language, by incorporating pointing instructions, the proposed iGPT significantly improves the efficiency of communication between users and chatbots, as well as the accuracy of chatbots in vision-centric tasks, especially in complicated visual scenarios where the number of objects is greater than 2.
no code implementations • 4 May 2023 • Yuanyuan Liu, Haoyu Zhang, Yibing Zhan, Zijing Chen, Guanghao Yin, Lin Wei, Zhe Chen
To this end, we present a novel paradigm that attempts to extract noise-resistant features in its pipeline and introduces a noise-aware learning scheme to effectively improve the robustness of multimodal emotion understanding.
no code implementations • 30 Apr 2023 • Zhe Chen, Yang Yang, Anne Bettens, Youngho Eun, Xiaofeng Wu
In our framework, by making the best use of the hardware parameters of the sensor that captures real-world space images, we first develop a high-fidelity RSO simulator that can generate various realistic space images.
1 code implementation • ICCV 2023 • Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo
We propose a simple, efficient, yet powerful framework for dense visual predictions based on the conditional diffusion pipeline.
Ranked #2 on
Monocular Depth Estimation
on SUN-RGBD
no code implementations • 28 Mar 2023 • Jingzhi Hu, Zhe Chen, Jun Luo
Metamaterial-based reconfigurable holographic surfaces (RHSs) have been proposed as novel cost-efficient antenna arrays, which are promising for improving the positioning and communication performance of integrated sensing and communications (ISAC) systems.
no code implementations • 17 Feb 2023 • Tianyue Zheng, Ang Li, Zhe Chen, Hongbo Wang, Jun Luo
Object detection with on-board sensors (e. g., lidar, radar, and camera) play a crucial role in autonomous driving (AD), and these sensors complement each other in modalities.
1 code implementation • 22 Jan 2023 • Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu
In this report, we present our champion solution to the WSDM2023 Toloka Visual Question Answering (VQA) Challenge.
no code implementations • 16 Dec 2022 • Wenyue Hua, Yuchen Zhang, Zhe Chen, Josie Li, Melanie Weber
We show that our model improves over general-domain and single-domain medical and legal language models when processing mixed-domain (personal injury) text.
no code implementations • 9 Dec 2022 • Zhe Chen, Garrett J. Blair, Chengdi Cao, Jim Zhou, Daniel Aharoni, Peyman Golshani, Hugh T. Blair, Jason Cong
Our FPGA implementation enables the real-time calcium image decoding with sub-ms processing latency for closed-loop feedback applications.
1 code implementation • CVPR 2023 • Yuanyuan Liu, Wenbin Wang, Yibing Zhan, Shaoze Feng, Kejun Liu, Zhe Chen
Self-supervised facial representation has recently attracted increasing attention due to its ability to perform face understanding without relying on large-scale annotated datasets heavily.
2 code implementations • 17 Nov 2022 • Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei HUANG, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, LiMin Wang, Yu Qiao
In this report, we present our champion solutions to five tracks at Ego4D challenge.
Ranked #1 on
State Change Object Detection
on Ego4D
2 code implementations • CVPR 2023 • Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao
Compared to the great progress of large-scale vision transformers (ViTs) in recent years, large-scale models based on convolutional neural networks (CNNs) are still in an early state.
Ranked #1 on
Instance Segmentation
on COCO test-dev
(APS metric, using extra
training data)
no code implementations • 14 Jul 2022 • Zhe Chen, Jing Zhang, Yufei Xu, DaCheng Tao
Current object detectors typically have a feature pyramid (FP) module for multi-level feature fusion (MFF) which aims to mitigate the gap between features from different levels and form a comprehensive object representation to achieve better detection performance.
1 code implementation • CVPR 2023 • Xu Zhang, Wen Wang, Zhe Chen, Yufei Xu, Jing Zhang, DaCheng Tao
Motivated by the progress of visual-language research, we propose that pre-trained language models (e. g., CLIP) can facilitate animal pose estimation by providing rich prior knowledge for describing animal keypoints in text.
1 code implementation • 19 May 2022 • Xiao Wang, Zhe Chen, Bo Jiang, Jin Tang, Bin Luo, DaCheng Tao
To track the target in a video, current visual trackers usually adopt greedy search for target object localization in each frame, that is, the candidate region with the maximum response score will be selected as the tracking result of each frame.
1 code implementation • 17 May 2022 • Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao
This work investigates a simple yet powerful dense prediction task adapter for Vision Transformer (ViT).
Ranked #4 on
Semantic Segmentation
on PASCAL Context
1 code implementation • CVPR 2022 • Liyao Tang, Yibing Zhan, Zhe Chen, Baosheng Yu, DaCheng Tao
Point cloud segmentation is fundamental in understanding 3D environments.
Ranked #15 on
Semantic Segmentation
on S3DIS Area5
no code implementations • 7 Mar 2022 • Zhe Chen, Cong Wang
We present sufficient conditions for the load-flow solvability under security constraints in DC distribution networks.
no code implementations • 28 Jan 2022 • Yu-Hong Cai, Xiao-Jun Wu, Zhe Chen
However, methods based on this technique ignore the pressure on a single transformation matrix due to the complex information contained in the data.
1 code implementation • 6 Jan 2022 • Chen Chen, Zhe Chen, Jing Zhang, DaCheng Tao
We observe that the prevailing set abstraction design for down-sampling points may maintain too much unimportant background information that can affect feature learning for detecting objects.
1 code implementation • CVPR 2022 • Zhe Chen, Jing Zhang, DaCheng Tao
Then, a glimpse-based decoder is introduced to provide refined detection results based on both the glimpse features and the attention modeling outputs of the previous stage.
Ranked #1 on
Object Detection
on COCO
(GFlops metric)
no code implementations • 1 Dec 2021 • Tianyue Zheng, Zhe Chen, Shuya Ding, Chao Cai, Jun Luo
Whereas adversarial training can be useful against specific adversarial perturbations, they have also proven ineffective in generalizing towards attacks deviating from those used for training.
no code implementations • 16 Nov 2021 • Tianyue Zheng, Zhe Chen, Shujie Zhang, Chao Cai, Jun Luo
Crucial for healthcare and biomedical applications, respiration monitoring often employs wearable sensors in practice, causing inconvenience due to their direct contact with human bodies.
2 code implementations • 3 Nov 2021 • Zhe Chen, Jiahao Wang, Wenhai Wang, Guo Chen, Enze Xie, Ping Luo, Tong Lu
We propose an accurate and efficient scene text detection framework, termed FAST (i. e., faster arbitrarily-shaped text detector).
Ranked #2 on
Scene Text Detection
on MSRA-TD500
1 code implementation • 29 Oct 2021 • Shuya Ding, Zhe Chen, Tianyue Zheng, Jun Luo
Radio-Frequency (RF) based device-free Human Activity Recognition (HAR) rises as a promising solution for many applications.
no code implementations • 28 Oct 2021 • Tianyue Zheng, Zhe Chen, Chao Cai, Jun Luo, Xu Zhang
Given the significant amount of time people spend in vehicles, health issues under driving condition have become a major concern.
no code implementations • 28 Oct 2021 • Tianyue Zheng, Zhe Chen, Shuya Ding, Jun Luo
To better understand this potential, this article takes a layered approach to summarize RF sensing enabled by deep learning.
no code implementations • 27 Oct 2021 • Tianyue Zheng, Zhe Chen, Jun Luo, Lin Ke, Chaoyang Zhao, Yaowen Yang
To this end, we equip SiWa with a deep learning pipeline to parse the rich sensory data.
no code implementations • 13 Oct 2021 • Haichao Yu, Zhe Chen, Dong Lin, Gil Shamir, Jie Han
Dropout has been commonly used to quantify prediction uncertainty, i. e, the variations of model predictions on a given input example.
no code implementations • 29 Sep 2021 • Shujie Zhang, Tianyue Zheng, Zhe Chen, Jun Luo, Sinno Pan
In many practical scenarios of signal extraction from a nonlinear mixture, only one (signal) source is intended to be extracted.
no code implementations • 17 Sep 2021 • Yuanyuan Liu, Wenbin Wang, Chuanxu Feng, Haoyu Zhang, Zhe Chen, Yibing Zhan
To this end, we propose to decompose each video into a series of expression snippets, each of which contains a small number of facial movements, and attempt to augment the Transformer's ability for modeling intra-snippet and inter-snippet visual relations, respectively, obtaining the Expression snippet Transformer (EST).
Ranked #7 on
Dynamic Facial Expression Recognition
on DFEW
Dynamic Facial Expression Recognition
Facial Expression Recognition
+1
2 code implementations • 11 Aug 2021 • Xiao Wang, Jianing Li, Lin Zhu, Zhipeng Zhang, Zhe Chen, Xin Li, YaoWei Wang, Yonghong Tian, Feng Wu
Different from visible cameras which record intensity images frame by frame, the biologically inspired event camera produces a stream of asynchronous and sparse events with much lower latency.
Ranked #1 on
Object Tracking
on VisEvent
no code implementations • 30 Mar 2021 • Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli, Erik Nygren, Sharada Mohanty
However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner.
1 code implementation • 30 Mar 2021 • Xiao Wang, Zhe Chen, Jin Tang, Bin Luo, YaoWei Wang, Yonghong Tian, Feng Wu
In this paper, we propose to introduce more dynamics by devising a dynamic attention-guided multi-trajectory tracking strategy.
1 code implementation • 22 Mar 2021 • Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo
(1) We divide input image into small patches and adopt TIN, successfully transferring image style with arbitrary high-resolution.
no code implementations • 17 Feb 2021 • Zhe Chen, Daniel Harabor, Jiaoyang Li, Peter J. Stuckey
During Multi-Agent Path Finding (MAPF) problems, agents can be delayed by unexpected events.
no code implementations • 25 Nov 2020 • Jack Humphreys, Zhe Chen, DaCheng Tao
Action recognition, which is formulated as a task to identify various human actions in a video, has attracted increasing interest from computer vision researchers due to its importance in various applications.
1 code implementation • 1 Nov 2020 • Licheng Wen, Zhen Zhang, Zhe Chen, Xiangrui Zhao, Yong liu
In this paper, we give a mathematical formalization of Multi-Agent Path Finding for Car-Like robots (CL-MAPF) problem.
Robotics Multiagent Systems
no code implementations • 17 Aug 2020 • Zhe Chen, Yuyan Wang, Dong Lin, Derek Zhiyuan Cheng, Lichan Hong, Ed H. Chi, Claire Cui
Despite deep neural network (DNN)'s impressive prediction performance in various domains, it is well known now that a set of DNN models trained with the same model specification and the same data can produce very different prediction results.
1 code implementation • ECCV 2020 • Zhe Chen, Shohei Nobuhara, Ko Nishino
We introduce a novel neural network-based BRDF model and a Bayesian framework for object inverse rendering, i. e., joint estimation of reflectance and natural illumination from a single image of an object of known geometry.
1 code implementation • 9 Aug 2020 • Weifeng Ma, Zhe Chen, Caoting Ji
This method can act as a plug-in for Fast Style Transfer without any modification to the network architecture.
no code implementations • 24 Jun 2020 • Di Cao, Junbo Zhao, Weihao Hu, Fei Ding, Qi Huang, Zhe Chen, Frede Blaabjerg
Accurate knowledge of the distribution system topology and parameters is required to achieve good voltage controls, but this is difficult to obtain in practice.
1 code implementation • 10 Jun 2020 • Zhe Chen, Jing Zhang, DaCheng Tao
Modern two-stage object detectors generally require excessively large models for their detection heads to achieve high accuracy.
no code implementations • 3 Jun 2020 • Zhe Chen
We extend the classical result asserting that the twisting operator preserves certain Deligne--Lusztig character values for truncated formal power series; along the way we discuss some properties of centralisers.
Representation Theory
no code implementations • 31 May 2020 • Di Cao, Junbo Zhao, Weihao Hu, Fei Ding, Qi Huang, Zhe Chen
This paper proposes a data-driven distributed voltage control approach based on the spectrum clustering and the enhanced multi-agent deep reinforcement learning (MADRL) algorithm.
5 code implementations • 17 May 2020 • Jian Ye, Zhe Chen, Juhua Liu, Bo Du
More specifically, we propose to perceive texts from three levels of feature representations, i. e., character-, word- and global-level, and then introduce a novel text representation fusion technique to help achieve robust arbitrary text detection.
Ranked #1 on
Scene Text Detection
on ICDAR 2015
1 code implementation • 3 Feb 2020 • Jing Zhang, Zhe Chen, DaCheng Tao
Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance.
Ranked #5 on
Pose Estimation
on COCO test-dev
no code implementations • 15 Dec 2019 • Zhe Chen, Wanli Ouyang, Tongliang Liu, DaCheng Tao
Alternatively, to access much more natural-looking pedestrians, we propose to augment pedestrian detection datasets by transforming real pedestrians from the same dataset into different shapes.
1 code implementation • 27 Oct 2019 • Jing Zhang, Zhe Chen, DaCheng Tao
Human keypoint detection from a single image is very challenging due to occlusion, blur, illumination and scale variance of person instances.
no code implementations • 14 May 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
Only learning one projection matrix from original samples to the corresponding binary labels is too strict and will consequentlly lose some intrinsic geometric structures of data.
1 code implementation • 2 Apr 2019 • Zhe Chen, Jing Zhang, DaCheng Tao
To this end, LiDAR sensor data can be incorporated to improve the visual image-based road detection, because LiDAR data is less susceptible to visual noises.
no code implementations • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
In this paper, we propose a non-negative representation based discriminative dictionary learning algorithm (NRDL) for multicategory face classification.
1 code implementation • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
On one hand, the Fisher criterion improves the intra-class compactness of the relaxed labels during relaxation learning.
no code implementations • 19 Mar 2019 • Zhe Chen, Xiao-Jun Wu, Josef Kittler
To solve above problems, we propose a low-rank discriminative least squares regression model (LRDLSR) for multi-class image classification.
1 code implementation • 22 Jan 2019 • Xiao Wang, Shaofei Zheng, Rui Yang, Aihua Zheng, Zhe Chen, Jin Tang, Bin Luo
We also review some popular network architectures which have been widely applied in the deep learning community.
no code implementations • ECCV 2018 • Zhe Chen, Shaoli Huang, DaCheng Tao
Current two-stage object detectors, which consists of a region proposal stage and a refinement stage, may produce unreliable results due to ill-localized proposed regions.
no code implementations • 18 Sep 2015 • Zhe Chen, Zhibin Hong, DaCheng Tao
We find that further improvements for correlation filter-based tracking can be made on estimating scales, applying part-based tracking strategy and cooperating with long-term tracking methods.
no code implementations • CVPR 2015 • Zhibin Hong, Zhe Chen, Chaohui Wang, Xue Mei, Danil Prokhorov, DaCheng Tao
Variations in the appearance of a tracked object, such as changes in geometry/photometry, camera viewpoint, illumination, or partial occlusion, pose a major challenge to object tracking.
no code implementations • 20 Mar 2015 • Rahul Agarwal, Zhe Chen, Sridevi V. Sarma
In this paper, a nonparametric maximum likelihood (ML) estimator for band-limited (BL) probability density functions (pdfs) is proposed.
no code implementations • 27 Nov 2014 • Scott W. Linderman, Matthew J. Johnson, Matthew A. Wilson, Zhe Chen
Rodent hippocampal population codes represent important spatial information about the environment during navigation.