no code implementations • 7 Jun 2018 • Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, Bo Li
Efficient model training requires eliminating such stragglers, yet for modern ML workloads, existing load balancing strategies are inefficient and even infeasible.
no code implementations • 25 May 2018 • Chen Chen, Min Cao, Xiyuan Hu, Silong Peng
Ideally person re-identification seeks for perfect feature representation and metric model that re-identify all various pedestrians well in non-overlapping views at different locations with different camera configurations, which is very challenging.
no code implementations • 1 Apr 2018 • Baochang Zhang, Jiaxin Gu, Chen Chen, Jungong Han, Xiangbo Su, Xian-Bin Cao, Jianzhuang Liu
Compression artifacts reduction (CAR) is a challenging problem in the field of remote sensing.
no code implementations • 3 May 2017 • Shangzhen Luan, Baochang Zhang, Chen Chen, Xian-Bin Cao, Jungong Han, Jianzhuang Liu
Steerable properties dominate the design of traditional filters, e. g., Gabor filters, and endow features the capability of dealing with spatial transformations.
no code implementations • 4 Dec 2017 • Mengyuan Liu, Hong Liu, Chen Chen
Then, motion and shape cues are jointly used to generate robust and distinctive spatial-temporal interest points (STIPs): motion-based STIPs and shape-based STIPs.
no code implementations • 30 Nov 2017 • Rui Hou, Chen Chen, Mubarak Shah
A video is first divided into equal length clips and next for each clip a set of tube proposals are generated based on 3D CNN features.
no code implementations • 11 Nov 2017 • Baochang Zhang, Shangzhen Luan, Chen Chen, Jungong Han, Wei Wang, Alessandro Perina, Ling Shao
In this paper, we introduce an intermediate step -- solution sampling -- after the data sampling step to form a subspace, in which an optimal solution can be estimated.
no code implementations • 22 May 2017 • Chen Chen, Changtong Luo, Zonglin Jiang
Symbolic regression that aims to detect underlying data-driven models has become increasingly important for industrial data analysis.
no code implementations • 27 Oct 2017 • Chen Chen, Xiaowei Zhao, Yang Liu
In this paper, we present a solution to Large-Scale Video Classification Challenge (LSVC2017) [1] that ranked the 1st place.
no code implementations • 6 Aug 2017 • Chen Chen, Baochang Zhang, Alessio Del Bue, Vittorio Murino
Low-rank decomposition (LRD) is a state-of-the-art method for visual data reconstruction and modelling.
no code implementations • 5 Aug 2017 • Chen Chen, Changtong Luo, Zonglin Jiang
In this paper, we analyze different types of separability of some real-world engineering equations and establish a mathematical model of generalized separable system (GS system).
no code implementations • 23 May 2017 • Changtong Luo, Chen Chen, Zonglin Jiang
This feature motivated us to develop a new method, divide and conquer (D&C), for symbolic regression, in which the target function is divided into a number of sub-functions and the sub-functions are then determined by any of a GP algorithm.
no code implementations • 7 Jun 2016 • Shangzhen Luan, Baochang Zhang, Jungong Han, Chen Chen, Ling Shao, Alessandro Perina, Linlin Shen
There is a neglected fact in the traditional machine learning methods that the data sampling can actually lead to the solution sampling.
no code implementations • 24 Apr 2017 • Chen Chen, Changtong Luo, Zonglin Jiang
In this paper, a new non-evolutionary real-time algorithm for symbolic regression, Elite Bases Regression (EBR), is proposed.
no code implementations • 9 May 2017 • Ce Li, Chen Chen, Baochang Zhang, Qixiang Ye, Jungong Han, Rongrong Ji
Visual data such as videos are often sampled from complex manifold.
no code implementations • 28 Feb 2016 • Shang-Wen Li, Sanjay Purushotham, Chen Chen, Yuzhuo Ren, C. -C. Jay Kuo
Textual data such as tags, sentence descriptions are combined with visual cues to reduce the semantic gap for image retrieval applications in today's Multimodal Image Retrieval (MIR) systems.
no code implementations • 16 Dec 2016 • Baochang Zhang, Zhigang Li, Xian-Bin Cao, Qixiang Ye, Chen Chen, Linlin Shen, Alessandro Perina, Rongrong Ji
Kernelized Correlation Filter (KCF) is one of the state-of-the-art object trackers.
no code implementations • 3 Apr 2016 • Yuzhuo Ren, Chen Chen, Shang-Wen Li, C. -C. Jay Kuo
The proposed Global-attributes Assisted Labeling (GAL) system exploits both local features and global attributes.
no code implementations • 18 Nov 2014 • Chen Chen, Junzhou Huang, Lei He, Hongsheng Li
The convergence rate of the proposed algorithm is almost the same as that of the traditional IRLS algorithms, that is, exponentially fast.
no code implementations • 18 Nov 2014 • Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang
In this paper, we propose a novel method for image fusion with a high-resolution panchromatic image and a low-resolution multispectral image at the same geographical location.
no code implementations • 20 Nov 2012 • Chen Chen, Yeqing Li, Junzhou Huang
In this paper, we investigate a new compressive sensing model for multi-channel sparse data where each channel can be represented as a hierarchical tree and different channels are highly correlated.
no code implementations • 20 Dec 2018 • Yinglong Wang, Shuaicheng Liu, Chen Chen, Dehua Xie, Bing Zeng
We present a novel rain removal method in this paper, which consists of two steps, i. e., detection of rain streaks and reconstruction of the rain-removed image.
no code implementations • NeurIPS 2012 • Chen Chen, Junzhou Huang
On the other side, some algorithms have been proposed for tree sparsity regularization, but few of them has validated the benefit of tree structure in CS-MRI.
no code implementations • ECCV 2018 • Zhenfeng Fan, Xiyuan Hu, Chen Chen, Silong Peng
The dense correspondence goal is revisited in two perspectives: semantic and topological correspondence.
no code implementations • CVPR 2014 • Chen Chen, Junzhou Huang, Lei He, Hongsheng Li
In this paper, we propose a novel algorithm for structured sparsity reconstruction.
no code implementations • CVPR 2014 • Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang
In this paper, we propose a novel method for image fusion from a high resolution panchromatic image and a low resolution multispectral image at the same geographical location.
no code implementations • CVPR 2015 • Yeqing Li, Chen Chen, Fei Yang, Junzhou Huang
The definition of the similarity measure is an essential component in image registration.
no code implementations • CVPR 2017 • Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang
Extensive experiments on four realistic action datasets in terms of three tasks (i. e., partial action retrieval, recognition and prediction) clearly show the superiority of PRBC over the state-of-the-art methods, along with significantly reduced memory load and computational costs during the online test.
no code implementations • LREC 2014 • Chen Chen, Vincent Ng
Compared to entity coreference resolution, there is a relatively small amount of work on event coreference resolution.
no code implementations • 12 Apr 2019 • Bin Sun, Chen Chen, Yingying Zhu, Jianmin Jiang
The task of cross-view image geo-localization aims to determine the geo-location (GPS coordinates) of a query ground-view image by matching it with the GPS-tagged aerial (satellite) images in a reference dataset.
no code implementations • 5 Jul 2019 • Wenjia Bai, Chen Chen, Giacomo Tarroni, Jinming Duan, Florian Guitton, Steffen E. Petersen, Yike Guo, Paul M. Matthews, Daniel Rueckert
In the recent years, convolutional neural networks have transformed the field of medical image analysis due to their capacity to learn discriminative image features for a variety of classification and regression tasks.
no code implementations • 21 Jul 2019 • Rui Hou, Chen Chen, Rahul Sukthankar, Mubarak Shah
Convolutional Neural Network (CNN) based image segmentation has made great progress in recent years.
Ranked #64 on Semi-Supervised Video Object Segmentation on DAVIS 2016
no code implementations • 23 Jul 2019 • Chen Chen, Carlo Biffi, Giacomo Tarroni, Steffen Petersen, Wenjia Bai, Daniel Rueckert
Cardiac MR image segmentation is essential for the morphological and functional analysis of the heart.
no code implementations • 20 Aug 2019 • Chen Chen, Cheng Ouyang, Giacomo Tarroni, Jo Schlemper, Huaqi Qiu, Wenjia Bai, Daniel Rueckert
In this work, we present a fully automatic method to segment cardiac structures from late-gadolinium enhanced (LGE) images without using labelled LGE data for training, but instead by transferring the anatomical knowledge and features learned on annotated balanced steady-state free precession (bSSFP) images, which are easier to acquire.
no code implementations • Pattern Recognition 2017 • Mengyuan Liu, Hong Liu, Chen Chen
First, a sequence-based view invariant transform is developed to eliminate the effect of view variations on spatio-temporal locations of skeleton joints.
Ranked #2 on Skeleton Based Action Recognition on UWA3D
no code implementations • 18 Sep 2019 • Chen Chen, Jaewoo Lee
In this paper we consider the problem of minimizing composite objective functions consisting of a convex differentiable loss function plus a non-smooth regularization term, such as $L_1$ norm or nuclear norm, under R\'enyi differential privacy (RDP).
no code implementations • 30 Sep 2019 • Haotian Fu, Hongyao Tang, Jianye Hao, Wulong Liu, Chen Chen
Most meta reinforcement learning (meta-RL) methods learn to adapt to new tasks by directly optimizing the parameters of policies over primitive action space.
no code implementations • WS 2019 • Danni Ma, Chen Chen, Behzad Golshan, Wang-Chiew Tan
Paraphrases are important linguistic resources for a wide variety of NLP applications.
no code implementations • 6 Oct 2019 • Chen Chen, Lisong Qiu, Zhenxin Fu, Dongyan Zhao, Junfei Liu, Rui Yan
Existing dialog systems are all monolingual, where features shared among different languages are rarely explored.
no code implementations • 9 Nov 2019 • Chen Chen, Chen Qin, Huaqi Qiu, Giacomo Tarroni, Jinming Duan, Wenjia Bai, Daniel Rueckert
Deep learning has become the most widely used approach for cardiac image segmentation in recent years.
no code implementations • 12 Nov 2019 • Weiguo Zhou, Xin Jiang, Chen Chen, Sijia Mei, Yun-hui Liu
In this paper, we propose a method that takes advantage of human hand morphological topology (HMT) structure to improve the pose estimation performance.
Robotics Human-Computer Interaction
1 code implementation • 20 Nov 2019 • Chen Chen, Mengyuan Liu, Xiandong Meng, Wanpeng Xiao, Qi Ju
Therefore, high efficiency object detectors on CPU-only devices are urgently-needed in industry.
no code implementations • 16 Jan 2020 • Yichen Zhang, Chen Chen, Guodong Liu, Tianqi Hong, Feng Qiu
In this paper, we introduce a deep learning aided constraint encoding method to tackle the frequency-constraint microgrid scheduling problem.
no code implementations • 1 Apr 2020 • Sijie Zhu, Chen Chen, Waqas Sultani
Temporal localization (i. e. indicating the start and end frames of the anomaly event in a video) is referred to as frame-level detection.
no code implementations • 6 Apr 2020 • Aaron Traylor, Chen Chen, Behzad Golshan, Xiaolan Wang, Yuliang Li, Yoshihiko Suhara, Jinfeng Li, Cagatay Demiralp, Wang-Chiew Tan
In this paper, we introduce xSense, an effective system for review comprehension using domain-specific commonsense knowledge bases (xSense KBs).
no code implementations • 22 Apr 2020 • Waqas Sultani, Qazi Ammar Arshad, Chen Chen
Temporal localization (i. e. indicating the start and end frames of the action in a video) is referred to as frame-level detection.
no code implementations • 1 May 2020 • Hao Dou, Chen Chen, Xiyuan Hu, Zuxing Xuan, Zhisen Hu, Silong Peng
Generative Adversarial Networks (GAN) have been employed for face super resolution but they bring distorted facial details easily and still have weakness on recovering realistic texture.
no code implementations • 30 Apr 2020 • Li'an Zhuo, Baochang Zhang, Hanlin Chen, Linlin Yang, Chen Chen, Yanjun Zhu, David Doermann
To this end, a Child-Parent (CP) model is introduced to a differentiable NAS to search the binarized architecture (Child) under the supervision of a full-precision model (Parent).
no code implementations • 7 May 2020 • Talal Alatiah, Chen Chen
Artificial intelligence technology has made its way absolutely necessary in a variety of industries including the fitness industry.
no code implementations • 18 May 2020 • Jia Xue, Junxiang Chen, Chen Chen, Chengda Zheng, Sijia Li, Tingshao Zhu
The study aims to understand Twitter users' discourse and psychological reactions to COVID-19.
no code implementations • 23 May 2020 • Sijie Zhu, Taojiannan Yang, Chen Chen
Street-to-aerial image geo-localization, which matches a query street-view image to the GPS-tagged aerial images in a reference set, has attracted increasing attention recently.
no code implementations • 26 May 2020 • Jia Xue, Junxiang Chen, Ran Hu, Chen Chen, Chengda Zheng, Xiaoqian Liu, Tingshao Zhu
Across all identified topics, the dominant sentiments for the spread of coronavirus are anticipation that measures that can be taken, followed by a mixed feeling of trust, anger, and fear for different topics.
no code implementations • 15 Jun 2020 • Chen Chen, Jingfeng Zhang, Anthony K. H. Tung, Mohan Kankanhalli, Gang Chen
We argue that the key to Byzantine detection is monitoring of gradients of the model parameters of clients.
no code implementations • 22 Jun 2020 • Xiahai Zhuang, Jiahang Xu, Xinzhe Luo, Chen Chen, Cheng Ouyang, Daniel Rueckert, Victor M. Campello, Karim Lekadir, Sulaiman Vesal, Nishant Ravikumar, Yashu Liu, Gongning Luo, Jingkun Chen, Hongwei Li, Buntheng Ly, Maxime Sermesant, Holger Roth, Wentao Zhu, Jiexiang Wang, Xinghao Ding, Xinyue Wang, Sen yang, Lei LI
In addition, the paired MS-CMR images could enable algorithms to combine the complementary information from the other sequences for the segmentation of LGE CMR.
no code implementations • 23 Jun 2020 • Shuo Wang, Giacomo Tarroni, Chen Qin, Yuanhan Mo, Chengliang Dai, Chen Chen, Ben Glocker, Yike Guo, Daniel Rueckert, Wenjia Bai
Our approach provides a real-time and model-agnostic quality control for cardiac MRI segmentation, which has the potential to be integrated into clinical image analysis workflows.
no code implementations • 24 Jun 2020 • Esther Puyol-Antón, Chen Chen, James R. Clough, Bram Ruijsink, Baldeep S. Sidhu, Justin Gould, Bradley Porter, Mark Elliott, Vishal Mehta, Daniel Rueckert, Christopher A. Rinaldi, Andrew P. King
Our key contribution is that the VAE disentangles the latent space based on `explanations' drawn from existing clinical knowledge.
no code implementations • ICML 2020 • Denny Zhou, Mao Ye, Chen Chen, Tianjian Meng, Mingxing Tan, Xiaodan Song, Quoc Le, Qiang Liu, Dale Schuurmans
This is achieved by layerwise imitation, that is, forcing the thin network to mimic the intermediate outputs of the wide network from layer to layer.
no code implementations • 18 Aug 2020 • Chen Chen, Jaewoo Lee
In this paper, we introduce a stochastic variant of classic backtracking line search algorithm that satisfies R\'enyi differential privacy.
no code implementations • EMNLP 2020 • Xinyin Ma, Yongliang Shen, Gongfan Fang, Chen Chen, Chenghao Jia, Weiming Lu
To the best of our knowledge, our framework is the first data-free distillation framework designed for NLP tasks.
no code implementations • NAACL 2021 • Xiaotao Gu, Liyuan Liu, Hongkun Yu, Jing Li, Chen Chen, Jiawei Han
Due to the excessive cost of large-scale language model pre-training, considerable efforts have been made to train BERT progressively -- start from an inferior but low-cost model and gradually grow the model to increase the computational complexity.
no code implementations • 27 Oct 2020 • Yu Shen, Sijie Zhu, Taojiannan Yang, Chen Chen
Fast and effective responses are required when a natural disaster (e. g., earthquake, hurricane, etc.)
Ranked #3 on 2D Semantic Segmentation on xBD
no code implementations • 10 Nov 2020 • Xiaoyu Mao, Jun Fu, Chen Chen, Yue Li, Heng Liu, Ming Gong, Hualing Zeng
With the saturated ferroelectric polarization of CIPS, electron-doped or hole-doped MoSe$_2$ is realized in a single device with a large carrier density tunability up to $5\times 10^{12}$cm$^{-2}$.
Materials Science
no code implementations • 5 Mar 2020 • Yichen Zhang, Chen Chen, Tianqi Hong, Bai Cui, Bo Chen, Feng Qiu
The capability to switch between grid-connected and islanded modes has promoted adoption of microgrid technology for powering remote locations.
no code implementations • 24 Nov 2020 • Sijie Zhu, Taojiannan Yang, Matias Mendieta, Chen Chen
Even under the same computational constraints, the performance of our adaptive networks can be significantly boosted over the baseline counterparts by the mutual training along three dimensions.
no code implementations • 20 May 2020 • Chen Chen, Shu Fu, Xin Jian, Min Liu, Xiong Deng, Zhiguo Ding
In order to improve the energy efficiency (EE) of the bidirectional LiFi-IoT system, non-orthogonal multiple access (NOMA) with a quality-of-service (QoS)-guaranteed optimal power allocation (OPA) strategy is applied to maximize the EE of the system.
no code implementations • 7 Dec 2020 • Lingjuan Lyu, Han Yu, Xingjun Ma, Chen Chen, Lichao Sun, Jun Zhao, Qiang Yang, Philip S. Yu
Besides training powerful global models, it is of paramount importance to design FL systems that have privacy guarantees and are resistant to different types of adversaries.
no code implementations • 28 Dec 2020 • Bowen Zhao, Chen Chen, Xi Xiao, Shutao Xia
Object detectors are typically learned on fully-annotated training data with fixed predefined categories.
no code implementations • 5 Jan 2021 • Ian E. Jacobs, Gabriele D'Avino, Yue Lin, Vincent Lemaur, Yuxuan Huang, Xinglong Ren, Dimitrios Simatos, William Wood, Chen Chen, Thomas Harrelson, Tarig Mustafa, Christopher A. O'Keefe, Leszek Spalek, Dion Tjhe, Martin Statz, Lianglun Lai, Peter A. Finn, William G. Neal, Joseph Strzalka, Christian B. Nielsen, Jin-Kyun Lee, Stephen Barlow, Seth R. Marder, Iain McCulloch, Simone Fratini, David Beljonne, Henning Sirringhaus
Doped organic semiconductors are critical to emerging device applications, including thermoelectrics, bioelectronics, and neuromorphic computing devices.
Materials Science
no code implementations • 23 Jan 2021 • Xinwei Zhao, Chen Chen, Matthew C. Stamm
In this paper, we propose a new anti-forensic attack framework designed to remove forensic traces left by a variety of manipulation operations.
no code implementations • 30 Jan 2021 • Chengli Peng, Jiayi Ma, Chen Chen, Xiaojie Guo
To verify the efficiency of the proposed bilateral attention decoder, we adopt a lightweight network as the backbone and compare our proposed method with other state-of-the-art real-time semantic segmentation methods on the Cityscapes and Camvid datasets.
no code implementations • 15 Feb 2021 • Chen Chen, Jingfeng Zhang, Xilie Xu, Tianlei Hu, Gang Niu, Gang Chen, Masashi Sugiyama
To enhance adversarial robustness, adversarial training learns deep neural networks on the adversarial variants generated by their natural data.
no code implementations • 17 Feb 2021 • Qazi Ammar Arshad, Mohsen Ali, Saeed-Ul Hassan, Chen Chen, Ayisha Imran, Ghulam Rasul, Waqas Sultani
Malaria microscopy, microscopic examination of stained blood slides to detect parasite Plasmodium, is considered to be a gold-standard for detecting life-threatening disease malaria.
no code implementations • 3 Mar 2021 • Chen Chen, Hongyao Tang, Jianye Hao, Wulong Liu, Zhaopeng Meng
We propose Nested Policy Iteration as a general training algorithm for PIC-augmented policy which ensures monotonically non-decreasing updates under some mild conditions.
no code implementations • 7 Mar 2021 • Chen Chen, Kezhi Kong, Peihong Yu, Juan Luque, Tom Goldstein, Furong Huang
Randomized smoothing (RS) is an effective and scalable technique for constructing neural network classifiers that are certifiably robust to adversarial perturbations.
no code implementations • 8 Mar 2021 • Yang Li, Chunling Wang, Guoqing Li, Chen Chen
In order to balance the interests of integrated energy operator (IEO) and users, a novel Stackelberg game-based optimization framework is proposed for the optimal scheduling of integrated demand response (IDR)-enabled integrated energy systems with uncertain renewable generations, where the IEO acts as the leader who pursues the maximization of his profits by setting energy prices, while the users are the follower who adjusts energy consumption plans to minimize their energy costs.
no code implementations • 23 May 2021 • Chen Chen, Xuanli He, Lingjuan Lyu, Fangzhao Wu
In this work, we bridge this gap by first presenting an effective model extraction attack, where the adversary can practically steal a BERT-based API (the target/victim model) by only querying a limited number of queries.
no code implementations • 26 May 2021 • Yong Qian, Zhongqing Wang, Rong Xiao, Chen Chen, Haihong Tang
Previous studies show effective of pre-trained language models for sentiment analysis.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1
no code implementations • 4 Jun 2021 • Xiaoying Xing, Hongfu Liu, Chen Chen, Jundong Li
Feature selection is a prevalent data preprocessing paradigm for various learning tasks.
no code implementations • 7 Jun 2021 • Bowen Zhao, Chen Chen, Qi Ju, Shutao Xia
Training on class-imbalanced data usually results in biased models that tend to predict samples into the majority classes, which is a common and notorious problem.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Jiale Yu, Yongliang Shen, Xinyin Ma, Chenghao Jia, Chen Chen, Weiming Lu
Extensive experiments on a real-world dataset show the effectiveness of our approach.
no code implementations • 24 Jun 2021 • Chen Chen, Lin Zeng, Xin Zhong, Shu Fu, Min Liu, Pengfei Du
In this paper, we propose an orthogonal frequency division multiplexing (OFDM)-based generalized optical quadrature spatial modulation (GOQSM) technique for multiple-input multiple-output optical wireless communication (MIMO-OWC) systems.
no code implementations • 8 Jul 2021 • Shuo Wang, Chen Qin, Nicolo Savioli, Chen Chen, Declan O'Regan, Stuart Cook, Yike Guo, Daniel Rueckert, Wenjia Bai
In cardiac magnetic resonance (CMR) imaging, a 3D high-resolution segmentation of the heart is essential for detailed description of its anatomical structures.
no code implementations • 16 Aug 2021 • Lingjuan Lyu, Chen Chen
We perform the first systematic evaluation of attribute reconstruction attack (ARA) launched by the malicious server in the FL system, and empirically demonstrate that the shared epoch-averaged local model gradients can reveal sensitive attributes of local training data of any victim participant.
no code implementations • 24 Aug 2021 • Xidong Feng, Chen Chen, Dong Li, Mengchen Zhao, Jianye Hao, Jun Wang
Meta learning, especially gradient based one, can be adopted to tackle this problem by learning initial parameters of the model and thus allowing fast adaptation to a specific task from limited data examples.
no code implementations • 22 Sep 2021 • Tareen Dawood, Chen Chen, Robin Andlauer, Baldeep S. Sidhu, Bram Ruijsink, Justin Gould, Bradley Porter, Mark Elliott, Vishal Mehta, C. Aldo Rinaldi, Esther Puyol-Antón, Reza Razavi, Andrew P. King
Evaluation of predictive deep learning (DL) models beyond conventional performance metrics has become increasingly important for applications in sensitive environments like healthcare.
no code implementations • 8 Oct 2021 • Xin Zhong, Chen Chen, Shu Fu, Zhihong Zeng, Min Liu
Generalized optical multiple-input multiple-output (GOMIMO) techniques have been recently shown to be promising for high-speed optical wireless communication (OWC) systems.
no code implementations • 29 Sep 2021 • Pengjie Gu, Mengchen Zhao, Chen Chen, Dong Li, Jianye Hao, Bo An
Offline reinforcement learning is a promising approach for practical applications since it does not require interactions with real-world environments.
no code implementations • 11 Oct 2021 • Shaoyu Zhang, Chen Chen, Xiujuan Zhang, Silong Peng
When applying mixup to long-tailed data, a label suppression issue arises, where the frequency of label occurrence for each class is imbalanced and most of the new examples will be completely or partially assigned with head labels.
no code implementations • 18 Oct 2021 • Pinyarash Pinyoanuntapong, Tagore Pothuneedi, Ravikumar Balakrishnan, Minwoo Lee, Chen Chen, Pu Wang
Federated Learning (FL) over wireless multi-hop edge computing networks, i. e., multi-hop FL, is a cost-effective distributed on-device deep learning paradigm.
no code implementations • 18 Oct 2021 • Haoran Sun, Chen Chen, Lantian Li, Dong Wang
SpeechFlow is a powerful factorization model based on information bottleneck (IB), and its effectiveness has been reported by several studies.
no code implementations • 24 Oct 2021 • Yangang Ren, Jianhua Jiang, Dongjie Yu, Shengbo Eben Li, Jingliang Duan, Chen Chen, Keqiang Li
This paper develops the dynamic permutation state representation in the framework of integrated decision and control (IDC) to handle signalized intersections with mixed traffic flows.
no code implementations • 18 Nov 2021 • Xuejing Zheng, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo
In this paper, we propose Lifelong reinforcement learning with Sequential linear temporal logic formulas and Reward Machines (LSRM), which enables an agent to leverage previously learned knowledge to fasten learning of logically specified tasks.
no code implementations • 14 Oct 2021 • Pinyarash Pinyoanuntapong, Prabhu Janakaraj, Ravikumar Balakrishnan, Minwoo Lee, Chen Chen, Pu Wang
To solve such MDP, multi-agent reinforcement learning (MA-RL) algorithms along with domain-specific action space refining schemes are developed, which online learn the delay-minimum forwarding paths to minimize the model exchange latency between the edge devices (i. e., workers) and the remote server.
no code implementations • 19 Nov 2021 • Zhigang Ye, Chen Chen, Ruihuan Liu, Kai Wu, Zhaohong Bie, Guannan Lou, Wei Gu, Yubo Yuan
Enhancing restoration capabilities of distribution systems is one of the main strategies for resilient power systems to cope with extreme events.
no code implementations • 25 Sep 2019 • Taojiannan Yang, Sijie Zhu, Yan Shen, Mi Zhang, Andrew Willis, Chen Chen
We propose a framework to mutually learn from different input resolutions and network widths.
no code implementations • 28 Sep 2020 • Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Wulong Liu, Yaodong Yang
The value function lies in the heart of Reinforcement Learning (RL), which defines the long-term evaluation of a policy in a given state.
no code implementations • 5 Dec 2021 • Yang Li, Meng Zhang, Chen Chen
Facing the difficulty of expensive and trivial data collection and annotation, how to make a deep learning-based short-term voltage stability assessment (STVSA) model work well on a small training dataset is a challenging and urgent problem.
no code implementations • 14 Dec 2021 • Yang Li, Bin Wang, Zhen Yang, Jiazheng Li, Chen Chen
An operating entity utilizing community-integrated energy systems with a large number of small-scale distributed energy sources can easily trade with existing distribution markets.
no code implementations • 18 Dec 2021 • Hankz Hankui Zhuo, Shuting Deng, Mu Jin, Zhihao Ma, Kebing Jin, Chen Chen, Chao Yu
Despite of achieving great success in real-world applications, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, i. e., data efficiency, lack of the interpretability and transferability.
no code implementations • 16 Feb 2022 • Dharma Shukla, Muthian Sivathanu, Srinidhi Viswanatha, Bhargav Gulavani, Rimma Nehme, Amey Agrawal, Chen Chen, Nipun Kwatra, Ramachandran Ramjee, Pankaj Sharma, Atul Katiyar, Vipul Modi, Vaibhav Sharma, Abhishek Singh, Shreshth Singhal, Kaustubh Welankar, Lu Xun, Ravi Anupindi, Karthik Elangovan, Hasibur Rahman, Zhou Lin, Rahul Seetharaman, Cheng Xu, Eddie Ailijiang, Suresh Krishnappa, Mark Russinovich
At the heart of Singularity is a novel, workload-aware scheduler that can transparently preempt and elastically scale deep learning workloads to drive high utilization without impacting their correctness or performance, across a global fleet of AI accelerators (e. g., GPUs, FPGAs).
no code implementations • 29 Mar 2022 • Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng
Automated Audio captioning (AAC) is a cross-modal task that generates natural language to describe the content of input audio.
no code implementations • 29 Mar 2022 • Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng
Noise-robust speech recognition systems require large amounts of training data including noisy speech data and corresponding transcripts to achieve state-of-the-art performances in face of various practical environments.
no code implementations • 31 Mar 2022 • Haochen Zhang, Chen Chen, Shunbo Lei, Zhaohong Bie
Distribution system (DS) restoration after natural disasters often faces the challenge of communication failures to feeder automation (FA) facilities, resulting in prolonged load pick-up process.
no code implementations • 31 Mar 2022 • Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen
To move a step further, this paper proposes GALA (Geometry-and-Lighting-Aware), a generic foreground object search method with discriminative modeling on geometry and lighting compatibility for open-world image compositing.
no code implementations • 6 Apr 2022 • Shimin Chen, Wei Li, Chen Chen, Jianyang Gu, Jiaming Chu, Xunqiang Tao, Yandong Guo
SEAL consists of two kinds of annotations, SEAL Tubes and SEAL Clips.
no code implementations • 6 Apr 2022 • Shimin Chen, Chen Chen, Wei Li, Xunqiang Tao, Yandong Guo
In this paper, we propose a unified network for TAD, termed Faster-TAD, by re-purposing a Faster-RCNN like architecture.
no code implementations • 6 Apr 2022 • Wenhan Cao, Jingliang Duan, Shengbo Eben Li, Chen Chen, Chang Liu, Yu Wang
Both the primal and dual estimators are learned from data using supervised learning techniques, and the explicit sample size is provided, which enables us to guarantee the quality of each learned estimator in terms of feasibility and optimality.
no code implementations • 13 Apr 2022 • Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng
Although automatic speech recognition (ASR) task has gained remarkable success by sequence-to-sequence models, there are two main mismatches between its training and testing that might lead to performance degradation: 1) The typically used cross-entropy criterion aims to maximize log-likelihood of the training data, while the performance is evaluated by word error rate (WER), not log-likelihood; 2) The teacher-forcing method leads to the dependence on ground truth during training, which means that model has never been exposed to its own prediction before testing.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 16 May 2022 • Chao Wang, Chen Chen, Dong Li, Bin Wang
Recently, reinforcement learning has been used to address logic synthesis by formulating the operator sequence optimization problem as a Markov decision process.
no code implementations • 14 Jun 2022 • Wenxuan Wang, Chen Chen, Jing Wang, Sen Zha, Yan Zhang, Jiangyun Li
For 3D medical image (e. g. CT and MRI) segmentation, the difficulty of segmenting each slice in a clinical case varies greatly.
no code implementations • 16 Jun 2022 • Chen Zhang, Honglin Sun, Chen Chen, Yandong Guo
We propose a motion forecasting model called BANet, which means Boundary-Aware Network, and it is a variant of LaneGCN.
no code implementations • 16 Jul 2022 • Jiahao Qi, Zhiqiang Gong, Xingyue Liu, Kangcheng Bin, Chen Chen, YongQian Li, Wei Xue, Yu Zhang, Ping Zhong
Deep learning methodology contributes a lot to the development of hyperspectral image (HSI) analysis community.
no code implementations • 24 Jul 2022 • Xingbo Fu, Binchi Zhang, Yushun Dong, Chen Chen, Jundong Li
Federated Graph Machine Learning (FGML) is a promising solution to tackle this challenge by training graph machine learning models in a federated manner.
no code implementations • 22 Aug 2022 • Qucheng Peng, Zhengming Ding, Lingjuan Lyu, Lichao Sun, Chen Chen
For the input-level, we design a new data augmentation technique as Phase MixUp, which highlights task-relevant objects in the interpolations, thus enhancing input-level regularization and class consistency for target models.
1 code implementation • 31 Aug 2022 • Xiaoqin Wang, Chen Chen, Rushi Lan, Licheng Liu, Zhenbing Liu, Huiyu Zhou, Xiaonan Luo
Different personalized subspaces are constructed to reflect category-specific attributes for different clusters, adaptively mapping instances within the same cluster to the same Hamming space.
no code implementations • 1 Sep 2022 • Wenhan Wu, Yilei Hua, Ce Zheng, Shiqian Wu, Chen Chen, Aidong Lu
Given the unmasked skeleton sequence, the encoder is fine-tuned for the action recognition task.
no code implementations • 9 Sep 2022 • Chen Chen, Yue Dai, Josiah Poon, Caren Han
Text-based games(TBG) are complex environments which allow users or computer agents to make textual interactions and achieve game goals. In TBG agent design and training process, balancing the efficiency and performance of the agent models is a major challenge.
no code implementations • 4 Oct 2022 • Guangyu Sun, Umar Khalid, Matias Mendieta, Taojiannan Yang, Chen Chen
Recently, the use of small pre-trained models has been shown effective in federated learning optimization and improving convergence.
no code implementations • 8 Oct 2022 • Zeyu Gao, Yao Mu, Ruoyan Shen, Chen Chen, Yangang Ren, Jianyu Chen, Shengbo Eben Li, Ping Luo, YanFeng Lu
End-to-end autonomous driving provides a feasible way to automatically maximize overall driving system performance by directly mapping the raw pixels from a front-facing camera to control signals.
no code implementations • PANDL (COLING) 2022 • Remo Nitschke, Yuwei Wang, Chen Chen, Adarsh Pyarelal, Rebecca Sharp
Natural language (as opposed to structured communication modes such as Morse code) is by far the most common mode of communication between humans, and can thus provide significant insight into both individual mental states and interpersonal dynamics.
no code implementations • 12 Oct 2022 • Shuo Wang, Chen Qin, Chengyan Wang, Kang Wang, Haoran Wang, Chen Chen, Cheng Ouyang, Xutong Kuang, Chengliang Dai, Yuanhan Mo, Zhang Shi, Chenchen Dai, Xinrong Chen, He Wang, Wenjia Bai
The quality of cardiac magnetic resonance (CMR) imaging is susceptible to respiratory motion artifacts.
no code implementations • 20 Oct 2022 • Min Cao, Cong Ding, Chen Chen, Junchi Yan, Qi Tian
Based on a natural assumption that images belonging to the same person identity should not match with images belonging to multiple different person identities across views, called the unicity of person matching on the identity level, we propose an end-to-end person unicity matching architecture for learning and refining the person matching relations.
no code implementations • 17 Nov 2022 • Andong Deng, Taojiannan Yang, Chen Chen, Qian Chen, Leslie Neely, Sakiko Oyama
In such cases, automatic recognition systems based on computer vision and machine learning (in particular deep learning) technology can alleviate this issue to a large extent.
no code implementations • 28 Nov 2022 • Chen Chen, Hongyao Tang, Yi Ma, Chao Wang, Qianli Shen, Dong Li, Jianye Hao
The key idea of SA-PP is leveraging discounted stationary state distribution ratios between the learning policy and the offline dataset to modulate the degree of behavior regularization in a state-wise manner, so that pessimism can be implemented in a more appropriate way.
no code implementations • 10 Dec 2022 • Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng
Audio-visual speech recognition (AVSR) has gained remarkable success for ameliorating the noise-robustness of speech recognition.
no code implementations • ICCV 2023 • Saeed Vahidian, Sreevatsank Kadaveru, Woonjoon Baek, Weijia Wang, Vyacheslav Kungurtsev, Chen Chen, Mubarak Shah, Bill Lin
Specifically, we aim to investigate how ordered learning principles can contribute to alleviating the heterogeneity effects in FL.
no code implementations • 6 Jan 2023 • Zirou Qiu, Chen Chen, Madhav V. Marathe, S. S. Ravi, Daniel J. Rosenkrantz, Richard E. Stearns, Anil Vullikanti
Fixed points of such dynamical systems represent configurations to which the system converges.
no code implementations • 19 Jan 2023 • Chen Chen, Junqing Zhang, Tianyu Lu, Magnus Sandell, Liquan Chen
Different from most previous works that adopt the iterative optimization to solve the problem, the proposed DNN based algorithm directly obtains the BS precoding and IRS phase shifts as the output of the DNN.
no code implementations • 27 Jan 2023 • Chen Chen, Wei Emma Zhang, Alireza Seyed Shakeri, Makhmoor Fiza
Despite the great development of document summarisation techniques nowadays, factual inconsistencies between the generated summaries and the original texts still occur from time to time.
no code implementations • 30 Jan 2023 • Chen Chen, BoWen Zhang, Liangliang Cao, Jiguang Shen, Tom Gunter, Albin Madappally Jose, Alexander Toshev, Jonathon Shlens, Ruoming Pang, Yinfei Yang
We extend the CLIP model and build a sparse text and image representation (STAIR), where the image and text are mapped to a sparse token space.
no code implementations • 30 Jan 2023 • Bowen Zhao, Chen Chen, Shu-Tao Xia
However, we find that two unfavorable defects are concealed in the prevalent adaptation methodologies like test-time batch normalization (BN) and self-learning.
no code implementations • 1 Feb 2023 • Chen Chen, Dylan Walker, Venkatesh Saligrama
We propose a novel supervised learning approach for political ideology prediction (PIP) that is capable of predicting out-of-distribution inputs.
no code implementations • 19 Feb 2023 • Jie Zhang, Bo Li, Chen Chen, Lingjuan Lyu, Shuang Wu, Shouhong Ding, Chao Wu
In this work, we propose a novel algorithm called Decision Boundary based Federated Adversarial Training (DBFAT), which consists of two components (local re-weighting and global regularization) to improve both accuracy and robustness of FL systems.
no code implementations • 22 Feb 2023 • Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia
For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples.
no code implementations • 23 Feb 2023 • Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng
Deep neural network based speech enhancement approaches aim to learn a noisy-to-clean transformation using a supervised learning paradigm.
no code implementations • 23 Feb 2023 • Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng
Deep neural network based speech enhancement technique focuses on learning a noisy-to-clean transformation supervised by paired training data.
no code implementations • 2 Mar 2023 • Chen Chen, Jie Fu, Lingjuan Lyu
AI Generated Content (AIGC) has received tremendous attention within the past few years, with content generated in the format of image, text, audio, video, etc.
no code implementations • IACR Transactions on Cryptographic Hardware and Embedded Systems 2020 • Neng Zhang, Bohan Yang, Chen Chen, Shouyi Yin, Shaojun Wei and Leibo Liu*
First, both the pre-processing of NTT and the post-processing of INTT are merged into the fast Fourier transform (FFT) algorithm, which reduces N and 2N modular multiplications for N-point NTT and INTT, respectively.
no code implementations • 23 Mar 2023 • Ce Zheng, Xianpeng Liu, Mengyuan Liu, Tianfu Wu, Guo-Jun Qi, Chen Chen
While image-based HMR methods have achieved impressive results, they often struggle to recover humans in dynamic scenarios, leading to temporal inconsistencies and non-smooth 3D motion predictions due to the absence of human motion.
Ranked #56 on 3D Human Pose Estimation on 3DPW
no code implementations • 31 Mar 2023 • Tao Bai, Chen Chen, Lingjuan Lyu, Jun Zhao, Bihan Wen
Recent studies show that models trained by continual learning can achieve the comparable performances as the standard supervised learning and the learning flexibility of continual learning models enables their wide applications in the real world.
no code implementations • 7 Apr 2023 • Xinshun Wang, Qiongjie Cui, Chen Chen, Shen Zhao, Mengyuan Liu
In recent years, Graph Convolutional Networks (GCNs) have been widely used in human motion prediction, but their performance remains unsatisfactory.
Ranked #3 on Human Pose Forecasting on Human3.6M
no code implementations • 6 Apr 2023 • Sijie Zhu, Linjie Yang, Chen Chen, Mubarak Shah, Xiaohui Shen, Heng Wang
Visual Place Recognition (VPR) estimates the location of query images by matching them with images in a reference database.
no code implementations • CVPR 2023 • Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen
Given a background image and a segmented object, the goal is to train a model to predict plausible placements (location and scale) of the object for compositing.
no code implementations • 11 Apr 2023 • Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng
Second, during finetuning we propose a Transformer-based code predictor to accurately predict clean codes by modeling the global dependency of input noisy representations, which enables discovery and restoration of high-quality clean representations with reduced distortions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 18 Apr 2023 • Sicen Guo, Jiahang Li, Yi Feng, Dacheng Zhou, Denghuang Zhang, Chen Chen, Shuai Su, Xingyi Zhu, Qijun Chen, Rui Fan
To foster advancements in this burgeoning field, we have launched an online open-source benchmark suite, referred to as UDTIRI.
no code implementations • 21 Apr 2023 • Wenxuan Wang, Jiachen Shen, Chen Chen, Jianbo Jiao, Jing Liu, Yan Zhang, Shanshan Song, Jiangyun Li
In this paper, we present the study on parameter-efficient transfer learning for medical volumetric segmentation and propose a new framework named Med-Tuning based on intra-stage feature enhancement and inter-stage feature interaction.
no code implementations • 28 Apr 2023 • Chen Chen, Junqing Zhang, Tianyu Lu, Magnus Sandell, Liquan Chen
Different from most previous works that adopt iterative optimisation to solve the problem, the proposed DNN-based algorithm directly obtains the BS precoding and IRS phase shifts as the output of the DNN.
no code implementations • 2 May 2023 • Xingbo Fu, Chen Chen, Yushun Dong, Anil Vullikanti, Eili Klein, Gregory Madden, Jundong Li
In this paper, we propose a novel problem of antibiogram pattern prediction that aims to predict which patterns will appear in the future.
no code implementations • CVPR 2023 • YuAn Wang, Kun Yu, Chen Chen, Xiyuan Hu, Silong Peng
To address this issue, we propose a Spatial-Frequency Dynamic Graph method to exploit the relation-aware features in spatial and frequency domains via dynamic graph learning.
no code implementations • CVPR 2023 • Chen Chen, Daochang Liu, Siqi Ma, Surya Nepal, Chang Xu
However, apart from this standard utility, we identify the "reversed utility" as another crucial aspect, which computes the accuracy on generated data of a classifier trained using real data, dubbed as real2gen accuracy (r2g%).
no code implementations • 19 May 2023 • Wenxuan Wang, Jing Liu, Xingjian He, Yisi Zhang, Chen Chen, Jiachen Shen, Yan Zhang, Jiangyun Li
Referring image segmentation (RIS) is a fundamental vision-language task that intends to segment a desired object from an image based on a given natural language expression.
no code implementations • 22 May 2023 • Yang Bai, Jingyao Wang, Min Cao, Chen Chen, Ziqiang Cao, Liqiang Nie, Min Zhang
Text-based person search (TBPS) aims to retrieve the images of the target person from a large image gallery based on a given natural language description.
no code implementations • 23 May 2023 • Lijun Li, Li'an Zhuo, Bang Zhang, Liefeng Bo, Chen Chen
Hand mesh reconstruction from the monocular image is a challenging task due to its depth ambiguity and severe occlusion, there remains a non-unique mapping between the monocular image and hand mesh.
no code implementations • 25 May 2023 • Lantian Li, Xiaolou Li, Haoyu Jiang, Chen Chen, Ruihai Hou, Dong Wang
A comprehensive study was conducted to compare CN-Celeb-AV with two popular public AVPR benchmark datasets, and the results demonstrated that CN-Celeb-AV is more in line with real-world scenarios and can be regarded as a new benchmark dataset for AVPR research.
no code implementations • 29 May 2023 • Zhenting Wang, Chen Chen, Yi Zeng, Lingjuan Lyu, Shiqing Ma
To overcome this problem, we first develop an alteration-free and model-agnostic origin attribution method via input reverse-engineering on image generation models, i. e., inverting the input of a particular model for a specific image.
no code implementations • 14 Jun 2023 • Xuechen Mu, Hankz Hankui Zhuo, Chen Chen, Kai Zhang, Chao Yu, Jianye Hao
Exploring sparse reward multi-agent reinforcement learning (MARL) environments with traps in a collaborative manner is a complex task.
no code implementations • 14 Jun 2023 • Shirui Pan, Linhao Luo, YuFei Wang, Chen Chen, Jiapu Wang, Xindong Wu
In this article, we present a forward-looking roadmap for the unification of LLMs and KGs.
no code implementations • 27 Jun 2023 • Weiming Zhuang, Chen Chen, Lingjuan Lyu
The intersection of the Foundation Model (FM) and Federated Learning (FL) provides mutual benefits, presents a unique opportunity to unlock new possibilities in AI research, and address critical challenges in AI and real-world applications.
no code implementations • 2 Jul 2023 • Jingjie Guo, Weitong Zhang, Matthew Sinclair, Daniel Rueckert, Chen Chen
In addition, different from most existing TTA methods which restrict the adaptation to batch normalization blocks in the segmentation network only, we further exploit the use of channel and spatial attention blocks for improved adaptability at test time.
no code implementations • 11 Jul 2023 • Chen Chen, YuFei Wang, Yang Zhang, Quan Z. Sheng, Kwok-Yan Lam
Previous KGC methods typically represent knowledge graph entities and relations as trainable continuous embeddings and fuse the embeddings of the entity $h$ (or $t$) and relation $r$ into hidden representations of query $(h, r, ?
no code implementations • 17 Jul 2023 • Jing Ma, Chen Chen, Anil Vullikanti, Ritwick Mishra, Gregory Madden, Daniel Borrajo, Jundong Li
In this paper, we study the problem of causal effect estimation with treatment entangled in a graph.
no code implementations • 25 Jul 2023 • Chen Chen, Bongshin Lee, Yunhai Wang, Yunjeong Chang, Zhicheng Liu
To facilitate the reuse of existing charts, previous research has examined how to obtain a semantic understanding of a chart by deconstructing its visual representation into reusable components, such as encodings.
no code implementations • 26 Jul 2023 • Xinshun Wang, Qiongjie Cui, Chen Chen, Shen Zhao, Mengyuan Liu
Existing Graph Convolutional Networks to achieve human motion prediction largely adopt a one-step scheme, which output the prediction straight from history input, failing to exploit human motion patterns.
no code implementations • 7 Aug 2023 • Pengfei Wu, Chen Chen, Dexiang Lai, Jian Zhong
Instead of integrating the constraint violation penalty with the reward function, its actor gradients are estimated by a Lagrange advantage function which is derived from two critic systems based on economic reward and violation cost.
no code implementations • 18 Aug 2023 • Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah
We attribute this deficiency to the lack of ability to extract the geometric layout of visual features and models' overfitting to low-level details.
no code implementations • 29 Aug 2023 • Tareen Dawood, Chen Chen, Baldeep S. Sidhua, Bram Ruijsink, Justin Goulda, Bradley Porter, Mark K. Elliott, Vishal Mehta, Christopher A. Rinaldi, Esther Puyol-Anton, Reza Razavi, Andrew P. King
The best-performing model in terms of both classification accuracy and the most common calibration measure, expected calibration error (ECE) was the Confidence Weight method, a novel approach that weights the loss of samples to explicitly penalise confident incorrect predictions.
no code implementations • 8 Sep 2023 • Sijia Li, Chen Chen, Haonan Lu
In this work, we propose a method with a mixture-of-expert (MOE) controllers to align the text-guided capacity of diffusion models with different kinds of human instructions, enabling our model to handle various open-domain image manipulation tasks with natural language instructions.
no code implementations • 25 Sep 2023 • Tongtong Yuan, Xuange Zhang, Kun Liu, Bo Liu, Chen Chen, Jian Jin, Zhenzhen Jiao
Furthermore, we benchmark SOTA models for four multimodal tasks on this newly created dataset, which serve as new baselines for surveillance video-and-language understanding.
no code implementations • 26 Sep 2023 • Zhiqian Lan, YuXuan Jiang, Yao Mu, Chen Chen, Shengbo Eben Li
Motion prediction is crucial for autonomous vehicles to operate safely in complex traffic environments.
no code implementations • 27 Sep 2023 • Jiawen Wang, Quan Chen, Deze Zeng, Zhuo Song, Chen Chen, Minyi Guo
With the collaborative serving mechanism, only part of node representations are updated during the update phase, and the final representations are calculated in the inference phase.
no code implementations • 12 Oct 2023 • Xingyue Liu, Jiahao Qi, Chen Chen, Kangcheng Bin, Ping Zhong
Moreover, to meet cross-modality discrepancy and orientation discrepancy challenges, we present a hybrid weights decoupling network (HWDNet) to learn the shared discriminative orientation-invariant features.
no code implementations • 15 Oct 2023 • Chengwei Qin, Chen Chen, Shafiq Joty
Inspired by the learning paradigm of humans, we propose Dynamic Module Expansion and Adaptation (DMEA), which enables the model to dynamically determine the architecture for acquiring new knowledge based on task correlation and select the most similar previous tasks to facilitate adaptation to new tasks.
no code implementations • 11 Oct 2023 • Chen Chen, Junqing Zhang, Yingying Chen
Physical layer key generation based on reciprocal and random wireless channels has been an attractive solution for securing resource-constrained low-power wide-area networks (LPWANs).
no code implementations • 17 Oct 2023 • Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng
In this work, we propose to leverage large language models (LLMs) and lists of hypotheses generated by an ASR to address the CS problem.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 24 Oct 2023 • Song Wang, Yaochen Zhu, Haochen Liu, Zaiyi Zheng, Chen Chen, Jundong Li
Afterward, we provide an innovative taxonomy of KME techniques based on how the new knowledge is introduced into pre-trained LLMs, and investigate existing KME strategies while analyzing key insights, advantages, and limitations of methods from each category.
no code implementations • 28 Oct 2023 • Haoran Shen, Yifu Zhang, Wenxuan Wang, Chen Chen, Jing Liu, Shanshan Song, Jiangyun Li
As a pioneering work, a dynamic architecture network for medical volumetric segmentation (i. e. Med-DANet) has achieved a favorable accuracy and efficiency trade-off by dynamically selecting a suitable 2D candidate model from the pre-defined model bank for different slices.
no code implementations • 30 Oct 2023 • Youbo Lei, Feifei He, Chen Chen, Yingbin Mo, Si Jia Li, Defeng Xie, Haonan Lu
Due to the success of large-scale visual-language pretraining (VLP) models and the widespread use of image-text retrieval in industry areas, it is now critically necessary to reduce the model size and streamline their mobile-device deployment.
no code implementations • 1 Nov 2023 • Mengxia Wu, Min Cao, Yang Bai, Ziyin Zeng, Chen Chen, Liqiang Nie, Min Zhang
In this paper, we make the first empirical study of frame selection for TVR.
no code implementations • 15 Nov 2023 • Yixiu Mao, Hongchang Zhang, Chen Chen, Yi Xu, Xiangyang Ji
Offline reinforcement learning suffers from the out-of-distribution issue and extrapolation error.
no code implementations • 24 Nov 2023 • Cuifeng Shen, Yulu Gan, Chen Chen, Xiongwei Zhu, Lele Cheng, Tingting Gao, Jinzhi Wang
The goal of conditional image-to-video (cI2V) generation is to create a believable new video by beginning with the condition, i. e., one image and text. The previous cI2V generation methods conventionally perform in RGB pixel space, with limitations in modeling motion consistency and visual continuity.
no code implementations • 30 Nov 2023 • Zhaoning Wang, Ming Li, Chen Chen
Nonetheless, achieving precise control over 3D generation continues to be an arduous task, as using text to control often leads to missing objects and imprecise locations.
no code implementations • 21 Nov 2023 • Yang Li, Chunhe Xia, Wei Liu, Weidong Zhou, Chen Chen, Tianbo Wang
This article proposes Blockchain-based Federated Learning (FBChain) model for federated learning parameter communication to overcome the above two problems.
no code implementations • 10 Dec 2023 • Letian Zhang, Ming Li, Chen Chen, Jie Xu
This poses a paradox as the necessary camera pose must be estimated from the entire dataset, even though the data arrives sequentially and future chunks are inaccessible.
no code implementations • 21 Dec 2023 • Nazmul Karim, Umar Khalid, Hasan Iqbal, Jing Hua, Chen Chen
To date, editing 3D scenes requires either re-training the model to adapt to various 3D edited scenes or design-specific methods for each special editing type.
no code implementations • 2 Jan 2024 • Ziheng Xu, Jianwei Niu, Qingfeng Li, Tao Ren, Chen Chen
In this paper we present NID-SLAM, which significantly improves the performance of neural SLAM in dynamic environments.
no code implementations • 14 Jan 2024 • Mingli Zhu, Zihao Zhu, Sihong Chen, Chen Chen, Baoyuan Wu
To tackle overfitting challenge, we design a new ensemble model framework cooperated with data augmentation to boost generalization.
no code implementations • 19 Jan 2024 • Hao Ai, Zidong Cao, Haonan Lu, Chen Chen, Jian Ma, Pengyuan Zhou, Tae-Kyun Kim, Pan Hui, Lin Wang
To this end, we propose a transformer-based 360 image outpainting framework called Dream360, which can generate diverse, high-fidelity, and high-resolution panoramas from user-selected viewports, considering the spherical properties of 360 images.
no code implementations • 4 Feb 2024 • Mengyuan Liu, Chen Chen, Songtao Wu, Fanyang Meng, Hong Liu
Recognizing interactive actions, including hand-to-hand interaction and human-to-human interaction, has attracted increasing attention for various applications in the field of video analysis and human-robot interaction.