no code implementations • 7 Aug 2023 • Xin Chen, Hungpo Chao, Wenbo Shi, Na Li
This paper presents a comprehensive framework for advancing research and policy development related to decarbonization in electric power systems.
no code implementations • 20 Jun 2023 • Runyu Zhang, Yang Hu, Na Li
This paper focuses on reinforcement learning for the regularized robust Markov decision process (MDP) problem, an extension of the robust MDP framework.
no code implementations • 17 Jun 2023 • YingYing Li, Tianpeng Zhang, Subhro Das, Jeff Shamma, Na Li
This paper considers a single-trajectory system identification problem for linear systems under general nonlinear and/or time-varying policies with i. i. d.
no code implementations • 22 May 2023 • Na Li, Zied Bouraoui, Steven Schockaert
In this paper, we show that the performance of existing methods can be improved using a simple technique: we use pre-trained label embeddings to cluster the labels into semantic domains and then treat these domains as additional types.
1 code implementation • 16 May 2023 • Na Li, Hanane Kteich, Zied Bouraoui, Steven Schockaert
Second, concept embeddings should capture the semantic properties of concepts, whereas contextualised word vectors are also affected by other factors.
no code implementations • 8 Apr 2023 • Tongzheng Ren, Zhaolin Ren, Na Li, Bo Dai
Optimal control is notoriously difficult for stochastic nonlinear systems.
no code implementations • 16 Mar 2023 • Jiang Hu, Kangkang Deng, Na Li, Quanzheng Li
With a computationally efficient approximation of the second-order information, natural gradient methods have been successful in solving large-scale structured optimization problems.
no code implementations • 10 Mar 2023 • Haitong Ma, Tianpeng Zhang, Yixuan Wu, Flavio P. Calmon, Na Li
We focus on Entropy Search (ES), a sample-efficient BO algorithm that selects queries to maximize the mutual information about the maximum of the black-box function.
no code implementations • 17 Dec 2022 • Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai
Theoretically, we establish the sample complexity of the proposed approach in the online and offline settings.
Model-based Reinforcement Learning
reinforcement-learning
+1
no code implementations • 14 Nov 2022 • Hanjun Dai, Yuan Xue, Niao He, Bethany Wang, Na Li, Dale Schuurmans, Bo Dai
In real-world decision-making, uncertainty is important yet difficult to handle.
no code implementations • 15 Oct 2022 • Na Li
In the 1st stage, our SVMR should take into account the fact that: 1) a positive candidate long video can contain plenty of irrelevant clips which are also semantically meaningful.
no code implementations • 10 Oct 2022 • Bin Hu, Kaiqing Zhang, Na Li, Mehran Mesbahi, Maryam Fazel, Tamer Başar
Gradient-based methods have been widely used for system design and optimization in diverse application domains.
1 code implementation • 20 Sep 2022 • Tianpeng Zhang, Kasper Johansson, Na Li
The graph defines the agent's freedom in selecting the next available nodes at each step.
no code implementations • 8 Sep 2022 • Aoxiao Zhong, Hao He, Zhaolin Ren, Na Li, Quanzheng Li
To make sure the FL model is robust when facing heterogeneous data among FL clients, most efforts focus on personalizing models for clients.
1 code implementation • 19 Aug 2022 • Na Li, Robert Ross
Confusion is a mental state triggered by cognitive disequilibrium that can occur in many types of task-oriented interaction, including Human-Robot Interaction (HRI).
no code implementations • 16 Aug 2022 • Chulong Zhang, Yuming Jiang, Na Li, Zhicheng Zhang, Md Tauhidul Islam, Jingjing Dai, Lin Liu, Wenfeng He, Wenjian Qin, Jing Xiong, Yaoqin Xie, Xiaokun Liang
Deformable image registration is a necessary technique for fusing multi-modal pathology slices.
1 code implementation • 13 Jul 2022 • Xiaoyi Qin, Na Li, Chao Weng, Dan Su, Ming Li
In this paper, we mine cross-age test sets based on the VoxCeleb dataset and propose our age-invariant speaker representation(AISR) learning method.
no code implementations • 6 Jun 2022 • Runyu Zhang, Qinghua Liu, Huan Wang, Caiming Xiong, Na Li, Yu Bai
Next, we show that this framework instantiated with the Optimistic Follow-The-Regularized-Leader (OFTRL) algorithm at each state (and smooth value updates) can find an $\mathcal{\widetilde{O}}(T^{-5/6})$ approximate NE in $T$ iterations, and a similar algorithm with slightly modified value update rule achieves a faster $\mathcal{\widetilde{O}}(T^{-1})$ convergence rate.
no code implementations • 6 Jun 2022 • Na Li, John D. Kelleher, Robert Ross
To this end, in this paper, we present our initial research centred on a user-avatar dialogue scenario that we have developed to study the manifestation of confusion and in the long term its mitigation.
no code implementations • 3 Jun 2022 • Na Li, Robert Ross
Human-robot studies are expensive to conduct and difficult to control, and as such researchers sometimes turn to human-avatar interaction in the hope of faster and cheaper data collection that can be transferred to the robot domain.
no code implementations • 1 May 2022 • Xiuxian Li, Lihua Xie, Na Li
And the local cost function of each agent is often time-varying in dynamic and even adversarial environments.
no code implementations • 8 Apr 2022 • Linwei Zhu, Yun Zhang, Na Li, Gangyi Jiang, Sam Kwong
To further improve the performance of intra coding in Versatile Video Coding (VVC), an intelligent intra mode derivation method is proposed in this paper, termed as Deep Learning based Intra Mode Derivation (DLIMD).
no code implementations • 4 Feb 2022 • Naijun Zheng, Na Li, Xixin Wu, Lingwei Meng, Jiawen Kang, Haibin Wu, Chao Weng, Dan Su, Helen Meng
This paper describes our speaker diarization system submitted to the Multi-channel Multi-party Meeting Transcription (M2MeT) challenge, where Mandarin meeting data were recorded in multi-channel format for diarization and automatic speech recognition (ASR) tasks.
no code implementations • 15 Dec 2021 • Yujie Tang, Vikram Ramanathan, Junshan Zhang, Na Li
We consider large scale distributed optimization over a set of edge devices connected to a central server, where the limited communication bandwidth between the server and edge devices imposes a significant bottleneck for the optimization procedure.
no code implementations • 31 Oct 2021 • YingYing Li, Subhro Das, Jeff Shamma, Na Li
We study the adaptive control of an unknown linear system with a quadratic cost function subject to safety constraints on both the states and actions.
no code implementations • 9 Oct 2021 • Xiaolong Zheng, Deyun Zhou, Na Li, Yu Lei, Tao Wu, Maoguo Gong
In the focus search strategy, if there is no knowledge source benefit the optimization of a task, then all knowledge sources in the task's pool are forbidden to be utilized except the task, which helps to improve the performance of the proposed algorithm.
no code implementations • 19 Jul 2021 • Na Li, Xinbo Zhao
We collect a new gait recognition database called OG RGB+D database, which breaks through the limitation of other gait databases and includes multimodal gait data of various occlusions (self-occlusion, active occlusion, and passive occlusion) by our multiple synchronous Azure Kinect DK sensors data acquisition system (multi-Kinect SDAS) that can be also applied in security situations.
no code implementations • 21 Jun 2021 • Na Li, Yao Liu
We further apply our proposed methods on super resolution model, which is the first to propose a spherical super-resolution model that directly operates on a mesh representation of spherical pixels of 360-degree data.
no code implementations • 1 Jun 2021 • Runyu Zhang, Zhaolin Ren, Na Li
We show that Nash equilibria (NEs) and first-order stationary policies are equivalent in this setting, and give a local convergence rate around strict NEs.
no code implementations • 16 Feb 2021 • Junshan Zhang, Na Li, Mehmet Dedeoglu
We consider a many-to-one wireless architecture for federated learning at the network edge, where multiple edge devices collaboratively train a model using local data.
2 code implementations • 8 Feb 2021 • Yang Zheng, Yujie Tang, Na Li
This paper revisits the classical Linear Quadratic Gaussian (LQG) control from a modern optimization perspective.
Policy Gradient Methods
Optimization and Control
Systems and Control
Systems and Control
Dynamical Systems
no code implementations • 27 Jan 2021 • Xin Chen, Guannan Qu, Yujie Tang, Steven Low, Na Li
With large-scale integration of renewable generation and distributed energy resources, modern power systems are confronted with new operational challenges, such as growing complexity, increasing uncertainty, and aggravating volatility.
no code implementations • 6 Jan 2021 • Maryam Motamedi, Jessica Dawson, Na Li, Douglas G. Down, Nancy M. Heddle
Platelet products are both expensive and have very short shelf lives.
no code implementations • 1 Jan 2021 • Yizhou Chen, Dong Li, Na Li, TONG LIANG, Shizhuo Zhang, Bryan Kian Hsiang Low
This paper presents a novel implicit process-based meta-learning (IPML) algorithm that, in contrast to existing works, explicitly represents each task as a continuous latent vector and models its probabilistic belief within the highly expressive IP framework.
no code implementations • 4 Dec 2020 • Na Li, Zied Bouraoui, Jose Camacho Collados, Luis Espinosa-Anke, Qing Gu, Steven Schockaert
While the success of pre-trained language models has largely eliminated the need for high-quality static word vectors in many NLP applications, such vectors continue to play an important role in tasks where words need to be modelled in the absence of linguistic context.
1 code implementation • 29 Nov 2020 • Na Li, Renyu Zhu, Xiaoxu Zhou, Xiangnan He, Wenyuan Cai, Ming Gao, Aoying Zhou
In this paper, we model the author disambiguation as a collaboration network reconstruction problem, and propose an incremental and unsupervised author disambiguation method, namely IUAD, which performs in a bottom-up manner.
no code implementations • NeurIPS 2020 • YingYing Li, Na Li
To address this question, we introduce a gradient-based online algorithm, Receding Horizon Inexact Gradient (RHIG), and analyze its performance by dynamic regrets in terms of the temporal variation of the environment and the prediction errors.
no code implementations • 3 Nov 2020 • Zhaolin Ren, Aoxiao Zhong, Na Li
In this work, we consider the general case where the target is allowed to be arbitrary, which we refer to as the LQR tracking problem.
2 code implementations • 28 Oct 2020 • Xu Li, Na Li, Chao Weng, Xunying Liu, Dan Su, Dong Yu, Helen Meng
This multiple scaling mechanism significantly improves the countermeasure's generalizability to unseen spoofing attacks.
no code implementations • 21 Oct 2020 • Jiaying Zhou, Xun Xian, Na Li, Jie Ding
In this paper, we propose a method named ASCII for an agent to improve its classification performance through assistance from other agents.
no code implementations • 11 Oct 2020 • Xin Chen, YingYing Li, Jun Shimada, Na Li
This paper studies the automated control method for regulating air conditioner (AC) loads in incentive-based residential demand response (DR).
no code implementations • 10 Oct 2020 • YingYing Li, Subhro Das, Na Li
We show that OGD-BZ can achieve a policy regret upper bound that is the square root of the horizon length multiplied by some logarithmic terms of the horizon length under proper algorithm parameters.
no code implementations • 8 Oct 2020 • Cheng Shen, Jianghua Ying, Le Liu, Jianpeng Liu, Na Li, Shuopei Wang, Jian Tang, Yanchong Zhao, Yanbang Chu, Kenji Watanabe, Takashi Taniguchi, Rong Yang, Dongxia Shi, Fanming Qu, Li Lu, Wei Yang, Guangyu Zhang
For {\theta}=1. 25{\deg}, we observe an emergence of topological insulating states at hole side with a sequence of Chern number |C|=4-|v|, where v is the number of electrons (holes) in moir\'e unite cell.
Mesoscale and Nanoscale Physics Materials Science
1 code implementation • 7 Sep 2020 • Chao Zhu, Teng Miao, Tongyu Xu, Tao Yang, Na Li
However, automatic stem-leaf segmentation of maize shoots from three-dimensional (3D) point clouds remains challenging, especially for new emerging leaves that are very close and wrapped together during the seedling stage.
1 code implementation • 1 Sep 2020 • Yang Zheng, Na Li
For unstable systems, our results suggest that the Markov parameters are harder to estimate in the presence of process noise.
Optimization and Control Systems and Control Systems and Control Dynamical Systems
no code implementations • 11 Jun 2020 • Xu Li, Na Li, Jinghua Zhong, Xixin Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng
Orthogonal to prior approaches, this work proposes to defend ASV systems against adversarial attacks with a separate detection network, rather than augmenting adversarial data into ASV training.
no code implementations • NeurIPS 2020 • Guannan Qu, Yiheng Lin, Adam Wierman, Na Li
It has long been recognized that multi-agent reinforcement learning (MARL) faces significant scalability issues due to the fact that the size of the state and action spaces are exponentially large in the number of agents.
Multi-agent Reinforcement Learning
reinforcement-learning
+1
no code implementations • L4DC 2020 • Guannan Qu, Adam Wierman, Na Li
We study reinforcement learning (RL) in a setting with a network of agents whose states and actions interact in a local manner where the objective is to find localized policies such that the (discounted) global reward is maximized.
no code implementations • 7 May 2020 • Xin Chen, Na Li
This method is applicable to aggregate only the active (or reactive) power, and the joint active-reactive power domain.
no code implementations • 27 Apr 2020 • Na Li, Xinbo Zhao, Chong Ma
However, appearance features suffer from external factors which can cause drastic appearance variations, e. g. clothing.
no code implementations • 20 Mar 2020 • YingYing Li, Qinran Hu, Na Li
One challenge in the optimization and control of societal systems is to handle the unknown and uncertain user behavior.
no code implementations • 7 Mar 2020 • Xin Chen, Yutong Nie, Na Li
Residential loads have great potential to enhance the efficiency and reliability of electricity systems via demand response (DR) programs.
no code implementations • 20 Feb 2020 • Hanshu Cai, Yiwen Gao, Shuting Sun, Na Li, Fuze Tian, Han Xiao, Jianxiu Li, Zhengwu Yang, Xiaowei Li, Qinglin Zhao, Zhenyu Liu, Zhijun Yao, Minqiang Yang, Hong Peng, Jing Zhu, Xiaowei Zhang, Guoping Gao, Fang Zheng, Rui Li, Zhihua Guo, Rong Ma, Jing Yang, Lan Zhang, Xiping Hu, Yumin Li, Bin Hu
The EEG dataset includes not only data collected using traditional 128-electrodes mounted elastic cap, but also a novel wearable 3-electrode EEG collector for pervasive applications.
1 code implementation • L4DC 2020 • Ying-Ying Li, Yujie Tang, Runyu Zhang, Na Li
We propose a Zero-Order Distributed Policy Optimization algorithm (ZODPO) that learns linear local controllers in a distributed fashion, leveraging the ideas of policy gradient, zero-order optimization and consensus algorithms.
no code implementations • 5 Dec 2019 • Guannan Qu, Adam Wierman, Na Li
We study reinforcement learning (RL) in a setting with a network of agents whose states and actions interact in a local manner where the objective is to find localized policies such that the (discounted) global reward is maximized.
no code implementations • 15 Sep 2019 • Guannan Qu, Na Li
Further, under some special conditions, we prove that the gap between the approximated reward function and the true reward function is decaying exponentially fast as the length of the truncated Markov process gets longer.
1 code implementation • 29 Aug 2019 • Rui Guo, Ronghua Liu, Na Li, Wei Liu
Current VHR(Very High Resolution) satellite images enable the detailed monitoring of the earth and can capture the ongoing works of railway construction.
1 code implementation • NeurIPS 2019 • Ying-Ying Li, Xin Chen, Na Li
In addition, we provide a fundamental limit of the dynamic regret for any online algorithms by considering linear quadratic tracking problems.
Optimization and Control
no code implementations • 16 May 2019 • Jun Wang, Dan Su, Jie Chen, Shulin Feng, Dongpeng Ma, Na Li, Dong Yu
We propose a novel method which simultaneously models both the sequence discriminative training and the feature discriminative learning within a single network architecture, so that it can learn discriminative deep features in sequence training that obviates the need for presegmented training data.
no code implementations • 26 Feb 2019 • Sindri Magnússon, Hossein Shokri-Ghadikolaei, Na Li
The communication time of these algorithms follows a complex interplay between a) the algorithm's convergence properties, b) the compression scheme, and c) the transmission rate offered by the digital channel.
no code implementations • 9 Aug 2017 • Na Li, Arnaud Martin, Rémi Estival
Detection of surface water in natural environment via multi-spectral imagery has been widely utilized in many fields, such land cover identification.