Search Results for author: Chao Qu

Found 24 papers, 5 papers with code

Subequivariant Reinforcement Learning Framework for Coordinated Motion Control

no code implementations22 Mar 2024 Haoyu Wang, Xiaoyu Tan, Xihe Qiu, Chao Qu

Effective coordination is crucial for motion control with reinforcement learning, especially as the complexity of agents and their motions increases.

reinforcement-learning

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching

no code implementations9 Dec 2023 Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu, Yuan Qi

Instruction fine-tuning has conventionally been employed to adapt Large Language Models (LLMs) to a variety of tasks.

In-Context Learning

LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints

1 code implementation27 Sep 2023 Weidi Xu, Jingwei Wang, Lele Xie, Jianshan He, Hongting Zhou, Taifeng Wang, Xiaopei Wan, Jingdong Chen, Chao Qu, Wei Chu

Integrating first-order logic constraints (FOLCs) with neural networks is a crucial but challenging problem since it involves modeling intricate correlations to satisfy the constraints.

Variational Inference

Gram-based Attentive Neural Ordinary Differential Equations Network for Video Nystagmography Classification

1 code implementation ICCV 2023 Xihe Qiu, Shaojie Shi, Xiaoyu Tan, Chao Qu, Zhijun Fang, Hailing Wang, Yongbin Gao, Peixia Wu, Huawei Li

Video nystagmography (VNG) is the diagnostic gold standard of benign paroxysmal positional vertigo (BPPV), which requires medical professionals to examine the direction, frequency, intensity, duration, and variation in the strength of nystagmus on a VNG video.

Classification

A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud

1 code implementation31 May 2022 Siqiao Xue, Chao Qu, Xiaoming Shi, Cong Liao, Shiyi Zhu, Xiaoyu Tan, Lintao Ma, Shiyu Wang, Shijun Wang, Yun Hu, Lei Lei, Yangfei Zheng, Jianguo Li, James Zhang

Predictive autoscaling (autoscaling with workload forecasting) is an important mechanism that supports autonomous adjustment of computing resources in accordance with fluctuating workload demands in the Cloud.

Decision Making Management +3

Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes

1 code implementation29 Jan 2022 Chao Qu, Xiaoyu Tan, Siqiao Xue, Xiaoming Shi, James Zhang, Hongyuan Mei

We consider a sequential decision making problem where the agent faces the environment characterized by the stochastic discrete events and seeks an optimal intervention policy such that its long-term reward is maximized.

Decision Making Model-based Reinforcement Learning +3

Bayesian Deep Basis Fitting for Depth Completion with Uncertainty

no code implementations ICCV 2021 Chao Qu, Wenxin Liu, Camillo J. Taylor

By adopting a Bayesian treatment, our Bayesian Deep Basis Fitting (BDBF) approach is able to 1) predict high-quality uncertainty estimates and 2) enable depth completion with few or no sparse measurements.

Depth Completion Depth Estimation +1

Model Embedding Model-Based Reinforcement Learning

no code implementations16 Jun 2020 Xiaoyu Tan, Chao Qu, Junwu Xiong, James Zhang

Model-based reinforcement learning (MBRL) has shown its advantages in sample-efficiency over model-free reinforcement learning (MFRL).

Model-based Reinforcement Learning reinforcement-learning +1

Variational Policy Propagation for Multi-agent Reinforcement Learning

no code implementations19 Apr 2020 Chao Qu, Hui Li, Chang Liu, Junwu Xiong, James Zhang, Wei Chu, Weiqiang Wang, Yuan Qi, Le Song

We propose a \emph{collaborative} multi-agent reinforcement learning algorithm named variational policy propagation (VPP) to learn a \emph{joint} policy through the interactions over agents.

Multi-agent Reinforcement Learning reinforcement-learning +2

SLOAM: Semantic Lidar Odometry and Mapping for Forest Inventory

no code implementations29 Dec 2019 Steven W. Chen, Guilherme V. Nardari, Elijah S. Lee, Chao Qu, Xu Liu, Roseli A. F. Romero, Vijay Kumar

This paper describes an end-to-end pipeline for tree diameter estimation based on semantic segmentation and lidar odometry and mapping.

Segmentation Semantic Segmentation

Depth Completion via Deep Basis Fitting

no code implementations21 Dec 2019 Chao Qu, Ty Nguyen, Camillo J. Taylor

In this paper we consider the task of image-guided depth completion where our system must infer the depth at every pixel of an input image based on the image content and a sparse set of depth measurements.

Depth Completion

S2VG: Soft Stochastic Value Gradient method

no code implementations25 Sep 2019 Xiaoyu Tan, Chao Qu, Junwu Xiong, James Zhang

In this paper, we propose a simple and elegant model-based reinforcement learning algorithm called soft stochastic value gradient method (S2VG).

Model-based Reinforcement Learning reinforcement-learning +1

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

no code implementations NeurIPS 2019 Chao Qu, Shie Mannor, Huan Xu, Yuan Qi, Le Song, Junwu Xiong

To the best of our knowledge, it is the first MARL algorithm with convergence guarantee in the control, off-policy and non-linear function approximation setting.

Multi-agent Reinforcement Learning reinforcement-learning +1

Projection-Free Algorithms in Statistical Estimation

no code implementations20 May 2018 Yan Li, Chao Qu, Huan Xu

Recently people have reduced the gradient evaluation complexity of FW algorithm to $\log(\frac{1}{\epsilon})$ for the smooth and strongly convex objective.

Communication-Efficient Projection-Free Algorithm for Distributed Optimization

no code implementations20 May 2018 Yan Li, Chao Qu, Huan Xu

We demonstrate this advantage and show that the linear oracle complexity can be reduced to almost the same order of magnitude as the communication complexity, when the feasible set is polyhedral.

Distributed Optimization Matrix Completion

Robust Fruit Counting: Combining Deep Learning, Tracking, and Structure from Motion

no code implementations1 Apr 2018 Xu Liu, Steven W. Chen, Shreyas Aditya, Nivedha Sivakumar, Sandeep Dcunha, Chao Qu, Camillo J. Taylor, Jnaneshwar Das, Vijay Kumar

We present a novel fruit counting pipeline that combines deep segmentation, frame to frame tracking, and 3D localization to accurately count visible fruits across a sequence of images.

Fast Global Convergence via Landscape of Empirical Loss

no code implementations13 Feb 2018 Chao Qu, Yan Li, Huan Xu

While optimizing convex objective (loss) functions has been a powerhouse for machine learning for at least two decades, non-convex loss functions have attracted fast growing interests recently, due to many desirable properties such as superior robustness and classification accuracy, compared with their convex counterparts.

General Classification

Fast, Autonomous Flight in GPS-Denied and Cluttered Environments

no code implementations6 Dec 2017 Kartik Mohta, Michael Watterson, Yash Mulgaonkar, Sikang Liu, Chao Qu, Anurag Makineni, Kelsey Saulnier, Ke Sun, Alex Zhu, Jeffrey Delmerico, Konstantinos Karydis, Nikolay Atanasov, Giuseppe Loianno, Davide Scaramuzza, Kostas Daniilidis, Camillo Jose Taylor, Vijay Kumar

One of the most challenging tasks for a flying robot is to autonomously navigate between target locations quickly and reliably while avoiding obstacles in its path, and with little to no a-priori knowledge of the operating environment.

Robotics

SAGA and Restricted Strong Convexity

no code implementations19 Feb 2017 Chao Qu, Yan Li, Huan Xu

SAGA is a fast incremental gradient method on the finite sum problem and its effectiveness has been tested on a vast of applications.

regression

Linear convergence of SDCA in statistical estimation

no code implementations26 Jan 2017 Chao Qu, Huan Xu

In this paper, we consider stochastic dual coordinate (SDCA) {\em without} strongly convex assumption or convex assumption.

regression

Linear Convergence of SVRG in Statistical Estimation

no code implementations7 Nov 2016 Chao Qu, Yan Li, Huan Xu

SVRG and its variants are among the state of art optimization algorithms for large scale machine learning problems.

Subspace Clustering with Irrelevant Features via Robust Dantzig Selector

no code implementations NeurIPS 2015 Chao Qu, Huan Xu

This paper considers the subspace clustering problem where the data contains irrelevant or corrupted features.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.