Search Results for author: Yuhang Jiang

Found 10 papers, 3 papers with code

COVID-19 event extraction from Twitter via extractive question answering with continuous prompts

2 code implementations19 Mar 2023 Yuhang Jiang, Ramakanth Kavuluru

As COVID-19 ravages the world, social media analytics could augment traditional surveys in assessing how the pandemic evolves and capturing consumer chatter that could help healthcare agencies in addressing it.

Benchmarking Event Extraction +2

Fast Hardware-Aware Neural Architecture Search

1 code implementation25 Oct 2019 Li Lyna Zhang, Yuqing Yang, Yuhang Jiang, Wenwu Zhu, Yunxin Liu

Unlike previous approaches that apply search algorithms on a small, human-designed search space without considering hardware diversity, we propose HURRICANE that explores the automatic hardware-aware search over a much larger search space and a two-stage search algorithm, to efficiently generate tailored models for different types of hardware.

Hardware Aware Neural Architecture Search Neural Architecture Search

Deep learning for video game genre classification

no code implementations21 Nov 2020 Yuhang Jiang, Lukun Zheng

Video game genre classification based on its cover and textual description would be utterly beneficial to many modern identification, collocation, and retrieval systems.

Classification Cultural Vocal Bursts Intensity Prediction +3

Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning

no code implementations24 Feb 2021 Jianzhun Shao, Hongchang Zhang, Yuhang Jiang, Shuncheng He, Xiangyang Ji

Reward decomposition is a critical problem in centralized training with decentralized execution~(CTDE) paradigm for multi-agent reinforcement learning.

Meta-Learning Multi-agent Reinforcement Learning +4

Reducing Conservativeness Oriented Offline Reinforcement Learning

no code implementations27 Feb 2021 Hongchang Zhang, Jianzhun Shao, Yuhang Jiang, Shuncheng He, Xiangyang Ji

In offline reinforcement learning, a policy learns to maximize cumulative rewards with a fixed collection of data.

D4RL reinforcement-learning +1

Wasserstein Unsupervised Reinforcement Learning

no code implementations15 Oct 2021 Shuncheng He, Yuhang Jiang, Hongchang Zhang, Jianzhun Shao, Xiangyang Ji

These pre-trained policies can accelerate learning when endowed with external reward, and can also be used as primitive options in hierarchical reinforcement learning.

Hierarchical Reinforcement Learning reinforcement-learning +2

Improved lightweight identification of agricultural diseases based on MobileNetV3

no code implementations19 Jul 2022 Yuhang Jiang, Wenping Tong

At present, the identification of agricultural pests and diseases has the problem that the model is not lightweight enough and difficult to apply.

Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning

no code implementations15 Oct 2022 Zihan Zhang, Yuhang Jiang, Yuan Zhou, Xiangyang Ji

Meanwhile, we show that to achieve $\tilde{O}(\mathrm{poly}(S, A, H)\sqrt{K})$ regret, the number of batches is at least $\Omega\left(H/\log_A(K)+ \log_2\log_2(K) \right)$, which matches our upper bound up to logarithmic terms.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.