Search Results for author: Haochen Zhang

Found 16 papers, 1 papers with code

IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D Scenes

no code implementations20 Mar 2025 Haochen Zhang, Nader Zantout, Pujith Kachana, Ji Zhang, Wenshan Wang

With this benchmark, we aim to provide a resource for 3D scene understanding that aids the development of robust, interactive navigation systems.

Gap-Dependent Bounds for Federated $Q$-learning

no code implementations5 Feb 2025 Haochen Zhang, Zhong Zheng, Lingzhou Xue

We present the first gap-dependent analysis of regret and communication cost for on-policy federated $Q$-Learning in tabular episodic finite-horizon Markov decision processes (MDPs).

Q-Learning

Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition

no code implementations10 Oct 2024 Zhong Zheng, Haochen Zhang, Lingzhou Xue

To our knowledge, this paper presents the first gap-dependent regret analysis for Q-learning using variance estimators and reference-advantage decomposition and also provides the first gap-dependent analysis on policy switching cost for Q-learning.

Q-Learning

Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost

no code implementations29 May 2024 Zhong Zheng, Haochen Zhang, Lingzhou Xue

In this paper, we consider model-free federated reinforcement learning for tabular episodic Markov decision processes.

Q-Learning

Jellyfish: A Large Language Model for Data Preprocessing

no code implementations4 Dec 2023 Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada

We select a collection of datasets across four representative DP tasks and construct instruction tuning data using data configuration, knowledge injection, and reasoning data distillation techniques tailored to DP.

Imputation Language Modeling +2

Adaptive Liquidity Provision in Uniswap V3 with Deep Reinforcement Learning

no code implementations18 Sep 2023 Haochen Zhang, Xi Chen, Lin F. Yang

The DRL policy aims to optimize trading fees earned by LPs against associated costs, such as gas fees and hedging expenses, which is referred to as loss-versus-rebalancing (LVR).

Asset Management Deep Reinforcement Learning +1

Large Language Models as Data Preprocessors

no code implementations30 Aug 2023 Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada

Large Language Models (LLMs), typified by OpenAI's GPT, have marked a significant advancement in artificial intelligence.

feature selection Imputation +1

Resilient Distribution System Restoration with Communication Recovery by Drone Small Cells

no code implementations31 Mar 2022 Haochen Zhang, Chen Chen, Shunbo Lei, Zhaohong Bie

Distribution system (DS) restoration after natural disasters often faces the challenge of communication failures to feeder automation (FA) facilities, resulting in prolonged load pick-up process.

Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective

no code implementations ICCV 2021 Wei Wang, Haochen Zhang, Zehuan Yuan, Changhu Wang

A popular attempts towards the challenge is unpaired generative adversarial networks, which generate "real" LR counterparts from real HR images using image-to-image translation and then perform super-resolution from "real" LR->SR.

Domain Adaptation Image-to-Image Translation +1

Learning by Passing Tests, with Application to Neural Architecture Search

no code implementations30 Nov 2020 Xuefeng Du, Haochen Zhang, Pengtao Xie

We propose a multi-level optimization framework to formulate LPT, where the tester learns to create difficult and meaningful tests and the learner learns to pass these tests.

Neural Architecture Search

Is There Tradeoff between Spatial and Temporal in Video Super-Resolution?

no code implementations13 Mar 2020 Haochen Zhang, Dong Liu, Zhiwei Xiong

Recent advances of deep learning lead to great success of image and video super-resolution (SR) methods that are based on convolutional neural networks (CNN).

Video Super-Resolution

On The Classification-Distortion-Perception Tradeoff

no code implementations NeurIPS 2019 Dong Liu, Haochen Zhang, Zhiwei Xiong

In this paper, we extend the previous perception-distortion tradeoff to the case of classification-distortion-perception (CDP) tradeoff, where we introduced the classification error rate of the restored signal in addition to distortion and perceptual difference.

Classification General Classification

Two-Stream Action Recognition-Oriented Video Super-Resolution

1 code implementation ICCV 2019 Haochen Zhang, Dong Liu, Zhiwei Xiong

Tailored for two-stream action recognition networks, we propose two video SR methods for the spatial and temporal streams respectively.

Action Recognition Optical Flow Estimation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.