Search Results for author: Jian Hu

Found 17 papers, 6 papers with code

Discriminative Partial Domain Adversarial Network

no code implementations ECCV 2020 Jian Hu, Hongya Tuo, Chao Wang, Lingfeng Qiao, Haowen Zhong, Junchi Yan, Zhongliang Jing, Henry Leung

Previous methods typically match the whole source domain to target domain, which causes negative transfer due to the source-negative classes in source domain that does not exist in target domain.

Partial Domain Adaptation Transfer Learning

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

1 code implementation20 May 2024 Jian Hu, Xibin Wu, Weixun Wang, Xianyu, Dehao Zhang, Yu Cao

However, unlike pretraining or fine-tuning a single model, scaling reinforcement learning from human feedback (RLHF) for training large language models poses coordination challenges across four models.

reinforcement-learning Scheduling

Aligning Language Models with Offline Learning from Human Feedback

1 code implementation23 Aug 2023 Jian Hu, Li Tao, June Yang, Chandler Zhou

Learning from human preferences is crucial for language models (LMs) to effectively cater to human needs and societal values.

reinforcement-learning Reinforcement Learning (RL)

Pixel-wise Graph Attention Networks for Person Re-identification

1 code implementation18 Jul 2023 Wenyu Zhang, Qing Ding, Jian Hu, Yi Ma, Mingzhe Lu

Based on these two modules, we consulted the ResNet and design a pixel-wise graph attention network (PGANet).

Graph Attention Graph Generation +1

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

no code implementations1 Jul 2023 Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong Liu

Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations.

Benchmarking Data Augmentation +5

Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling

1 code implementation2 Jun 2022 Jian Hu, Haowen Zhong, Junchi Yan, Shaogang Gong, Guile Wu, Fei Yang

However, due to the significant imbalance between the amount of annotated data in the source and target domains, usually only the target distribution is aligned to the source domain, leading to adapting unnecessary source specific knowledge to the target domain, i. e., biased domain adaptation.

Domain Adaptation Pseudo Label +1

Feature-Distribution Perturbation and Calibration for Generalized Person ReID

no code implementations23 May 2022 Qilei Li, Jiabo Huang, Jian Hu, Shaogang Gong

In this work, we propose a Feature-Distribution Perturbation and Calibration (PECA) method to derive generic feature representations for person ReID, which is not only discriminative across cameras but also agnostic and deployable to arbitrary unseen target domains.

Person Re-Identification

Semi-supervised t-SNE for Millimeter-wave Wireless Localization

no code implementations26 Nov 2021 Junquan Deng, Wei Shi, Jian Hu, Xianlong Jiao

We consider the mobile localization problem in future millimeter-wave wireless networks with distributed Base Stations (BSs) based on multi-antenna channel state information (CSI).

Revisiting the Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning

no code implementations29 Sep 2021 Jian Hu, Siyang Jiang, Seth Austin Harding, Haibin Wu, Shih-wei Liao

QMIX, a popular MARL algorithm based on the monotonicity constraint, has been used as a baseline for the benchmark environments, such as Starcraft Multi-Agent Challenge (SMAC), Predator-Prey (PP).

reinforcement-learning Reinforcement Learning (RL) +2

Self-Adaptive Partial Domain Adaptation

no code implementations18 Sep 2021 Jian Hu, Hongya Tuo, Shizhao Zhang, Chao Wang, Haowen Zhong, Zhikang Zou, Zhongliang Jing, Henry Leung, Ruping Zou

Partial Domain adaptation (PDA) aims to solve a more practical cross-domain learning problem that assumes target label space is a subset of source label space.

Partial Domain Adaptation

Domain Adaptive YOLO for One-Stage Cross-Domain Detection

no code implementations26 Jun 2021 Shizhao Zhang, Hongya Tuo, Jian Hu, Zhongliang Jing

Multi-scale instance level features alignment is presented to reduce instance domain shift effectively , such as variations in object appearance and viewpoint.

Domain Adaptation

Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning

2 code implementations6 Feb 2021 Jian Hu, Siyang Jiang, Seth Austin Harding, Haibin Wu, Shih-wei Liao

Multi-Agent Reinforcement Learning (MARL) has seen revolutionary breakthroughs with its successful application to multi-agent cooperative tasks such as computer games and robot swarms.

reinforcement-learning Reinforcement Learning (RL) +3

QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning

no code implementations9 Sep 2020 Jian Hu, Seth Austin Harding, Haibin Wu, Siyue Hu, Shih-wei Liao

Existing methods such as Value Decomposition Network (VDN) and QMIX estimate the value of long-term returns as a scalar that does not contain the information of randomness.

reinforcement-learning Reinforcement Learning (RL) +2

A hybrid model based on deep LSTM for predicting high-dimensional chaotic systems

no code implementations21 Jan 2020 Youming Lei, Jian Hu, Jianpeng Ding

The numerical results show that the proposed method can effectively avoid the rapid divergence of the multi-layer LSTM model when reconstructing chaotic attractors, and demonstrate the feasibility of the combination of deep learning based on the gradient descent method and the empirical model.

The Trajectory of Voice Onset Time with Vocal Aging

no code implementations15 Oct 2018 Xuanda Chen, Ziyu Xiong, Jian Hu

Vocal aging, a universal process of human aging, can largely affect one's language use, possibly including some subtle acoustic features of one's utterances like Voice Onset Time.

Human Aging

Cannot find the paper you are looking for? You can Submit a new open access paper.