no code implementations • 9 Jun 2025 • Ziwen Wang, Jiajun Fan, Ruihan Guo, Thao Nguyen, Heng Ji, Ge Liu
Protein generative models have shown remarkable promise in protein design but still face limitations in success rate, due to the scarcity of high-quality protein datasets for supervised pretraining.
no code implementations • 14 Apr 2025 • Chaoran Cheng, Jiahan Li, Jiajun Fan, Ge Liu
Recent efforts have extended the flow-matching framework to discrete generative modeling.
1 code implementation • 17 Aug 2024 • Hao Li, Hao Jiang, Jiajun Fan, Dongsheng Ye, Liang Du
This paper introduces the Dynamic Neural Dowker Network (DNDN), a novel framework specifically designed to approximate the results of dynamic Dowker filtration, aiming to capture the high-order topological features of dynamic directed graphs.
1 code implementation • 6 Jul 2024 • Ye Li, Chen Tang, Yuan Meng, Jiajun Fan, Zenghao Chai, Xinzhu Ma, Zhi Wang, Wenwu Zhu
We introduce PRANCE, a Vision Transformer compression framework that jointly optimizes the activated channels and reduces tokens, based on the characteristics of inputs.
no code implementations • 1 Jul 2024 • Hao Wang, Zhichao Chen, Yuan Shen, Jiajun Fan, Zhaoran Liu, Degui Yang, Xinggao Liu, Haoxuan Li
Heterogeneous treatment effect (HTE) estimation from observational data poses significant challenges due to treatment selection bias.
no code implementations • 5 Aug 2023 • Hao Wang, Jianxun Lian, Mingqi Wu, Haoxuan Li, Jiajun Fan, Wanyue Xu, Chaozhuo Li, Xing Xie
Sequential user modeling, a critical task in personalized recommender systems, focuses on predicting the next item a user would prefer, requiring a deep understanding of user behavior sequences.
no code implementations • 9 May 2023 • Jiajun Fan, Yuzheng Zhuang, Yuecheng Liu, Jianye Hao, Bin Wang, Jiangcheng Zhu, Hao Wang, Shu-Tao Xia
The exploration problem is one of the main challenges in deep reinforcement learning (RL).
Ranked #1 on
Atari Games
on Atari-57
no code implementations • 20 Oct 2022 • Hao Wang, Zhichao Chen, Jiajun Fan, Yuxin Huang, Weiming Liu, Xinggao Liu
As a basic research problem for building effective recommender systems, post-click conversion rate (CVR) estimation has long been plagued by sample selection bias and data sparsity issues.
no code implementations • 7 Jun 2022 • Jiajun Fan, Changnan Xiao
Then, we cast these two problems into the training data distribution optimization problem, namely to obtain desired training data within limited interactions, and address them concurrently via i) explicit modeling and control of the capacity and diversity of behavior policy and ii) more fine-grained and adaptive control of selective/sampling distribution of the behavior policy using a monotonic data distribution optimization.
Ranked #1 on
Atari Games
on atari game
no code implementations • 8 Dec 2021 • Jiajun Fan
From Deep Q-Networks (DQN) to Agent57, RL agents seem to achieve superhuman performance in ALE.
no code implementations • 11 Jun 2021 • Jiajun Fan, Changnan Xiao, Yue Huang
Deep Q Network (DQN) firstly kicked the door of deep reinforcement learning (DRL) via combining deep learning (DL) with reinforcement learning (RL), which has noticed that the distribution of the acquired data would change during the training process.
Ranked #1 on
Atari Games
on Atari 2600 Freeway
no code implementations • 1 Jun 2021 • Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng
We find valued-based reinforcement learning methods with {\epsilon}-greedy mechanism are capable of enjoying three characteristics, Closed-form Diversity, Objective-invariant Exploration and Adaptive Trade-off, which help value-based methods avoid the policy collapse problem.
no code implementations • 9 May 2021 • Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng, Haiyan Yin
We study the problem of model-free reinforcement learning, which is often solved following the principle of Generalized Policy Iteration (GPI).
no code implementations • 13 Nov 2020 • Jiajun Fan, He Ba, Xian Guo, Jianye Hao
Extensive experiments demonstrate that Critic PI2 achieved a new state of the art in a range of challenging continuous domains.