Search Results for author: Xu He

Found 29 papers, 11 papers with code

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection

no code implementations5 Sep 2024 Yewen Li, Chaojie Wang, Xiaobo Xia, Xu He, Ruyi An, Dong Li, Tongliang Liu, Bo An, Xinrun Wang

Therefore, we appeal for more attention to incremental effectiveness on likelihood, i. e., whether a method could always surpass or at least match the performance of likelihood in U-OOD detection.

Out-of-Distribution Detection

MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

no code implementations26 Aug 2024 Xu He, Xiaoyu Li, Di Kang, Jiangnan Ye, Chaopeng Zhang, Liyang Chen, Xiangjun Gao, Han Zhang, Zhiyong Wu, Haolin Zhuang

Existing works in single-image human reconstruction suffer from weak generalizability due to insufficient training data or 3D inconsistencies for a lack of comprehensive multi-view knowledge.

3D Human Reconstruction Novel View Synthesis

Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines

1 code implementation20 Jun 2024 Xinyi Ying, Chao Xiao, Ruojing Li, Xu He, Boyang Li, Zhaoxu Li, Yingqian Wang, Mingyuan Hu, Qingyu Xu, Zaiping Lin, Miao Li, Shilin Zhou, Wei An, Weidong Sheng, Li Liu

Based on the proposed RGBT-Tiny dataset and SAFit measure, extensive evaluations have been conducted, including 23 recent state-of-the-art algorithms that cover four different types (i. e., visible generic detection, visible SOD, thermal SOD and RGBT object detection).

Diversity object-detection +1

MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time

no code implementations25 May 2024 Jikun Kang, Xin Zhe Li, Xi Chen, Amirreza Kazemi, Qianyi Sun, Boxing Chen, Dong Li, Xu He, Quan He, Feng Wen, Jianye Hao, Jun Yao

Although Large Language Models (LLMs) achieve remarkable performance across various tasks, they often struggle with complex reasoning tasks, such as answering mathematical questions.

GSM8K Math +1

iVideoGPT: Interactive VideoGPTs are Scalable World Models

1 code implementation24 May 2024 Jialong Wu, Shaofeng Yin, Ningya Feng, Xu He, Dong Li, Jianye Hao, Mingsheng Long

This work introduces Interactive VideoGPT (iVideoGPT), a scalable autoregressive transformer framework that integrates multimodal signals--visual observations, actions, and rewards--into a sequence of tokens, facilitating an interactive experience of agents via next-token prediction.

Decision Making Model-based Reinforcement Learning +3

Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

1 code implementation CVPR 2024 Xu He, Qiaochu Huang, Zhensong Zhang, Zhiwei Lin, Zhiyong Wu, Sicheng Yang, Minglei Li, Zhiyi Chen, Songcen Xu, Xiaofei Wu

While previous works mostly generate structural human skeletons, resulting in the omission of appearance information, we focus on the direct generation of audio-driven co-speech gesture videos in this work.

Video Generation

Improving Unsupervised Hierarchical Representation with Reinforcement Learning

1 code implementation CVPR 2024 Ruyi An, Yewen Li, Xu He, Pengjie Gu, Mengchen Zhao, Dong Li, Jianye Hao, Chaojie Wang, Bo An, Mingyuan Zhou

To address this issue we first analyze the shortcomings of existing methods for mitigating the "posterior collapse" from an information theory perspective then highlight the necessity of regularization for explicitly propagating data information to higher-level latent variables while maintaining the dependency between different levels.

reinforcement-learning Reinforcement Learning +1

Dynamic Embedding Size Search with Minimum Regret for Streaming Recommender System

no code implementations15 Aug 2023 Bowei He, Xu He, Renrui Zhang, Yingxue Zhang, Ruiming Tang, Chen Ma

The high-throughput data requires the model to be updated in a timely manner for capturing the user interest dynamics, which leads to the emergence of streaming recommender systems.

Recommendation Systems

Dynamically Expandable Graph Convolution for Streaming Recommendation

1 code implementation21 Mar 2023 Bowei He, Xu He, Yingxue Zhang, Ruiming Tang, Chen Ma

Personalized recommender systems have been widely studied and deployed to reduce information overload and satisfy users' diverse needs.

Graph Learning Recommendation Systems

DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities

no code implementations15 Dec 2021 Shuo Sun, Wanqi Xue, Rundong Wang, Xu He, Junlei Zhu, Jian Li, Bo An

Reinforcement learning (RL) techniques have shown great success in many challenging quantitative trading tasks, such as portfolio management and algorithmic trading.

Algorithmic Trading Decision Making +4

RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents

no code implementations NeurIPS 2021 Wei Qiu, Xinrun Wang, Runsheng Yu, Rundong Wang, Xu He, Bo An, Svetlana Obraztsova, Zinovi Rabinovich

Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE).

Multi-agent Reinforcement Learning quantile regression +5

RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents

no code implementations16 Feb 2021 Wei Qiu, Xinrun Wang, Runsheng Yu, Xu He, Rundong Wang, Bo An, Svetlana Obraztsova, Zinovi Rabinovich

Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE).

Multi-agent Reinforcement Learning quantile regression +5

RMIX: Risk-Sensitive Multi-Agent Reinforcement Learning

no code implementations1 Jan 2021 Wei Qiu, Xinrun Wang, Runsheng Yu, Xu He, Rundong Wang, Bo An, Svetlana Obraztsova, Zinovi Rabinovich

Centralized training with decentralized execution (CTDE) has become an important paradigm in multi-agent reinforcement learning (MARL).

Multi-agent Reinforcement Learning reinforcement-learning +4

Personalized Adaptive Meta Learning for Cold-start User Preference Prediction

no code implementations22 Dec 2020 Runsheng Yu, Yu Gong, Xu He, Bo An, Yu Zhu, Qingwen Liu, Wenwu Ou

Recently, many existing studies regard the cold-start personalized preference prediction as a few-shot learning problem, where each user is the task and recommended items are the classes, and the gradient-based meta learning method (MAML) is leveraged to address this challenge.

Few-Shot Learning

TB2J: a python package for computing magnetic interaction parameters

1 code implementation3 Sep 2020 Xu He, Nicole Helbig, Matthieu J. Verstraete, Eric Bousquet

We present TB2J, a Python package for the automatic computation of magnetic interactions, including exchange and Dzyaloshinskii-Moriya interactions, between atoms of magnetic crystals from the results of density functional calculations.

Materials Science

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

no code implementations21 Aug 2020 Xu He, Bo An, Yanghua Li, Haikai Chen, Qingyu Guo, Xin Li, Zhirong Wang

First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set.

Continual Learning from the Perspective of Compression

no code implementations ICML Workshop LifelongML 2020 Xu He, Min Lin

We compare these approaches in terms of both compression and forgetting and empirically study the reasons that limit the performance of continual learning methods based on variational posterior approximation.

Continual Learning

Learning Behaviors with Uncertain Human Feedback

1 code implementation7 Jun 2020 Xu He, Haipeng Chen, Bo An

However, previous works rarely consider the uncertainty when humans provide feedback, especially in cases that the optimal actions are not obvious to the trainers.

Learning from Naturalistic Driving Data for Human-like Autonomous Highway Driving

no code implementations23 May 2020 Donghao Xu, Zhezhang Ding, Xu He, Huijing Zhao, Mathieu Moze, François Aioun, Franck Guillemard

In this study, a method of learning cost parameters of a motion planner from naturalistic driving data is proposed.

Motion Planning

PyProcar: A Python library for electronic structure pre/post-processing

1 code implementation26 Jun 2019 Uthpala Herath, Pedram Tavadze, Xu He, Eric Bousquet, Sobhit Singh, Francisco Muñoz, Aldo H. Romero

A file with a specific property evaluated for each $k$-point in a $k-$mesh and for each band can be used to project other properties such as electron-phonon mean path, Fermi velocity, electron effective mass, etc.

Materials Science

Task Agnostic Continual Learning via Meta Learning

no code implementations ICML Workshop LifelongML 2020 Xu He, Jakub Sygnowski, Alexandre Galashov, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu

One particular formalism that studies learning under non-stationary distribution is provided by continual learning, where the non-stationarity is imposed by a sequence of distinct tasks.

Continual Learning Meta-Learning

Overcoming Catastrophic Interference by Conceptors

no code implementations16 Jul 2017 Xu He, Herbert Jaeger

Catastrophic interference has been a major roadblock in the research of continual learning.

Continual Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.