no code implementations • 9 Dec 2024 • Qifan Yu, Zhebei Shen, Zhongqi Yue, Yang Wu, Wenqiao Zhang, Yunfei Li, Juncheng Li, Siliang Tang, Yueting Zhuang
Instruction tuning fine-tunes pre-trained Multi-modal Large Language Models (MLLMs) to handle real-world tasks.
no code implementations • 30 Oct 2024 • Shuzhen Li, Yuxin Chen, Xuesong Chen, Ruiyang Gao, Yupeng Zhang, Chao Yu, Yunfei Li, Ziyi Ye, Weijun Huang, Hongliang Yi, Yue Leng, Yi Wu
However, reliable BCG-based sleep staging is challenging due to the limited sleep monitoring data available for BCG.
no code implementations • 10 Oct 2024 • Dongliang Zhang, Yunfei Li, Jiaran Zhou, Yuezun Li
These varying qualities diversify the pattern of forgery traces, significantly increasing the difficulty of DeepFake detection.
1 code implementation • 18 Jun 2024 • Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, Yunfei Li, Siliang Tang
Representation learning on text-attributed graphs (TAGs) is vital for real-world applications, as they combine semantic textual and contextual structural information.
no code implementations • 22 Apr 2024 • Yunfei Li, Yuezun Li, Xin Wang, Baoyuan Wu, Jiaran Zhou, Junyu Dong
Our method features four major improvements: \ding{182} we describe a new texture-aware branch that effectively captures subtle manipulation traces with a Diversiform Pixel Difference Attention module.
no code implementations • 6 Mar 2024 • Yunfei Li, Yiting Luo, Weiqiang Tan, ChunGuo Li, Shaodan Ma, Guanghua Yang
To surmount the intractable nonlinear and non-convex objective function inherent in the problem, we introduce a variational Bayesian learning-based framework that enables the joint optimization of localization, path loss exponent, and reference noise parameters by leveraging an effective approximation to the true posterior distribution.
no code implementations • 2 Mar 2024 • Yunfei Li, Yiting Luo, Xianda Wu, Zheng Shi, Shaodan Ma, Guanghua Yang
As opposed to existing works that treat channel estimation and localization independently, this paper exploits the intrinsic coupling and nonlinear relationships between the channel parameters and user location for enhancement of both localization and channel reconstruction.
no code implementations • 7 Oct 2023 • Jiayu Chen, Zelai Xu, Yunfei Li, Chao Yu, Jiaming Song, Huazhong Yang, Fei Fang, Yu Wang, Yi Wu
In this work, we present a novel subgame curriculum learning framework for zero-sum games.
2 code implementations • 9 Jan 2023 • Chao Yu, Xinyi Yang, Jiaxuan Gao, Jiayu Chen, Yunfei Li, Jijia Liu, Yunfei Xiang, Ruixin Huang, Huazhong Yang, Yi Wu, Yu Wang
Simply waiting for every robot being ready for the next action can be particularly time-inefficient.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 24 Jun 2022 • Yunfei Li, Tian Gao, Jiaqi Yang, Huazhe Xu, Yi Wu
It has been a recent trend to leverage the power of supervised learning (SL) towards more effective reinforcement learning (RL) methods.
1 code implementation • 12 Apr 2022 • Yunfei Li, Tao Kong, Lei LI, Yi Wu
Can a robot autonomously learn to design and construct a bridge from varying-sized blocks without a blueprint?
no code implementations • 13 Dec 2021 • Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu
A ubiquitous requirement in many practical reinforcement learning (RL) applications, including medical treatment, recommendation system, education and robotics, is that the deployed policy that actually interacts with the environment cannot change frequently.
no code implementations • 10 Oct 2021 • Yuyang Zhang, Dik Hin Leung, Min Guo, Yijia Xiao, Haoyue Liu, Yunfei Li, Jiyuan Zhang, Guan Wang, Zhen Chen
Matrix multiplication is the bedrock in Deep Learning inference application.
no code implementations • 5 Aug 2021 • Yunfei Li, Tao Kong, Lei LI, Yifeng Li, Yi Wu
In this task, the robot needs to first design a feasible bridge architecture for arbitrarily wide cliffs and then manipulate the blocks reliably to construct a stable bridge according to the proposed design.
1 code implementation • ICLR 2021 • Yunfei Li, Yilin Wu, Huazhe Xu, Xiaolong Wang, Yi Wu
We propose a novel learning paradigm, Self-Imitation via Reduction (SIR), for solving compositional reinforcement learning problems.
no code implementations • 25 Sep 2019 • Tao Du, Yunfei Li, Jie Xu, Andrew Spielberg, Kui Wu, Daniela Rus, Wojciech Matusik
Over the last decade, two competing control strategies have emerged for solving complex control tasks with high efficacy.
no code implementations • 9 May 2019 • Xiaoqin Zhang, Yunfei Li, Huimin Ma, Xiong Luo
Pretraining reinforcement learning methods with demonstrations has been an important concept in the study of reinforcement learning since a large amount of computing power is spent on online simulations with existing reinforcement learning algorithms.