1 code implementation • 15 Nov 2024 • Rang Meng, Xingyu Zhang, Yuming Li, Chenguang Ma
Recent work on human animation usually involves audio, pose, or movement maps conditions, thereby achieves vivid animation quality.
1 code implementation • 29 Oct 2024 • Bo Jiang, Shaoyu Chen, Bencheng Liao, Xingyu Zhang, Wei Yin, Qian Zhang, Chang Huang, Wenyu Liu, Xinggang Wang
In contrast, Large Vision-Language Models (LVLMs) excel in scene understanding and reasoning.
no code implementations • 7 Oct 2024 • Junming Wang, Xingyu Zhang, Zebin Xing, Songen Gu, Xiaoyang Guo, Yang Hu, Ziying Song, Qian Zhang, Xiaoxiao Long, Wei Yin
In this paper, we propose HE-Drive: the first human-like-centric end-to-end autonomous driving system to generate trajectories that are both temporally consistent and comfortable.
no code implementations • 30 Sep 2024 • Junming Wang, Wei Yin, Xiaoxiao Long, Xingyu Zhang, Zebin Xing, Xiaoyang Guo, Qian Zhang
In this paper, we introduce OccRWKV, an efficient semantic occupancy network inspired by Receptance Weighted Key Value (RWKV).
no code implementations • 29 Jul 2024 • Sribala Vidyadhari Chinta, Zichong Wang, Xingyu Zhang, Thang Doan Viet, Ayesha Kashif, Monique Antoinette Smith, Wenbin Zhang
Artificial intelligence (AI) is rapidly advancing in healthcare, enhancing the efficiency and effectiveness of services across various specialties, including cardiology, ophthalmology, dermatology, emergency medicine, etc.
1 code implementation • 17 Jul 2024 • Xingyu Zhang, Siyu Zhao, Zeen Song, Huijie Guo, Jianqi Zhang, Changwen Zheng, Wenwen Qiang
Although Fourier analysis offers an alternative to effectively capture reusable and periodic patterns to achieve long-term forecasting in different scenarios, existing methods often assume high-frequency components represent noise and should be discarded in time series forecasting.
no code implementations • 24 May 2024 • Zeen Song, Siyu Zhao, Xingyu Zhang, Jiangmeng Li, Changwen Zheng, Wenwen Qiang
Large-scale pre-trained vision-language models such as CLIP have been widely applied to a variety of downstream scenarios.
no code implementations • 9 May 2024 • Fengyi Gao, Xingyu Zhang, Sonish Sivarajkumar, Parker Denny, Bayan Aldhahwani, Shyam Visweswaran, Ryan Shi, William Hogan, Allyn Bove, Yanshan Wang
In this study, we utilized statistical analysis and machine learning methods to examine whether rehabilitation exercises can improve patients post-stroke functional abilities, as well as forecast the improvement in functional abilities.
no code implementations • 24 Mar 2024 • Linzhi Wu, Xingyu Zhang, Yakun Zhang, Changyan Zheng, Tiejun Liu, Liang Xie, Ye Yan, Erwei Yin
Lip reading, the process of interpreting silent speech from visual lip movements, has gained rising attention for its wide range of realistic applications.
no code implementations • 16 Apr 2022 • Zijian Jin, Xingyu Zhang, Mo Yu, Lifu Huang
Script knowledge is critical for humans to understand the broad daily tasks and routine activities in the world.
no code implementations • 8 May 2019 • Xingyu Zhang, Namiko Mitarai
We propose a model that includes a simplified regulatory dynamics of the tumbling frequency in individual cells to clarify the role of finite response time.