1 code implementation • 13 Mar 2025 • Xinrang Ling, Chen Zhu, Meiqi Wu, Hangyu Li, Xiaokun Feng, Cundian Yang, Aiming Hao, Jiashu Zhu, JiaHong Wu, Xiangxiang Chu
Based on these findings, we introduce VMBench--a comprehensive Video Motion Benchmark that has perception-aligned motion metrics and features the most diverse types of motion.
no code implementations • 22 Nov 2024 • Dingyuan Shi, Yong Wang, Hangyu Li, Xiangxiang Chu
In this paper, we propose Denoised Distribution Estimation (DDE), a novel method for credit assignment.
no code implementations • 13 Sep 2024 • Hangyu Li, Yihan Xu, Jiangchao Yao, Nannan Wang, Xinbo Gao, Bo Han
Then, we transform the facial expression representation to a neutral representation by simulating the difference in text embeddings from textual facial expression to textual neutral.
Facial Expression Recognition
Facial Expression Recognition (FER)
no code implementations • 29 Aug 2024 • Kanghao Chen, Guoqiang Liang, Hangyu Li, Yunfan Lu, Lin Wang
This dataset was curated using a robotic arm that traces a consistent non-linear trajectory, achieving spatial alignment precision under 0. 03mm and temporal alignment with errors under 0. 01s for 90% of the dataset.
no code implementations • 9 Aug 2024 • Hangyu Li, Xiangxiang Chu, Dingyuan Shi, Wang Lin
In particular, with the pretrained diffusion models, existing methods predominantly use Score Distillation Sampling (SDS) to train 3D models such as Neural RaRecent advances in text-to-3D generation have made significant progress.
no code implementations • 2 Aug 2024 • Lutao Jiang, Hangyu Li, Lin Wang
Such a design enables each 3D Gaussian to assimilate the spatial information from other areas and semantic information from texts.
no code implementations • 8 Jul 2024 • Kanghao Chen, Hangyu Li, Jiazhou Zhou, Zeyu Wang, Lin Wang
However, due to diffusion models' inherent diversity and randomness, it is hardly possible to directly apply them to achieve spatial and temporal consistency for E2V reconstruction.
1 code implementation • 22 Apr 2024 • Zhengwei Tao, Ting-En Lin, Xiancai Chen, Hangyu Li, Yuchuan Wu, Yongbin Li, Zhi Jin, Fei Huang, DaCheng Tao, Jingren Zhou
To address this issue, self-evolution approaches that enable LLM to autonomously acquire, refine, and learn from experiences generated by the model itself are rapidly growing.
no code implementations • CVPR 2024 • Guoqiang Liang, Kanghao Chen, Hangyu Li, Yunfan Lu, Lin Wang
To this end, we propose a real-world (indoor and outdoor) dataset comprising over 30K pairs of images and events under both low and normal illumination conditions.
no code implementations • 16 Jan 2024 • Wei Jiang, Yongqi Zhai, Hangyu Li, Ronggang Wang
This short paper describes our method for the track of image compression.
no code implementations • 22 Sep 2023 • Haoyu Gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Yongbin Li
Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts.
2 code implementations • 2 Jun 2023 • Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang
Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields.
no code implementations • 26 May 2023 • Qichao Wang, Huan Ma, WenTao Wei, Hangyu Li, Liang Chen, Peilin Zhao, Binwen Zhao, Bo Hu, Shu Zhang, Zibin Zheng, Bingzhe Wu
The rapid development of digital economy has led to the emergence of various black and shadow internet industries, which pose potential risks that can be identified and managed through digital risk management (DRM) that uses different techniques such as machine learning and deep learning.
1 code implementation • NeurIPS 2023 • Shuzheng Si, Wentao Ma, Haoyu Gao, Yuchuan Wu, Ting-En Lin, Yinpei Dai, Hangyu Li, Rui Yan, Fei Huang, Yongbin Li
SpokenWOZ further incorporates common spoken characteristics such as word-by-word processing and reasoning in spoken language.
2 code implementations • 14 Apr 2023 • Minghao Li, Yingxiu Zhao, Bowen Yu, Feifan Song, Hangyu Li, Haiyang Yu, Zhoujun Li, Fei Huang, Yongbin Li
(2) How can we enhance LLMs' ability to utilize tools?
no code implementations • 20 Oct 2022 • Zeyu Cao, Zhipeng Liang, Shu Zhang, Hangyu Li, Ouyang Wen, Yu Rong, Peilin Zhao, Bingzhe Wu
In this paper, we investigate a novel problem of building contextual bandits in the vertical federated setting, i. e., contextual information is vertically distributed over different departments.
1 code implementation • CVPR 2022 • Hangyu Li, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao
In this paper, we learn an Adaptive Confidence Margin (Ada-CM) to fully leverage all unlabeled data for semi-supervised deep facial expression recognition.
Facial Expression Recognition
Facial Expression Recognition (FER)
no code implementations • ACL 2021 • Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si, Xiaodan Zhu
Existing dialog state tracking (DST) models are trained with dialog data in a random order, neglecting rich structural information in a dataset.
no code implementations • 1 Jun 2021 • Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si, Xiaodan Zhu
Existing dialog state tracking (DST) models are trained with dialog data in a random order, neglecting rich structural information in a dataset.
Ranked #1 on
Multi-domain Dialogue State Tracking
on MULTIWOZ 2.1
(using extra training data)
no code implementations • ACL 2020 • Yinpei Dai, Hangyu Li, Chengguang Tang, Yongbin Li, Jian Sun, Xiaodan Zhu
Existing end-to-end dialog systems perform less effectively when data is scarce.