Search Results for author: Hangyu Li

Found 21 papers, 6 papers with code

VMBench: A Benchmark for Perception-Aligned Video Motion Generation

1 code implementation13 Mar 2025 Xinrang Ling, Chen Zhu, Meiqi Wu, Hangyu Li, Xiaokun Feng, Cundian Yang, Aiming Hao, Jiashu Zhu, JiaHong Wu, Xiangxiang Chu

Based on these findings, we introduce VMBench--a comprehensive Video Motion Benchmark that has perception-aligned motion metrics and features the most diverse types of motion.

Motion Generation Video Generation

Knowledge-Enhanced Facial Expression Recognition with Emotional-to-Neutral Transformation

no code implementations13 Sep 2024 Hangyu Li, Yihan Xu, Jiangchao Yao, Nannan Wang, Xinbo Gao, Bo Han

Then, we transform the facial expression representation to a neutral representation by simulating the difference in text embeddings from textual facial expression to textual neutral.

Facial Expression Recognition Facial Expression Recognition (FER)

EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More

no code implementations29 Aug 2024 Kanghao Chen, Guoqiang Liang, Hangyu Li, Yunfan Lu, Lin Wang

This dataset was curated using a robotic arm that traces a consistent non-linear trajectory, achieving spatial alignment precision under 0. 03mm and temporal alignment with errors under 0. 01s for 90% of the dataset.

feature selection Monocular Depth Estimation +2

FlowDreamer: Exploring High Fidelity Text-to-3D Generation via Rectified Flow

no code implementations9 Aug 2024 Hangyu Li, Xiangxiang Chu, Dingyuan Shi, Wang Lin

In particular, with the pretrained diffusion models, existing methods predominantly use Score Distillation Sampling (SDS) to train 3D models such as Neural RaRecent advances in text-to-3D generation have made significant progress.

3D Generation NeRF +1

A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness

no code implementations2 Aug 2024 Lutao Jiang, Hangyu Li, Lin Wang

Such a design enables each 3D Gaussian to assimilate the spatial information from other areas and semantic information from texts.

3D Generation Text to 3D

LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction

no code implementations8 Jul 2024 Kanghao Chen, Hangyu Li, Jiazhou Zhou, Zeyu Wang, Lin Wang

However, due to diffusion models' inherent diversity and randomness, it is hardly possible to directly apply them to achieve spatial and temporal consistency for E2V reconstruction.

Denoising Video Reconstruction

A Survey on Self-Evolution of Large Language Models

1 code implementation22 Apr 2024 Zhengwei Tao, Ting-En Lin, Xiancai Chen, Hangyu Li, Yuchuan Wu, Yongbin Li, Zhi Jin, Fei Huang, DaCheng Tao, Jingren Zhou

To address this issue, self-evolution approaches that enable LLM to autonomously acquire, refine, and learn from experiences generated by the model itself are rapidly growing.

Diversity Survey

Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models

no code implementations22 Sep 2023 Haoyu Gao, Ting-En Lin, Hangyu Li, Min Yang, Yuchuan Wu, Wentao Ma, Yongbin Li

Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts.

Dialogue Understanding

MathChat: Converse to Tackle Challenging Math Problems with LLM Agents

2 code implementations2 Jun 2023 Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields.

Elementary Mathematics Math +1

Attention Paper: How Generative AI Reshapes Digital Shadow Industry?

no code implementations26 May 2023 Qichao Wang, Huan Ma, WenTao Wei, Hangyu Li, Liang Chen, Peilin Zhao, Binwen Zhao, Bo Hu, Shu Zhang, Zibin Zheng, Bingzhe Wu

The rapid development of digital economy has led to the emergence of various black and shadow internet industries, which pose potential risks that can be identified and managed through digital risk management (DRM) that uses different techniques such as machine learning and deep learning.

Management

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

1 code implementation NeurIPS 2023 Shuzheng Si, Wentao Ma, Haoyu Gao, Yuchuan Wu, Ting-En Lin, Yinpei Dai, Hangyu Li, Rui Yan, Fei Huang, Yongbin Li

SpokenWOZ further incorporates common spoken characteristics such as word-by-word processing and reasoning in spoken language.

Vertical Federated Linear Contextual Bandits

no code implementations20 Oct 2022 Zeyu Cao, Zhipeng Liang, Shu Zhang, Hangyu Li, Ouyang Wen, Yu Rong, Peilin Zhao, Bingzhe Wu

In this paper, we investigate a novel problem of building contextual bandits in the vertical federated setting, i. e., contextual information is vertically distributed over different departments.

Multi-Armed Bandits

Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin

1 code implementation CVPR 2022 Hangyu Li, Nannan Wang, Xi Yang, Xiaoyu Wang, Xinbo Gao

In this paper, we learn an Adaptive Confidence Margin (Ada-CM) to fully leverage all unlabeled data for semi-supervised deep facial expression recognition.

Facial Expression Recognition Facial Expression Recognition (FER)

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking

no code implementations1 Jun 2021 Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si, Xiaodan Zhu

Existing dialog state tracking (DST) models are trained with dialog data in a random order, neglecting rich structural information in a dataset.

 Ranked #1 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.1 (using extra training data)

dialog state tracking Multi-domain Dialogue State Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.