no code implementations • 3 Jul 2024 • Xia Hou, QiFeng Li, Jian Yang, Tongliang Li, Linzheng Chai, Xianjie Wu, Hangyuan Ji, Zhoujun Li, Jixuan Nie, Jingbo Dun, Wenfeng Song
In this paper, we present a novel framework named R2S that leverages the CoD-Chain of Dialogue logic to guide large language models (LLMs) in generating knowledge-intensive multi-turn dialogues for instruction tuning.
no code implementations • CVPR 2024 • Wenfeng Song, Xinyu Zhang, Shuai Li, Yang Gao, Aimin Hao, Xia Hou, Chenglizhao Chen, Ning li, Hong Qin
To date the quest to rapidly and effectively produce human-object interaction (HOI) animations directly from textual descriptions stands at the forefront of computer vision research.
1 code implementation • CVPR 2024 • Wenfeng Song, Xingliang Jin, Shuai Li, Chenglizhao Chen, Aimin Hao, Xia Hou, Ning li, Hong Qin
Our MCM-LDM's cornerstone lies in its ability first to disentangle and then intricately weave together motion's tripartite components: motion trajectory motion content and motion style.
1 code implementation • 6 Dec 2023 • Mengke Song, Linfeng Li, Dunquan Wu, Wenfeng Song, Chenglizhao Chen
To conquer, this paper proposes a new paradigm for saliency ranking, which aims to completely focus on ranking salient objects by their "importance order".
no code implementations • ICCV 2023 • Shuai Li, Sisi Zhuang, Wenfeng Song, Xinyu Zhang, Hejia Chen, Aimin Hao
At the technical level, we explore the local-to-global semantic features of previous and current texts to extract relevant information.
1 code implementation • 20 Jun 2022 • Chenglizhao Chen, Mengke Song, Wenfeng Song, Li Guo, Muwei Jian
Video saliency detection (VSD) aims at fast locating the most attractive objects/things/patterns in a given video clip.