1 code implementation • 16 Oct 2024 • Lingxiao Luo, Bingda Tang, Xuanzhong Chen, Rong Han, Ting Chen
For instance, most VLMs rely on a single method of visual grounding, whereas complex medical tasks demand more versatile approaches.
1 code implementation • 27 Jun 2024 • Chengwen Zhang, Yun Liu, Ruofan Xing, Bingda Tang, Li Yi
With 1K human-object-human motion sequences captured in the real world, we enrich CORE4D by contributing an iterative collaboration retargeting strategy to augment motions to a variety of novel objects.
Human-Object Interaction Detection Human-Object Interaction Generation +2
1 code implementation • 19 Jun 2024 • Yue Huang, Jingyu Tang, Dongping Chen, Bingda Tang, Yao Wan, Lichao Sun, Xiangliang Zhang
Recently, Large Language Models (LLMs) have garnered significant attention for their exceptional natural language processing capabilities.
1 code implementation • 12 Dec 2023 • Lingxiao Luo, Xuanzhong Chen, Bingda Tang, Xinsheng Chen, Rong Han, Chengpeng Hu, Yujiang Li, Ting Chen
In this work, we propose a universal foundation model for medical image analysis that processes images with heterogeneous spatial properties using a unified structure.