no code implementations • 10 Sep 2024 • Xiaoyu Liang, Jiayuan Yu, Lianrui Mu, Jiedong Zhuang, Jiaqi Hu, Yuchen Yang, Jiangnan Ye, Lu Lu, Jian Chen, Haoji Hu
Concurrently, the visual branch focuses on the selection of significant tokens, refining the attention mechanism to highlight the primary subject.
no code implementations • 26 Aug 2024 • Xu He, Xiaoyu Li, Di Kang, Jiangnan Ye, Chaopeng Zhang, Liyang Chen, Xiangjun Gao, Han Zhang, Zhiyong Wu, Haolin Zhuang
Existing works in single-image human reconstruction suffer from weak generalizability due to insufficient training data or 3D inconsistencies for a lack of comprehensive multi-view knowledge.
no code implementations • 1 Aug 2024 • Shiji Zhou, Lianzhe Wang, Jiangnan Ye, Yongliang Wu, Heng Chang
Generative AI (GenAI), which aims to synthesize realistic and diverse data samples from latent variables or other data modalities, has achieved remarkable results in various domains, such as natural language, images, audio, and graphs.
no code implementations • 8 Jul 2024 • Jiedong Zhuang, Jiaqi Hu, Lianrui Mu, Rui Hu, Xiaoyu Liang, Jiangnan Ye, Haoji Hu
CLIP has achieved impressive zero-shot performance after pre-training on a large-scale dataset consisting of paired image-text data.
no code implementations • 4 Jan 2024 • Heng Chang, Jiangnan Ye, Alejo Lopez Avila, Jinhua Du, Jia Li
Graph Neural Networks (GNNs) have achieved great success in Knowledge Graph Completion (KGC) by modelling how entities and relations interact in recent years.
no code implementations • 30 Nov 2023 • Lianrui Mu, Jianhong Bai, Xiaoxuan He, Jiangnan Ye, Xiaoyu Liang, Yuchen Yang, Jiedong Zhuang, Haoji Hu
Enhancing the domain generalization performance of Face Anti-Spoofing (FAS) techniques has emerged as a research focus.
no code implementations • 7 Aug 2023 • Wenqiang Lai, Qihan Yang, Ye Mao, Endong Sun, Jiangnan Ye
Voice disorders affect millions of people worldwide.