no code implementations • 26 Dec 2023 • Yuhang Liu, Daowan Peng, Wei Wei, Yuanyuan Fu, Wenfeng Xie, Dangyang Chen
Recently, neural module networks (NMNs) have yielded ongoing success in answering compositional visual questions, especially those involving multi-hop visual and logical reasoning.
no code implementations • 17 May 2023 • Daowan Peng, Wei Wei, Xian-Ling Mao, Yuanyuan Fu, Dangyang Chen
Generalization beyond in-domain experience to out-of-distribution data is of paramount significance in the AI domain.
1 code implementation • 5 May 2022 • Yuhang Liu, Wei Wei, Daowan Peng, Feida Zhu
In recent years, the pre-training-then-fine-tuning paradigm has yielded immense success on a wide spectrum of cross-modal tasks, such as visual question answering (VQA), in which a visual-language (VL) model is first optimized via self-supervised task objectives, e. g., masked language modeling (MLM) and image-text matching (ITM), and then fine-tuned to adapt to downstream task (e. g., VQA) via a brand-new objective function, e. g., answer prediction.
no code implementations • 2 May 2020 • Shuyin Xia, Daowan Peng, Deyu Meng, Changqing Zhang, Guoyin Wang, Zizhong Chen, Wei Wei
The assigned cluster of the points in the stable area is not changed in the current iteration while the points in the annulus area will be adjusted within a few neighbor clusters in the current iteration.